Quantilal control for finite MDPs — AI Alignment Forum