Some work on connecting UDT and Reinforcement Learning — AI Alignment Forum