Benign model-free RL — AI Alignment Forum