“Behaviorist” RL reward functions lead to scheming — AI Alignment Forum