Reward is the optimization target (of capabilities researchers) — AI Alignment Forum