Reward hacking behavior can generalize across tasks — AI Alignment Forum