Hackable Rewards as a Safety Valve? — AI Alignment Forum