Rigged reward learning — AI Alignment Forum