Cake or Death toy model for corrigibility — AI Alignment Forum