Value Learning for Irrational Toy Models — AI Alignment Forum