Full toy model for preference learning — AI Alignment Forum