"Moral" as a preference label — AI Alignment Forum