The Polarity Problem [Draft] — AI Alignment Forum