Why Do Some Language Models Fake Alignment While Others Don't? — AI Alignment Forum