x
[Interim research report] Evaluating the Goal-Directedness of Language Models — AI Alignment Forum