x
Experimentally evaluating whether honesty generalizes — AI Alignment Forum