x
Eliciting secret knowledge from language models — AI Alignment Forum