x
Transformer Circuit Faithfulness Metrics Are Not Robust — AI Alignment Forum