An Interpretability Illusion for Activation Patching of Arbitrary Subspaces — AI Alignment Forum