Subsets and quotients in interpretability — AI Alignment Forum