Conditions under which misaligned subagents can (not) arise in classifiers — AI Alignment Forum