Simple probes can catch sleeper agents — AI Alignment Forum