Latent Adversarial Training — AI Alignment Forum