The Self-Hating Attention Head: A Deep Dive in GPT-2 — AI Alignment Forum