Failure modes in a shard theory alignment plan — AI Alignment Forum