Goodhart in RL with KL: Appendix — AI Alignment Forum