Towards Developmental Interpretability — AI Alignment Forum