Safe exploration and corrigibility — AI Alignment Forum