Online Learning 2: Bandit learning with catastrophes — AI Alignment Forum