Counterfactually uninfluenceable agents — AI Alignment Forum