Predictors that don't try to manipulate you(?) — AI Alignment Forum