Making it harder for an AGI to "trick" us, with STVs — AI Alignment Forum