AI ALIGNMENT FORUM
AF

RLHFAI
Frontpage

17

[Link] Why I’m excited about AI-assisted human feedback

by janleike
6th Apr 2022
1 min read
0

17

RLHFAI
Frontpage
New Comment
Moderation Log
More from janleike
View more
Curated and popular this week
0Comments

This is a link post for https://aligned.substack.com/p/ai-assisted-human-feedback

I'm writing a sequence of posts on the approach to alignment I'm currently most excited about. This first post argues for recursive reward modeling and the problem it's meant to address (scaling RLHF to tasks that are hard to evaluate).