AI ALIGNMENT FORUM
AF

Value LearningWireheadingAI
Frontpage

11

Value extrapolation vs Wireheading

by Stuart_Armstrong
17th Jun 2022
1 min read
1

11

Value LearningWireheadingAI
Frontpage
New Comment
Moderation Log
More from Stuart_Armstrong
View more
Curated and popular this week
0Comments

Talk given by Rebecca Gorman and Stuart Armstrong at the CHAI 2022 Asilomar Conference. We present an example of AI wireheading (an AI taking over its own reward channel), and show how value extrapolation can be used to combat it.

https://www.youtube.com/watch?v=REUanSy0SgU

Mentioned in
27Benchmark for successful concept extrapolation/avoiding goal misgeneralization