Promoted to curated: These additions are really great, and they fill in a lot of the most confusing parts of the original Embedded Agency sequence, which was already one of my favorite pieces of content on all of Lesswrong. So it seems fitting to curate this update to it, which improves it even further.
Abram Demski and Scott Garrabrant's "Embedded Agency" has been updated with quite a bit of new content from Abram. All the changes are live today, and can be found at any of these links:
Abram says, "I'm excited about this new version because I feel like in a lot of cases, the old version gestured at an idea but didn't go far enough to really explain. The new version feels to me like it gives the real version of the problem in cases where the previous version didn't quite make it, and explains things more thoroughly."
This diff shows all the changes to the blog version. Changes include (in addition to many added or tweaked illustrations)...
Changes to "Decision Theory":
Changes to "Embedded World-Models":
Changes to "Robust Delegation":
Changes to "Subsystem Alignment":