Deconfusion

Written by Abram Demski last updated 17th Mar 2021

Narrowly, deconfusion is a specific branch of AI alignment research, discussed in MIRI's 2018 research update. More broadly, the term applies to any domain. Quoting from the research update:

By deconfusion, I mean something like “making it so that you can think about a given topic without continuously accidentally spouting nonsense.”

Posts tagged Deconfusion

26Looking Deeper at Deconfusion

Adam Shimi

44Builder/Breaker for Deconfusion

Abram Demski

16Traps of Formalization in Deconfusion

Adam Shimi

81. A Sense of Fairness: Deconfusing Ethics

Roger Dearnaley

42Deconfusing Direct vs Amortised Optimization

Beren Millidge

35Modelling Transformative AI Risks (MTAIR) Project: Introduction

David Manheim, Aryeh Englander

25Applications for Deconfusing Goal-Directedness

Adam Shimi

21Musings on general systems alignment

Alex Flint

15Open problem: how can we quantify player alignment in 2x2 normal-form games?

Alex Turner, Vanessa Kosoy

12A review of "Agents and Devices"

Adam Shimi

11Goal-Directedness and Behavior, Redux

Adam Shimi

10Approaches to gradient hacking

Adam Shimi

12Alex Turner's Research, Comprehensive Information Gathering

Adam Shimi

10Power-seeking for successive choices

Adam Shimi

137Simulators

janus

AI ALIGNMENT FORUM
Wikitags
AF

Deconfusion

Summaries