Eliezer Yudkowsky

GPTs are Predictors, not Imitators

(Related text posted to Twitter; this version is edited and has a more advanced final section.) Imagine yourself in a box, trying to predict the next word - assign as much probability mass to the next token as possible - for all the text on the Internet. Koan: Is this...

Apr 8, 2023427

Eliezer Yudkowsky's Shortform

Apr 1, 202314

Alexander and Yudkowsky on AGI goals

This is a lightly edited transcript of a chatroom conversation between Scott Alexander and Eliezer Yudkowsky last year, following up on the Late 2021 MIRI Conversations. Questions discussed include "How hard is it to get the right goals into AGI systems?" and "In what contexts do AI systems exhibit 'consequentialism'?"....

Jan 24, 2023179

A challenge for AGI organizations, and a challenge for readers

(Note: This post is a write-up by Rob of a point Eliezer wanted to broadcast. Nate helped with the editing, and endorses the post’s main points.) Eliezer Yudkowsky and Nate Soares (my co-workers) want to broadcast strong support for OpenAI’s recent decision to release a blog post ("Our approach to...

Dec 1, 2022303

Let's See You Write That Corrigibility Tag

The top-rated comment on "AGI Ruin: A List of Lethalities" claims that many other people could've written a list like that. "Why didn't you challenge anybody else to write up a list like that, if you wanted to make a point of nobody else being able to write it?" I...

Jun 19, 2022124

AGI Ruin: A List of Lethalities

Preamble: (If you're already familiar with all basics and don't want any preamble, skip ahead to Section B for technical difficulties of alignment proper.) I have several times failed to write up a well-organized list of reasons why AGI will kill you. People come in with different ideas about why...

Jun 5, 2022977

Six Dimensions of Operational Adequacy in AGI Projects

Editor's note: The following is a lightly edited copy of a document written by Eliezer Yudkowsky in November 2017. Since this is a snapshot of Eliezer’s thinking at a specific time, we’ve sprinkled reminders throughout that this is from 2017. A background note: It’s often the case that people are...

May 30, 2022323

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

Eliezer Yudkowsky

Eliezer Yudkowsky

AGI Ruin: A List of Lethalities

GPTs are Predictors, not Imitators

Discussion with Eliezer Yudkowsky on AGI interventions

Six Dimensions of Operational Adequacy in AGI Projects

Eliezer Yudkowsky

AGI Ruin: A List of Lethalities

GPTs are Predictors, not Imitators

Discussion with Eliezer Yudkowsky on AGI interventions

Six Dimensions of Operational Adequacy in AGI Projects

GPTs are Predictors, not Imitators

Eliezer Yudkowsky's Shortform

Alexander and Yudkowsky on AGI goals

A challenge for AGI organizations, and a challenge for readers

Let's See You Write That Corrigibility Tag

AGI Ruin: A List of Lethalities

Six Dimensions of Operational Adequacy in AGI Projects