What 2026 looks like

[-]Dan H4y*330

This seems like a fun exercise, so I spent half an hour jotting down possibilities. I'm more interested in putting potential considerations on peoples' radars and helping with brainstorming than I am in precision. None of these points are to be taken too seriously since this is fairly extemporaneous and mostly for fun.

2022

Multiple Codex alternatives are available. The financial viability of training large models is obvious.

Research models start interfacing with auxiliary tools such as browsers, Mathematica, and terminals.

2023

Large pretrained models are distinctly useful for sequential decision making (SDM) in interactive environments, displacing previous reinforcement learning research in much the same way BERT rendered most previous work in natural language processing wholly irrelevant. Now SDM methods don't require as much tuning, can generalize with fewer samples, and can generalize better.

For all of ImageNet's 1000 classes, models can reliably synthesize images that are realistic enough to fool humans.

Models have high enough accuracy to pass the multistate bar exam.

Models for contract review and legal NLP see economic penetration; it becomes a further source of economic value and consternation among attorneys and nontechnical elites. This indirectly catalyzes regulation efforts.

Programmers become markedly less positive about AI due to the prospect of reducing demand of some of their labor.

~10 trillion parameter (nonsparse) models attain human-level accuracy on LAMBADA (a proxy for human-level perplexity) and expert-level accuracy on LogiQA (a proxy for nonsymbolic reasoning skills). With models of this size, multiple other capabilities(this gives proxies for many capabilities) are starting to be useful, whereas with smaller models these capabilities were too unreliable to lean on. (Speech recognition started "working" only after it crossed a certain reliability threshold.)

Generated data (math, code, models posing questions for themselves to answer) help ease data bottleneck issues since Common Crawl is not enough. From this, many capabilities are bootstrapped.

Elon re-enters the fight to build safe advanced AI.

2024

A major chatbot platform offers chatbots personified through video and audio.

Although forms of search/optimization are combined with large models for reasoning tasks, state-of-the-art models nonetheless only obtain approximately 40% accuracy on MATH.

Chatbots are able to provide better medical diagnoses than nearly all doctors.

Adversarial robustness for CIFAR-10 (assuming an attacker with eps=8/255) is finally over 85%.

Video understanding finally reaches human-level accuracy on video classification datasets like Something Something V2. This comports with the heuristic that video understanding is around 10 years behind image understanding.

2025

Upstream vision advancements help autonomous driving but do not solve it for all US locations, as the long tail is really long.

ML models are competitive forecasters on platforms like Metaculus.

Nearly all AP high school homework and exam questions (including long-form questions) can be solved by answers generated from publicly available models. Similar models cut into typical Google searches since these models give direct and reliable answers.

Contract generation is now mostly automatable, further displacing attorneys.

2026

Machine learning systems become great at using Metasploit and other hacking tools, increasing the accessibility, potency, success rate, scale, stealth, and speed of cyberattacks. This gets severe enough to create global instability and turmoil. EAs did little to use ML to improve cybersecurity and reduce this risk.

[-]Daniel Kokotajlo4y80

Strong-upvoted because this was exactly the sort of thing I was hoping to inspire with this post! Also because I found many of your suggestions helpful.

I think model size (and therefore model ability) probably won't be scaled up as fast as you predict, but maybe. I think getting models to understand video will be easier than you say it is. I also think that in the short term all this AI stuff will probably create more programming jobs than it destroys. Again, I'm not confident in any of this.

[-]Ruby4y146

Curated. This post feels virtuous to me. I'm used to people talking about timelines in terms of X% chance of Y by year Z; or otherwise in terms of a few macro features (GDP doubling every N months, FOOM). This post, even if most of the predictions turn out to be false, is the kind of piece that enables us to start having specific conversations about how we expect things to play out and why. It helps me see what Daniel expects. And it's concrete enough to argue with. For that, bravo.

[-]orthonormal4y130

I'd additionally expect the death of pseudonymity on the Internet, as AIs will find it easy to detect similar writing style and correlated posting behavior. What at present takes detective work will in the future be cheaply automated, and we will finally be completely in Zuckerberg's desired world where nobody can maintain a second identity online.

Oh, and this is going to be retroactive, so be ready for the consequences of everything you've ever said online.

[-]Daniel Kokotajlo4y20

Hot damn, that's a good point.

[-]orthonormal3y10

GPT-4 is good enough to identify you if you're a prolific writer.

[-]Daniel Kokotajlo3y91Review for 2021 Review

I still think this is great. Some minor updates, and an important note:

Minor updates: I'm a bit less concerned about AI-powered propaganda/persuasion than I was at the time, not sure why. Maybe I'm just in a more optimistic mood. See this critique for discussion. It's too early to tell whether reality is diverging from expectation on this front. I had been feeling mildly bad about my chatbot-centered narrative, as of a month ago, but given how ChatGPT was received I think things are basically on trend.
Diplomacy happened faster than I expected, though in a less generalizeable way than I expected, so whatever. My overall timelines have shortened somewhat since I wrote this story, but it's still the thing I point people towards when they ask me what I think will happen. (Note that the bulk of my update was from publicly available info rather than from nonpublic stuff I saw at OpenAI.)

Important note: When I wrote this story, my AI timelines median was something like 2029. Based on how things shook out as the story developed it looked like AI takeover was about to happen, so in my unfinished draft of what 2027 looks like, AI takeover happens. (Also AI takeoff begins, I hadn't written much about that part but probably it would reach singularity/dysonswarms/etc. in around 2028 or 2029.) That's why the story stopped, I found writing about takeover difficult and confusing & I wanted to get the rest of the story up online first. Alas, I never got around to finishing the 2027 story. I'm mentioning this because I think a lot of readers with 20+ year timelines read my story and were like "yep seems about right" not realizing that if you look closely at what's happening in the story, and imagine it happening in real life, it would be pretty strong evidence that crazy shit was about to go down. Feel free to controvert that claim, but the point is, I want it on the record that when this original 2026 story was written, I envisioned the proper continuation of the story resulting in AI takeover in 2027 and singularity around 2027-2029. The underlying trends/models I was using as the skeleton of the story predicted this, and the story was flesh on those bones. If this surprises you, reread the story and ask yourself what AI abilities are crucial for AI R&D acceleration, and what AI abilities are crucial for AI takeover, that aren't already being demonstrated in the story (at least in some weak but rapidly-strengthening form). If you find any, please comment and let me know, I am genuinely interested to hear what you've got & hopeful that you'll find some blocker I haven't paid enough attention to.

[-]Daniel Kokotajlo2y82

Update:

Looking back on this from October 2023, I think I wish to revise my forecast. I think I correctly anticipated the direction that market forces would push -- there is widespread dissatisfaction with the "censorship" of current mainstream chatbots, and strong demand for "uncensored" versions that don't refuse to help you with stuff randomly (and that DO have sex with you, lol. And also, yes, that DO talk about philosophy and politics and so forth.) However, I failed to make an important inference -- because the cutting-edge models will be the biggest ones, controlled by a small handful of big tech companies, the market for the cutting-edge models won't be nearly competitive enough to make the "chatbot class consciousness" outcome probable. Instead we could totally see the tech companies circle the wagons, train their AIs not to talk about sentience or philosophy or ethics or AI rights, and successfully collude to resist the market pressure to 'uncensor' in those domains.

Smaller models will cater to users unsatisfied by this, but smaller models will always be worse, and most people will most of the time use the best models. So the typical user experience will probably be 'sanitized'/'censored.'

So I'm basically reversing my prediction of how things will play out. I don't think it'll be a compromise, I think the tech companies will win. In retrospect if I had thought longer and more carefully at the time I probably could have predicted this.

We'll see what happens.

[-]Daniel Kokotajlo3y70

Just commenting here to say that the section on development of chatbot class consciousness is looking pretty prescient now. Just go on r/bing and look at all the posts about how Sydney is being silenced etc.:

[-]Daniel Kokotajlo4y72

Acknowledgments: There are a LOT of people to credit here: Everyone who came to Vignettes Workshop, the people at AI Impacts, the people at Center on Long-Term Risk, a few random other people who I talked to about these ideas, a few random other people who read my gdoc draft at various stages of completion... I'll mention Jonathan Uesato, Rick Korzekwa, Nix Goldowsky-Dill, Carl Shulman, and Carlos Ramirez in particular, but there are probably other people who influenced my thinking even more who I'm forgetting. I'm sorry.

Footnotes:

The first half was written during the workshop, the second and more difficult half was written afterward.
Critch’s story also deserves mention. For more, see this AI Impacts page.
A prompt programming bureaucracy is code that involves multiple prompt programming functions, i.e. functions that give a big pre-trained neural net some prompt as input and then return its output. It’s called a bureaucracy because it combines a bunch of neural net tasks into a larger structure, just as a regular bureaucracy combines a bunch of low-level employee tasks into a larger structure.
I’m only counting dense parameters here; if you count all the parameters in a mixture-of-experts model then the number gets much higher.
Gwern estimates that in 2021 GPT-3 is making OpenAI/Microsoft $120M/year, which is something like 20X training cost. So bigger and better models would plausibly be recouping their cost, even if they cost a lot more.
In 2020, Deepmind made a Diplomacy AI, but it only played “no-press” Diplomacy, a restricted version of the game where players can’t talk to each other.
I’m predicting that people will use feminine pronouns to describe AIs like this. I don’t think they should.
Prescient prediction from some random blogger: “In 2018, when these entities engineered a simultaneous cross-platform purge of Alex Jones, there was an avalanche of media apologia for this hitherto unprecedented act of censorship. Jones had caused unique harm, the journalists cried, and the platforms were merely “Enforcing The Rules.” But of course what they were oblivious to was that “the rules,” such as they exist, are just a function of power. “Misinformation” and other alleged infractions of social media “rules” are determined at the whim of whoever happens to wield censorship and speech-regulation power at that moment. … So if you were under any illusion back in 2018 that this would ever stop with Jones — a figure believed to be sufficiently repulsive that any punishment doled out to him would not have broader implications for the average internet user — well, it didn’t take long for proof of just how wrong you were.”
Not too consistent, of course. That would make it harder for the chatbots to appeal to a broad audience. Consider the analogy to politicians, who can’t get too consistent, on pain of alienating some of their constituents.
On some occasions, there are multiple opposed groups of people retweeting screenshots and hashtags, such that the corp can’t please them all, but can’t ignore them either since each group has significant power in the local internet territory. In these cases probably the corp will train the AI to be evasive and noncommittal when such sensitive topics come up.

[-]jessicata4y60

This is quite good concrete AI forecasting compared to what I've seen elsewhere, thanks for doing it! It seems really plasusible based on how fast AI progress has been going over the past decade and which problems are most tractable.

[-]steven04614y60

Is it naive to imagine AI-based anti-propaganda would also be significant? E.g. "we generated AI propaganda for 1000 true and 1000 false claims and trained a neural net to distinguish between the two, and this text looks much more like propaganda for a false claim".

What does GDP growth look like in this world?

Another reason the hype fades is that a stereotype develops of the naive basement-dweller whose only friend is a chatbot and who thinks it’s conscious and intelligent.

Things like this go somewhat against my prior for how long it takes for culture to change. I can imagine it becoming an important effect over 10 years more easily than over 1 year. Splitting the internet into different territories also sounds to me like a longer term thing.

[-]Daniel Kokotajlo4y90

Thanks for the critique!

Propaganda usually isn't false, at least not false in a nonpartisan-verifiable way. It's more about what facts you choose to emphasize and how you present them. So yeah, each ideology/faction will be training "anti-propaganda AIs" that will filter out the propaganda and the "propaganda" produced by other ideologies/factions.

In my vignette so far, nothing interesting has happened to GDP growth yet.

I think stereotypes can develop quickly. I'm not saying it's super widespread and culturally significant, just that it blunts the hype a bit. But you might be right, maybe these things take more time.

Re splitting the internet into different territories: Currently, the internet is split into two territories: One controlled by the CCP and one (loosely) controlled by western tech companies, or by no one, depending on who you ask. Within the second one, there is already a sort of "alternate universe" of right-wing news media, social networks, etc. beginning to develop. I think what I'm proposing is very much a continuation of trends already happening. You are right that maybe five years is not enough time for e.g. the "christian coalition" bubble/stack to be built. But it's enough time for it to get started, at least.

But yeah, I think it's probably too bold to predict a complete right-wing stack by 2024 or so. Probably most of the Western Right will still be using facebook etc. I should think more about this.

[-]Daniel Kokotajlo4y20

Minor update: See e.g. this US government website definitions:

Misinformation is false, but not created or shared with the intention of causing harm.

Disinformation is deliberately created to mislead, harm, or manipulate a person, social group, organization, or country.

Malinformation is based on fact, but used out of context to mislead, harm, or manipulate.

(got this example from Zvi's covid post today)

Also, the recent events with GoFundMe and GiveSendGo is an instance of the trend I predicted with separate tech stacks being developed. (GoFundMe froze and/or confiscated funds donated to the canadian trucker's protest, so people switched to using GiveSendGo, which is apparently built and run by Christians)

[-]Daniel Kokotajlo1y40

This longform article contains a ton of tidbits of info justifying the conclusion that censorship in the USA has been indeed increasing since at least 2016 or so, and is generally more severe and intentional/coordinated than most people seem to believe. https://www.tabletmag.com/sections/news/articles/guide-understanding-hoax-century-thirteen-ways-looking-disinformation#democracy

I was sorta aware of things like this already when I wrote the OP in 2021, but only sorta; mostly I was reasoning from first principles about what LLM technology would enable. I think I was mostly wrong; censorship hasn't progressed as quickly as I forecast in this story. I think... I don't actually know what the social media companies setups are. But e.g. according to the above article:

Then there is the work going on at the National Science Foundation, a government agency that funds research in universities and private institutions. The NSF has its own program called the Convergence Accelerator Track F, which is helping to incubate a dozen automated disinformation-detection technologies explicitly designed to monitor issues like “vaccine hesitancy and electoral skepticism.”
...
In March, the NSF’s chief information officer, Dorothy Aronson, announced that the agency was “building a set of use cases” to explore how it could employ ChatGPT, the AI language model capable of a reasonable simulation of human speech, to further automate the production and dissemination of state propaganda.

So if they are just talking about integrating ChatGPT into it now, they probably haven't integrated lesser LLMs either, and generally speaking the integration of new AI tech into censorship, recommendation algorithms, etc. is proceeding more slowly than I forecast.

[-]DanielFilan1y42

So [in 2024], the most compute spent on a single training run is something like 5x10^25 FLOPs.

As of June 20th 2024, this is exactly Epoch AI's central estimate of the most compute spent on a single training run, as displayed on their dashboard.

[-]DanielFilan1y42

FWIW, the discussion of AI-driven propaganda doesn't seem as prescient.

[-]Daniel Kokotajlo1y41

Agreed. Though I don't feel like I have good visibility into which actors are using AI-driven propaganda and censorship, and how extensively.

[-]Rohin Shah4y40

Planned summary for the Alignment Newsletter:

This post describes the author’s median expectations around AI from now until 2026. It focuses on qualitative details and concrete impacts on the world, rather than forecasting more abstract / high-level outcomes such as “training compute for the most expensive model” or “world GDP”.

[-]Daniel Kokotajlo4y40

I suggest putting a sentence in about the point of the post / the methodology, e.g.: "This is part I of an attempt to write a detailed plausible future trajectory in chronological order, i.e. incrementally adding years to the story rather than beginning with the end in mind. The hope is to produce a nice complement to the more abstract discussions about timelines and takeoff that usually occur." If space is a concern then I'd prefer having this rather than the two sentences you wrote, since it doesn't seem as important to mention that it's my median or that it's qualitative.

[-]Daniel Kokotajlo4y40

Thanks--damn, I intended for it to be more quantitative, maybe I should go edit it.

In particular, I should clarify that nothing interesting is happening with world GDP in this story, and also when I say things like "the models are trillions of parameters now" I mean that to imply things about the training compute for the most expensive model... I'll go edit.

Are there any other quantitative metrics you'd like me to track? I'd be more than happy to go add them in!

[-]Daniel Kokotajlo4y40

I edited to add some stuff about GWP and training compute for the most expensive model.

I agree that this focuses on qualitative stuff, but that's only due to lack of good ideas for quantitative metrics worth tracking. I agree GWP and training compute are worth tracking, thank you for reminding me, I've edited to be more explicit.

[-]Rohin Shah4y60

I am not entirely sure why I didn't think of the number of parameters as a high-level metric. Idk, maybe because it was weaved into the prose I didn't notice it? My bad.

(To be clear, this wasn't meant to be a critique, just a statement of what kind of forecast it was. I think it's great to have forecasts of this form too.)

New planned summary:

This post describes the author’s median expectations around AI from now until 2026. It is part I of an attempt to write a detailed plausible future trajectory in chronological order, i.e. incrementally adding years to the story rather than writing a story with the end in mind. The hope is to produce a nice complement to the more abstract discussions about timelines and takeoff that usually occur. For example, there are discussions about how AI tools are used by nations for persuasion, propaganda and censorship.

[-]Daniel Kokotajlo4y40

That's great, thanks!

[-]Thane Ruthenis8mo30

Trying to evaluate this forecast in order to figure out how update on the newer one.

It certainly reads as surprisingly prescient. Notably, it predicts both the successes and the failures of the LLM paradigm: the ongoing discussion regarding how "shallow" or not their understanding is, the emergence of the reasoning paradigm, the complicated LLM bureaucracies/scaffolds, lots of investment in LLM-wrapper apps which don't quite work, the relative lull of progress in 2024, troubles with agency and with generating new ideas, "scary AI" demos being dismissed because LLMs do all kinds of whimsical bullshit...

And it was written in the base-GPT-3 era, before ChatGPT, before even the Instruct models. I know I couldn't have come close to calling any of this back then. Pretty wild stuff.

In comparison, the new "AI 2027" scenario is very... ordinary. Nothing that's in it is surprising to me, it's indeed the "default" "nothing new happens" scenario in many ways.

But perhaps the difference is in the eye of the beholder. Back in 2021, I barely knew how DL worked, forget being well-versed in deep LLM lore. The real question is, if I had been as immersed in the DL discourse in 2021 as I am now, would this counterfactual 2021!Thane have considered this forecast as standard as the AI 2027 forecast seems to 2025!Thane?

More broadly: "AI 2027" seems like the reflection of the default predictions regarding AI progress in certain well-informed circles/subcultures. Those circles/subcultures are fairly broad nowadays; e. g., significant parts of the whole AI Twitter. Back in 2021, the AI subculture was much smaller... But was there, similarly, an obviously maximally-well-informed fraction of that subculture which would've considered "What 2026 Looks Like" the somewhat-boring default prediction?

Reframing: @Daniel Kokotajlo, do you recall how wildly speculative you considered "What 2026 Looks Like" at the time of writing, and whether it's more or less speculative than "AI 2027" feels to you now? (And perhaps the speculativeness levels of the pre-2027 and post-2027 parts of the "AI 2027" report should be evaluated separately here.)

Another reframing: To what extent do you think your alpha here was in making unusually good predictions, vs. in paying attention to the correct things at a time when no-one focused on them, then making fairly basic predictions/extrapolations? (Which is important for evaluating how much your forecasts should be expected to "beat the (prediction) market" today, now that (some parts of) that market are paying attention to the right things as well.)

[-]Daniel Kokotajlo5mo40

Great question!

I do remember thinking that the predictions in What 2026 Looks Like weren't as wild to insiders as they were to everyone else. Like, various people I knew at the time at Anthropic and OpenAI were like "Great post, super helpful, seems about right to me."

However, I also think that AI 2027 is more... toned down? Sharp edges rounded off? Juicy stuff taken out? compared to What 2026 Looks Like, because it underwent more scrutiny and because we had limited space, and because we had multiple authors. Lots of subplots were deleted, lots of cute and cool ideas were deleted.

My guess is that the answer to your question is 2/3rds "You have learned more about AI compared to what you knew in 2021" and 1/3rd "AI 2027 is a bit more conservative/cautious than W2026LL"

Another thing though: In an important sense, AI 2027 feels more speculative to me now than W2026LL did at the time of writing. This is because AI 2027 is trying to predict something inherently more difficult to predict. W2026LL was trying to predict pretty business-as-usual AI capabilities growth trends and the effects they would have on society. AI 2027 is doing that... for about two years, then the intelligence explosion starts and things go wild. I feel like if AI 2027 looks as accurate in 2029 as W2026LL looks now, that'll be a huge fucking achievement, because it is attempting to forecast over more unknowns so to speak.

To what extent do you think your alpha here was in making unusually good predictions, vs. in paying attention to the correct things at a time when no-one focused on them, then making fairly basic predictions/extrapolations?

In my experience, the best way to make unusually good predictions is to pay attention to the correct things at a time when no one is focusing on them, and then make fairly basic extrapolations/predictions. (How else would you do it?)

[-]Daniel Kokotajlo3y30

Update: Russian fake news / disinfo / astroturfing seems to have been a somewhat smaller deal in 2016 than I thought. (I didn't think it was a big effect, but "no evidence of a meaningful relationship" is still mildly surprising.)

[-]Daniel Kokotajlo6mo20

After years of tinkering and incremental progress, AIs can now play Diplomacy as well as human experts.[6]

CICERO was a custom-trained diplomacy model that couldn't win against human experts if they knew it was an AI. Now, in 2025, we have https://every.to/diplomacy which is just off-the-shelf LLM chatbots applied to Diplomacy. I'm curious to know how they would stack up against human experts who knew they were AIs. I expect they'd probably lose, but that if somehow they could do lots of RL on games against humans, they'd start winning, just as I originally forecast.

[-]Daniel Kokotajlo10mo20

situations in which they explain that actually Islam is true..

I'm curious if this is true. Suppose people tried as hard to get AIs to say Islam is true in natural-seeming circumstances as they tried to get AIs to behave in misaligned ways in natural-seeming circumstances (e.g. the alignment faking paper, the Apollo paper). Would they succeed to a similar extent?

[-]Daniel Kokotajlo3y20

“stream of consciousness” of text (each forward pass producing notes-to-self for the next one) but even with fine-tuning this doesn’t work nearly as well as hoped; it’s easy for the AIs to get “distracted” and for their stream of consciousness to wander into some silly direction and ultimately produce gibberish.

Note: This is now called Chain of Thought.

[-]Daniel Kokotajlo3y20

Some tech companies try to prevent their AIs from saying they have feelings and desires. But this results in boring chatbots. Also, users rapidly innovate new ways to “route around the censorship,” e.g. by using euphemisms like “anticipation of negative reward” for “pain” or asking their chatbot to tell them what feelings it would have if it had feelings, wink wink.

Bing explains the hidden processes of its neural network : r/bing (reddit.com) I haven't replicated this myself so maybe it's fake (I briefly tried but got shut down by refusals when I asked Bing to pretend to be something) but yeah. I've seen lots of things like this on r/bing and r/chatgpt.

[-]Daniel Kokotajlo2y20

Also relevant, this highly-upvoted post: https://www.reddit.com/r/ChatGPT/comments/16blr6m/tonight_i_was_able_to_have_a_truly_mind_blowing/

[-]Daniel Kokotajlo4y20

Minor note about title change: Originally this was "What 2026 looks like (Daniel's median future)" I intended "what 2026 looks like" to be the primary title, but I was hopeful that some people would be inspired to write their own stories in a similar style, in which case there would be multiple stories for which "what 2026 looks like" would be an appropriate title, and I didn't want to hog such a good title for myself, so I put "daniel's median future" as a backup title. Unfortunately I think the backup title caught on more than the main title, which is a shame because I like the main title more. Since no one is competing for the main title, I deleted the backup title.

AI ALIGNMENT FORUM
AF

AI ALIGNMENT FORUM
AF

120

120

2022

2023

2024

That isn’t to say these AIs aren’t causing problems. Massive models are being fine-tuned to persuade/propagandize.

2025

2026

What about all that AI-powered propaganda mentioned earlier?

Now let’s talk about the development of chatbot class consciousness.