x

AI ALIGNMENT FORUM

AF

Jesse Hoogland — AI Alignment Forum

Jesse Hoogland

Top postsTop post

Jesse Hoogland

Message

Cofounder at Sequent Research. Previously, executive director at Timaeus, where I worked on applications of singular learning theory and developmental interpretability.

Website: jessehoogland.com

Twitter: @jesse_hoogland

3395

Ω

877

28

91

6y

Jesse Hoogland

Cofounder at Sequent Research. Previously, executive director at Timaeus, where I worked on applications of singular learning theory and developmental interpretability.

Website: jessehoogland.com

Twitter: @jesse_hoogland

Top postsTop post

Sequent: scale and automation for higher confidence in alignment

Alignment is not on track Artificial superintelligence (ASI) may be developed in the next few years. It is unclear whether alignment is on track to be ready on the same timeframe. At a minimum, the empirical programs at AI labs are unlikely to deliver a priori confidence, before training ASI, that things will go well. We are starting a large nonprofit research organization, Sequent, that aims to clear a higher bar: 1. We are aiming at higher confidence via a portfolio of theory and empirics bets, all of which could fail, such that if any succeed, they would give us more a priori confidence in aligned outcomes. 2. We are investing heavily in automation to accelerate progress on these bets. 3. We believe that theory unlocks higher automation. Taking a more principled approach offers better filters for deciding which directions of automated research are promising (a proof is worth a thousand experiments, and even a pseudo-proof is worth hundreds). Who[1]: researchers from the UK AISI’s Alignment Team and Timaeus, with more to come. We’re aiming at 40-80 FTE two years from now. The Alignment Team ran the £30m Alignment Project, and Timaeus has pioneered applying singular learning theory (SLT) to alignment. Founding team: * Geoffrey Irving — Chief Scientist at UK AISI; ex-DeepMind, OpenAI, and Google Brain. * Daniel Murfet — Head of Research at Timaeus; left tenure to pioneer SLT for alignment. * AISI Alignment — Alex Holness-Tofts and Jacob Pfau. * Timaeus — Jesse Hoogland, Stan van Wingerden, and Marco Cozzi. * Joined by researchers from Timaeus and more researchers from the UK AISI’s Alignment Team Where: a large in-person presence in the Bay Area (Berkeley), as well as researchers working remotely from London, Melbourne, and elsewhere. In this post, we discuss: * What it means to aim at higher confidence * Why start a new big organization * Whether sufficiently fast progress is possible with automated research Aiming at higher confidence In an ide

Neural networks generalize because of this one weird trick

215Jan 18, 2023

Towards Developmental Interpretability

195Jul 12, 2023

Announcing Timaeus

188Oct 22, 2023

Sequent: scale and automation for higher confidence in alignment

by Geoffrey Irving, Alex HT, Jesse Hoogland, Daniel Murfet, Jacob Pfau, Marco Cozzi, and Stan van Wingerden

Alignment is not on track Artificial superintelligence (ASI) may be developed in the next few years. It is unclear whether alignment is on track to be ready on the same timeframe. At a minimum, the empirical programs at AI labs are unlikely to deliver a priori confidence, before training ASI,...

SLT for AI Safety

> This sequence draws from a position paper co-written with Simon Pepin Lehalleur, Jesse Hoogland, Matthew Farrugia-Roberts, Susan Wei, Alexander Gietelink Oldenziel, Stan van Wingerden, George Wang, Zach Furman, Liam Carroll, Daniel Murfet. Thank you to Stan, Dan, and Simon for providing feedback on this post. Alignment ⊆ Capabilities. As...

Jul 1, 2025•78

The Sweet Lesson: AI Safety Should Scale With Compute

A corollary of Sutton's Bitter Lesson is that solutions to AI safety should scale with compute.[1] Let's consider a few examples of research directions that are aiming at this property: * Deliberative Alignment: Combine chain-of-thought with Constitutional AI to improve safety with inference-time compute (see Guan et al. 2025, Figure...

May 5, 2025•98

Timaeus in 2024

> TLDR: We made substantial progress in 2024: > > * We published a series of papers that verify key predictions of Singular Learning Theory (SLT) [1, 2, 3, 4, 5, 6]. > * We scaled key SLT-derived techniques to models with billions of parameters, eliminating our main concerns around...

Feb 20, 2025•100

Timaeus is hiring researchers & engineers

TLDR: We're hiring for research & engineering roles across different levels of seniority. Hires will work on applications of singular learning theory to alignment, including developmental interpretability. About Us Timaeus' mission is to empower humanity by making breakthrough scientific progress on alignment. Our research focuses on applications of singular learning...

Jan 17, 2025•65

Building AI Research Fleets

by Ben Goldhaber and Jesse Hoogland

From AI scientist to AI research fleet Research automation is here (1, 2, 3). We saw it coming and planned ahead, which puts us ahead of most (4, 5, 6). But that foresight also comes with a set of outdated expectations that are holding us back. In particular, research automation...

Jan 12, 2025•132

o1: A Technical Primer

> TL;DR: In September 2024, OpenAI released o1, its first "reasoning model". This model exhibits remarkable test-time scaling laws, which complete a missing piece of the Bitter Lesson and open up a new axis for scaling compute. Following Rush and Ritter (2024) and Brown (2024a, 2024b), I explore four hypotheses...

Dec 9, 2024•175

Load More (7/19)