AI ALIGNMENT FORUM
Wikitags
AF

1

AXRP

1

Written by Multicore, DanielFilan, et al. last updated 30th Dec 2024

AI X-Risk Research Podcast is a podcast hosted by Daniel Filan.

See also: Audio, Interviews

Posts tagged AXRP

2

37AXRP Episode 31 - Singular Learning Theory with Daniel Murfet

1y

0

2

38AXRP Episode 27 - AI Control with Buck Shlegeris and Ryan Greenblatt

1y

6

2

19AXRP Episode 24 - Superalignment with Jan Leike

2y

3

2

30AXRP Episode 22 - Shard Theory with Quintin Pope

2y

4

2

25AXRP Episode 19 - Mechanistic Interpretability with Neel Nanda

2y

0

2

17AXRP Episode 25 - Cooperative AI with Caspar Oesterheld

2y

0

2

25AXRP Episode 39 - Evan Hubinger on Model Organisms of Misalignment

4mo

0

2

20AXRP Episode 33 - RLHF Problems with Scott Emmons

10mo

0

2

20AXRP Episode 38.2 - Jesse Hoogland on Singular Learning Theory

4mo

0

2

18AXRP Episode 15 - Natural Abstractions with John Wentworth

3y

0

1

15AXRP Episode 14 - Infra-Bayesian Physicalism with Vanessa Kosoy

3y

10

1

15AXRP Episode 13 - First Principles of AGI Safety with Richard Ngo

3y

1

2

14AXRP Episode 36 - Adam Shai and Paul Riechers on Computational Mechanics

6mo

0

2

11AXRP Episode 30 - AI Security with Jeffrey Ladish

1y

0

2

13AXRP Episode 34 - AI Evaluations with Beth Barnes

9mo

0