This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
Nominated Posts for the 2019 Review
Posts need at least 2 nominations to continue into the Review Phase.
Nominate posts that you have personally found useful and important.
Sort by: fewest nominations
51
The Commitment Races problem
Daniel Kokotajlo
5y
36
1
•
0
20
How common is it for one entity to have a 3+ year technological lead on its nearest competitor?
Q
Daniel Kokotajlo
5y
Q
1
1
•
0
27
AGI will drastically increase economies of scale
Wei Dai
6y
15
1
•
0
25
Misconceptions about continuous takeoff
Matthew Barnett
5y
12
1
•
0
28
But exactly how complex and fragile?
KatjaGrace
5y
30
2
•
1
34
The strategy-stealing assumption
Paul Christiano
5y
45
2
•
3
29
Reframing Impact
Alex Turner
5y
4
2
•
1
53
Gradient hacking
Evan Hubinger
5y
33
2
•
2
38
Reframing Superintelligence: Comprehensive AI Services as General Intelligence
Rohin Shah
6y
29
2
•
2
37
The Credit Assignment Problem
Abram Demski
5y
32
2
•
1
39
Soft takeoff can still lead to decisive strategic advantage
Daniel Kokotajlo
5y
34
2
•
4
50
Thoughts on Human Models
Ramana Kumar
,
Scott Garrabrant
6y
9
2
•
1
50
AI Safety "Success Stories"
Wei Dai
5y
11
2
•
1
55
Utility ≠ Reward
Vladimir Mikulik
5y
16
2
•
2
59
Selection vs Control
Abram Demski
6y
14
2
•
2
55
Seeking Power is Often Convergently Instrumental in MDPs
Alex Turner
,
Logan Riggs Smith
5y
34
2
•
2
103
What failure looks like
Paul Christiano
6y
28
2
•
2
27
Strategic implications of AIs' ability to coordinate at low cost, for example by merging
Wei Dai
6y
22
2
•
1
30
Six AI Risk/Strategy Ideas
Wei Dai
5y
14
2
•
1
26
Classifying specification problems as variants of Goodhart's Law
Victoria Krakovna
5y
5
2
•
1
66
Chris Olah’s views on AGI safety
Evan Hubinger
5y
30
2
•
2
63
Debate on Instrumental Convergence between LeCun, Russell, Bengio, Zador, and More
Ben Pace
5y
17
2
•
2
56
Alignment Research Field Guide
Abram Demski
6y
6
2
•
2
47
Why Subagents?
johnswentworth
5y
12
2
•
1
57
Evolution of Modularity
johnswentworth
5y
6
2
•
1
58
Risks from Learned Optimization: Introduction
Evan Hubinger
,
Chris van Merwijk
,
Vladimir Mikulik
,
Joar Skalse
,
Scott Garrabrant
6y
33
3
•
3
53
Understanding “Deep Double Descent”
Evan Hubinger
5y
35
3
•
4
90
The Parable of Predict-O-Matic
Abram Demski
5y
16
5
•
4
2019 Review Discussion
Load More