This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
CAST: Corrigibility As Singular Target
AF
Login
CAST: Corrigibility As Singular Target
53
0. CAST: Corrigibility as Singular Target
Max Harms
4mo
4
17
1. The CAST Strategy
Max Harms
5mo
15
24
2. Corrigibility Intuition
Max Harms
5mo
9
12
3a. Towards Formal Corrigibility
Max Harms
5mo
2
10
3b. Formal (Faux) Corrigibility
Max Harms
5mo
12
18
4. Existing Writing on Corrigibility
Max Harms
5mo
8
9
5. Open Corrigibility Questions
Max Harms
5mo
0