This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
AF
Login
AMA
AI
Frontpage
8
[Link] Aligned AI
AMA
by
Stuart Armstrong
1st Mar 2022
1 min read
0
8
We're doing an AMA for Aligned AI
here
. All questions welcome.
New Comment
Submit
Moderation Log
More from
Stuart_Armstrong
33
Go home GPT-4o, you’re drunk: emergent misalignment as lowered inhibitions
Stuart Armstrong
,
R Gorman
1mo
2
41
Using GPT-Eliezer against ChatGPT Jailbreaking
Stuart Armstrong
,
R Gorman
2y
18
36
Alignment can improve generalisation through more robustly doing what a human wants - CoinRun example
Stuart Armstrong
1y
2
View more
Curated and popular this week
63
Power Lies Trembling: a three-book review
Richard Ngo
3d
0
59
Negative Results for SAEs On Downstream Tasks and Deprioritising SAE Research (GDM Mech Interp Team Progress Update #2)
lewis smith
,
Senthooran Rajamanoharan
,
Arthur Conmy
,
CallumMcDougall
,
Tom Lieberum
,
János Kramár
,
Rohin Shah
,
Neel Nanda
6d
6
46
Frontier AI Models Still Fail at Basic Physical Tasks: A Manufacturing Case Study
Adam Karvonen
4d
0
0
Comments
Previous
Next
We're doing an AMA for Aligned AI here. All questions welcome.