This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Wikitags
AF
Login
DeepMind
Settings
Applied to
The GDM AGI Safety+Alignment Team is Hiring for Applied Interpretability Research
by
Arthur Conmy
1mo
ago
Applied to
AGI Safety & Alignment @ Google DeepMind is hiring
by
Rohin Shah
1mo
ago
Applied to
MONA: Managed Myopia with Approval Feedback
by
Seb Farquhar
2mo
ago
Applied to
Addressing doubts of AI progress: Why GPT-5 is not late, and why data scarcity isn't a fundamental limiter near term.
by
LDJ
2mo
ago
Dakara
v1.5.0
Dec 30th 2024 GMT
LW
1
Applied to
Meta AI (FAIR) latest paper integrates system-1 and system-2 thinking into reasoning models.
by
happy friday
5mo
ago
Applied to
AGI Safety and Alignment at Google DeepMind: A Summary of Recent Work
by
Rohin Shah
7mo
ago
Applied to
"AI achieves silver-medal standard solving International Mathematical Olympiad problems"
by
Multicore
8mo
ago
Applied to
I'm a bit skeptical of AlphaFold 3
by
Oleg Trott
9mo
ago
Applied to
On DeepMind’s Frontier Safety Framework
by
Tobias D.
9mo
ago
Applied to
Paper: "The Ethics of Advanced AI Assistants" -Google DeepMind
by
Tobias D.
1y
ago
Applied to
Notes on Dwarkesh Patel’s Podcast with Demis Hassabis
by
Tobias D.
1y
ago
Applied to
The One and a Half Gemini
by
Tobias D.
1y
ago
Applied to
Skepticism About DeepMind's "Grandmaster-Level" Chess Without Search
by
Arjun Panickssery
1y
ago
Applied to
Explaining grokking through circuit efficiency
by
Jason Gross
1y
ago
Applied to
Review of Alignment Plan Critiques- December AI-Plans Critique-a-Thon Results
by
Kabir Kumar
1y
ago
Applied to
AI #41: Bring in the Other Gemini
by
Tobias D.
1y
ago