This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Wikitags
AF
Login
Literature Reviews
Settings
Applied to
Shallow review of technical AI safety, 2024
by
jordine
3mo
ago
Applied to
Should you have children? All LessWrong posts about the topic
by
Sherrinford
4mo
ago
Applied to
Current safety training techniques do not fully transfer to the agent setting
by
Simon Lermen
5mo
ago
Applied to
AI for Bio: State Of The Field
by
Kaj Sotala
7mo
ago
Applied to
Compute Governance Literature Review
by
sijarvis
9mo
ago
Applied to
Transfer Learning in Humans
by
niplav
1y
ago
Applied to
Paper review: “The Unreasonable Effectiveness of Easy Training Data for Hard Tasks”
by
Vassil Tashev
1y
ago
Applied to
Neural uncertainty estimation review article (for alignment)
by
Charlie Steiner
1y
ago
Applied to
AISC project: How promising is automating alignment research? (literature review)
by
Bogdan Ionut Cirstea
1y
ago
Applied to
Shallow review of live agendas in alignment & safety
by
technicalities
1y
ago
Applied to
Appendices to the live agendas
by
technicalities
1y
ago
Applied to
Elicit: Language Models as Research Assistants
by
Mayao, Cheslie Nica H.
1y
ago
Applied to
Paper digestion: "May We Have Your Attention Please? Human-Rights NGOs and the Problem of Global Communication"
by
Klara Helene Nielsen
2y
ago
Applied to
Some Summaries of Agent Foundations Work
by
Matt MacDermott
2y
ago
Applied to
A Study of AI Science Models
by
Eleni Angelou
2y
ago
Applied to
How To Get Startup Ideas: A Brief Lit Review and Analysis
by
Adam Zerner
2y
ago
Applied to
Literature review of TAI timelines
by
Raymond Arnold
2y
ago
Applied to
Scaling Laws Literature Review
by
Raymond Arnold
2y
ago
Applied to
[Link] Childcare : what the science says
by
Raymond Arnold
3y
ago