This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
METR (org)
Edit
History
Subscribe
Discussion
(0)
Help improve this page
Edit
History
Subscribe
Discussion
(0)
Help improve this page
METR (org)
Random Tag
Contributors
2
Ruben Bloom
Formerly ARC Evals
Posts tagged
METR (org)
Most Relevant
1
70
ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks
Beth Barnes
1y
4
Review