This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Wikitags
AF
Login
Subscribe
Discussion
0
METR (org)
Ruben Bloom
METR (org)
Subscribe
Discussion
0
Written by
Ruben Bloom
last updated
1st Jul 2024
Summaries
Cancel
Submit
Formerly ARC Evals
Posts tagged
METR (org)
Most Relevant
0
77
METR: Measuring AI Ability to Complete Long Tasks
Zach Stein-Perlman
11d
18
1
70
ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks
Beth Barnes
2y
4
0
61
Clarifying METR's Auditing Role
Beth Barnes
11mo
0