This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
AI ALIGNMENT FORUM
Tags
AF
Login
AI Benchmarking
Edit
History
Subscribe
Discussion
(0)
Help improve this page
Edit
History
Subscribe
Discussion
(0)
Help improve this page
AI Benchmarking
Random Tag
Contributors
Posts tagged
AI Benchmarking
Most Relevant
1
15
Improving Model-Written Evals for AI Safety Benchmarking
Sunishchal Dev
,
Marius Hobbhahn
1mo
0
0
5
Auto-Enhance: Developing a meta-benchmark to measure LLM agents’ ability to improve other agents
Sam Brown
,
Basil Labib
,
Codruta Lugoj
,
Sai Sasank Y
4mo
0
0
5
MMLU’s Moral Scenarios Benchmark Doesn’t Measure What You Think it Measures
corey morris
1y
2