Featured Research
Our AI evaluations research focuses on assessing broad autonomous capabilities and the ability of AI systems to accelerate AI R&D. We also study potential AI behavior that threatens the integrity of evaluations and mitigations for such behavior.
View all research
Measuring the Self-Reported Impact of Early-2026 AI on Technical Worker Productivity
A survey of 349 technical workers finds a median 1.4–2x self-reported change in value of work due to AI tools, expected to grow over time, though there are reasons to be skeptical of the magnitude.
Read more
Early Work on Monitorability Evaluations
We show preliminary results on a prototype evaluation that tests monitors' ability to catch AI agents doing side tasks, and AI agents' ability to bypass this monitoring.
Read more
How Does Time Horizon Vary Across Domains?
We build on our time-horizon work and analyze 9 benchmarks for scientific reasoning, math, robotics, computer use, and self-driving in terms of time-horizon trends; we observe generally similar rates of improvement to the 7-month doubling time in our original time-horizon work.
Read more
MALT
A dataset of natural and prompted examples of behaviors that threaten evaluation integrity, like generalized reward hacking or sandbagging
Read more
Hawk
Our open-source platform for running AI agent evaluations at scale, built upon Inspect AI
Read moreMeasuring autonomous AI capabilities — resource collection
An index of our research and guidance on how to measure AI systems' ability to autonomously complete a wide range of multi-hour tasks
Read moreCommon Elements of Frontier AI Safety Policies
An analysis of the shared components across twelve published frontier AI safety policies, including capability thresholds, model weight security, and deployment mitigations
Read more PDFFrontier AI Safety Policies
A list of AI companies' frontier safety policies intended to evaluate and manage severe AI risks
Read moreWhat should companies share about risks from frontier AI models?
We describe areas for risk transparency and specific technical questions that a frontier AI developer could answer.
Read moreRisk Assessment
Our work assessing risks from frontier AI systems — including the Frontier Risk Report, independent reviews of AI developers' risk assessments, and capability evaluations of frontier models.
Frontier Risk Report (Feb–Mar 2026)
19 May 2026
•
Partnership
Review of Anthropic Risk Report: February 2026 § Risks from automated R&D
8 May 2026
•
Partnership
Red-Teaming Anthropic's Internal Agent Monitoring Systems
26 March 2026
•
Partnership
Review of the Anthropic Sabotage Risk Report: Claude Opus 4.6
12 March 2026
•
Partnership
Review of the Anthropic Summer 2025 Pilot Sabotage Risk Report
28 October 2025
•
Partnership
Summary of our gpt-oss methodology review
23 October 2025
•
Partnership
GPT-5.1-Codex-Max
19 November 2025
•
Partnership
GPT-5
7 August 2025
•
Partnership
DeepSeek and Qwen
27 June 2025
•
No company involvement
OpenAI o3 and o4-mini
16 April 2025
•
Partnership
Claude 3.7
4 April 2025
•
Partnership
DeepSeek-R1
5 March 2025
•
No company involvement
GPT-4.5
27 February 2025
•
Partnership
DeepSeek-V3
12 February 2025
•
No company involvement
Claude 3.5 Sonnet and o1
31 January 2025
•
Partnership
Claude 3.5 Sonnet (original)
30 October 2024
•
Partnership
o1-preview
12 September 2024
•
Partnership
GPT-4o
7 August 2024
•
Partnership
GPT-4 and Claude
17 March 2023
•
Partnership
METR does not accept compensation for this work.
Frontier AI Safety Policies
We advise AI developers and governments on implementing risk assessment methodologies for AI. For example, we have advised developers on Frontier AI Safety Policies.
Resources on FSPs