METR updates - page 2

11 March 2025

Why it’s good for AI reasoning to be legible and faithful

Why legible and faithful reasoning is valuable for safely developing powerful AI

8 February 2025

List of frontier safety policies published by AI companies, including Amazon, Anthropic, Google DeepMind, G42, Meta, Microsoft, OpenAI, and xAI.

17 January 2025

AI models can be dangerous before public deployment

Why pre-deployment testing is not an adequate framework for AI risk management

11 October 2024

Response to Bureau of Industry and Security’s proposed AI reporting requirements

Red-teaming and security suggestions regarding proposed rule by the Bureau of Industry and Security, “Establishment of Reporting Requirements for the Development of Advanced Artificial Intelligence Models and Computing Clusters.”

9 October 2024

New Support Through The Audacious Project

Funding for Canary will enable research and implementation at scale

8 September 2024

Response to U.S. AISI Draft “Managing Misuse Risk for Dual-Use Foundation Models”

Suggestions for expanded guidance on capability elicitation and robust model safeguards in the U.S. AI Safety Institute’s draft document “Managing Misuse Risk for Dual-Use Foundation Models” (NIST AI 800-1).

Updates

Non-research updates from the METR team.