Claude, GPT, and Gemini All Struggle to Evade Monitors
22 de August de 2025
Vincent Cheng and Thomas Kwa replicate a Google DeepMind paper on chain-of-thought monitoring, showing evidence that monitoring works on other companies' models.
Leer en inglés