METR logo

METR

Company

METR (Model Evaluation and Threat Research) is a Berkeley-based nonprofit that runs empirical studies and benchmarks to assess the real-world capabilities and risks of frontier AI systems. Its recent work includes auditing AI-generated code that passes the SWE-bench benchmark and showing that many such patches are rejected in human code review.

article 4 storys calendar_today First: 2026-03-08 update Last: 2026-04-03 menu_book Wikipedia

Stories

Completed digest stories linked to this service.

GET_DAILY_EMAIL
AI + SDLC // 5 MIN DAILY