CLAUDE-45-OPUS
30 days · UTC
LIVE_DATA_STREAM // APRIL_14_2026
Synchronizing with global intelligence nodes...
DENSITY_RATIO: MAX
METR
MAR_12 // 07:40
METR study challenges SWE-bench wins as Sonar touts 79.2% "Verified" score
A new METR review finds many SWE-bench "passes" aren’t merge-worthy, casting recent leaderboard wins like Sonar’s 79.2% in a different light. Researc...
CISCO
FEB_10 // 18:36
Cisco donates CodeGuard to CoSAI as research exposes persistent LLM code vulnerabilities
Cisco donated its model-agnostic CodeGuard security ruleset to CoSAI while new research shows LLM code generators reliably repeat exploitable patterns...
CISCO
FEB_10 // 10:44
Cisco open-sources CodeGuard as research flags predictable LLM code flaws
Cisco donated its CodeGuard security framework to OASIS’s Coalition for Secure AI as new research shows LLM code assistants repeat predictable vulnera...