30 days · UTC
Synchronizing with global intelligence nodes...
Anthropic launches Project Glasswing, using unreleased Claude Mythos to harden critical software with industry partners
Anthropic unveiled Project Glasswing, a defense-focused program using its unreleased Claude Mythos model to find and fix critical software vulnerabili...
SWE-bench scores are spiking, but variant mix-ups make the leaderboard noisy for real-world tool choices
Vendors are touting big SWE-bench jumps, but versions differ and scores alone won’t pick your coding copilot. SWE-bench measures fail-to-pass bug fix...
Anthropic launches Project Glasswing, giving controlled access to Claude Mythos for vulnerability discovery
Anthropic formed Project Glasswing and is withholding its Claude Mythos Preview model for controlled, defensive use after it found thousands of high‑s...
Meta launches Muse Spark, a small, fast model built for real-world app deployment
Meta introduced Muse Spark, a smaller, faster model powering Meta AI with an API in private preview aimed at efficient, product-ready deployments. Ac...
Anthropic previews Claude Mythos and launches Project Glasswing to weaponize defense against zero‑days
Anthropic previewed Claude Mythos and launched Project Glasswing, claiming the model can autonomously find high‑severity bugs across major OSes and br...
Anthropic’s Mythos and Project Glasswing push AI into real-world vuln discovery, with tight access and strong benchmark signals
Anthropic launched Project Glasswing and a Mythos Preview model that finds serious software bugs, pairing industry partners with restricted access and...
Anthropic launches Project Glasswing and restricts Claude Mythos Preview to harden critical software
Anthropic launched Project Glasswing and a restricted Claude Mythos Preview, a model that reportedly finds thousands of serious software vulnerabiliti...
Google tests AI-searchable Play Store reviews, shifting how apps get discovered
Google is testing AI-powered review search in the Play Store, which could change how users discover and evaluate apps. WebProNews reports that Google...
Open agents grow up: Gemma 4, Qwen 3.6 Plus, and a cost-savvy runtime pattern you can use now
Open-source-grade agents just got more practical with Gemma 4, Qwen 3.6 Plus, and a cost‑savvy agent runtime update. Google’s new Gemma 4 brings Apac...
AI talent arms race collides with data-scraping lawsuits and workforce push
Big Tech is overpaying for scarce AI talent while lawsuits and workforce programs reshape how companies source data and skills. A new class-action ta...
Microsoft ships in-house MAI models for speech, voice, and images, aiming for lower GPU cost and enterprise scale
Microsoft launched three in-house MAI models for transcription, voice, and images, targeting better accuracy, speed, and cost than current options.
Gemini API adds Flex and Priority inference tiers; OSS client ships circuit breaker for Gemini 503s
Google introduced Flex and Priority inference tiers for the Gemini API to trade cost for reliability, and an OSS client added circuit breakers for Gem...
AI-first mobile platforms meet an AI app flood: get your APIs and data ready
Android and Apple are shifting to AI-first mobile platforms while AI-generated apps surge, which will stress backend APIs, privacy controls, and telem...
Real-time AI gets faster and less forgetful: Google bumps Gemini Live to Flash 3.1 as SSMs gain steam
Google upgraded Gemini Live to the Flash 3.1 model, tightening real-time voice latency and context handling while state-space models offer a path to l...
Google’s agentic dev stack: Gemini 3.1 long-context and ADK 2.0 deterministic graphs move from hype to practice
Google is consolidating its AI coding bet around Gemini 3.1 and a new ADK 2.0 graph workflow, pushing agentic, deterministic software delivery. A Web...
AI model training isn’t your biggest cost center anymore—the exploration, data, and eval work are
New research suggests final training runs are a small share of AI model costs, with exploration, data work, and evaluation dominating spend. Epoch AI...
Cheaper coding LLMs and subagent stacks are here—time to re-architect your model routing
Production-ready, cheaper models plus subagent patterns are shifting AI economics for coding and document workflows. Z.ai’s new GLM-5.1 posts a 45.3 ...
From Pilot Purgatory to Platform: Shipping AI That Actually Works
Many AI pilots are stuck as demos; production success needs a real platform, guardrails, and workflow automation. Analyses flag a widening execution ...
Google donates llm-d LLM inference gateway to CNCF Sandbox
Google open-sourced llm-d, a Kubernetes-native LLM inference gateway, into the CNCF Sandbox with backing from IBM, Red Hat, NVIDIA, and Anyscale. llm...
AI is reshaping hiring and org charts: judgment up, agents in
AI is changing who you hire and how you staff: judgment matters more, and agents are taking real seats. Hiring signals are shifting from speed of cod...
AI moves from chat to execution: MCP-powered automation and Google Stitch’s design-to-code push
Two concrete signals show AI shifting from chat to tool execution: an MCP-driven Notion CLI and Google Stitch’s design-to-code workflow.
Top LLMs split on tiers and naming: what that means for cost, routing, and long jobs
Vendors now expose high‑end LLMs with different tiers and names, which changes how you budget, route jobs, and handle long or tool‑heavy tasks. A dee...
AI dev tooling shift: Copilot CLI hits GA, Antigravity leans into agentic IDEs, and teams share what works
AI coding assistants are evolving into full workflows, with Copilot CLI at GA and Google’s Antigravity pushing agentic, plan-first IDEs. A hands-on r...