CODING-AGENTS
30 days · UTC
Synchronizing with global intelligence nodes...
OpenAI Codex shifts to per-task compute-unit pricing; plan for quotas, rate limits, and ops
OpenAI’s Codex coding agent now charges per task in compute units, changing how teams budget and operate AI-assisted development. OpenAI’s newly surf...
MCP-powered coding agents hit real tooling (Chrome DevTools, ABL in Windsurf) as typosquatting targets IDEs
MCP-based coding agents are moving into serious dev workflows while IDE extension typosquatting raises fresh supply chain risk. Google’s open-source ...
Agentic coding grows up: pipelines, persistence, and cost control land in open source
Agentic coding just took a step from hype to operations with new releases, persistent workflows, and cost-aware controls. The open-source agent stack...
Coding agents in production: architecture choices, reliability budgets, and hitting the brakes
A wave of practitioner write-ups agrees: shipping coding agents is about reliability budgets and the right architecture, not flashy demos. At the AAA...
Production reality check for coding agents: reliability over benchmarks
AI coding agents are hitting production walls where reliability, latency, and evaluation—not raw benchmarks—decide whether they help or hurt teams. A...
Agentic SDLC gets real: LangWatch Skills launch + agentic-qe adds code–test hypergraph
Agent-focused SDLC tooling leveled up this week with LangWatch Skills and agentic-qe’s hypergraph CLI, making agents observable, testable, and safer t...
Claude Code vs Cursor Composer 2: pick by workflow surface, watch Cursor stability
Claude Code and Cursor Composer 2 solve different problems—cross-surface coding agent versus Cursor-native model—so your rollout plan, governance, and...
Always-on coding agents are arriving; reliability math and monitoring decide if they’re production-ready
Coding agents just became always-on, and the blockers are compounded error rates and the lack of production-grade monitoring. Anthropic shipped /loop...
Open-weight coding agents hit 60%+ SWE-Bench and get easier to run on-prem
Open-weight coding agents leaped forward as NVIDIA’s Nemotron 3 Super tops SWE-Bench and new research streamlines on‑prem and local runs. NVIDIA unve...
Subagents: Scaling coding agents beyond context limits
A new guide explains the subagents pattern for coding agents, using Claude Code’s Explore subagent to work around LLM context limits. [Simon Willison...
Claude Code 2.1.78 lands reliability and sandbox hardening; LangChain adds Anthropic prompt caching
Anthropic shipped a Claude Code update focused on reliability, sandbox safety, and faster feedback, while LangChain added first-class Anthropic prompt...
Agentic coding needs a harness: ship the guardrails before the agents
Coding agents are useful, but without a real harness and governance they’ll break prod faster than they help you ship. Simon Willison explains how co...