AGENTIC-CODING
30 days · UTC
Synchronizing with global intelligence nodes...
Claude Code v2.1.90: faster streaming, sturdier long sessions, onboarding “/powerup,” and tighter Windows tool permissions
Anthropic shipped Claude Code v2.1.90 with faster streaming, long-session stability, onboarding lessons, and tightened Windows tool permissions. The ...
Antigravity Awesome Skills v9.4.0 hardens the agentic coding stack
The Antigravity Awesome Skills library shipped v9.4.0 focused on validation, CI guardrails, and marketplace sync reliability instead of new skills. T...
Anthropic’s three-agent harness keeps long-running coding agents on track
Anthropic details a three-agent harness that keeps Claude coherent on multi-hour autonomous coding tasks by decomposing work and grading outputs. Ant...
Antigravity Skills v8.9 ships a Snowflake engineering skill and tighter GitHub/refactor workflows
Antigravity Awesome Skills v8.9 adds a Snowflake engineering skill and sharper GitHub/refactor workflows for agentic coding tools. The v8.9.0 release...
Testing agents grow up: Diffblue launches orchestration as benchmarks cap AI code review at ~40%
Diffblue launched an autonomous testing agent while new research finds current AI code reviewers only solve about 40% of review tasks. [Diffblue Test...
Cursor Composer 2 lands with agentic coding gains, cost claims, and questions about provenance and safety
Cursor launched Composer 2, a MoE-based agentic coding model claiming strong multi-file performance at lower cost, but its base model and stability ar...
Swift.org documents Cursor support as users report 2.6.13 instability — a reality check for AI IDE rollouts
Swift.org now shows how to use Cursor for Swift, while Cursor forum posts flag crashes and stuck chats in version 2.6.13. Swift’s official docs expla...
CLI coding agents rise, with Docker isolation to tame risk
Open-source, CLI-first coding agents are getting easier to use while new tools add Docker isolation to reduce security risk in real projects. Develope...
Cursor Automations brings policy-driven agents to your repo and Slack
Cursor launched Automations, a policy-driven system that triggers coding agents on commits, Slack messages, or schedules and loops humans in only when...
Cursor Automations + Copilot CLI hooks push agentic coding into your pipeline
Agentic coding is moving from hype to practical reality as Cursor ships always-on Automations and JetBrains support, and GitHub Copilot CLI adds workf...
Open-source CodeBuff brings multi-agent coding to complex repos
Open-source CodeBuff advances a multi-agent approach to coding that decomposes complex repo work, addressing the single-model bottleneck seen in tools...
MiniMax-M2.5 launches with SOTA coding claims; verify SWE-bench results
MiniMax launched MiniMax-M2.5, a fast, low-cost coding and agentic model, but teams should validate its headline SWE-bench gains with internal tests g...
Graph-structured dependency navigation fixes missed-file failures in repo-scale coding agents
New results show that wiring coding agents to traverse a code dependency graph outperforms expanding context or keyword/vector retrieval on architectu...
Claude Opus 4.6 adds agent teams, 1M context, and fast mode; GPT-5.3-Codex counters
Anthropic’s Claude Opus 4.6 ships multi-agent coding, a 1M-token context window, and a 2.5x fast mode, while OpenAI’s GPT-5.3-Codex brings faster agen...
Codex 5.3 vs Opus 4.6: agentic speed vs long‑context depth
OpenAI's GPT-5.3 Codex and Anthropic's Claude Opus 4.6 arrive with distinct strengths—Codex favors faster agentic execution while Opus excels at long-...
OpenAI’s GPT-5.3-Codex rolls out to Copilot with faster, agentic workflows
OpenAI's GPT-5.3-Codex is a 25% faster, more agentic coding model built for long-running, tool-driven workflows and is now rolling out across Codex su...
Opus 4.6 Agent Teams vs GPT-5.3 Codex: multi‑agent coding arrives for real SDLC work
Anthropic's Claude Opus 4.6 brings multi-agent "Agent Teams" and a 1M-token context while OpenAI's GPT-5.3-Codex counters with faster, stronger agenti...
OpenAI Codex agent loop goes from suggestions to sandboxed, auditable code changes
OpenAI’s Codex now uses an iterative agent loop that plans, calls tools, and executes in air‑gapped containers with quotas—returning JSON‑logged diffs...
Microsoft pilots Claude Code at scale as Anthropic’s agentic coder hits an inflection
Microsoft is rolling out [Claude Code](https://www.theverge.com/tech/865689/microsoft-claude-code-anthropic-partnership-notepad)[^1] across major engi...
Claude Code + Remotion: AI-written React renders promo videos
Developers are using Remotion with Claude Code to generate fully rendered promo videos by having the agent write React components and export to MP4, e...
Microsoft pilots Claude Code across core teams as agentic coding inflects
Anthropic’s Claude Code is hitting a real agentic-coding inflection: developers report step-function gains with the Claude Opus 4.5 model, and the pro...
IDE agents mature; TPUs tilt inference economics for 2026
Cursor Agent Mode and Windsurf Cascade push agentic, multi-file coding in IDEs, while Copilot adds Anthropic and Google models and Google previews the...
ABC-Bench: End-to-end benchmark for agentic backend coding
ABC-Bench evaluates LLM agents on real backend tasks from repo exploration through Dockerization, service deployment, and end-to-end API testing. It i...