AGENTIC-CODING

30 days · UTC

LIVE_DATA_STREAM // APRIL_14_2026

Synchronizing with global intelligence nodes...

DENSITY_RATIO: MAX

OPEN AGENTS GROW UP: GEMMA 4, QWEN 3.6 PLUS, AND A COST-SAVVY RUNTIME PATTERN YOU CAN USE NOW

Open-source-grade agents just got more practical with Gemma 4, Qwen 3.6 Plus, and a cost‑savvy agent runtime update. Google’s new Gemma 4 brings Apac...

ANTHROPIC

APR_02 // 06:23

Claude Code v2.1.90: faster streaming, sturdier long sessions, onboarding “/powerup,” and tighter Windows tool permissions

Anthropic shipped Claude Code v2.1.90 with faster streaming, long-session stability, onboarding lessons, and tightened Windows tool permissions. The ...

ANTIGRAVITY

APR_01 // 06:40

Antigravity Awesome Skills v9.4.0 hardens the agentic coding stack

The Antigravity Awesome Skills library shipped v9.4.0 focused on validation, CI guardrails, and marketplace sync reliability instead of new skills. T...

ANTHROPIC

MAR_26 // 07:34

Anthropic’s three-agent harness keeps long-running coding agents on track

Anthropic details a three-agent harness that keeps Claude coherent on multi-hour autonomous coding tasks by decomposing work and grading outputs. Ant...

ANTIGRAVITY

MAR_26 // 07:23

Antigravity Skills v8.9 ships a Snowflake engineering skill and tighter GitHub/refactor workflows

Antigravity Awesome Skills v8.9 adds a Snowflake engineering skill and sharper GitHub/refactor workflows for agentic coding tools. The v8.9.0 release...

GITHUB-COPILOT

MAR_25 // 07:28

Testing agents grow up: Diffblue launches orchestration as benchmarks cap AI code review at ~40%

Diffblue launched an autonomous testing agent while new research finds current AI code reviewers only solve about 40% of review tasks. [Diffblue Test...

CURSOR-IDE

MAR_24 // 07:27

Cursor Composer 2 lands with agentic coding gains, cost claims, and questions about provenance and safety

Cursor launched Composer 2, a MoE-based agentic coding model claiming strong multi-file performance at lower cost, but its base model and stability ar...

ANTHROPIC

MAR_24 // 07:25

ANTHROPIC BRINGS COMPUTER USE AND CHAT CHANNELS TO CLAUDE CODE

Anthropic is rolling out Computer Use and chat Channels for Claude Code, adding OS control and Discord/Telegram texting to the coding agent. Claude C...

CLAUDE-CODE

CRITICAL_LEVEL // MAR_12 // 07:34

CLAUDE CODE 2.1.74 STOPS NODE STREAMING MEMORY LEAKS AND ADDS ENTERPRISE-GRADE MODEL ROUTING

Anthropic shipped Claude Code 2.1.73–2.1.74 with a key Node.js memory leak fix, better provider routing, and sturdier enterprise auth. The 2.1.74 rel...

CURSOR

MAR_09 // 07:24

Swift.org documents Cursor support as users report 2.6.13 instability — a reality check for AI IDE rollouts

Swift.org now shows how to use Cursor for Swift, while Cursor forum posts flag crashes and stuck chats in version 2.6.13. Swift’s official docs expla...

AIDER

MAR_08 // 07:23

CLI coding agents rise, with Docker isolation to tame risk

Open-source, CLI-first coding agents are getting easier to use while new tools add Docker isolation to reduce security risk in real projects. Develope...

CURSOR-AUTOMATIONS

MAR_06 // 10:14

Cursor Automations brings policy-driven agents to your repo and Slack

Cursor launched Automations, a policy-driven system that triggers coding agents on commits, Slack messages, or schedules and loops humans in only when...

CURSOR

MAR_05 // 19:21

Cursor Automations + Copilot CLI hooks push agentic coding into your pipeline

Agentic coding is moving from hype to practical reality as Cursor ships always-on Automations and JetBrains support, and GitHub Copilot CLI adds workf...

CODEBUFF

MAR_04 // 21:06

Open-source CodeBuff brings multi-agent coding to complex repos

Open-source CodeBuff advances a multi-agent approach to coding that decomposes complex repo work, addressing the single-model bottleneck seen in tools...

MINIMAX-M25

MAR_04 // 20:48

MiniMax-M2.5 launches with SOTA coding claims; verify SWE-bench results

MiniMax launched MiniMax-M2.5, a fast, low-cost coding and agentic model, but teams should validate its headline SWE-bench gains with internal tests g...

CLAUDE-CODE

MAR_04 // 20:41

CLAUDE CODE V2.1.68 SETS OPUS 4.6 TO MEDIUM BY DEFAULT AND REINTRODUCES ONE-TURN "ULTRATHINK"

Claude Code v2.1.68 changes default model behavior to Opus 4.6 at medium effort, re-enables a one-turn high-effort "ultrathink" switch, and migrates a...

QWEN-35

CRITICAL_LEVEL // MAR_03 // 23:22

CODING BENCHMARKS SHAKE-UP: QWEN 3.5, MINIMAX M2.5, AND A SWE-BENCH REALITY CHECK

Open models like Alibaba’s Qwen 3.5 and MiniMax M2.5 post strong coding-agent results, but OpenAI’s audit of SWE-bench Verified shows contamination an...

CODECOMPASS

FEB_24 // 21:13

Graph-structured dependency navigation fixes missed-file failures in repo-scale coding agents

New results show that wiring coding agents to traverse a code dependency graph outperforms expanding context or keyword/vector retrieval on architectu...

ANTHROPIC

FEB_10 // 18:19

Claude Opus 4.6 adds agent teams, 1M context, and fast mode; GPT-5.3-Codex counters

Anthropic’s Claude Opus 4.6 ships multi-agent coding, a 1M-token context window, and a 2.5x fast mode, while OpenAI’s GPT-5.3-Codex brings faster agen...

OPENAI

FEB_10 // 10:43

Codex 5.3 vs Opus 4.6: agentic speed vs long‑context depth

OpenAI's GPT-5.3 Codex and Anthropic's Claude Opus 4.6 arrive with distinct strengths—Codex favors faster agentic execution while Opus excels at long-...

OPENAI

FEB_10 // 10:33

OpenAI’s GPT-5.3-Codex rolls out to Copilot with faster, agentic workflows

OpenAI's GPT-5.3-Codex is a 25% faster, more agentic coding model built for long-running, tool-driven workflows and is now rolling out across Codex su...

ANTHROPIC

FEB_10 // 10:31

Opus 4.6 Agent Teams vs GPT-5.3 Codex: multi‑agent coding arrives for real SDLC work

Anthropic's Claude Opus 4.6 brings multi-agent "Agent Teams" and a 1M-token context while OpenAI's GPT-5.3-Codex counters with faster, stronger agenti...

OPENAI-CODEX

JAN_26 // 22:46

OpenAI Codex agent loop goes from suggestions to sandboxed, auditable code changes

OpenAI’s Codex now uses an iterative agent loop that plans, calls tools, and executes in air‑gapped containers with quotas—returning JSON‑logged diffs...

REMOTION

JAN_23 // 16:44

REMOTION + CLAUDE CODE: REACT-TO-MP4 VIA AI AGENTS

A developer shows how [Remotion turned Claude Code into a video production tool](https://jpcaparas.medium.com/remotion-turned-claude-code-into-a-video...

CLAUDE-CODE

CRITICAL_LEVEL // JAN_23 // 16:44

MICROSOFT PILOTS CLAUDE CODE BROADLY AS ARR TOPS $1B AND SAFETY MATURES

Microsoft is encouraging thousands of employees across Windows, M365, and Teams to use Claude Code—even alongside GitHub Copilot—and is counting Anthr...

CLAUDE-CODE

JAN_23 // 16:11

Microsoft pilots Claude Code at scale as Anthropic’s agentic coder hits an inflection

Microsoft is rolling out [Claude Code](https://www.theverge.com/tech/865689/microsoft-claude-code-anthropic-partnership-notepad)[^1] across major engi...

CLAUDE-CODE

JAN_23 // 15:39

Claude Code + Remotion: AI-written React renders promo videos

Developers are using Remotion with Claude Code to generate fully rendered promo videos by having the agent write React components and export to MP4, e...

CLAUDE-CODE

JAN_23 // 15:39

Microsoft pilots Claude Code across core teams as agentic coding inflects

Anthropic’s Claude Code is hitting a real agentic-coding inflection: developers report step-function gains with the Claude Opus 4.5 model, and the pro...

CURSOR

JAN_21 // 19:38

IDE agents mature; TPUs tilt inference economics for 2026

Cursor Agent Mode and Windsurf Cascade push agentic, multi-file coding in IDEs, while Copilot adds Anthropic and Google models and Google previews the...

QWEN3

JAN_21 // 19:38

ABC-Bench: End-to-end benchmark for agentic backend coding

ABC-Bench evaluates LLM agents on real backend tasks from repo exploration through Dockerization, service deployment, and end-to-end API testing. It i...