GITHUB
30 days · UTC
Synchronizing with global intelligence nodes...
Claude Code ships native CLI, tighter sandboxing, and a desktop redesign for parallel agent work
Anthropic pushed rapid Claude Code updates and a desktop redesign that tighten security, speed up reviews, and make multi-session agent work practical...
Claude Code 2.1.111 lands Opus 4.7 xhigh, Auto mode upgrades, and cloud ultrareview; 2.1.112 hotfix follows
Anthropic shipped a sizable Claude Code update with smarter model controls, fewer permission nags, and a new multi-agent cloud code review. The 2.1.1...
Copilot turbulence: Pro trials paused while Copilot CLI ships 1.0.29–1.0.31 with agent/MCP quality fixes
GitHub paused new Copilot Pro trials due to abuse while Copilot CLI shipped three rapid releases with agent/MCP and terminal stability fixes. GitHub ...
Windsurf 2.0 ships “Agent Command Center” and brings Devin into the IDE
Windsurf 2.0 adds an Agent Command Center and “Devin in Windsurf,” turning the IDE into a stronger agent hub versus Cursor. Windsurf’s new release hi...
OpenAI’s Agents SDK grows up: model-native harness + safe sandboxes, with SDKs and Codex shipping reliability and security polish
OpenAI expanded its Agents SDK with a model-native harness and built-in sandbox execution, plus companion reliability/security updates in openai-pytho...
Cloudflare Agent Cloud + Codex: enterprise-ready agents on GPT-5.4, with some early quirks
OpenAI and Cloudflare made it easier to run enterprise-grade coding and workflow agents with GPT-5.4 and Codex, while early users report a few glitche...
Agents get real: Gemini CLI adds remote subagents; Snowflake leans into agentic Snowpark with Cortex Code
Gemini CLI now speaks to remote subagents over A2A, while Snowflake’s Cortex Code pushes agentic Snowpark coding into everyday data engineering. A de...
Copilot CLI 1.0.24 ships; Pro+ model glitches and surprise PRs surface
GitHub Copilot CLI 1.0.24 landed with practical agent fixes, while users flag model entitlement glitches and unexpected repo activity. GitHub shipped...
RAG quality and reliability: cross-encoder reranking and vector storage recall gotchas
RAG quality jumps with cross-encoder reranking, while some teams report recall issues in OpenAI’s vector storage. This deep dive shows why two-stage ...
Lean agentic coding: add a memory layer and make skills portable
Practitioners are converging on lean, memory‑equipped agents and cross‑platform skills as the practical way to use AI for coding. A hands‑on guide ar...
Claude Code leak prompts clean-room clones; Anthropic says no sensitive data exposed
A public Claude Code leak triggered clean-room reimplementations and community scrutiny while Anthropic claims no sensitive data was exposed. A popul...
From AI Chat to Agentic Layer: Orchestrate the SDLC, Not Just Prompts
An essay argues teams should build an agentic layer that orchestrates SDLC workflows, not just bolt chat onto editors. Chat helps individuals, but de...
Copilot CLI 1.0.22 tightens agent control, simplifies MCP config, and pairs well with “synthetic user” doc testing
GitHub Copilot CLI 1.0.22 brings safer, more predictable agents and a single .mcp.json config, while teams apply agents to continuously test docs. Th...
Copilot CLI 1.0.21 ships MCP support; safer agent limits land in 1.0.22-0 pre-release, while Copilot updates data-training policy for individuals
GitHub Copilot CLI now manages MCP servers, adds agent safety limits in pre-release, and GitHub updated Copilot’s data training policy for individual ...
Cursor 3 breaks from VS Code; Windsurf doubles down on agentic IDEs
Cursor 3 is moving off the VS Code base while Windsurf pushes an agentic IDE, forcing real AI editor choices against VS Code + Copilot. Cursor 3 is r...
Claude Code v2.1.97 tightens safety, fixes reliability pain points, and surfaces live subagents
Anthropic shipped Claude Code v2.1.97 with stronger permission hardening, better retry logic, MCP leak fixes, and an indicator for live subagents. Th...
Copilot CLI ships MCP management and OTel docs; experimental “Rubber Duck” reviewer lands; Copilot data-training defaults change
GitHub updated Copilot CLI with ops-focused fixes, added an experimental second-model reviewer, and changed Copilot data-training defaults for individ...
Claude Code 2.1.94 ships Bedrock (Mantle) support; 2.1.96 hotfixes Bedrock auth regression
Anthropic’s Claude Code added Amazon Bedrock (Mantle) support in 2.1.94 and fixed a Bedrock auth regression in 2.1.96 amid reliability debate. The [v...
Claude Code after Opus 4.6: new defaults, noisy regressions, npm change, and a brief outage
Claude Code flipped key defaults with Opus 4.6, prompting mixed results as install paths changed and Claude had a brief outage.
Agentic coding hits the reliability phase: this week’s updates focus on state, ops, and safety
Multiple agentic coding stacks shipped reliability-first updates, signaling a shift from model flash to harness quality, state handling, and operator ...
Claude-mem v11.0.1 makes semantic memory injection opt-in to cut latency and context noise
The claude-mem tool now disables semantic memory injection by default to reduce latency and irrelevant context during prompts. Per the v11.0.1 releas...
Copilot CLI adds a "Critic" agent as agent skills land in VS Code
GitHub is pushing Copilot from autocomplete to agent workflows across IDEs and the CLI, with a new Critic reviewer. The Copilot feature matrix shows ...
Code agents grow up: CI-scale benchmarking, structured patch checks, and cheaper eval runs
Code agent evaluation is shifting to long-run maintainability, execution-free patch checks, and leaner, cheaper benchmark runs. A new benchmark, [SWE...