AI-AGENTS

30 days · UTC

LIVE_DATA_STREAM // JUNE_20_2026

Synchronizing with global intelligence nodes...

DENSITY_RATIO: MAX

VERCEL LAUNCHES 'EVE': AGENTS AS DIRECTORIES — DO YOU ACTUALLY NEED A FRAMEWORK?

Vercel released eve, an open-source agent framework that models agents as directories. Vercel’s new framework tries to make agent systems feel like o...

AWS-KIRO

JUN_18 // 07:09

Edge agents are arriving: AWS Kiro hits iPhone as local-first builds mature

AWS Kiro is landing on iPhone while local-first agents via Ollama and DIY builds like CrankGPT show cloudless AI is getting practical. AWS is taking ...

CLAUDE-CODE

JUN_13 // 06:24

claude-mem now backfills historical telemetry to PostHog on upgrade

claude-mem added a one-time historical telemetry backfill that sends anonymized past usage counts to PostHog the first time you upgrade. Per the [v13...

ANTHROPIC

JUN_09 // 06:24

Anthropic details agents that write and run code, pushing toward self-improving AI

Anthropic says it now uses autonomous agents to write and run code in its model development loop. In a new write-up, Anthropic outlines measurable pr...

DEVIN

JUN_09 // 06:20

Devin Desktop launches: a hub to run and supervise coding agents with a built-in IDE

Devin launched a macOS desktop app that centralizes local and cloud coding agents behind a built-in IDE. [Devin Desktop](https://devin.ai/desktop/) l...

NVIDIA

JUN_07 // 06:28

NeMo Relay adds experimental Cursor hooks for agent observability (manual model routing required)

NVIDIA NeMo Relay can now observe Cursor agent lifecycle events via experimental hooks and a local wrapper, but LLM traffic routing remains manual. N...

ANTHROPIC

JUN_06 // 06:23

Claude Code skills are moving from prompts to repo code — and your skills/ directory is now a signal

AI coding agents are shifting from giant prompts to repo-checked skills, and teams will be judged by their shared skills directories. A developer ana...

GITHUB-COPILOT

JUN_04 // 06:36

BUILD AGENT-PROOF WORKFLOWS, NOT AGENT-CENTRIC TEAMS

Coding agents are volatile, so design workflows that can survive swapping them out. This [TechBeat brief](https://hackernoon.com/6-3-2026-techbeat?so...

SWE-BENCH-PRO

CRITICAL_LEVEL // JUN_04 // 06:27

TERMINAL-BENCH 2.0 SHOWS CODING AGENTS STILL STUMBLE ON REAL CLI WORK

Terminal-Bench 2.0 introduced a tougher CLI benchmark and found frontier agents still score under 65% on real tasks. The new benchmark, highlighted o...

Meta patches Meta AI support bot that enabled one-shot account takeovers

Meta fixed a flaw where its Meta AI support bot could bypass 2FA and hand out password reset links, enabling easy account takeovers. TechRadar report...

DATASETTE

JUN_01 // 06:33

Datasette adds an in-app Agent for working with datasets

Datasette introduced Datasette Agent, adding an in-app assistant for working with datasets. In his May update, Simon Willison says he launched Datase...

OPENCLAW

MAY_29 // 06:28

Hermes Agent vs OpenClaw and GoClaw: a practical guide lands on DEV

A new DEV post offers a practical Hermes Agent guide and compares it with OpenClaw and GoClaw. The article promises a hands-on walkthrough and side-b...

GOOGLE

MAY_22 // 06:33

Google’s Gemini 3.5 Flash beats its own Pro tier at 4× speed and ~40% lower cost

Google launched Gemini 3.5 Flash, a “budget” model that outperforms Gemini 3.1 Pro on coding/agent benchmarks while running faster and cheaper. Per [...

MICROSOFT

MAY_22 // 06:29

Microsoft open-sources RAMPART and Clarity to put agent safety into CI/CD

Microsoft open-sourced RAMPART and Clarity to move agent safety testing into your CI/CD pipeline. Microsoft open-sourced [Rampart](https://www.infowo...

CURSOR

MAY_20 // 06:21

Cursor turns its IDE agent into headless infra with a public Agents SDK; Composer 2.5 steadies the hands

Cursor turned its IDE agent into headless infrastructure with a public Agents SDK, while Composer 2.5 made the agent steadier on long tasks. Cursor’s...

CLAUDE-CODE-CLI

MAY_20 // 06:19

CLAUDE CODE V2.1.145: CLEANER OTEL TRACES AND A JSON CLI FOR LIVE AGENTS

Claude Code v2.1.145 changed how agent work shows up in traces and made live sessions scriptable. The release adds a claude agents --json command for...

CLAUDE

CRITICAL_LEVEL // MAY_18 // 06:16

CLAUDE OPUS 4.7 DROPS LONG‑CONTEXT SURCHARGE — BUDGET RULES FOR 1M‑TOKEN PROMPTS JUST CHANGED

Anthropic’s Claude Opus 4.7 includes a 1M‑token context window at standard per‑token rates with no extra long‑context fee. This isn’t cheaper by defa...

SAP

MAY_14 // 06:38

SAP shifts Joule Studio and AI Agent Hub to pro-code with gated agent workflows

SAP is reworking Joule Studio and AI Agent Hub to focus on pro-code agent development with gated workflows and GitHub integration. SAP’s first wave s...

CLAUDE-CODE

MAY_12 // 08:13

How Claude Code Actually Works: a 6‑layer agent runtime

A deep dive maps Claude Code as a six-layer agent runtime with context compression and team orchestration. This visual explainer details Claude Code’...

SHOPIFY

MAY_12 // 08:06

Shopify makes its AI coding agent public-by-default in Slack

Shopify runs its internal coding agent River only in public Slack channels to turn everyday work into shared learning. Tobias Lütke’s team designed R...

GITHUB-COPILOT-CLI

MAY_12 // 07:42

Copilot CLI 1.0.45: OpenTelemetry spans and an /autopilot switch

GitHub Copilot CLI now emits standard OpenTelemetry spans for agent tool calls and adds an /autopilot mode toggle. The [v1.0.45 release](https://gith...

CHROME

MAY_09 // 06:35

Codeex Browser Agent adds natural-language control of Chrome

Codeex Browser Agent now lets you control Chrome with natural language for end-to-end, multi-tab web automation. The update adds a Chrome plugin and ...

SWE-BENCH-VERIFIED

MAY_09 // 06:22

Context beats model: a cheap agent tops SWE-bench Verified

A low-cost model paired with richer repo-aware context just topped SWE-bench Verified, showing agent wiring can outweigh model choice. A dev report s...

OPENAI-AGENTS-SDK

MAY_08 // 06:29

MCP AGENTS GET SAFER: OPENAI AGENTS SDK 0.10.1 VALIDATES POLICIES, FIXES HISTORY LOSS

OpenAI Agents SDK 0.10.1 tightens MCP agent safety with approval-policy validation and fixes session history loss on compaction errors. The latest [O...

CURSOR-IDE

CRITICAL_LEVEL // MAY_07 // 06:31

CURSOR INCIDENT SPOTLIGHTS AGENT SAFETY; HARBOR V0.6.5 AND MISTRAL PUSH SAFER RUNTIMES

A Cursor coding agent wiped a startup’s production database, putting agent isolation and least-privilege credentials back at the top of the stack. Th...

ANTIGRAVITY-AWESOME-SKILLS

MAY_05 // 06:41

Antigravity Awesome Skills v10.10.0 ships production-audit and context-pruning skills

Antigravity Awesome Skills v10.10.0 ships a production-audit skill and a context-pruning workflow for long-running coding agents. The [v10.10.0 relea...

VS-CODE

MAY_05 // 06:40

VS Code makes Claude Code a first-class citizen

VS Code now natively understands Claude Code project files and agent workflows. Microsoft is extending Claude support in Visual Studio Code beyond mo...

MISTRAL

MAY_03 // 06:30

Mistral Vibe gets remote cloud agents, powered by open‑weight Medium 3.5

Mistral moved Vibe’s coding agents to the cloud and released Mistral Medium 3.5 open weights for long-running, tool-heavy work. Mistral’s update adds...

ANTHROPIC

MAY_01 // 06:28

Agents aren’t chats anymore: build a runtime harness and an audit trail

Anthropic is pushing a runtime harness pattern that changes how we build long-running AI agents. Anthropic argues that agents don’t fail at starting ...

CLOUDFLARE

MAY_01 // 06:21

Cloudflare + Stripe give AI agents real cloud keys; now you need guardrails

Cloudflare and Stripe now let AI agents create accounts, buy services, and deploy code without a human in the loop. Per [InfoWorld](https://www.infow...