Synchronizing with global intelligence nodes...
Poke raises $4M to make AI agents work like group chats
Poke raised $4M to build business AI agents that work like group chats instead of complex workflow builders. A two-person startup called Poke raised ...
SonarQube Cloud adds Agentic Analysis (beta) to verify AI-generated code at PR speed
SonarQube Cloud introduced a beta Agentic Analysis that delivers CI-level static checks on pull requests in seconds. Agentic Analysis is the Verify s...
AI security pivots to defense: restricted LLMs, risky code assistants, and practical guardrails
Vendors are shifting from open access to locked-down, defense-first AI as code assistants prove easy to abuse. A report says OpenAI is prepping a res...
Shipping time‑to‑event churn models needs survival analysis plus point‑in‑time correct real‑time features
Use survival analysis for churn forecasting, then back it with a point-in-time correct real-time feature pipeline to avoid leakage and ship it safely....
Meta launches Muse Spark, a small, fast model built for real-world app deployment
Meta introduced Muse Spark, a smaller, faster model powering Meta AI with an API in private preview aimed at efficient, product-ready deployments. Ac...
Oracle-SWE dissects the “oracle hints” behind SWE-bench wins, challenging headline coding benchmarks
New research isolates which “oracle” hints actually move SWE-bench agent scores, explaining why headline results often don’t match real coding impact....
AWS rolls out Agent Registry in Bedrock, pushing enterprises toward real agent governance
AWS launched an Agent Registry in Amazon Bedrock to catalog and govern enterprise AI agents across stacks. Reports say the new registry gives teams a...
Anthropic previews Claude Mythos and launches Project Glasswing to weaponize defense against zero‑days
Anthropic previewed Claude Mythos and launched Project Glasswing, claiming the model can autonomously find high‑severity bugs across major OSes and br...
Anthropic launches Claude Managed Agents: production-grade agent orchestration as a service
Anthropic launched Claude Managed Agents, a hosted stack that runs long-lived, tool-using AI agents with sandboxing, tracing, and scoped permissions.
Claude Code 2.1.98 lands Vertex AI setup, Linux sandboxing, trace propagation, and key Bash safety fixes
Anthropic shipped Claude Code 2.1.98 with a Vertex AI setup wizard, Linux subprocess sandboxing, OpenTelemetry trace propagation, and several importan...
Copilot CLI 1.0.22 tightens agent control, simplifies MCP config, and pairs well with “synthetic user” doc testing
GitHub Copilot CLI 1.0.22 brings safer, more predictable agents and a single .mcp.json config, while teams apply agents to continuously test docs. Th...
Cursor 3 ships Bugbot learned rules and MCP support, but early 3.0 users report stability hiccups
Cursor 3 brings smarter AI code review with learned rules and MCP integration, while some users hit stability issues on 3.0 clients. Cursor’s update ...
Claude‑mem 12.1 ships "Knowledge Agents" with HTTP APIs; MassGen 0.1.74 hardens MCP — local agent stacks get production legs
Two open-source releases make private, queryable knowledge bases and agent workflows far easier to stand up and operate. Claude‑mem’s latest release ...
Agentic LLMs move from hype to patterns: draft, parse, verify — with logs and guardrails
Three new studies show agentic LLMs can draft code, parse scientific data, and verify claims—if you add structure, provenance, and human oversight. A...
Detection is hard: calibrate AI text checks and harden code-quality scoring with adversarial tests
AI detectors look confident, but their math and calibration can mislead unless you account for base rates and validate with adversarial tests. A clea...
Hardening LLM Backends: LangChain Sanitization, Contextual PII Redaction, and a Practical RAG Playbook
LLM app security got a lift: LangChain tightened prompt sanitization, researchers advanced contextual PII redaction, and a clear RAG blueprint dropped...
Agentic coding goes long‑haul: open models, on‑the‑job memory, and S3 as a file system
Agentic AI for software and data workflows is solidifying, with longer‑running models, practical memory systems, and AWS wiring S3 in as an agent file...
Copilot CLI 1.0.21 ships MCP support; safer agent limits land in 1.0.22-0 pre-release, while Copilot updates data-training policy for individuals
GitHub Copilot CLI now manages MCP servers, adds agent safety limits in pre-release, and GitHub updated Copilot’s data training policy for individual ...
Cursor 3 breaks from VS Code; Windsurf doubles down on agentic IDEs
Cursor 3 is moving off the VS Code base while Windsurf pushes an agentic IDE, forcing real AI editor choices against VS Code + Copilot. Cursor 3 is r...
Claude Code v2.1.97 tightens safety, fixes reliability pain points, and surfaces live subagents
Anthropic shipped Claude Code v2.1.97 with stronger permission hardening, better retry logic, MCP leak fixes, and an indicator for live subagents. Th...
Anthropic’s Mythos and Project Glasswing push AI into real-world vuln discovery, with tight access and strong benchmark signals
Anthropic launched Project Glasswing and a Mythos Preview model that finds serious software bugs, pairing industry partners with restricted access and...
Claude Opus 4.6 pricing isn’t one thing: seats vs tokens, very different bills
Anthropic splits Claude Opus 4.6 access between seat-based app plans and token-metered API usage, which leads to very different costs in practice. [T...
Nvidia buys SchedMD (Slurm), putting the de facto AI/HPC scheduler under one GPU vendor’s roof
Nvidia’s acquisition of SchedMD hands Slurm’s roadmap to a single GPU vendor, triggering concerns about neutrality for mixed-hardware clusters. Per [...