Agents are now a real workload: Uber’s b…

AGENTIC-WORKFLOWS PUB_DATE: 2026.06.08

AGENTS ARE NOW A REAL WORKLOAD: UBER’S BILL AND 1,000 DEPLOYS/MONTH FORCE OPS, COST, AND CONTROL REWRITES

AI agents moved from experiments to production-scale work, exposing gaps in pipelines, cost controls, and access governance. Teams are hitting deployment veloc...

AI agents moved from experiments to production-scale work, exposing gaps in pipelines, cost controls, and access governance.

Teams are hitting deployment velocities their delivery stacks weren’t built for, with AI changes landing continuously—up to 1,000 times a month—per The New Stack.

Uber reportedly blew through its 2026 AI budget early while struggling to tie token spend to customer value, a cautionary signal detailed in this Substack briefing. Meanwhile, practitioners are asking how to put hard financial guardrails around agents in the OpenAI forum.

Guidance is coalescing around treating agents like non-human actors with least privilege and human-in-the-loop gates, not chatbots—see this VentureBeat piece. Reliability still bites too, with real-world hiccups like Responses API 520s and fresh focus on outcome-centric evaluation from this agents-in-the-enterprise write-up on AI economics and benchmarking.

[ WHY_IT_MATTERS ]

01.

Agent workflows are creating near-continuous deploys and spend; legacy CI/CD, RBAC, and cost models won’t keep up.

02.

Without token-to-outcome visibility, CFOs cap usage while engineering loses leverage.

[ WHAT_TO_TEST ]

terminal
Run a canary for an internal agent with strict token budgets, cost alerts, and per-action approvals; measure impact on lead time vs. incidents.
terminal
Version and test prompts/skills like code in CI; enforce reproducible runs, idempotency, and rollbacks; track token-to-KPI attribution.

[ BROWNFIELD_PERSPECTIVE ]

Legacy codebase integration strategies...

01.
Wrap agents with least-privilege service identities, read/write separation, circuit breakers, and rate limits; require human approval for destructive ops.
02.
Add token metering to your observability stack; emit per-request spend, model, latency, and outcome tags into logs/metrics.

[ GREENFIELD_PERSPECTIVE ]

Fresh architecture paradigms...

01.
Design for 1,000+ deploys/month: ephemeral envs, progressive delivery, auto-rollback, and policy-as-code for agent permissions.
02.
Build an attribution layer from day one: map tokens and actions to business KPIs so cost scales with value.

Enjoying_this_story?

Get daily AGENTIC-WORKFLOWS + SDLC updates.

Practical tactics you can ship tomorrow
Tooling, workflows, and architecture notes
One short email each weekday

arrow_back

PREVIOUS_DATA_LOG

Copilot goes metered: usage-based billing lands as GitHub ships an agent desktop and Microsoft swaps in a new coding model

Initialize_Return_to_Core

LINK_STATUS: 127.0.0.1 (SECURE)

NEXT_DATA_LOG

Claude Code gets native installers; treat it like a production agent

arrow_forward