The Agentic Enterprise -- June 25, 2026
The Agentic Enterprise AK · Thu, Jun 25, 2026 · 8 min
Thursday, June 25, 2026
The job stopped being the prompt.
In two weeks, "loop engineering" went from a blog post to the way the best AI teams describe their work. The leverage in building with AI just moved up a level, and so did the bill.
Stop prompting your agent by hand. Design the loop that prompts it for you. It sounds like developer fashion, but it is the same story running under every headline this week: the model is becoming the commodity, and the value is moving to the system around it, the chips, the compute, the talent, and now the loops.
The Lead
Two weeks ago a Google engineer published an essay called "Loop Engineering," and a phrase that had been floating around developer chat suddenly had a name.

The idea is small enough to fit in a sentence, which is exactly why it spread: stop prompting your AI agent by hand, and start designing the loop that prompts it for you. Boris Cherny, who leads Claude Code at Anthropic, put it more bluntly. He does not prompt Claude anymore. He writes the loops that prompt Claude.

This sounds like an inside-baseball developer trend, and on the surface it is. But underneath it is the same story that ran through every other headline this week. Capability is commoditizing. The frontier labs are racing to own the chips, the compute, and the talent precisely because the model itself is becoming the cheap part. Loop engineering is that shift seen from the practitioner's desk: when generating code or text is nearly free, the scarce skill is no longer writing the prompt. It is designing the system that decides what to do next, knows when it is done, and does not set your cloud bill on fire in the process.

The Big Story Research
Loop engineering, and the leverage that just moved up the stack again.
T wo weeks ago the AI-engineering conversation reorganized itself around one idea. Addy Osmani, a Google Chrome engineering lead, published an essay titled "Loop Engineering" that named and structured a practice many teams were already doing without a word for it. The compression that made it spread came from developer Peter Steinberger: stop prompting your agents, start designing the loops that prompt them. And the quote that gave it authority came from Anthropic's Boris Cherny, who runs Claude Code.
I don't prompt Claude anymore. I have loops running that prompt Claude and figuring out what to do. My job is to write loops.
Boris Cherny · Anthropic

A loop, here, is a system that runs an agent in a cycle: act, observe the result, decide the next move, repeat, until a defined goal is met. You set the objective and a stopping condition, and the loop does the iterating. Cherny's point is not that coding got easier. It is that the highest-value work moved. The leverage is no longer in the prompt. It is in the loop.

This is the fourth move in a now-familiar progression. Prompt engineering owned 2022 to 2024. Context engineering, curating everything the model sees at inference, was formalized by Anthropic in late 2025. Harness engineering, building the scaffolding around the agent, took early 2026. Loop engineering is the June 2026 layer, and each step has pushed the human further from the keystroke and closer to the system design.

For enterprise leaders, three consequences matter. First, generation is becoming nearly free, which means engineering value concentrates in judgment: defining the right goal, the right termination condition, the right checks. That changes who you hire and what you reward. Second, the economics invert. A single agent in a loop consumes roughly four times the tokens of a normal chat session, and multi-agent loops up to fifteen times, with parallel loops at scale reaching seven-figure monthly bills. FinOps is no longer downstream of the AI strategy; it is the AI strategy. Third, governance gets harder, because a loop that runs unattended for an hour makes hundreds of decisions no human reviewed in real time.

The Spearhead Take
Budget and govern loops, not prompts. Require every production loop to ship with a testable termination condition, a hard cost ceiling, and full step-level observability before it runs unattended, the same way you would not deploy a cron job with no timeout. And reconsider the hire: the scarce person on your team is no longer the one who writes the cleverest prompt, it is the one who can precisely define what "done" looks like.
The Obvious & The Overlooked
What everyone saw, and what they did not.
The Obvious
The vocabulary moved from prompts to loops.
Osmani named the practice and Cherny says he writes loops, not prompts; the term took over developer discourse in two weeks. Addy Osmani
The leverage shifted from generation to judgment.
When producing code or text is nearly free, the scarce skill is deciding what to build and when it is finished. O'Reilly Radar
The labs are racing to own the full stack.
OpenAI's first chip, multi-billion compute deals, and a talent war all say the model is becoming the commodity. CNBC
The Overlooked
Loops blow up the bill.
A single agent burns roughly 4x the tokens of a chat, multi-agent loops up to 15x, and parallel loops reach seven-figure months. The constraint is now FinOps. Unblocked
"Loopmaxxing" is the new failure mode.
Pointing a loop at an unmeasurable goal makes it run forever, converting your cloud budget into API bills without converging. TechTalks
The real artifact is the termination condition.
"Make the tests pass" is a good loop goal because success is checkable; "improve the code" is not. The skill to hire is defining done. Oracle
The talent selloff and the loop meme are one story.
A two-résumé drop at Alphabet and a developer reframing the job around loops both say value is leaving the model for the system around it. Axios
Moving Pieces
Five developments worth a CIO's attention.
Workforce
The résumés that moved a trillion-dollar stock

Alphabet had its worst day in roughly a year after Bloomberg reported that Jonas Adler and Alexander Pritzel, both seen internally as core contributors to Gemini, are leaving for Anthropic, with the broader talent drain erasing as much as $200 billion in market value. The departures follow Noam Shazeer's move to OpenAI and Nobel laureate John Jumper's to Anthropic. The mechanism is pre-IPO equity: as Anthropic and OpenAI approach public listings, joining early can be life-changing, and Big Tech can match cash but not that. The enterprise read is about vendor risk. When you standardize on a model provider, you are betting on a roadmap that is downstream of a research team that can be poached. The best model is not a fixed property of a vendor, it is a temporary outcome of who is currently in the building.

Deals
OpenAI ships its first chip, and the full-stack land grab gets literal

OpenAI and Broadcom unveiled Jalapeño, OpenAI's first custom processor, an inference-only ASIC that reached tape-out in roughly nine months and is slated to deploy in late 2026. OpenAI says the design was accelerated by its own models and that early samples already run production workloads, including a Codex variant, at materially better performance per watt than current state-of-the-art parts. The strategic read is straightforward: inference is where enterprise AI bills actually accumulate, and owning the chip is how OpenAI attacks its own cost structure rather than renting margin to Nvidia. For buyers, the near-term implication is competitive pressure on inference pricing and one more sign that the frontier labs intend to own every layer between your prompt and the transistor.

Sources: CNBC · TechCrunch
Infrastructure
Reflection signs a $6.3B compute deal, and compute keeps eating the income statement

Reflection, an open-source AI startup, agreed to pay SpaceX roughly $150 million a month beginning July 1, 2026, through 2029, a contract worth about $6.3 billion that grants immediate access to Nvidia's top-tier GB300 chips via SpaceX's Colossus capacity. The deal echoes a pattern becoming the defining feature of this market: compute is no longer a line item, it is the business model. A relatively young company is committing billions in future cash to secure capacity, which only makes sense if you believe demand for inference will outrun supply for years. For enterprises, the signal is about scarcity. When startups lock in multi-year, multi-billion-dollar compute floors, the spot capacity you assume will be there for your own workloads is being pre-committed by someone with deeper pockets.

Sources: CNBC
Product
Anthropic puts Claude inside Slack channels with Claude Tag

Anthropic launched Claude Tag, a beta that lets teams tag @Claude directly into Slack channels to delegate tasks, connect tools and data, and hand off asynchronous work, with context memory and proactive updates across an Enterprise or Team workspace. The shift here is from a single-user assistant to a participant in the collaboration surface where work already happens. That is a meaningfully different product to govern: a tool that reads channels, retains context, and acts across connected systems raises the same data-access and audit questions that desktop agents do, just inside the messaging layer most enterprises treat as low-stakes. The capability is genuinely useful and the governance footprint is larger than it looks. Treat an agent in your channels with the same scrutiny you would give an agent on your desktops.

Policy
California and Colorado start writing AI layoffs into law

The regulatory response to AI-driven job cuts is moving from rhetoric to statute. California's proposed SB 951, the Worker Technological Displacement Act, would require 90 days' advance notice before AI-driven layoffs and disclosure of which AI systems were used, while Governor Newsom's May executive order directs the state to reform displacement protections within 180 days. Colorado's AI Act takes effect June 30, 2026, requiring employers to guard against algorithmic discrimination in employment decisions. No federal law yet requires disclosing AI's role in a layoff, which means the compliance map is fragmenting state by state. For any enterprise operating across jurisdictions, the practical takeaway is that "we used AI to become more efficient" is becoming a phrase with legal consequences attached, and the documentation requirements are arriving before the federal framework that would standardize them.

Sources: ABC7 · Foley & Lardner
On the Radar
Quick hits, sharpened.
Product Alteryx hands agent-building to the analysts. At Inspire 2026, Alteryx introduced Agent Studio and an MCP Server that let business analysts turn existing data workflows into autonomous agents without central IT, accelerating citizen-developer automation and the governance debt that comes with it. Build Fast with AI
Deals Getty bets its stock on being inside ChatGPT. A multi-year partnership lets OpenAI surface Getty's licensed photo and editorial library directly in ChatGPT search, and Getty shares soared more than 200% in a session, repricing what verified content distribution is worth. explainx.ai
Compute Google's eighth-gen TPUs are built for agents. The newest TPU generation spans two distinct chips and systems engineered specifically for agentic workloads, reinforcing that hyperscalers are now designing silicon around the agent loop, not just training. Google Cloud
Markets The AI trade got a reality check. The Nasdaq 100 fell 3.3% and the S&P 500 closed 1.4% lower amid talent-flight headlines and chip-stock jitters, reviving the "one big bubble" debate even as token consumption keeps climbing. NPR
Product MiniMax M2.5 sharpens the China cost alternative. The release is pitched as a cheaper substitute for frontier models from Anthropic, OpenAI, and Google, picking up enterprise interest as buyers look for pricing leverage. Build Fast with AI
The Number
15x
The token consumption a multi-agent loop can reach versus a single chat interaction. A single agent in a loop runs about 4x.
This is the hidden cost of the loop-engineering shift. The moment you stop typing prompts and let a system iterate on its own, you trade human time for machine tokens, and at scale that trade gets expensive fast: parallel loops running unattended can reach seven-figure monthly bills. The productivity story is real. So is the invoice. The teams that win with loops will be the ones who put a cost ceiling on every one of them before turning it loose.
Source: Unblocked
Counter-Signal
Risk
The loop that never stops is not autonomy. It is a runaway bill.

The pitch for loop engineering is seductive: design a system that runs your agent while you sleep. The failure mode arrived almost as fast as the term. Practitioners are already calling it "loopmaxxing," the assumption that pointing an agent at a problem and letting it iterate indefinitely will eventually produce a correct answer. It is the same mistake as the earlier "tokenmaxxing" phase, when teams believed throwing more inference at a task would resolve hard logic on its own.

It does not, and the reason is precise. A loop needs a binary, checkable exit condition to know when it is finished. "Make the tests pass" works because success is verifiable. "Improve the user experience" or "generate a viral strategy" strips the system of any concrete stopping point, so it never converges and never stops spending. The discipline that separates a reliable loop from a budget incinerator is not the loop itself, it is the guardrails around it: an explicit termination condition, plus secondary limits on cost, iterations, wall-clock time, and stall detection that kills the run when the agent makes the same call three times without progress. The industry is selling loops that run while you sleep. The responsible version is loops that stop on their own before they bankrupt you. Convenience says trust the loop. Durability says bound it.

Sources: TechTalks · Oracle
From the Field
Prompt engineering, then context engineering, then harness engineering, and now loop engineering. Four terms in three years, and every one of them moves the human further from the model and closer to the system around it.

That is not a coincidence. It is the whole industry telling you, in its own slightly exhausting vocabulary, where the value is going. And it rhymes with everything else that happened this week. OpenAI is building its own chip. A startup committed $6.3 billion to compute through 2029. Two researchers left Google and a trillion-dollar company lost a year of gains in a session. None of those are model announcements. All of them are bets on the layers that surround the model, because the model is becoming the commodity and the advantage is moving to the chips, the compute, the data, the people, and the loops that orchestrate all of it.

So when your engineers come to you talking about loops, do not file it under developer fashion. It is a preview of how the work itself is changing. The leverage in your AI organization is leaving the keystroke and moving into system design, and the budget is going to follow it whether you plan for that or not. The companies that do well in the back half of 2026 will be the ones that noticed the leverage moved, staffed for judgment over generation, and put a cost ceiling on every loop before it ran.

The skill stopped being how well you can ask. It is how precisely you can define what you actually want, and how you will know when you have it.
Let's get to production,
AK
Talk to Spearhead Forward this edition
This edition references Anthropic, whose Claude models include the one used to produce this newsletter; the Big Story features Anthropic's Boris Cherny and the Claude Code team, though loop engineering spans tools and was named by Google's Addy Osmani, and Anthropic and OpenAI are treated symmetrically here. Loop engineering is an emerging practitioner trend; sourcing leans on primary essays (Osmani), trade analysis (O'Reilly Radar, TechTalks), and vendor engineering writing rather than corporate filings. Token-cost multiples (4x, up to 15x) and seven-figure bill figures are practitioner and vendor estimates that vary by workload. Alphabet's stock-drop magnitude is reported inconsistently (roughly 5% on a closing basis to as much as 7% intraday across several sessions); the $200 billion figure reflects the broader drain, not one session, and is an order-of-magnitude estimate. Startup valuations, funding figures, and the Reflection compute terms are as reported by financial press and trackers. Produced for Spearhead with AI assistance and human editorial direction.
The Agentic Enterprise
Know more about AI than 95% of your peers. By 7 AM.
A daily AI intelligence briefing for enterprise leaders, published by Spearhead. We build AI systems that work. Strategy. Engineering. Production. Outcomes.
© 2026 Spearhead. All rights reserved.

Keep Reading