Analysis · Cost of AI

Why agentic AI costs up to 30x more, and how to budget for it

An AI agent is not a single model call. It is a chain of them: the agent plans, calls a tool, reads the result, re-sends the accumulating context, reasons again, and repeats until the task is done. Each loop pays for all the context before it, so a task that a chatbot answers in one call can cost an agent many times more. By one EY estimate, an orchestrated agentic interaction in 2026 can cost around thirty times a simple 2023 workflow. The number that matters is not the price per token; it is the cost of one completed task.

Agent cost, in one line
Cost per task = Σ over steps of (tokens at that step × price) + tool and retrieval calls + human review, all divided by the task success rate.
Why agents are different
5-30×
more tokens per task than a single chatbot call, because agents re-send accumulating context at every step.
EY, 2026
~$1.20
estimated cost of one orchestrated agentic interaction in 2026, versus a few cents for a 2023 linear workflow.
EY, 2026
per task
the right unit of cost is one completed, validated task, not one token or one call.
Practitioner consensus, 2026

Figures are attributed and illustrative of the direction of travel; confirm against your own agent traces before budgeting. The structural point, that context is re-sent and so cost compounds, is not in dispute.

Where the cost hides

The single biggest hidden driver is re-sent context. At each step an agent typically re-supplies the conversation so far, the tool definitions, and the intermediate results. By the final step of a long task, most of the tokens being paid for are not new work, they are the same context sent again and again. Analysts estimate re-sent context can be the majority of an agent bill. Add to that the cost of tool calls, retrieval, and the reasoning tokens that newer models generate before they answer, and a single task fans out into dozens of paid calls.

The second hidden driver is failure. An agent that completes a task on the second or third attempt has paid for the failed attempts too. If a class of task succeeds only 60 percent of the time, the true cost per completed task is the cost of the attempts divided by 0.6, plus the human time spent checking and re-running. A cost-per-token view never sees this; a cost-per-completed-task view makes it the headline.

Why flat pricing breaks

Because agent cost is variable and task-driven, any AI feature sold at a flat monthly price carries margin risk the moment a heavy user arrives. Through 2025, several coding-tool vendors repriced away from flat plans for exactly this reason: a power user running an agent all day could consume far more than the subscription covered. The lesson for any company embedding agents is that the pricing of the feature and the cost of the feature have to be designed together, which requires knowing the cost per task before you set the price.

HOW AGENT COST COMPOUNDS PER STEP

Illustrative. Each step re-sends the context before it, so the token cost of a multi-step task grows faster than the number of steps. The failed attempts that precede a success are paid for too.

With an agent, you are not paying per answer. You are paying per attempt, and the context is on the meter every step of the way.

Common questions

How much do AI agents cost to run?
Far more per task than a single model call. Agents re-send accumulating context and loop through many steps and tool calls, consuming five to thirty times more tokens per task than a chatbot by industry estimates. The cost also depends on how often the agent succeeds: failed attempts are paid for too. The only reliable figure is your own cost per completed task, measured from real agent traces.
Why are AI agents so much more expensive than a chatbot?
Because a chatbot answers in roughly one call, while an agent plans, calls tools, reads results and reasons over many steps, re-sending the context each time. Re-sent context can be the majority of the bill. Newer reasoning models also generate extra tokens before they answer. The cost compounds with the length of the task.
What is cost per completed task?
It is the total token, tool and human-review cost of getting one task done, including the attempts that failed before it succeeded, divided by the success rate. It is the honest unit of agent cost, because it captures retries and oversight that a cost-per-token figure ignores.
How should we budget for agentic AI?
Budget by task, not by token. Estimate the cost of one completed task for each agent use case, multiply by expected volume, and add a margin for the failure rate and human review. Then check that figure against the value the task creates. This is Time-Driven Activity-Based Costing applied to agents, and it is how you avoid an end-of-quarter surprise.

Know what your AI agents cost per task.

The Profit Check shows where your cost to serve, agents included, is hiding, in five minutes.

Take the Profit Check
M
Ask us anything
usually replies in minutes
Hi. I can answer the quick questions about cost, method and timing right here. For anything specific to your business, I'll hand you to Miguel on WhatsApp.
Free. No bot loops. Straight to a person.