engineering notes

What AI-agent development costs and what drives the price

What makes up the cost of AI-agent development: process, integrations, human control, operation and token cost.

In brief for executives. There is no fixed price for “an AI agent” — for the same reason there is no price for “an employee”: cost depends on the work it does. The key thing for budgeting: the main cost of an AI agent is not development but operation and the cost of models in operation. A project costed by development alone almost guarantees a budget overrun in production. The right question is not “what does an agent cost” but “what is the full cost of ownership of the process it closes, and over what period does it pay off”.

“How much does it cost to make an AI agent” is the most frequent question on the first call and the most meaningless one detached from the process. It can be answered honestly only by breaking down what the price is made of and which part usually stays invisible until launch.

An agent’s main cost is not development, but operation.

Hypothesis: the price is set by the process and operation, not “building a bot”

An AI agent’s cost is not the cost of writing a prompt and connecting a model. It is the cost of designing a process, embedding it into your systems, ensuring control, and paying for its work every day after launch. Development is the visible part; operation is the part that determines the final figure.

Problem: people expect a price list for “an agent”

A request for a fixed price for “an agent” comes from the model “an agent is a product with a price tag”. In practice one “agent” can be a one-day automation of a single decision, and another a process with a dozen integrations, human control and a reliability requirement. Naming one price for both means deceiving in one of the cases. So a serious contractor first analyzes the process and then names a figure; a figure named without analysis is either a padded insurance or a future overrun.

Why the usual approaches don’t work

A “by development” estimate misleads because it ignores the largest line item — the system’s work in production. An AI agent, unlike ordinary software, costs money on every request: each step is tokens. A system without cost control can work correctly while burning the budget: long context, repeat calls, silent loops. This is invisible in a demo with dozens of requests and becomes the main line item in production with millions. An “by analogy” estimate also fails: agents that look alike can differ in cost of ownership by multiples due to different error cost and control requirements.

data

Model inference price falls year over year

9–900×

annual inference-price drop — depending on the task

$20 → $0.07

per 1M tokens at GPT-3.5 level in ~18 months

The per-token price is collapsing — but that does not remove the need to manage cost: the token gets cheaper, not the habit of stuffing everything into context and calling the model needlessly.

Source: Stanford HAI, AI Index Report 2025 https://hai.stanford.edu/ai-index/2025-ai-index-report

(This fact cuts both ways: models get cheaper, but that doesn’t remove the need to count cost — the per-token price falls, not the architectural habit of stuffing everything into context.)

Engineering model: what the price is made of

The cost of an AI agent decomposes into clear drivers.

Process complexity. How many decisions the agent makes, how many steps and branches it has, how variable the input is. One clear step and a ten-step process with exceptions are different orders of effort.

Integrations. Every external system (CRM, mail, internal services) is work to connect, to handle its failures and to isolate from its instability. Often this is more work than the “AI” itself.

Control and reliability requirements. Handoff to a human, decision audit, guarantees on exceptions cost money — and the more, the higher the error cost. This is not overhead — it is exactly what separates a system from a demo.

Operation and model cost. Tokens on every step, monitoring, support, re-tuning to a changed process. This is a recurring, not a one-off, line item, and it is designed into the architecture (context budgets, per-step model choice), not optimized after the bill.

Practical takeaway for business

Count cost of ownership, not development. Request the estimate in two figures: what it costs to build and what it will cost to run per month at your volume. A project without the second figure is not estimated.

Tie the price to the process and the payback period. The correct framing is “automate this process, effect such-and-such, payback in so much time”, not “develop an agent for a fixed sum”. If a contractor is ready to name a price without analyzing the process — that is a risk, not a convenience.

Design cost control in from the start. Per-step model routing, context budgets, breaking silent loops and spend tracing cost a few days of work if done in advance, and weeks of investigation plus temporary feature shutdown if done after the bill. This is the most manageable part of the budget — provided it was thought of before launch.

Apply this to your processes — .

Open questions

Fixed price or time-and-materials for AI systems is an open question: the process is often refined during the pilot, and a hard fixed price shifts risk into quality. How to correctly factor model price decline into a long-term budget is debatable: the per-token price falls, but context volume in a growing system grows, and these trends partly cancel out. How to estimate cost of ownership before the pilot — only by a range; the precise figure appears after measuring on the real flow.

If you have a process you’re considering an AI agent for — name it and your request volume, and the cost of ownership can be estimated as a range before the start. — we’ll break down the price drivers and the payback period for your process.