AI Agents Are About To Get A CFO

The free ride is ending

Enterprise AI is moving from the toy box to the finance desk. The first wave felt almost magical because people could open a chatbot, ask for help, and get something useful back in seconds. A developer could ask for code. A marketer could ask for a campaign idea. A support team could ask for a customer reply. A manager could ask for a summary. That stage was exciting, and it proved the technology had real value. But the problem is that early excitement often hides early cost. When a few people experiment, the bill feels manageable. When thousands of employees and agents start reading files, calling tools, writing code, running tests, checking documents, retrying tasks and looping through workflows, the bill becomes something the CFO can see. That is where enterprise AI is heading now. It will not be judged by how many tasks agents can attempt. It will be judged by whether the final result is worth more than the tokens, tools, review time, rework and risk left behind.

The ai bill is becoming real

The old SaaS model was easy to understand. A company paid for a seat, gave an employee access, and mostly knew what the monthly cost would look like. AI agents are different. They behave more like cloud workloads. They consume resources while they work. They can be cheap for one task and expensive for another. They can reuse cached context, burn through fresh context, generate long outputs, call tools, run searches, trigger infrastructure and ask for human review. GitHub has already announced that Copilot plans will move to usage-based billing from June 1, 2026, replacing premium request units with GitHub AI Credits and calculating usage from input, output and cached tokens using model-specific API rates. OpenAI has also updated Codex pricing so that, from April 2, 2026, pricing aligns with API token usage instead of per-message pricing, with the change later extended to existing Enterprise, Edu, Health, Gov and teacher plans. This is the signal. AI is moving from fixed-fee software into metered work.

Token pricing changes the conversation

Tokens are not just a technical detail anymore. They are becoming a business unit. Every time an AI agent reads a long document, scans a codebase, reasons through a problem, drafts an answer, retries a failed step or calls a tool, cost can build. Anthropic’s Claude pricing shows the shape of this clearly, with Claude Opus 4.7 listed at $5 per million input tokens and $25 per million output tokens, while Claude Sonnet 4.6 is listed at $3 per million input tokens and $15 per million output tokens. The same pricing page also separates cache writes, cache hits and output tokens, which matters because long-running agents often reuse context and carry large working memory across tasks. Tool use adds another layer, because Claude’s documentation says tool-use requests are priced from the total input tokens sent to the model, output tokens generated, and additional usage-based pricing for server-side tools such as web search. What this really means is that the cost of an AI task is not just “the model answered.” It is everything the agent had to read, think, write and touch to get there.