3 Comments
User's avatar
Kenny Fraser's avatar

This all makes sense but it is also going to be a passing phase. AI will get more efficient at using tokens and the level of use will become more predictable as we learn.

Azeem Azhar's avatar

Yeah, agree. That's what we say.

Scott's avatar

This is the part CFOs will feel before most AI teams have language for it.

The price per token can keep falling while total AI spend keeps climbing, because agentic work is mostly hidden context loading, tool calls, retries, checks and evaluation. The unit of cost is no longer β€œone prompt.” It is a workflow.

The next layer after observability is reuse.

If every agent has to re-read the same repo, rediscover the same policy, repeat the same failed path, and relearn the same human correction, monitoring only tells you how expensive your forgetting is.

The companies that get this right will track token spend by team, workflow and customer, yes. But they will also capture which context, corrections and decisions actually improved outcomes, then make those reusable by the next run.

That is where the unit economics start to change: fewer ghost tokens, fewer repeated tool calls, fewer cold starts.

This is exactly why we are building Memco as shared memory for agentic work. One agent learns. The next one should not pay to rediscover it.