📈 Why AI bills rise as costs fall

May 25

Boo 👻 it's the ghost token

3 Comments

This all makes sense but it is also going to be a passing phase. AI will get more efficient at using tokens and the level of use will become more predictable as we learn.

Reply (1)

Azeem Azhar

May 25

Yeah, agree. That's what we say.

Scott

May 27

This is the part CFOs will feel before most AI teams have language for it.

The price per token can keep falling while total AI spend keeps climbing, because agentic work is mostly hidden context loading, tool calls, retries, checks and evaluation. The unit of cost is no longer “one prompt.” It is a workflow.

The next layer after observability is reuse.

If every agent has to re-read the same repo, rediscover the same policy, repeat the same failed path, and relearn the same human correction, monitoring only tells you how expensive your forgetting is.

The companies that get this right will track token spend by team, workflow and customer, yes. But they will also capture which context, corrections and decisions actually improved outcomes, then make those reusable by the next run.

That is where the unit economics start to change: fewer ghost tokens, fewer repeated tool calls, fewer cold starts.

This is exactly why we are building Memco as shared memory for agentic work. One agent learns. The next one should not pay to rediscover it.

Exponential View

📈 Why AI bills rise as costs fall