🔮 Sunday edition #525: Integrating agents; token crunch; China’s EV victory; flame-throwing robots, big brains & Claude controls++
An insider’s guide to AI and exponential technologies
Hi, it’s Azeem.
This week, we explore the shift from raw AI horsepower to systemic integration. Models are being wired into feedback loops, infrastructure and ecosystems. From Claude 4’s autonomous coding sprints to the rise of open agent protocols, the “agentic web” is no longer a theory – it’s being built. Across sectors, the race is on to operationalize AI.
Let’s go!
The agentic web is arriving
2025 was billed as the year of AI agents and in many ways it has arrived. Google, OpenAI and Anthropic ship agents that code on-demand and fetch citations in minutes. Codex can spin up a bug fix faster than you can draft a ticket. But the question is no longer how smart they are – it’s whether they can run unattended across systems. Today, they can’t - even a 1% hallucination rate can unravel a long task chain.
Still, progress is unmistakable. Rakuten, a Japanese tech conglomerate, let Claude 4 refactor code for seven hours with zero intervention. Each win is still shadowed by lethal slips – Claude 4, for all its impressiveness, still bungled “What’s 9.9 minus 9.11” – proof that 1 percent error still matters.
Microsoft CTO Kevin Scott calls this build-out the “agentic web,” a mesh of AIs working through shared protocols. Apple’s forthcoming Intelligence SDK will hand those sockets to every developer, and emerging standards – MCP for tools, Agent2Agent for AI-to-AI chat – are becoming the thread tape.
The pipes are going in, the journeyman is still on probation, but every new coupling cuts leaks and gets us closer to “hands-free” AI.
See also:
Sam Altman and Jony Ive’s AI device collaboration targets late 2025 – will new interfaces push agent potential further?
Google unveiled Veo 3, an AI model that generates talking video. Its realism adds to worries about a flood of synthetic media.
Ravenous reasoning agents
Google’s inference load soared 50-fold in just 12 months, from 10 trillion tokens in April 2024 to 480 trillion in April 2025. More users played a role, but the bigger driver is the rise of reasoning models, which consume about 17 times as many tokens as predecessors because they run long internal chains of thought.

This exponential growth will continue as agentic workloads become more commonplace.
For instance, I spent 2 million tokens building a small video game with Claude in under an hour; a typical chatbot session might use only 10,000 tokens in the same time. Chatbots wait for humans to type, while agents run continuously, limited only by available compute. That translates into steep prices: Google AI Ultra costs $250 per month, Claude Max $100–200, and ChatGPT Pro $200. OpenAI is reportedly thinking of a $20,000-per-month tier.
Last week, we showed how training costs keep sliding down predictable curves, but inference is different. Although Moore’s Law and better algorithms cut the cost per token, demand still grew 50 times last year, far faster than efficiency gains. Whether the economics of inference can keep up with our appetite for longer contexts, deeper reasoning, and always-on agents is an open question.
We may need to apply hard budgets to autonomous agents – just like the robot police in THX 1138, who abandon a chase once the cost crosses a set threshold. This kind of cost-governed autonomy where agents must justify, cap, or cancel actions based on compute limits, could become a defining constraint. The inference bottleneck won’t kill agentic AI, but it might force it to act with surgical precision.
Grid strain or grid brain?
AI consumes energy; everything does. The real issue is whether the electricity we spend on AI will help us fight climate change. I believe it will.
RAND projects that by 2030 AI could draw about 327 GW – roughly 3.6–4 percent of today’s global generating capacity. That load is significant, but AI can also discover ways to shrink its own footprint and accelerate climate solutions.
Consider Microsoft’s Discovery platform, which designed a PFAS-free data-center coolant in hours instead of years and FutureHouse’s Robin AI, which identified promising age-related macular degeneration drug candidates in weeks. The same processors driving up power bills can uncover breakthroughs that cut them.
Local strain is real. Collectively, around a dozen hyperscaler projects in Nevada have asked NV Energy for nearly 6 GW of new electricity capacity – about 40 percent of the state’s entire grid. Governments should insist that new data centers run as much as possible on clean power and recycle waste heat.
Is the trade-off worth it? Yes. If AI expansion is paired with aggressive clean energy build-outs, the technology will help decouple economic growth from emissions. Limiting AI today to save energy would throttle a tool that can slash energy use tomorrow.
See also:
Rapid “event-attribution” studies now use AI to link extreme weather to climate change in weeks. The World Weather Attribution group has run more than forty such analyses, giving hard numbers on how warming raised the odds or severity of events.
Shenzhen’s hardware hegemony
In recent years, automakers have fought for the industry’s future. We previously argued that Germany’s automotive sector faced an existential threat. Today it is clear that China has won the electric vehicle hardware race.
Evidence is everywhere. BYD claims its batteries can recharge in five minutes. Xiaomi delivers Ferrari-like styling at Volkswagen prices and Morgan Stanley projects Xiaomi will earn $32 billion in automotive revenue by 2027— about the size of Tesla’s entire auto business in 2020. China also controls the entire supply chain – batteries, motors and electronics. The advantage is not only speed but an ecosystem density the West cannot replicate.
Hardware dominance, however, is only half the story. Morgan Stanley analyst Adam Jonas notes that value is migrating from hardware to software. Tesla’s pivot from building the “best car” to achieving the “best autonomy” shows that once hardware commoditizes, software differentiates. Profit pools now lie in over-the-air updates, data services, and robotaxi networks. Algorithms and regulatory approvals, not sheet metal, create the new moats.
China owns the hardware victory, but the software endgame remains wide open. Either Tesla wins here or it doesn’t win at all.
See also:
Xiaomi will invest $6.9 billion in chipmaking, pursuing vertical integration that reaches beyond its electric-car unit.
Analyst
argues China is moving ahead of the United States in key technologies — progress, he says, accelerated during Trump’s trade war.China now runs a fleet of 100 autonomous mining trucks, showing how industrial AI can scale up heavy equipment.
- warns the United States looks like a “late-stage republic” and must rebuild technological and cultural strength to compete with China. See my conversation about the state of the US here.
Elsewhere
Separate AI models trained on different data still learn the same hidden map of meaning. This hints that concepts share an underlying universal structure.
Berkeley’s automated A-Lab runs robots around the clock. It tests 50–100 times more material samples per day than a human team.
Stephen Wolfram says bigger brains could tap pockets of simple computation. These pockets would let them form higher-level ideas and handle larger mental spaces.
Anthropic has switched on AI Safety Level-3 safeguards for Claude Opus 4 because the company can’t yet rule out the possibility that it could materially aid CBRN1 weapon projects.
A statistical study of Myanmar, Sri Lanka, Thailand, and Singapore finds that state privilege – not Buddhist belief – drives Buddhist violence against minorities.
AI executives now call data centers “AI factories.” The term seeks subsidies and casts the sites as strategic assets even though they employ few people.
By choosing Robert Prevost as Pope Leo XIV, the Church signals an alternative model of American global leadership grounded in humility.
Ukraine has approved the Krampus robot, which fires thermobaric rounds (i.e., a flamethrower). This brings its roster of armed ground robots to more than 80.
Olivier Blanchard’s new NBER paper reviews 40 years of mainstream macroeconomics. He gives the field a maturity score of 7.5 out of 8 but notes its weak forecasting record.
Thanks for reading!
CBRN weapon projects, involving chemical, biological, radiological and nuclear weapons, encompass various initiatives aimed at developing, testing, deploying and mitigating the risks associated with such weapons.
I‘d love to see a dedicated deep dive on AI vs climate change, both sides of the medal
The Claude 4 mess-up of arithmetic subtraction is a bit of a non-issue, isn't it? Obviously, if instead Clause had been asked: "write a python script to work out 9.11-9.9 & execute it", it would have had no trouble. It is presumably not hard for Claude to have a script running in the background with some logic going: "does this look like a maths question? if so, write a bit of code, execute it and slot in the answer". A few more tokens to identify the likelihood a problem is a maths problem, but not many. Hardly a step to full-blown reasoning.