š® Exponential View #553: The story of 100 trillion tokens; Chinaās chip win; superhuman persuasion, Waymo ethics, Polymarket & hydrology++
An insiderās view of AI and exponential technologies
I havenāt read anyone as thorough as you on the AI bubble and related topics. ā Miguel O., a paying member
Hi all,
In this edition:
Superhuman persuasion. New data shows that AI is more effective than TV ads at changing minds, even when it lies.
The chip sanctions failed. How āstackingā old tech is neutralizing the US blockade (and validating my 2020 thesis).
Solving the unsolved. Terence Tao calls AI āroutineā and autonomous agents are finally cracking the backlog of neglected science.
The end of Hollywood. Financialization, not AI, may be the root cause.
In my latest video, I break down how I think about the overlapping technology S-curves that are driving the market upheaval:
There are more exponentials in this AI wave doing their work than just ChatGPT and large language models. These new technologies make new things possible: we start to do things we didnāt do before, either because they werenāt possible or because they were too expensive.
Listen on Apple Podcasts or Spotify
RIP chatbot, hello glass slipper
OpenRouter, an aggregator routing traffic across 300+ AI models for five million developers, released an analysis of 100 trillion tokens of usage. Their data offers a unique, cross-platform look at the market. Itās worth a deep read, but for now Iād highlight two directions:
First, the āglass slipperā effect ā retention is driven by āfirst-to-solve,ā not āfirst-to-market.ā When a model is the first to unlock a specific, high-friction workload (like a complex reasoning chain), the cohort that adopts it shows effectively zero churn, even when cheaper or faster competitors emerge. This confirms my long-held view: customers donāt buy benchmarks; they buy solutions. Once a model fits the problem, like a glass slipper, switching costs become irrelevant.
Second, the shift to agentic inference is undeniable. In less than 12 months, reasoning-optimised models have surged from negligible to over 50% of all token volume. Consequently, average prompt lengths have quadrupled to over 6,000 tokens, while completion lengths have tripled. The insight here is that users arenāt becoming more verbose; the systems are. We are seeing the mechanical footprint of agentic loops iterating in the background.
Chinaās chip tipping point
Chinaās drive for semiconductor independence is accelerating faster than predicted. The recent Shanghai IPO of Moore Threads, a leading AI chipmaker, surged 425% on debut, signaling voracious domestic capital support for āChinaās Nvidiaā alternatives. This aligns with a bold forecast from Bernstein, that China is on track to produce more AI chips than it consumes by 2026, effectively neutralizing the intended chokehold of US export controls.
