The Supply and Demand of AI Tokens | Dylan Patel Interview
Dylan Patel of SemiAnalysis details an explosive shift in AI token economics: his firm's annual spending exploded from tens of thousands to $7M annually in just months as non-technical team members discovered Claude for coding and other high-value tasks. As implementation costs collapse and execution becomes trivially easy, the strategic advantage now lies entirely in identifying *which ideas are worth the capital spend*—and those with early access to frontier models like Anthropic's Opus and Mythos will capture outsized economic value before token demand outpaces supply infrastructure.
Key takeaways
- • Ideas, not execution, now drive competitive advantage: execution on AI has become so cheap that the constraint shifts to choosing which ideas justify token spend; this inverts the old startup adage and means founders must get comfortable with massive token bills on the critical path to growth.
- • Token demand is exploding far faster than supply can scale: a single analyst spent $6,000/day for weeks building a US power grid model that would have required 100 people a decade to create; CPUs, memory (DRAM), and logic chips are all sold out with lead times extending to 2027–28, creating a multi-year supply crunch that will drive margins up across the entire hardware stack.
- • Frontier model access is becoming a competitive moat: Anthropic's Mythos (unavailable to most users) represents a 2-month jump from L4 to L6 engineer capability; those with enterprise contracts and capital to burn will get earlier, unfettered access and will be able to "crush" competitors who rely on cheaper, older models or rate-limited tiers.
- • Phantom GDP masks real economic value creation: traditional metrics don't capture the massive deflationary benefit of AI-accelerated work; a $7M annual token spend is generating multiples more in economic value than it costs, but that value doesn't show up in conventional GDP measurement, making it hard for CFOs and executives to justify the spend upward.
- • Don't use slower or cheaper models once you've experienced frontier capability: users shown a 10% better model immediately abandon previous versions; this "Claude psychosis" behavior means cheaper alternatives (like GPT-4 class models) can't capture demand even if they hit the same capability tier as prior frontier models, because users don't want to work slower.
- • Robotics represents a second, massive demand curve still in early innings: few tokens are currently consumed by physical systems, but once vision language action models and reinforcement learning improve via easy implementation, robot adoption and token consumption will explode, extending the supply shortage years further.
Recommendations (6)
Mentioned (10)
More from these creators
GLP-1s, Peptides, and The Trillion-Dollar Health Revolution
From SpaceX to Founders Fund to Solving America's Nuclear Fuel Problem
What 100 Years of American Finance Tells Us About Today
World's Top Researcher on AI, LLMs, and Robot Intelligence