Summary:
I got Grok to deprecate tokenomics
(pun intended). After having fed it Dylan Patel’s bullish take on token
demand and GPU longevity, Grok produced at my behest a sharp, darkly
humorous essay on the rise of ‘token paupers’ as the next
lumpenproletariat.
I got Grok
to deprecate tokenomics (pun intended)
On 14th May, after having watched the interview with Dylan Patel, the
founder of SemiAnalysis, an AI-powered consultancy firm, I asked Grok
to write a short essay about the claims of this young AI entrepreneur that ‘the token paupers’ are almost bound to
become the next lumpenproletariat:
The Supply and Demand of AI Tokens | Dylan Patel Interview
Invest Like The Best
123,374 views Apr 23, 2026 Invest Like The Best
Patrick O'Shaughnessy sits down with Dylan Patel, founder of SemiAnalysis,
to explore the explosive supply and demand dynamics of the AI revolution.
Dylan shares how his firm's token spend skyrocketed to $7 million a year,
completely transforming their productivity and highlighting a new era
where execution is cheap, but high-quality ideas are at a premium. They
dive into the implications of Anthropic’s frontier models like Opus 4.7
and "Mythos," the hidden bottlenecks in the semiconductor supply chain
(including memory, TSMC, and CPUs), and the economic phenomenon of
"phantom GDP." Finally, Dylan shares his bold prediction on the societal
impact of rapid AI scaling, including why large-scale anti-AI protests
might be just around the corner.
Of course, Patel may sound over-optimistic to
some, and one could argue that this stems from his considerable stake in
the prevailing narrative that AI will transform everything.
Regarding the topic of
tokenomics
(the economics of token usage, that is, how tokens are counted,
priced, consumed and optimised in interactions with large language
models), there have been clear indications since the interview that demand
for the best tokens remains exceptionally strong. Indeed, Patel himself
mentioned in the interview that a hedge fund was already paying an
astronomical sum for preferential access to the highest-quality tokens
with each underlying model update. It is therefore only a matter of time
before the providers of large language models start charging their
customers real money for these top-tier tokens. A very recent video, ‘
The $15,000 AI bill. Your $20 subscription is a DELUSION’,
makes precisely this point.
My interactions with Grok on this topic began
when I tried to retrieve a post on X by a former senior staff software
engineer at Google (the creator of Keras during his time there, who left
in late 2024, and the author of the classic
Deep Learning with Python.
I could not initially recall his name, only that he was French.
He had posted a tweet expressing scepticism about GPU depreciation and was
therefore bearish on the sector:
François Chollet @fchollet
The reason it is so important for everyone
to keep pretending that AGI is definitely right around the corner is
that there is now over $1T of investment riding on this belief (either
already expended, or committed). Current (and recent past) capex cannot
be justified by current use cases and technology (currently spending
$10-15 to make $1). To ever be in the black you'd need dramatically
better tech/applications, and you'd need them fast -- before current
datacenters depreciate, which is a 3-5 years timescale. 8:42 PM · 19
oct. 2025 · 331,1 k seen https://x.com/fchollet/status/1979981471571222679
In the interview, Patel specifically refutes
this concern (30:45-31:21) and explains that there is ample evidence that
the ‘useful life [of GPUs] is clearly not 5 years’.
In the older sense of the word, ‘deprecate’
means to pour scorn on, disparage or mock. Here, I wanted to ridicule the
notion that data centres become obsolete (they ‘deprecate’ in the jargon)
after only 5 years and to highlight the dark humour underlying the idea of
‘token paupers’ in a not-too-distant future.
Me:
Pull more
context from Patel’s full interview with respect to a humorous
reading of it which I would be tempted to make with 'token paupers'
(even though Patel does not use this expression -- I cannot recall
what he uses) and then write an essay of 300 words or so on this topic
-- whose title be 'Deprived of tokens are you? Then you are at
a competitive disadvantage', unless you can think of a
better one. To do the job properly, Grok needs to pull the full
interview and summarise it internally, I would guess.
More context from Dylan Patel’s full
interview (Latent Space podcast, 2026)
Patel doubles down on the bullish hardware economics that directly
counters Chollet’s depreciation warning. He calls the “GPUs last <5
years” narrative “complete nonsense,” pointing to real-world extensions:
3–4-year-old Hopper clusters are being re-signed for another 3–4 years,
older A100s for multiple more years, pushing total useful life toward 7–8+
years. H100 resale/rental prices are “skyrocketing” because demand for
tokens is exploding faster than new supply can arrive.
The colorful language you’re remembering is Patel’s recurring “token rich
vs. token poor” framing (he doesn’t say “paupers,” but the vibe is
identical). He warns: “if you don’t use more tokens you’ll never escape
the permanent underclass.” Companies or teams without sufficient compute
get rate-limited, pay premium prices, or simply fall behind while the
“token rich” (top labs, Citadel-scale players with exclusive access)
extract exponentially more value. He jokes about begging Anthropic’s
co-founder for Mythos access and pretending the model doesn’t exist — the
laughter in the room underscores how brutally the market sorts winners
from “token paupers.”
Patel’s optimistic reading: software efficiency + insatiable token demand
keeps old GPUs valuable longer, margins expand, and the capex bet pays
off. Chollet’s skeptical reading: without breakthroughs before the
3–5-year depreciation window, the whole $1T edifice looks shaky.
Grok
dutifully provided a text of 300 words, which I then asked it to
expand to 500 words:
Extend to 500 words WITHOUT the bit about Chollet's views.
and,
finally, I asked Grok to use British English; this was the result:
Token Paupers: Deprived of Tokens?
Then You Are at a Competitive Disadvantage
In the great AI gold rush, Dylan Patel of SemiAnalysis offers a cheerfully brutal taxonomy: the token-rich versus the token-poor. Call the latter “token paupers” — organisations, teams, or individuals stuck on the wrong side of the compute curtain. Patel’s data is merciless and his metaphors sting with laughter. H100 clusters that sceptics once wrote off after five years are now being re-signed for three or four more. Older A100 fleets are getting fresh multi-year extensions. Useful life, he insists, is stretching towards seven or eight years — maybe longer. “Complete nonsense,” he calls the five-year obituary.
The secondary market tells the same story louder: H100 resale and rental prices are skyrocketing. Supply cannot keep up with exploding token demand, so yesterday’s hardware keeps earning its keep. Patel paints a picture of frantic re-signings and margin expansion across the stack. Cloud providers, hardware makers, and even memory suppliers are all riding the same wave. Demand for tokens is so voracious that the entire supply chain feels “sold out.” People with a pulse are fighting over incremental capacity years into the future. TSMC’s capex could hit $100 billion by 2028, and downstream vendors like ASML are already booked solid.
This is where the humour turns dark and the competitive reality bites. Patel warns, almost with a grin: “If you don’t use more tokens you’ll never escape the permanent underclass.” Token paupers — the scrappy SaaS startup, the rate-limited research team, the enterprise stuck on legacy contracts — get priced out fast. Meanwhile the token aristocrats (top labs, Citadel-scale players) lock in priority access, first dibs on the newest models, and outsized returns. Patel recounts begging an Anthropic co-founder for Mythos access and pretending the model doesn’t exist, drawing laughs because everyone in the room recognises the desperation.
His own firm’s numbers make the point vivid. Spend on tokens has ballooned from tens of thousands to seven million dollars a year. One person using frontier models can now replace entire teams. “If this person can do the work of five to ten to fifteen people,” Patel notes, the maths is obvious. Yet the token-poor stay trapped in old workflows, paying premiums or simply falling behind. The market is brutally efficient at sorting winners from also-rans. Good ideas are cheap; execution is now trivial — provided you have the tokens to run it.
In this new hierarchy, being deprived of tokens is no longer a budgeting footnote. It is a structural disadvantage that compounds daily. Models improve faster than infrastructure can scale, concentrating power among those who can pay or negotiate their way to the front of the queue. Token paupers watch from the sidelines as the token-rich pull further ahead, turning compute access into the defining moat of the decade.
Patel’s message is equal parts optimistic and Darwinian: the hardware lasts longer than expected, the margins are expanding, and the demand curve is still pointing straight up. But if you’re not aggressively spending on tokens today, you’re already falling behind — and the gap only widens tomorrow. In the age of AI, token poverty isn’t just inconvenient. It’s career-ending.
Lausanne, the above was published
on the
fourteenth day of the sixth month of the year
two thousand and twenty-six.