From ClusterMax, InferenceMax & the Token Efficiency Race | Dylan Patel at Aria Networks Launc · · Aria Networks
“the AI market growing so fast and inference demand growing so fast that the price of 3-year-old GPUs is soaring. Right? In the just in the last 6 months it's gone from, you know, deals for 1 year transacting at 170, 160 an hour for H100s to now 240 plus. And and you know, in reality there's actually no spare capacity to buy of H100s. That's that's how tight the market has gotten.”
On , Dylan Patel, Founder, CEO, and Chief Analyst at SemiAnalysis, spoke about GPU market during ClusterMax, InferenceMax & the Token Efficiency Race | Dylan Patel at Aria Networks Launc on Aria Networks.
Dylan Patel, founder and CEO of SemiAnalysis, has been speaking at several industry events in early 2026 about AI infrastructure, benchmarking, and market dynamics. At an Aria Networks launch event in April, Patel stated that AI inference demand has grown so rapidly that the rental price of three-year-old H100 GPUs has risen from around $160-170 per hour to over $240 per hour in six months, with no spare capacity available. He also discussed the InferenceX project, which he described as a free and open-source benchmarking effort with over a thousand GPUs donated by companies including OpenAI, Microsoft, and Nvidia. In a March interview at the Daytona Compute Conference, Patel said that hyperscalers like Google, Amazon, and Microsoft were slow to move into AI, creating an opportunity for "NeoClouds" that could skip complex legacy software. He also noted that the entire cloud market had run out of CPUs, with Amazon's CPU server installations tripling year-over-year. In an April interview with Patrick O'Shaughnessy, Patel said his firm's AI token spend had skyrocketed from tens of thousands of dollars annually to $7 million, driven by non-technical staff using AI for coding. He stated that "ideas are cheap and plentiful but execution is very easy," and warned that people who do not use more tokens, generate value from them, and capture that value will "never escape the permanent underclass." Patel also predicted a "large scale protest against Anthropic and AI," citing a Pew survey that he said showed AI is less popular than politicians. In a panel at the Beyond Summit, Patel asserted that vendor benchmark claims are "lies, impossible to achieve," and that "if you're not pissing off people with your benchmark, then you're not testing something useful."