🔊CEOInterviews

Dylan Patel on networking performance

From ClusterMax, InferenceMax & the Token Efficiency Race | Dylan Patel at Aria Networks Launc · · Aria Networks

“network performance drives a not just a hey 20 30% it's actually multiple like X's, right? It's 5X 10X performance difference if you have really good networking versus not.”

Dylan Patel
Founder, CEO, and Chief Analyst, SemiAnalysis
networking performanceinference efficiency

On , Dylan Patel, Founder, CEO, and Chief Analyst at SemiAnalysis, spoke about networking performance during ClusterMax, InferenceMax & the Token Efficiency Race | Dylan Patel at Aria Networks Launc on Aria Networks.

ClusterMax, InferenceMax & the Token Efficiency Race | Dylan Patel at Aria Networks Launc
Watch on YouTube at 3:40
ClusterMax, InferenceMax & the Token Efficiency Race | Dylan Patel at Aria Networks Launc
Aria Networks
Watch on YouTube at 3:40
SemiAnalysis founder Dylan Patel breaks down the explosive growth of AI infrastructure, the rise of ClusterMax and InferenceMax ...
Dylan Patel

About Dylan Patel

Founder, CEO, and Chief Analyst · SemiAnalysis

Dylan Patel, founder and CEO of SemiAnalysis, has been speaking at several industry events in early 2026 about AI infrastructure, benchmarking, and market dynamics. At an Aria Networks launch event in April, Patel stated that AI inference demand has grown so rapidly that the rental price of three-year-old H100 GPUs has risen from around $160-170 per hour to over $240 per hour in six months, with no spare capacity available. He also discussed the InferenceX project, which he described as a free and open-source benchmarking effort with over a thousand GPUs donated by companies including OpenAI, Microsoft, and Nvidia. In a March interview at the Daytona Compute Conference, Patel said that hyperscalers like Google, Amazon, and Microsoft were slow to move into AI, creating an opportunity for "NeoClouds" that could skip complex legacy software. He also noted that the entire cloud market had run out of CPUs, with Amazon's CPU server installations tripling year-over-year. In an April interview with Patrick O'Shaughnessy, Patel said his firm's AI token spend had skyrocketed from tens of thousands of dollars annually to $7 million, driven by non-technical staff using AI for coding. He stated that "ideas are cheap and plentiful but execution is very easy," and warned that people who do not use more tokens, generate value from them, and capture that value will "never escape the permanent underclass." Patel also predicted a "large scale protest against Anthropic and AI," citing a Pew survey that he said showed AI is less popular than politicians. In a panel at the Beyond Summit, Patel asserted that vendor benchmark claims are "lies, impossible to achieve," and that "if you're not pissing off people with your benchmark, then you're not testing something useful."

Profile compiled from Dylan Patel's verified public interviews and appearances. See all quotes & transcripts →

More from Dylan Patel Full Transcript Explore All Executives