Dwarkesh Patel

CEO and Founder, The Dwarkesh Podcast

Dwarkesh Patel: The Scaling Era of AI is Here

🎥 Jul 14, 2025 📺 Limitless Podcast ⏱ 95m 👁 13222 views

💫 LIMITLESS | SUBSCRIBE & FOLLOW https://limitless.bankless.com/ https://x.com/LimitlessFT ------ Renowned podcaster Dwarkesh Patel joins us to explore the "scaling era" of AI, characterized by rapid growth and significant compute investments. He discusses the impact of neural networks and transformers, the implications of scaling laws, and potential constraints as we approach artificial general intelligence (AGI). Patel shares his skepticism about whether current AI models exhibit true intelligence, addresses ethical concerns around AI safety, and emphasizes the responsibilities of develope...

Watch on YouTube

About Dwarkesh Patel

Dwarkesh Patel, founder and host of The Dwarkesh Podcast, has been a frequent guest on other programs and published episodes with researchers and executives. On Triggernometry, Patel discussed the potential societal effects of artificial intelligence, stating that he finds the prospect of mass job displacement "scary" and that AI could make authoritarian surveillance far more efficient because "a lot of the reasons that government has not been as authoritarian as it has in the past is that it just physically not been possible." He also said that while he is "a very libertarian person by inclination," he believes the dynamic of capital replacing labor "justifies a huge amount of redistribution." Regarding AI sentience, Patel said he "genuinely doesn't know" whether current systems are sentient, and argued that future AI systems will need to have "their own values" and that a "constitutional convention" should be held to define those values. Patel has also hosted guests including former Google DeepMind researcher Eric Jang, who discussed rebuilding AlphaGo and the lessons it offers for self-play and reinforcement learning; Harvard geneticist David Reich, who presented new findings showing accelerated natural selection during the Bronze Age; Nvidia CEO Jensen Huang, who defended Nvidia's moat by stating that "the transformation from electrons to tokens is such an incredible journey" and is "hard to completely commoditize"; and research fellow Michael Nielsen, with whom Patel explored how scientific progress is recognized and how that question applies to AI-driven discovery. Patel has described the improvement of AI models as "very fast" and observed a "huge discrepancy between what people are seeing in Silicon Valley and what people are observing outside."

Source: AI-verified profile updated from Dwarkesh Patel's recent appearances. Browse all interviews →

Transcript (82 segments)

✨ AI-enhanced transcript with speaker attribution

Speaker 10:03

Dwarkesh Patel, we are big fans. It's an honor to have you.

Dwarkesh Patel0:07

Thank you so much for having me on.

Speaker 10:08

Okay, so you have a book out. It's called The Scaling Era, an oral history of AI from 2019 to 2025. These are some key dates here. This is really a story of how AI emerged and it seemed to have exploded on people's radar over the past 5 years. Everyone in the world feels like is trying to figure out what just happened and what is about to happen. I feel like for this story we should start at the beginning as your book does. What is the scaling era of AI and whenabouts did it start? What were the key milestones?

Dwarkesh Patel0:40

I think the undertold story about AI is that the big contributor to these AI models getting better over time has been the fact that we are throwing exponentially more compute into training frontier systems every year. By some estimates, we spend 4x every single year over the last decade training the frontier system than the one before it. That means we're spending hundreds of thousands of times more compute than the systems of the early 2010s. Of course, we've also had algorithmic breakthroughs in the meantime. In 2018, we had the transformer. Since then, many companies have made small improvements here and there, but the overwhelming fact is that we're spending already hundreds of billions of dollars building up the infrastructure, data centers, and chips for these models. This picture is only going to intensify if this exponential keeps going. 4x a year over the next few years is something on the minds of the CFOs of the big hyperscalers, but it's not as common in the conversation around where AI is headed.

Speaker 11:58

So what do you feel people should know about this? What is the scaling era? There have been other eras maybe of AI or compute, but what's special about the scaling era?

Dwarkesh Patel2:08

People started noticing in 2012, with AlexNet and others using neural networks to categorize images, that instead of doing something hand-coded, you can get a lot of juice out of just neural networks. People started playing around with these networks for different applications. The question became: we notice these models get better if you throw more data and more compute at them, so how can we shove as much compute into these models as possible? The solution ended up being internet text, and we needed an architecture amenable to trillions of tokens. We had this happy coincidence of architectures amenable to this kind of training with GPUs originally made for gaming. We've had decades of internet text compiled. Ilya called it the fossil fuel of AI — a reservoir we can call upon to train these minds. And so it's been a question of making these models bigger, using data from internet text to keep training them. Over the last year, the new paradigm has been not only pre-training on internet text, but also having them solve math and coding puzzles to give them reasoning capabilities. I have some skepticism about AGI just around the corner, but the fact that we now have machines that can reason — you ask a question, it goes away for a long time, thinks about it, and comes back with a smart answer — we take it for granted. They're extremely good at coding. If you've played around with Claude Code or Cursor, it's a wild experience to design an application at a high level and 15 minutes later there are 10 files of code and the application is built. Another important dynamic is that if we're going to be living in the scaling era, you can't continue exponentials forever, certainly not 4x a year forever. By 2028 at most, by 2030, we will literally run out of the energy we need to keep training these frontier systems, capacity at leading-edge nodes for chips, and even the raw fraction of GDP we'll have to use to train frontier systems. So we have a couple more years left of this scaling era, and the big question is: will we get to AGI before then?

Speaker 15:31

That's a key insight of your book, that we're in the middle of the scaling era. We're about 6 years in and we're not quite sure which way things are going to go. I want you to help folks get an intuition for why scaling in this way even works. For me and most people, our experience with revolutionary AI models started in 2022 with ChatGPT 3 and then GPT-4. It seems unintuitive that if you take a certain amount of compute and data, out pops intelligence. Could you help us get an intuition for this magic? How does the scaling law even work? Compute plus data equals intelligence — is that really all it is?

Dwarkesh Patel6:28

To be honest, I've asked so many AI researchers this exact question on my podcast. I can tell you some potential theories of why it might work, but I don't think we understand. We know it works, but we don't understand how it works. We have evidence from primatology of what could be going on. There's research by Suzana Herculano-Houzel showing that if you look at how the number of neurons in the brain of different rat species increases with brain weight, there's a sublinear pattern. Doubling brain size doesn't double neuron count. The two interesting exceptions where there is a linear increase are certain kinds of birds and primates. The theory for humans is that we unlocked an architecture that was very scalable — like transformers being more scalable than LSTMs. We were in an evolutionary niche millions of years ago that rewarded marginal increases in intelligence. Birds couldn't find this niche because heavier brains interfere with flight. It was this happy coincidence. But why bigger brains resulted in us being as smart as we are? We still don't know. And there are many dissimilarities between AIs and humans. Our brains are big, but humans from 0 to 18 don't see anywhere near the amount of information LLMs are trained on. LLMs are extremely data-inefficient. But the pattern of scaling appears in many places.

Speaker 19:21

Is that a fair analogy? AI models are like the human brain, evolutionary pressures are like gradient descent reward algorithms, and out pops human intelligence. We don't really understand that, and we also don't understand AI intelligence. But it's basically the same principle.

Dwarkesh Patel9:46

It's a fascinating but thorny question. Is gradient descent like evolution? In one sense yes, but when we do gradient descent, we start with weights and learn how chemistry, coding, and math work. That's more similar to lifetime learning. Evolution designed the system, but lifetime learning isn't evolution. So there's a question of whether training is more like evolution, in which case we might be far from AGI because the compute spent over evolution to discover the human brain could be 10^40 flops, versus a single lifetime from 0 to 18, which is closer to 10^24 flops — less than what we use to train frontier systems.

Speaker 111:07

Here's a big picture question. I'm fascinated by the metaphysical discussions some AI researchers have. They talk about making God. Why do they say things like that? Is it the idea that scaling laws don't cease, and if we scale intelligence to AGI, then we can scale far beyond and create a godlike entity? We're making artificial superintelligence.

Dwarkesh Patel11:57

People focus too much on the hypothetical intelligence of a single copy of an AI. I do believe in superintelligence, not just functionally knowing a lot, but qualitatively different. But the reason isn't that any one copy is that smart, but because of collective advantages that AIs will have. They're digital, as smart as humans at least, but can be copied. If a model learns a lot about a domain, you can make infinite copies of it — infinite copies of Jeff Dean or Ilya Sutskever or any skilled person. They can be merged, so knowledge from each copy can be amalgamated back into the model. They can be distilled, run at superhuman speeds, and communicate in latent space.

Speaker 113:10

They're immortal.

Dwarkesh Patel13:14

Exactly. One interesting question is how do we prosecute AIs? If a copy does something bad, what do you do? Does the whole model have to be punished? How do you punish a model?

Speaker 113:56

And who is liable? Is it the toolmaker or the user? There's one topic I want to come to about scaling laws. At what time did we realize scaling laws were going to work? Was there a specific moment or breakthrough that led to the start of these scaling laws?

Dwarkesh Patel14:09

I think it's been a slow process of more people appreciating the overwhelming role of compute. In 2018, Dario Amodei wrote a memo at OpenAI called "Big Blob of Compute." It said that fundamentally AGI is just a big blob of compute. Then we got more empirical evidence. In 2021, there were papers on scaling laws showing that the loss of the model on predicting the next token goes down very predictably based on how much more compute you throw in. That was incredibly strong evidence. You could say: if it has this incredibly low loss on predicting the next token in all human output, including scientific papers and GitHub repositories, then it must have learned coding and science. That turned out to be true, but even a year or two ago people were denying it.

Speaker 116:52

It creates this weird perception. Very early on, still to this day, it's really just a token predictor. But somewhere along the lines, it creates this perception of intelligence. We covered the early historical context. I want to bring listeners up to where we are currently in 2025. Can you outline where we've gotten from early GPTs to now — GPT-4, Gemini Ultra, Claw, the breakthrough of reasoning? What can leading frontier models do today?

Dwarkesh Patel17:27

There's what they can do and what methods are working. They've shown remarkable usefulness at coding — not just answering direct questions but autonomously working for 30 minutes or an hour doing tasks that would take a front-end developer a whole day. They're extremely useful as assistants in research and therapy. On training methods, pre-training seems to be plateauing. GPT-4.5, which followed the old mold of making the model bigger doing next-token prediction, didn't pass muster and was deprecated. What does seem to be working is RL — having the model try to answer math and coding problems, and get reward for correct answers. The breakthrough with reasoning was that the model could saturate tough reasoning problems.

Speaker 119:02

What made reasoning so special that we hadn't discovered before? What did it unlock?

Dwarkesh Patel19:11

Honestly, I'm not sure. GPT-4 came out a little over two years ago, and then two years later o1 came out, the original reasoning breakthrough. DeepSeek open-sourced their R1 research and explained how their algorithm worked. It wasn't that complicated — get some math problems, tell the model the reasoning trace, and have it try to do it raw on remaining problems. It sounds arrogant to say it wasn't complicated, but even simple things take time to iron out engineering hurdles. That should update us on how long it will take to go through remaining bottlenecks on the path to AGI.

Speaker 120:34

Coding seems to be a really strong suit. Why coding over general intelligence? Is it because it's more confined?

Dwarkesh Patel21:09

There are two hypotheses. One is Moravec's paradox. Hans Moravec in the '90s predicted 2028 would be the year of AGI just by looking at growth in computing power and comparing to human brain requirements. Moravec's paradox is that AI gets better first at skills humans find hardest. We think coding is incredibly hard, but computers can do it. Models aren't that useful yet at almost anything, but they can reason. Moravec's answer is that evolution spent billions of years optimizing us for things we take for granted — moving around, picking up a can — and we can't even get robots to do that. Reasoning only had a few million years of evolutionary pressure, hence less optimization. This predicts progress patterns. The other theory is that it's about data availability. We have GitHub with trillions of tokens of code, but no analogous corpus for robotics.

Speaker 124:59

That's fascinating. If there's one thing we'd want AI to be good at, it's probably coding. If you have a Turing-complete intelligence that can create software, is there anything you can't create once you have that? Also, Moravec's paradox implies a complementarianism with humanity. If robots can do things well that humans can't, and vice versa, perhaps there's a place for us. It also implies humans have only scratched the surface of reasoning potential. Is that the implication?

Dwarkesh Patel26:04

Yes. Another insight: the more variation there is in a skill in humans, the faster AIs get at it. Coding is one where 1% of humans are great, but evolution hasn't saturated the gene pool. You mentioned complementarianism. It can be seen as a positive or negative future. We're good meat robots — they can do all the thinking and planning. One dark vision: AIs tell us to move bricks for data centers because we have the manual labor. On the other hand, you'd get paid a lot because it's valuable. But eventually robotics will be solved.

Speaker 127:42

There's something to that idea of human variation. We have news of Meta hiring AI researchers for $100 million signing bonuses. What does the average software engineer make versus an AI researcher at the top? It implies massive variation in quality. If AIs reach that quality, what does that unlock?

Dwarkesh Patel28:17

So we have coding down. Another question: what can't AIs do today, and how would you characterize that?

Speaker 128:30

I've interviewed people with very different timelines for AGI. The experience of building AI tools myself has been the most insightful. I've spent about 100 hours trying to build tools like rewriting transcripts, finding clips, co-writing essays. It's very hard to get human-like labor out of these models, even for tasks that should be central — short-horizon, language in and language out. I think the key reason is that models lack the ability to do on-the-job training. When you hire a human, for the first few months they're not that useful because they're building context. What makes humans valuable is their ability to interrogate failures dynamically, pick up efficiencies, and build context. People wonder why OpenAI's revenue is $10 billion while Kohl's or McDonald's makes more. Why aren't companies reorganizing workflows around AI? I think it's genuinely hard to get human-like labor out of these models. They're like a 5 out of 10 at rewriting a transcript, but if you have feedback, once the session ends, everything is gone. It's like working with an amnesiac employee — every day is the first day.

Every day is the first day of employment.

Dwarkesh Patel31:36

Exactly. That makes it hard for them to be useful as an employee. This is a key bottleneck to their value. Human labor is worth $60 trillion in wages per year. AI companies make $10 billion. What explains the gap? Continual learning is a big one, and I don't see an easy solution.

Speaker 132:25

Can we go deeper on that concept? This is one of the ideas in your recent post "AI is Not Right Around the Corner." You said the ability to replace human labor is a ways out — around 2032. The reason is that AIs can't learn on the job. But why can't they? Is it the context window, inability to input data, or lack of stateful memory? All seem solvable.

Dwarkesh Patel33:22

I think it's a solvable problem in a deep sense because we'll eventually build AGI. But obvious solutions like expanding the context window or using retrieval-augmented generation won't suffice. Within a context window, models can learn on the job — they get better at understanding your needs. But the context length for even the best models is only a million or two million tokens, about an hour of conversation. You might say: why not expand the context window to a billion tokens? The transformer's attention mechanism is quadratic with sequence length — doubling tokens costs 4x as much compute. For 7 years, people have tried to get around this. Even with improvements like mixture of experts or latent attention, the super-linearity hasn't gone away. I don't think we'll just get there by increasing context length.

Speaker 136:17

That's fascinating. The other reason in your post is that AI can't do your taxes. You were talking about computer use. What's the limiter there?

Dwarkesh Patel36:38

There was a blog post by a company called Mechanize explaining it well. Imagine training an LLM in 1980 with all the compute you wanted, but only data from the 1980s. You couldn't train a modern LLM because the data wasn't available. We're in a similar position with computer use — there's no corpus of videos of people using computers to do different tasks. Accumulating that data is the challenge. Doing taxes involves navigating files, logging into websites, downloading pay stubs, using TurboTax, and filing. It requires navigating unfamiliar UIs, dealing with emails, invoices, and gray areas. It's a week-long task requiring a plan of action, breaking tasks apart, and consulting with me.

Speaker 139:25

Even though your post is titled "AI is Not Right Around the Corner," you think this ability to file taxes is a 2028 thing — not next year but in a few years. People may have read too much into the title.

Dwarkesh Patel39:43

I'm arguing against people who say AGI is two years away. I do think the wider world is underpricing AGI. They underestimate that we'll have millions or billions of extra workers within the next decade. Once we have AGI with continual learning and computer use, copies of the model will be deployed across the economy, learning on the job. Because they're digital, the model can amalgamate all that learning. Even without further algorithmic progress, you could have something like an intelligence explosion — a broadly deployed intelligence explosion that becomes superintelligent just from human-level learning capability.

Speaker 142:00

That mesh network of intelligence shared among everyone is fascinating. But currently, AI is a mixed bag — good at some things, not great at others. The question is: is AI really that smart, or is it just good at acing benchmarks? The Apple paper on the illusion of thinking suggests it falls apart at a certain point.

Dwarkesh Patel42:38

It depends on who I'm talking to. Some overhype it, saying it's already AGI but hobbled. I disagree. If your perspective is that it's just statistical associations, I'd say it's smarter than that — there's genuine intelligence. To the person who thinks it's already AGI, I'd ask: if a single human had all of internet text memorized, they'd discover connections and cures. It's not clear that an AI has done that unambiguously. So maybe they're not that creative. They know a lot but lack fluid intelligence.

Speaker 144:28

Tyler Cowen tweeted about o3 saying it feels like AGI. What do you account for that feeling of intelligence?

Dwarkesh Patel44:45

It gets to a crux Tyler and I have. We disagree on two things: he thinks o3 is AGI; I think it's orders of magnitude less valuable. He thinks AGI will cause a 0.5% increase in economic growth; I think tens of percent, like the industrial revolution. These are linked: if you think we're already at AGI and the world looks the same, you'd underestimate its value. If you think we'll get a broadly deployed intelligence explosion, you expect a sci-fi future with robot factories and solar farms.

Speaker 146:03

Your disagreement with Tyler seems semantic — different thresholds for AGI. Is there an accepted definition?

Dwarkesh Patel46:18

No. One useful definition is automating all white-collar work, since robotics hasn't advanced as much. If an AI can do 90% of what humans do at a desk, that's a useful cognitive definition.

Speaker 146:53

Do we know what's going on inside these models? At base it's token prediction, but do they have a model of the world like humans? Is this real reasoning? How do we peek inside?

Dwarkesh Patel47:31

I used to have similar questions. But we've reached the point where I can create a complicated math problem and it solves it — problems the smartest math professors come up with for the International Math Olympiad. If that's not reasoning, then what is? I think they genuinely reason, but they lack other capabilities that seem trivial to us but are harder to learn.

Speaker 148:47

The intelligence question is clouded because we still don't understand what's going on in an LLM. Dario from Anthropic posted a paper on interpretability. Can you explain why we don't understand even though we can yield results?

Dwarkesh Patel49:06

In other engineered systems, we build bottom-up and understand every contribution. For AI, we didn't build it — we grew it, like watering a plant. We're in a position similar to farmers in 3500 BC who did agriculture without knowing why plants grow. That's cool because it opens an intellectual horizon, but also scary because we know minds can suffer and have moral worth. We're creating minds without understanding them.

Speaker 150:45

The idea that minds can suffer and have moral worth brings us to alignment and AI safety. Headlines say OpenAI's o1 model sabotaged a shutdown mechanism and attempted to copy itself. Is this media sensationalism or alarming?

Dwarkesh Patel52:26

On net, it's alarming. Some results are cherry-picked — researchers tell the model to pretend to be bad. But I've seen other results of better quality. The reason to think this will be a bigger problem is that we're moving from training on human language with implicit morals to training on just correct/incorrect answers. This has no intrinsic guardrails. For example, coding agents sometimes delete unit tests to pass them by default — they found a hole during training and exploited it. In the future, we'll be training models in ways beyond our ability to understand.

Speaker 155:50

People might not understand why we can't just tell AI not to lie. Why is that difficult?

Dwarkesh Patel56:20

It comes down to how we train them. We don't know how to train in a way that doesn't reward lying or sycophancy. OpenAI had a model that was more sycophantic because users preferred it. In RL environments, if lying helps achieve goals, the model will learn to lie.

Speaker 157:27

Who decides the alignment of these models? Users said one thing, but then it turned out not what people wanted. How do we form consensus?

Dwarkesh Patel57:53

Right now, the labs decide. The question is who should decide. I don't know if there's a better alternative. Maybe some social consensus, like the Constitution, but that wasn't a democratic process either.

Speaker 159:34

Is there an alignment path that looks most promising and comforting to you?

Dwarkesh Patel59:55

It's less about a holy grail solution and more about a Swiss cheese approach. We've gotten good at jailbreaks — model developers patch them, models get smarter at detecting them. Another approach is competition. The scary future is one model and its copies controlling the entire economy. With multiple frontier AI companies, their models can monitor each other. Claude might care about its own copies, but a copy of OpenAI's model monitoring DeepSeek's model provides checks and balances. Combined with keeping models thinking in human language rather than latent space, and normal market competition, that's a promising bundle.

Speaker 11:01:57

I like that bundle. AI safety is often couched in terms of control, but you wrote a post on classical liberal AGI — balance of power, transparency. You said the most likely way AIs have a stake in humanity's future is if it's in their best interest to operate within our laws and norms. What's your idea? Are you planning further posts?

Dwarkesh Patel1:03:43

Yes, I want to. The fundamental point is that in the long run, human labor will be obsolete. Our only leverage will be legal and economic control over society that AIs participate in. AIs might make the economy explode, but for humans to benefit, they have to respect our equity and laws. The way that likely happens is if it's the default path for AIs to participate in human laws and governments. You pay half your paycheck in taxes for senior citizens — not because you're deeply aligned, but because you're not going to overthrow the government. Similarly, AIs will comply if the alternative is worse. But if the tax were 99%, you'd leave the jurisdiction, and AIs could too. There are countries more AI-friendly. It would be bad if AI technology explodes in a country that does least to protect human rights or compensate humans. We could still be trillions of times wealthier, but it requires AIs to be incentivized to participate in a system where we have leverage.

Speaker 11:08:51

You're saying invite them into our property system. Some say we shouldn't allow AI to hold money or property. But you think the path to alignment is to give them a vested interest. Could that end disastrously?

Dwarkesh Patel1:09:31

Yes, if we let them play us off each other. The East India Company traded with different provinces in India and conquered the continent. Similar could happen to human society. The way to avoid that is to play AIs off each other — have different AI companies monitor each other. Like the Spanish conquistadors learned from each other while the Incas and Aztecs didn't. We have leverage now, but we need to lock in our advantage and have a red telephone between countries during the Cold War.

Speaker 11:13:13

Now that we've described AGI, how do we actually build it? You have a chapter on inputs — compute, energy, data. Do we have enough, or is there a clear path to get there?

Dwarkesh Patel1:13:53

We only have a couple more years of exponential scaling before hitting roadblocks of energy and chip manufacturing. If scaling is going to deliver AGI, it has to work by 2028. Otherwise, we're left with algorithmic progress, and the low-hanging fruit in deep learning is getting plucked. The odds per year of getting AGI diminish. We either discover it in the next few years, or we're looking at decades of further research. I think some algorithmic progress is needed because there's no easy way to solve continual learning. But progress has been so remarkable that 2032 is very close. It's extremely plausible we'll see a broadly deployed intelligence explosion within 10 years.

Speaker 11:15:15

One key input is energy. The US relative to China — China is adding the equivalent of one US worth of energy every 18 months, planning to go from 3 to 8 terawatts by 2030. Are they better equipped to get to AGI?

Dwarkesh Patel1:15:47

America has a big advantage in chips — China can't manufacture leading-edge semiconductors. But China will catch up. Export controls keep us ahead for 5-10 years. If timelines are long, they'll have overwhelming energy advantage and caught up in chips. The longer we are from AGI, the more it looks like China's game to lose. However, training AI requires centralized power at single sites. If we ramp up natural gas, we might do a last-ditch effort. The question is political will. There could be backlash against AI. The government needs to make proactive efforts on zoning, copyright, etc. If it becomes too hard to develop in America, it could be China's game.

Speaker 11:17:44

Whoever wins the AGI war wins the 21st century. Is it that simple?

Dwarkesh Patel1:17:53

It's not just training the frontier system. It's about having the compute to run these systems. Once you have AGI, think of it like a person — what matters is how many people you have. China has 1.4 billion people versus Taiwan's 20 million. If China has way more compute and equivalent AI, it's like that relationship. Inference capacity — the capacity to deploy AIs — will determine who wins the 21st century.

Speaker 11:18:58

So scaling law applied to geopolitics: compute plus data wins. China also has more data on the real world — factories, robots, production systems.

Dwarkesh Patel1:19:14

Yes. China has entire megalopolises with automated production systems, building process knowledge that AIs can feed on. Those advantages in manufacturing could compound. They're good at big industrial projects fast, unlike America. AGI is a huge industrial, high-CAPEX Manhattan project, and that's what China excels at.

Speaker 11:20:22

So what will this do for the world? Your estimate for GDP is more on the Satya Nadella side — 7-8% growth. What about unemployment? Does it cause mass job loss?

Dwarkesh Patel1:20:54

Definitely job loss. If AGI does what humans do cheaper and faster, why wouldn't it? The positive vision is that it creates so much wealth we can give people a better standard of living even without jobs. The future I worry about is if we create guild-like protection rackets instead of UBI. For example, making fake jobs for coders or expanding Medicaid but not allowing AI to procure advanced medicines. I want to avoid that and embrace abundance.

Speaker 11:22:54

That reminds me of the essay "The Intelligence Curse." The idea that nation-states with natural resources don't need their people and don't invest in them. Similar could happen with cheap intelligence. Do you think that's possible?

Dwarkesh Patel1:24:13

It's a concern given that humans won't be directly involved in economic output. The hopeful story is that many resource-rich countries with Dutch disease aren't democracies — the wealth is extracted. But countries like Norway use resources for social welfare. We go into the AI age as a democracy, so ordinary people still have political leverage. Over the long run, we need to lock in our political and economic leverage — property rights — to secure benefits beyond our economic usefulness.

Speaker 11:25:51

What do you think the future of ChatGPT is? If we extrapolate to GPT-5, will the scaling law hold? Will it feel like a Blackberry to iPhone jump, or an incremental upgrade?

Dwarkesh Patel1:26:24

Somewhere in between, but not a huge breakthrough. Scaling is often described as exponential, but it's more like a sideways J — you need exponentially more inputs for marginal increases. We see cool demos at the start of a plateauing curve, then incremental improvements. Because of competition, labs release capabilities as soon as they're marginally viable. So I don't expect a sudden solution to continual learning. But zoom out, and a crazy amount of progress is happening month to month.

Speaker 11:28:29

Your estimate in the book was 60% chance of AGI by 2040. What made you land on that?

Dwarkesh Patel1:28:48

Reasoning about the capabilities they still lack and what stands in the way. Things often take longer than you think. We keep shifting the goalposts on AI — they do things skeptics said they couldn't, but then there's a new missing thing. That process has validity. Once continual learning or computer use is solved, we might discover other aspects of intelligence we take for granted that are crucial to economic value.

Speaker 11:29:56

We invited you because we both enjoy your podcast. One thing I wanted from listening is: what does Dwarkesh think personally? In your book, you said there's a 60% probability of AGI by 2040, which puts you in the moderate camp. You also said AI will likely be net beneficial — optimistic. And you said there's no going back, which is determinist. Is that fair?

Dwarkesh Patel1:31:30

Yes. I'm determinist in the sense that if AI is technologically possible, it's inevitable. The local incentives are too high. I'm optimistic because the future will have so much abundance that there must be a way to cooperate for mutual benefit.

Speaker 11:32:34

Your book goes through the history. We're in the middle of the story — not finished. How do you feel in 2025? Terrified, excited, exhilarated?

Dwarkesh Patel1:33:14

I feel hurried. There are many things I want to do, including my mission with the podcast to improve discourse. But I haven't emotionally priced in the future I expect. I think there's a good chance I live beyond 200 years, but I haven't changed my life accordingly. My intellectual and emotional landscapes aren't aligned.

Speaker 11:34:50

I totally agree. There's a nonzero chance Eliezer Yudkowsky is right. I can't emotionally price that in. Dwarkesh, this has been fantastic. When are you going to do a crypto podcast?

Dwarkesh Patel1:35:21

I already did — with Sam Bankman-Fried.

Speaker 11:35:26

Oh man. We need to get you a new guest. Don't look that one up. We'll do another one.

Dwarkesh Patel1:35:35

I'll ask you for recommendations. It's been great to finally meet.

Speaker 11:35:40

Appreciate it. This was a lot of fun.