Nvidia & AI Bubble Questions | Sharp Tech with Ben Thompson

🎥 Apr 01, 2024 📺 Sharp Tech Podcast ⏱ 9m 👁 852 views

Link to Episode: https://sharptech.fm/member/episode/a... Links: Stratechery Youtube: / @stratechery Sharp Tech website: https://sharptech.fm Stratechery: https://stratechery.com Sign up for Stratechery Plus: https://stratechery.com/stratechery-plus Submit reader questions at [email protected]

Watch on YouTube

About Ben Thompson

Ben Thompson, founder of Stratechery, has published a series of articles and podcast episodes in the last 60 days analyzing major technology companies and the AI industry. In "The iPhone’s Last Stand," Thompson argued that Microsoft's Project Solera positions it as an enterprise play, and that consumers "don't want to work and don't really care about being productive." He described Apple's Siri as the only service that can pull off personal context across apps, "as long as it's not vaporware." In "The Google Capital Company," Thompson characterized Google's business model as one where "supply is free" and "consumers willfully compete against each other to raise your prices," and discussed the company's use of equity to fund AI capital expenditures. On his Sharp Tech podcast, Thompson discussed the possibility of an AI bubble, stating that "it's going to be a lot of existing companies realizing the AI spends not worth it" and that such companies "are just going to slowly die as new companies come along that actually do use AI." Thompson also wrote about SpaceX's IPO and the concept of data centers in space, expressing concern about "our ability to muster enough compute to fully realize the gains from AI" and describing Musk's proposal as "an alternative path to unlimited compute." He noted that "Musk is the master of memes" and that his companies offer "a dream" to investors. In "Amazon's Durability," Thompson argued that Amazon's focus on long-term investments in the physical world makes it "as sturdy as ever" in the AI landscape. He also covered Anthropic's Mythos model, describing it as a "major security threat" and discussing the company's "opportunity cost problem" regarding compute allocation. In "Tim Cook’s Impeccable Timing," Thompson reviewed Cook's tenure as Apple CEO, noting that revenue increased 303% and profits 354% during his leadership, while also suggesting that Cook may have "created the conditions for a crash out" by forgetting "what makes Apple Apple."

Source: AI-verified profile updated from Ben Thompson's recent appearances. Browse all interviews →

Transcript (2 segments)

✨ AI-enhanced transcript with speaker attribution

Jerry0:01

I'm still trying to process Ben's piece on inflection and its bizarre history and what it means for the broader tech sector, specifically Nvidia's convoluted role as gatekeeper and investor and seller of R of rare jewels, i.e., GPUs. All that is fascinating to me. In this story, it seems to me that the arc of innovation is being shaped in large part by who has privileged access to special hardware, in this case GPUs. That's a tech story we haven't seen in decades, or maybe ever. It seems ripe for unintended consequences. So if big tech and the startups they acquire end up winning the AI race in the short term because of their privileged access to Nvidia's GPUs, it seems like they might be very vulnerable in the long term to outside disruptors who figure out a different and better way to innovate in AI without Nvidia's help. What do you think, Ben? I found this interesting because we talk about hallucinations. It seems like hallucinations are a real problem for enterprise AI applications that could help all this technology transform various sectors of the economy. And at least for the last year, as we said earlier, it seems like the proposed solution to the hallucination problem is to just throw more and more and more compute at the problem. It's like the sledgehammer keeps getting bigger as they just swing more and more GPUs at the hallucination issues across all these different products. And so I'm curious what you think of Jerry's theory because it seems plausible to me that there might be sort of an orthogonal approach that emerges along the way that transforms how we understand all...

Ben Thompson1:49

Yeah, I mean this is a subject of hot debate right. Like, is the Transformer model the end-all be-all? We just need to throw more compute at it, or do we actually need a fundamental algorithmic breakthrough that does stuff differently to actually make decisions well, right, as opposed to, you know, and eliminate this problem? I think that's an open question. I certainly think that big tech generally is betting on just throw more compute at it, which makes sense, they have the capability of throwing more compute at it. And Nvidia is the kingmaker right now, and yes, it absolutely warps the market. And Nvidia, there is one company that sort of dominates the space, making decisions based on are you going to bundle with all our stuff, are you going to do XYZ, oh by the way our usual license agreement says you can't translate CUDA, right? They're using legal agreements, they're using bundling, they're using all this sort of stuff that is theoretically illegal, but we're going to... just a real classic case. I can't wait for the Nvidia antitrust case in 2040 that doesn't matter anymore because actually it turned out there was a completely different approach. Tell you one thing, I know I'm newer to the tech space, Nvidia never happens without the Microsoft antitrust trial of 2001. So I'll just pre-write that complaint for 2040. But the other argument though is that a new approach actually is good for Nvidia because GPUs are relatively programmable. I think actually the question is, when you specialize in this particular sort of approach and getting specialized chips and all this sort of thing, once you start streamlining because you need efficiency, particularly from an energy perspective but also speed, latency, bandwidth, all these sorts of things that are often trade-offs against each other, that means you specialize. The more you specialize, the more rigid you become and the less able you are to adapt to a new paradigm that comes along, whereas GPUs are better and more efficient than CPUs, as we talked about a couple weeks ago, yet also relatively programmable and can be adjusted to new workloads if what you need generally is parallelism, but how that parallelism is leveraged can differ. All these are very much open questions. This is where the energy thing does matter though, because it is a real forcing function that continues to drive pushes to do this different, right? And so I think if there is a better way to do it, someone's going to figure it out because there is tremendous economic incentive to figure it out. And just as there's tremendous economic incentive to keep building this out, and you know, maybe it's going to be a situation... I mentioned the Nvidia event made me feel a little bubbly, right? Like there's some of this stuff that is starting to feel like it's getting a little crazy, like what exactly are we using this stuff for? Right, like how useful is this? But you can imagine a world like after the era where you had all this bandwidth that was built out, all this dark fiber that actually was completely economically ruinous but also created the conditions for the actual sort of Web 2.0, the online world. Will we get a scenario? It is interesting, people still find A100s pretty useful, they're still buying H100s. Are we going to get to a world at some point where there's a bunch of GPUs out in the world that were already paid for and they're just sitting there and, boy, wish someone would use this, and that actually creates the conditions for new use cases and creations where it's actually free, not because we solved energy, because it turned out not that many people wanted to use autocomplete? Yeah. A lot of people went bust. No, I mean, at the top of that Nvidia episode a week or two back, the thinking made a lot of sense to me, or at least the possibility that this is a bubble but it's directionally correct, it's just everyone's a couple years early. And so there's this huge infrastructure buildout and the actual use cases won't materialize at least in time for everybody to monetize their investments now, but it could still get there a couple years down the line or even further down the line. Um, I mean, there's a very famous model that is Carlota Perez, a Venezuelan sort of economist, she lives in the UK now, but she wrote this book called Technological Revolutions and Financial Capital, and this is sort of the core thesis. Going back and tracing through these major technological revolutions, how it impacted society, in all of them in her framework, an essential portion is there's this frenzy period that causes a bubble and the bubble bursts, and it's actually the bursting, it's the frenzy that inspires irrational investment that actually creates the conditions for what comes next. Because, like, you know, it was irrational. We go back to the Apple thing, the Apple Vision Pro, right? There's a bit where Apple and what they missed out on was the irrational building of great experiences by developers. What they needed was to excite developers to build an amazing experience on the Vision Pro even though it was stupid to do so. Why did they do it? Because they were so excited about the Vision Pro, right? And so they missed the irrational era, so they're now in the rational era, they need... so which means they need to pay developers. That's right. And so there's a bit where irrational exuberance, it sucks for the people that buy into it because they all go bankrupt, but it's actually societally speaking, it's very beneficial because you build up infrastructure that never made sense to build, but it got built, and so now that enables sort of what comes next. And I do think, you know, there is some aspect here, I'm not sure that we've... like we are, you know, at what point is this GPU buildout, is it actually being leveraged, is it actually being used? The reality is, I think the safe bet is it will become bubbly. Is it bubbly now? To be debated. You know, it's bubbly now when everyone agrees, "No, this is not a bubble, this is real." The moment people stop saying it's a bubble is the moment you know it's a bubble. And I don't think we're in a bubble yet, but it is feeling a little frothy. Um, but they estimated that the AI industry spent 50 billion on Nvidia chips used to train models and brought in only $3 billion in revenue. So make of that what you will. That was in the Wall Street Journal this past weekend. God bless them. Thank you. Appreciate it. These... this is the frenzy that we're counting on to transform society. This is step one. All right. For...