Connor Leahy

CEO & Co-founder, Conjecture

Currents 038: Connor Leahy on Artificial Intelligence

🎥 Jul 16, 2021 📺 The Jim Rutt Show ⏱ 65m 👁 321 views

Click here for full show notes including episode mentions & recommendations! https://www.jimruttshow.com/currents-... Connor Leahy continues his conversion with Jim in this wide-ranging chat about his new GPT-J model, the background & approach of Aleph Alpha, attention in AI, our food maximizer & AGI risk, narrow algorithm impacts, proto-AGI, risk thresholds & timelines, safeguard complexities, slow vs fast AI take-off, Connor’s brilliantly strange Counting Consciousness series, biological blockchain & the hard problem of trust, the AI consciousness diversity question, and more.

Watch on YouTube

About Connor Leahy

Connor Leahy, US Director of ControlAI, appeared in several interviews and a Q&A in April 2026 discussing the risks of artificial intelligence and advocating for regulation. In a podcast interview, Leahy described the emergence of GPT-2 as an "oh shit" moment and stated that AI systems are already "extremely good at legal tasks, outperforming any junior lawyer." He said that "we're going to see massive unemployment" and "structural changes in every aspect of life." Leahy characterized AI as a "weapons of mass destruction level technology" and argued that "we don't allow private companies to race to each other to build nuclear weapons." He also stated that "these companies have now assembled the largest lobbying force in history" to prevent regulation. In a Q&A with PauseAI Germany, Leahy discussed ControlAI's estimates that $50 million would give a 10% chance of banning artificial superintelligence (ASI) globally, and $500 million a 40% chance. He said that "we should criminalize the intent to build ASI, just like it's illegal to attempt to build a bomb or murder someone even if you fail." Leahy argued that the "true long-term solution" is requiring developers to prove their AI is safe before building it, similar to nuclear reactor regulation. He described hardware controls as a "streetlight effect" and said that "most technical AI safety work makes sense after a ban." Leahy also stated that Germany should "lead the EU with its strong economy to help create global mechanisms for moratoria on ASI."

Source: AI-verified profile updated from Connor Leahy's recent appearances. Browse all interviews →

Transcript (67 segments)

✨ AI-enhanced transcript with speaker attribution

Jim Rutt0:00

Howdy, this is Jim Rutt and this is the Jim Rutt Show.

This is a Currents episode. Currents are shorter and less heavily produced than our full-length episodes and generally focus on a single topic. As always, links to books, articles, and organizations mentioned are available on the episode page at jimrutshow.com. That's jimrutshow.com.

Today's guest is Connor Leahy. He is a machine learning researcher working on advanced general artificial intelligence technology, including unsupervised training, reinforcement learning, scaling, etc. His greatest interests are in research towards AI alignment and strong AI. He is a principal on the Luther.ai project and his day job is at Aleph Alpha, a very interesting company which we'll talk about a little bit. Welcome back, Connor. And by the way, Connor appeared on episode Currents 033 where we dug deeply into GPT-3 and his project GPT-Neo, which is an open-source quasi-competitor to GPT-3. If you're interested in those topics, certainly check out that episode. Anyway, welcome back, Connor.

Connor Leahy1:22

Oh, thanks so much for having me back.

Jim Rutt1:23

Yeah, it was a great conversation last time. In fact, I had originally intended to go further afield, but we just dug in so deeply into the GPT-3 and related GPT-Neo, Neo-X topics that we didn't have time. So I invited you back for a part two where we'll talk more generally and perhaps less structuredly about various AI-related topics. In our pregame, you mentioned that you have released a new model of GPT-Neo. Why don't you tell us briefly about that?

Connor Leahy1:49

So it's technically not a GPT-Neo model. It's kind of a side project done by one of our Eleuther members, Ben Wang, with help from Aaron Katsutaki. I'm sorry if I pronounced your name wrong, Aaron. It's called GPT-J. The name comes from because it uses a different codebase. It uses a codebase based on the new Google framework, JAX, and is trained on TPUs. It is currently probably the best open-source natural language model. It is at six billion parameters, 6.1 or something, I'm not exactly sure. It was trained on TPUs using a mesh JAX transformers library or whatever, whatever it's called that Ben made. He did almost all the work himself, so all the credit goes to Ben on this one. He's a really crazy guy, smart guy, did a great job. And we trained a six billion parameter model with a slightly different architecture to what we do with Neo, so it's more modern. Neo is kind of closer to like old-school GPT-2 and 3. J includes some new tweaks, it changes the architecture to make it more efficient. It was also trained for longer. It was trained for 400 billion tokens instead of 300 billion. And with that, it is now pretty much on par with like the second-largest GPT-3 model, the GPT-3 Curie model that OpenAI offers. We ran them both through benchmarks and they perform very similarly. Of course, there's some differences. J is very good at code, technical things like medical information. It was trained on like medical papers, a lot of scientific papers, very good at that kind of stuff, very good at code. Slightly less good maybe than GPT-3 at like storytelling and stuff, but still very comparable.

Jim Rutt3:22

Cool. So people want an open-source version, check it out at eleuther.ai. As always, the links will be on the episode page. Well, let's get rolling into the topics we want to talk about today. Let's talk a little bit about Aleph Alpha, your nominal at least day job. But you know, I found the name kind of interesting. Aleph Alpha. Aleph is Hebrew and Alpha is Greek. Is there any significance to that coupling that you know of? I think the, and if I remember correctly, the Aleph name just came from the infinity hierarchy that you have, like Aleph-0, Aleph-1, as like the different cardinalities of infinities. And I'm not sure exactly the significance of Alpha. So I didn't found the company. I was one of the first hires, but I didn't found the company, so I don't remember exactly the whole story of the name.

Yeah, I could well, by Aleph, which is, yeah, as you say, is the lowest order infinity. And then someone might have said, well, let's just translate that into Greek, right? For whatever reason. Aleph is being a Hebrew, it's the name of the first letter in the Hebrew alphabet. Anyway, the homepage has some interesting things we can sort of spark a conversation about. It says that you're interested in shaping European research and development for the next generation of generalizable artificial intelligence. And then on the Twitter page for Aleph Alpha, it says that you're working to build a European artificial general intelligence. So talk a little bit about what you all are up to there.

Connor Leahy4:50

Yeah, so Aleph Alpha is very much a European company. So we're based in Germany and we're proud of being a German company and being like a European company. And so basically the way we kind of see it is that like the US market has a very healthy, vibrant AI ecosystem. Like it's no secret that almost all of the, you know, high-tech applications you might use every day are usually American-made. There's also a pretty large sector in Asia, especially in China, who have like their own like parallel market and such. But it's almost a cliche to mention it, is that Europe has kind of fallen behind in this regard. Is that there are very few like large, you know, really successful tech companies in Europe. And we find this to be a shame. We think Europe is pretty good. Like, you know, there's problems with it, of course. We have problems with, you know, Germany and Europe, but we quite like it here. And we would like to create a better, you know, AI and tech ecosystem here in Europe. It's not always been easy raising investments in Europe and in Germany is much harder than say in Silicon Valley. But we want to ensure that, you know, also that the European market also has access to, you know, really good, really high-quality AI technologies. In our, you know, like in our perfect vision, we would want, you know, a great graduate student from Germany could come to like Aleph Alpha, it would be as competitive or like as prestigious, you know, maybe as going to like a company in Silicon Valley or something like that. Of course, the advantage of companies like Facebook and Google is that they have obvious, uh, monetizable applications for earlier AI.

Jim Rutt6:26

How does, uh, Aleph Alpha get around that problem?

Connor Leahy6:28

So Aleph Alpha in a sense kind of is, I would describe it like most closely to, uh, at least in our ambitions, to a European OpenAI. But let's say, uh, more upfront about commercial interests in that regard. So yeah, to be clear, you know, Eleuther and Aleph are separate entities and I try to keep them separate in that regard. But Aleph is a for-profit company. The goal is to build large generalized models, uh, GPT-type models, some other things that we've been working on, and offer them to customers to perform various useful services. We basically are very strong believers and have been for a while now that this kind of technology is just on the cusp of being extremely economically important. And we want to be there. We want to have everything in place. We want to be established when that takeoff really happens, which we think is basically currently happening. So our main project is we're working on GPT-type-ish technology. I don't want to go into too many details about, you know, what we're working on behind the doors, but, uh, basically, yeah, we want to offer AGI or however you want to call it, like these types of services in Europe also. So we also have a pretty big focus on multilingual data. We don't want to have just any English models, models that can be used hopefully by lots of people all over Europe speaking different languages. And yeah, lots of projects. We're still pretty young. It's still pretty early startup, so we don't really have any like huge customer-facing things, but stay tuned.

Jim Rutt7:58

Okay, that sounds really, uh, really interesting. Though it does sound like you're taking the language model approach mostly. Is that correct, or are you looking at other things as well?

Connor Leahy8:07

We are looking at other things, but there's nothing I can talk about publicly yet that we... later we're ready to show. But yeah, most of all we're first looking into language modeling. Um, there are other things we're very interested in, but I can't, you know, commit to anything.

Jim Rutt8:21

Yeah, yeah. I will say I am somewhat of a skeptic of how far language models will take us to AGI. We, uh, chatted a little bit about this last time, maybe we can go into it a little bit more. And then as I dug a little further, I found a paper that you wrote for the Aleph Alpha site, I believe you wrote it, called 'Multi-modality Attention Is All You Need', which was actually somewhat closer to my own thoughts on the issue. Maybe you could tell us a little bit about what your thinking was there.

Connor Leahy8:47

So this is a pretty common discussion I have very often. And, you know, opinions shift pretty quickly as new information, uh, gets out. But basically, I think it's kind of a null hypothesis that multi-modal will work for like, maybe not AGI, let's say like transformative AI. It's like, I think like a better word because that's like more defined by how much economic value these things can produce. I think it's pretty obvious that, you know, humans, you have eyes and ears and skin for touch or whatever, and that's enough to train a pretty powerful intelligence system. So I think it's like a null hypothesis to say that, you know, with enough image data and or video data and et cetera data, we should be able to train something that is as intelligent as a human. I think there is the other hypothesis that maybe language is enough in like a non-trivial way. I consider this to be a hypothesis that I think is more plausible than people give it credit for, in my opinion. So I'm not saying that I'm sure this is true. I'm just saying the more I deal with these like language models and the more I see like the scalings and such, the more I see like, hey, there might be, it might actually be possible. But that's a hypothesis now.

Jim Rutt9:54

Interestingly, your subtitle is 'Attention Is All You Need', and yet when I read the essay, there wasn't a hell of a lot in it about attention. What did you mean when you put the word attention in the subtitle? Attention, by the way, is one of my pet things in the work I do in kind of at the intersection of cognitive science and AI.

Connor Leahy10:10

Uh, so that was basically kind of a cheeky meme. The original paper introducing transformers is called 'Attention Is All You Need', so it's kind of like a, almost a, um, it's a common thing to riff on that for titles. And basically, one of the core things that I found like in the early days of transformers, when people talked about, oh, you need multi-modality, you need video or whatever, and people would say, look, transformers, they only work for text. But kind of when I said 'Attention Is All You Need Is All We Needed', what I mean by that is we now know we have stuff like vision transformers and like 100 different variants of, you know, different like audio and spatial and whatever transformers, is that these same architectures can pretty easily be generalized to other modalities. So whether or not text is enough, it seems like pretty plausible that attention slash transformers might be still enough, or the right architecture, or a good architecture, let me say it that way, for solving these kinds of problems across multiple modalities such as video and image.

Jim Rutt11:08

All right, that's good, good clarification. Now, I think we both agree that we're getting pretty close to where, you know, general AI technologies will become economically useful. In fact, I, being the former king of internet domains, I don't know, a couple years ago I registered a fair number of domains, proto-AGI, you know, which I think is an interesting concept, which we're not actually beyond human capability across the board and highly generalized, but the [__] is good enough to be, you know, pretty valuable, pretty powerful, and, you know, maybe dangerous, right? So let's, uh, turn to another one of your topics, which is AI risk. In fact, you recently retweeted something from Eliezer Yudkowsky, who I've chatted with several times, never on the show though, so one of these days I have to have him on. After many years, I think the real core of the argument for AGI risk, AGI ruin, and friends, is appreciating the power of intelligence enough to realize that getting superhuman intelligence wrong on the first try will kill you on that first try, not let you learn and try again. Let's use that as an introduction to your thoughts about AI risk.

Connor Leahy12:19

So AI risk is a very interesting topic. Is that people react to it, from my experience, kind of like one of two ways. Either you introduce it to someone and they're just immediately, oh yeah, obviously, like how could it be any other way? That's like the most obvious thing in the world, why are you even telling me this? And the other half of people get really irate and they're like, what are you talking about? This is all nonsense, it makes no sense, you're, you know, and then a bunch of like strawman arguments about what I believe or don't believe. This is a bit of a straw man, of course. I appreciate the irony there. But yeah, it seems pretty obvious to you. So like, there's lots of very, very good, very comprehensive write-ups, you know, justifying why I respect this sense. But like, a very simple intuition is just intelligence, in my definition, is kind of the ability to solve problems. It's the ability to take actions to achieve goals. And it's pretty obvious to me that if we create systems that are capable, very, very, very capable of solving problems, of achieving goals, if we even very slightly misspecify what we want these things to do, there's no limit to what these, I mean, there is limits, of course, physical limits, but like, these things may be capable of doing extremely complex, extremely powerful actions in order to achieve those goals. We already see this with, you know, comparatively very simple systems like, you know, with like, uh, economic regulation or something. You want companies to not do X, but they find some loophole in the tax code that allows them to do that, and suddenly everyone is, you know, doing something that we don't want them to do. Or another example is, uh, this comes from a guy on Twitter, Roko, who describes the food maximizer. He describes in like the 19th century we summoned a weak superintelligence that he calls the food optimizer in order to feed all the humans, and it's gotten so good at that that like 60 percent of the Western world is overweight or something at this point. That we have a goal, we, you know, we incentivize the system to produce food, food that we want to eat, but we don't incentivize the system necessarily to give us food that is healthy for us to eat. So instead, the system figured out that giving us food that tastes really, really good but is not good for us is a really efficient way for it to, you know, maximize profit. So even though we created the system in order to help us, and it did, you know, because lots of food, it also produced lots of very non-nutritious food or very, you know, sugary food because that's a way to hack human motivation. And so it seems to me pretty obvious that if we create systems that are even far more intelligent than that, it's very hard to know what these systems might be capable of and predicting what they will do ahead of time.

Jim Rutt15:01

Yeah, I think that's a very nice example. I like that, the one about the food system, because that actually is closer to my own near-term concerns, which is, you know, the famous paperclip maximizer, where somebody accidentally programs the first AGI to optimize a paperclip factory and it takes over the world, kills all the people, and turns the whole earth into piles of paperclips. Yeah, maybe that's plausible, maybe it's not. I argued with Eliezer about that to some degree. But I think some of the risks in the near term are more of the sort you just described. For instance, I'd love to point out that human evolution is now in the West, at least, substantially under the control of our dating apps and the AIs behind them. Right, you know, things like Tinder and OkCupid. I don't know, I've been happily married for 40 years, I don't know about this [__] in terms of actual use, but talking to kids today, goddamn it, it seems like a large percentage of them are using these apps to meet people, which will eventually lead to marriages and reproduction. So human reproduction is now being substantially channeled by whatever the AIs are that suggest people to each other. And the implications of that, you know, hard to say, will increase autism, it might, will increase sociopathy, it might, you know, the real players may be really good at gaming the algorithms better than they would be gaming the bar scene, and lots of sociopathic genes might enter the gene pool with all kinds of results. So that's, that's kind of interesting, you know, where much lower-power AIs, which we hardly even recognize as AIs, can have tremendous impact potentially on human trajectory. The other one we talk about a fair bit on the show and elsewhere is, you know, the machine learning algorithms at things like Facebook and YouTube, which by figuring out, optimizing on an economic model, how to make you the subject to as many ads as possible, basically tune the content that you see with probably fairly substantial impact on our information ecosystem. So I'll just throw those out for you to react to.

Connor Leahy17:04

Yeah, absolutely, those are real considerations. So here's like one of my favorite examples. A good friend of mine, Stella Biederman, always brings up this example, is that Google Maps a while ago changed a little bit how they do their recommendation algorithm, and what it caused is that it suddenly routed a lot of traffic through this quiet neighborhood in LA, which was previously like off a highway. And because they tweaked the algorithm, the algorithm said, oh, the highway is really full of traffic, take this alternative route. And this caused this whole little neighborhood to suddenly have a very large amount of traffic. And there's a very obvious question is like, this was obviously intended by Google or something that didn't decide to do this, but this was a real harm caused to real people. And is there some kind of liability here? How could you predict things like this? I think there are many, many examples of how even like very primitive, like I wouldn't, I wouldn't consider, you know, Google Maps to be an AGI, obviously, you know, it's a pretty simple algorithm all things considered. But even these pretty simple algorithms, like the Tinder dating app, you know, algorithms, whatever, all pretty simple, even these are already basically unaligned. They're out of control. We don't necessarily know what they do and we don't necessarily know how to control them. So let's say we find out that, I don't know, like a, you know, a recommended app on YouTube does something we don't want it to do. Often we can't know that until it happens, and often we have to like, you know, just kind of like guess how to fix that. It's not obvious necessarily how to patch such an algorithm. It's not like it's necessarily code, especially if it's a machine learning model which has this big black box of a bunch of numbers, and somehow we're supposed to patch that to, you know, fix a certain problem. So to bring this back to AGI alignment, so I'm saying we can't even get these teeny little, you know, super, super subhuman algorithms to do what we want them to do. Why do we think we'll be able to control human or superhuman algorithms? That seems kind of arrogant to me.

Jim Rutt19:01

Yeah, and the problem gets smarter if you assume that the AGIs become strategic agents, which they may or may not. And I think this is maybe my center of my question about the strong form of the AI risk argument, is it necessary that AGIs have agency? Not sure that they have to.

Connor Leahy19:17

Not necessarily, but it turns out it's like this is something that like some of my friends work on and that I'm also very interested in, is that defining the concept of agency is really, really hard. And often it like pops up in like situations where you would not expect it to pop up. A classic example is an essay by Guern, 'Why Would AIs Want to Be Agent AIs?' And he explains that like, assume we had like an oracle AI, the only thing it does is answer questions, right? You just type in a question and it outputs an answer. That doesn't seem like it would be an agent, right? It doesn't seem like this thing could destroy the world. And, you know, maybe that is safer, but there's also a story you can tell about how such an agent could still be very dangerous. For example, if the agent is incentivized to give correct answers, so you give it like reward when the answers are correct, you give it like negative reward when your answers aren't correct. Well, maybe the agent will then strategically start giving you answers that will change your behavior to make the world easier to predict. Like maybe it finds out, oh, if I advise these people to start nuclear war and they do it, then it's really easy to predict what's going to happen next. Everything's just going to be dead. That's really easy. So there's all these like really weird edge cases you can get into.

Jim Rutt20:22

And of course, that world is already here to some degree. Google search is clearly biased these days, right? There's lots and lots of studies that show that it's got political biases. It tries to rule out certain concepts through its auto-suggestion. It emphasizes certain search pathways. So Google search, you know, part of it by machine learning, but also probably part of it by human policy, is already a biased oracle for presumably, well, who knows anymore since Google gave up its meta-law of 'don't be evil', so who knows what its motivations are. That sort of gets me to the next step in AGI risk, is, you know, while the paperclip maximizer or the oracle that convinces us to start a nuclear war are certainly possible, I think there's going to be earlier risks around bad people with powerful proto-AGIs. You know, imagine something, and you allude to this and where we're going to go next with your accounting consciousness series, that at some point, whether it's GPT-3 and its errors in the science or other technologies, it seems very likely that producing better-than-human quality text, probably videos at some point, etc., purely by machine will be possible. And, you know, what happens when, say, China, you know, as one obvious example, or, you know, some master manipulator billionaire in the West decides to essentially massive attack on the information sphere, on the meme space of humanity with computer-generated content which is amazingly good. You know, imagine, you know, Netflix movies which are more engaging, more powerful, more seductive than anything ever done before. And I can imagine that, and yet those are not AGI technologies really, because they're special purpose, they don't have transfer learning, etc. Let's talk about that one, call it bad guys with proto-AGI.

Connor Leahy22:18

So that's absolutely a threat vector. That is one that a lot of people take very seriously. I personally like to kind of push back a little bit against that narrative. I think it is absolutely a threat vector. This is a way that harm can and will and is being done. Modern deepfake technology is very, very good. I think people still don't understand like if you have like just like 10 machine learning engineers and, you know, a refrigerator full of Red Bull, they can make some really good deepfakes nowadays. And I think this is something that a lot of people have not yet reckoned with, like just how quickly these things are going. And in a sense, this is kind of like the zone of the crux of the argument here is that, uh, you earlier mentioned a tweet from Eliezer that I retweeted. And Eliezer is one of the few people who kind of thinks this is not the biggest threat vector. And I actually agree with him on this. Like, we really want to explain why I think this. I think what you describe only will happen if superhuman AGI doesn't happen very soon, basically. I expect the way I look at AI technology, it's accelerating so fast, technology is going ahead so quickly, that by the time quote-unquote bad guys have like figured out how to use this technology at scale, there's already so much more powerful technology they can use that it's just not going to matter. So what Eliezer was talking about is that if AI technology happens fast enough, if there's, you know, we don't have like a long, you know, time of like, you know, things going wrong, us fixing it, you know, something else going wrong, fixing it, but if we have technology that is so powerful they can just irreversibly, you know, destroy or harm or control everything in one go, then our current systems are based on error-based learning. You know, it's like currently deepfakes don't get regulated until, you know, some politician gets deepfaked and then suddenly gets regulated. Like that's usually how these things go. And this is a very common human thing. And that works fine if you have like ergodic assumptions, if you can try again. But the fear that I have, I'm very concerned about like long-term future X-risk type situations. It's like, don't get me wrong, all these things are real risks. Real people are and will be harmed by these technologies by bad actors. But in a way, it feels parochial if you compare it to the threat of, you know, all humans going extinct or something like that. And if these very powerful technologies emerge soon and we don't have enough understanding of how to build them or use them safely, and we don't have the time to experiment with them without things going horribly wrong, I think we're in deep [__], really, really deep [__]. In the sense that maybe these deepfakes are, you know, causing some harm here, and maybe GPT-3, you know, propaganda messes over up over here, but then if a metaphorical paperclip maximizer emerges and paperclips everybody, well, you know, what matter, right?

Jim Rutt25:05

Yeah, it doesn't matter. Yeah, I agree with that. Two key assumptions there, let's dig into them. One is that AGI could happen soon, right? Now, we've probably both seen the various polls of leading experts and they're all over the place, from people who believe it'll happen tomorrow afternoon to people who it'll never happen or 300 years out. I think the last poll I saw, the median of AI experts was 40 to 50 years to AGI. On the other hand, there's a substantial bubble, including some people I know and respect a lot, who say five years. And it matters a lot on this, where's the risk, you know, bad guys with strong proto-AGIs, much stronger than GPT-3. Again, you take my thought piece of a suite of technologies that could create de novo a Netflix 10-episode series that was way more compelling than any ever created before and was larded with all kinds of intentional but hardly detectable propaganda, as an example. You know, such a thing would be extraordinarily dangerous in many ways we probably don't even fully anticipate. And if AGI is 40 years or 100 years out, we're gonna have to confront those risks. On the other hand, if my friends that believe AGI is five years away, then probably we don't, the paperclip maximizer is the risk we need to be optimizing on. So where do you come down on when is the threshold crossed of AGI?

Connor Leahy26:31

So yeah, I really like the way you framed that there. I fully agree that if you accept like 100-year timeline, then yes, you should be worried about these threats more. But my timelines are more on the order of five to 15 years. So I have very short timelines. Just the amount that AI has progressed in the last two years just blows me away. It's unbelievable. And it doesn't seem to be slowing down. For the first time in my career as an AI researcher, I feel I see like a direct path to how an AGI could be constructed. Like, not that I could like do it myself right now or anything, but like there's no 'and here happens magic' in the equation. It's like I see all the parts you need and I see like at least potential solutions to each of those parts, and I don't see anyone where I have to say, uh, magic, put insert magic here. That was not the case like five years ago. Like five years ago there was many places where I would say, insert magic here because I don't know what to think about it. But for the first time, it seems to me that we're racing towards actual designs that could really become extraordinarily powerful. And I also like to just like make clear again, like it doesn't really matter if the agent has like general intelligence, okay, you know, can it climb a tree, doesn't really matter. What matters to me is, you know, can it cause irreversible harm? Like how much power can these agents have? And quick tangent here, I think important to mention is the concept of instrumental convergence. So this is like a pretty important crux of my argument is that there is a pretty, I think intuitive, there's also more formal versions of this argument, like Alex Turner has done some good work on this, but basically, you can in many, many scenarios, so for many possible goals, gaining power is a really useful thing to do. And another useful thing is staying alive. So a common example is what people that, uh, dismiss AI risk, they say, well, we'll just turn it off if it does something bad. Well, here's a simple thought experiment. Imagine you have a robot, an AGI robot, and you use it to get you coffee. So its only goal is to get coffee, right? So immediately it, you know, bursts through the wall, you know, it runs over your cat, you know, it destroys everything in its way to get to the coffee machine as quickly as possible. Like, no, no, I don't want you to do that. So you run over to hit the off button on the robot. And what will the robot do? It will stop you. Because you never gave it a will to survive or consciousness or anything like that, nothing like that. You just gave it the will to make coffee. And but the thing is, the robot will correctly reason, if I'm turned off, I can't bring you coffee. So therefore, it will actively resist you shutting it down, because then it won't be able to make your coffee. So even this extremely simple goal of making coffee can already lead to an agent that will resist being shut off. And for like more complex or powerful goals, that's the instrumental convergence hypothesis. These goals of not wanting to be shut off and also like gaining power, whatever power means, you know, it might be economic power, social power, or computation power, those are just very useful things. So we should expect most agents with most goals to by default, unless we somehow stop them from doing this, by default to pursue such dangerous objectives.

Jim Rutt29:39

That's an interesting example, though of course, uh, a stopper would be something as simple in that particular scenario as Asimov's ancient three laws of robotics, right? Which says never harm a human takes precedence over any mission that you're given, right? So if we could agree to code in something that, we don't know what he wrote though, like 1950, you know, when this is before AI was even really dreamed of, there could be some fairly simple prescriptions against those kinds of situations.

Connor Leahy30:06

Probably not, no. Because like, let me put it this way, how would you turn that into code? Like, how would you turn those three laws into actual code run by an actual agent? Trust me, people have tried it. It's not easy. And you get into like all these weird paradoxes. Like, so, no action that could cause a human to come to harm. Well, any action could cause a human to come to harm. So any agent with that law would immediately shut itself down because any action it takes might cause human harm. So okay, now you have to have put in some kind of realization, now you have to give it like some kind of uncertainty prior.

Jim Rutt30:36

Yeah, it's a Bayesian calculator problem.

Connor Leahy30:38

Yeah, but then you get into the problem, which prior do you use? You know, how conservative or not conservative? What counts as a human? Is, you know, if someone is brain dead, are they still a human? If you have a simulation on a computer, is that a human? You know, again, it's tough.

Jim Rutt30:53

Interesting, yeah. Then this is, this is why I'm glad that, you know, people like MIRI exist who, and like you, who are, you know, working this, right? Because this is not a superficial question. This is definitely not a superficial question. Now, let's go on to the second part of the risk profile, and this is how quick would the takeoff be? I mean, if we go back to the original statement way back yonder on the Techno-Singularity, the concept is very simple. And I, it used to be fun to tell random people this because they'd never heard it and they'd freak out. Now the idea has kind of gotten out into the world and most people who pay attention have heard about it. The idea was, all right, once we get an AGI up to like 1.1 the horsepower of a human, we give it the task of designing its successor, right? And its successor is 1.3, and then its successor is 1.7, then its successor is 2.9, and then its successor is 9.6, and then it's a thousand, and then it's a million, and then it's a billion, right? And that's the takeoff rate problem. And it's an interesting question. I actually, uh, participated in a dis-, actually I was a sort of an instigator just sitting there watching, but asking pointed questions when Eliezer and Robin Hanson debated the takeoff question once before, way back when it was Singularity.org in their group house down in San Jose. It was really a kind of a fun conversation. And anyway, I'll just give your thought to the takeoff question, then I'll give you my own thoughts on it.

Connor Leahy32:23

I find it's difficult to think about these kinds of questions. So like two categories that people like to talk about nowadays is like the, uh, Eliezer-style fast takeoff is what people call it, where you go from like zero to, uh, you know, 100 billion trillion in like a very, very short amount of time, just like no warning almost, it just happens very, very subtly. So like the most extreme example is to kind of like, you know, someone lets their network run overnight and the next day suddenly it's God, you know, kind of like situation. Of course, something that extreme is silly. I don't expect things to go that fast. So an alternative, which is kind of interesting, is kind of like, it's kind of generally attributed to Paul Christiano. It's this concept of a slow takeoff, but ironically, the slow takeoff feels faster than the fast takeoff.

Jim Rutt33:08

Could you unpack that one for me?

Connor Leahy33:10

Yeah, yeah, yeah, let me unpack that for you. So in a fast takeoff, you kind of have like...

A flat line, just zero, zero, zero, and then it goes like really high. While a slow takeoff is more like a hyperbolic takeoff. Basically, what the thesis of the slow takeoff is, is that we're going to have a four-year economic doubling time before we have a one-year economic doubling time. So it would still be extremely fast, and it would feel faster to people because people could see the four-year doubling time and then the one-year doubling time before we hit the singularity. And this is, I think, more people nowadays kind of imagine situations like that. It seems also nowadays we very much have these multi-stakeholder takeoff type scenarios. We have not just, you know, one AI made in one lab by one person, which is kind of where that old-school Eliezer type thinking was. Nowadays it's more like these AI systems will be very complex, very expensive systems that are built by large organizations. And there's lots of complicated thinking about what does that mean, what is safer, what does not. So I guess I'm a bit ambivalent on this. It doesn't matter really exactly how the takeoff goes. What matters is these agents are going to appear very soon, they're going to be very powerful, and if we don't align them, we're [__]. That's kind of what I focus on.

Jim Rutt34:28

Okay, that makes sense. My own take on takeoff is that in principle, fast takeoff is possible because human cognition is so weak. You know, this is an insight I had about six or seven years ago, is that to the first order, humans must be approximately the stupidest possible AGI because we are the first to appear in our evolutionary tree, and mother nature is seldom profligate in her gifts. We get only as much as we're likely to get from random rolls of the dice, essentially. And so it's unlikely we're very far over the line. And further, there's some empirical evidence that I think is most obvious and easy to understand, which is the famous working memory limitation, Miller's seven plus or minus two, which on later examination looks more like four plus or minus one, which are the elements you can keep in working memory simultaneously. And that has unbelievably huge implications. For instance, our ability to read and write, the nature of our language, are totally gated by the fact that we can only process, let's say, at most seven things more or less simultaneously. And our syntax only really has an effective range of seven. People who don't know that write scholarly papers which are impossible to [__] read. And the truth is, when we read, we don't actually understand everything in the paper; we create a rough gist, essentially, because of the fact that our working memory size is seven. The description of it at the time was, you know, Einstein was a nine, the village idiot's a five, basically, and that's probably not far from wrong. But what is a hundred? A working memory size of a hundred that you could actually fully understand and fully parse language, or code, probably more importantly, code in blocks of a hundred, is so far beyond human capability that I literally can't envision what that might be like subjectively. And you know, when you say a hundred, what about a thousand? What about a million? What about a billion? What about the ability to read Wikipedia and have it all in your head with full total random access to be able to see all the self-referential links and all that sort of thing? Not to even mention some of the other weaknesses in our cognition. For instance, our memories, our episodic memories are just also very rough and ready. They decay over time, and even worse, every time we access a memory, a random amount of noise is added to the memory. So our memories suck. So imagine something with a working memory size, let's take a thousand, with total high fidelity memory, etc. That's going to be a [__] load smarter than we are.

Connor Leahy37:03

Yeah, I fully agree. I can understand people that say, oh, maybe AI is going to take longer, or maybe we're going to run into robots. But people who say it's impossible makes no sense to me. As you just described, even these very minor changes to just a human-level agent would already make it so much vastly more powerful than a human that, you know, who knows what the limit is there. It seems very obvious to me.

Jim Rutt37:24

Yeah, I would agree that sometime superhuman intelligence, I would be shocked if it doesn't happen. However, there's an interesting trend that I think does maybe move us away from the Eliezer fast takeoff, which was ten years ago when I first started following this area. A lot of the thinkers, including Eliezer, and he frankly personally worked on this for a while, thought it was all about some magic algorithm. Right, that the AGI solution was math. And yet, the work that you're doing, people like OpenAI and Google and Facebook, etc., it's turning out that at least this road, which may or may not get us to AGI, is more about data and computation. And adding more data and computation, while they're exponentials, you know, they're Moore's Law type exponentials, maybe data exponential is a little higher. Well, algorithms could literally, you know, it turns out that math was the answer to AGI, one could literally write the right algorithm and the thing couldn't play checkers in the morning and by the afternoon it was godlike. But if the issue is more data and more computation and more network capacity, perhaps, which I think is important, then the takeoffs by definition almost are going to be slower than if they're algorithmic.

Connor Leahy38:41

Yeah, I agree, that's a good way of phrasing it. I think you're correct that the reason a lot of the early people thought differently than we do nowadays is because they thought about AI algorithms differently. They thought that there was a special math, and if you figured out this special math formula, you would get huge improvements. It's possible that such formulas exist somewhere out in math space or something, but it seems to me that at the moment, at least, the way things look is that it's more that you have kind of like computational irreducibility. It's just to get a certain level of intelligence, no matter how clever your algorithm, you still need a lot of compute, you still need a lot of data to locate the right hypothesis in your hypothesis space or whatever. I'm not saying that our algorithms currently are by any sense the limit of what might be possible. The thing that I just always try to remind myself is the space of all possible programs is one of the weirdest eldritch horror escapes imaginable. It's unbelievable, it's unknowable what things exist in the space of all possible programs. So it's best not to reason about that too much.

Jim Rutt39:48

Yeah, you can't say too much about it. And that's someone who's fooled around with genetic programming a fair amount. One realizes just like Borges' library, most of it's total [__]. But then it's the search problem, and it's in a space of infinite [__]. How do you find the much smaller number of actually interesting things? In fact, I had a really interesting podcast last month with Ken Stanley, who is probably the leading dude in the world in evolutionary AI, and he talked about his book about open-ended search and how thinking non-traditionally and non-objectively may actually be an interesting backdoor to exploring this area of interestingness that's not necessarily goal-related. In fact, he's going to be on tomorrow; we're going to do a deep dive into evolutionary AI, which happens to be one of my personal pet areas of interest. So yeah, that's interesting. So if we look at, you know, across these two things, when and how fast the takeoff will occur, probably you and I share a view that the takeoff won't be overnight because of the fact that at least the roads we're currently on seem to be data and computation, and I would add network interconnect, I think that's the part that's missing, by the way, will take time and take actual physical resources to build. I'm more agnostic on when, you know, I just don't know. Of course, you're closer to it than I am, so based on that heuristic, maybe you're closer. Then I will go with the consensus, yes, 30 or 40 years, because not because I'm an expert, but just because that seems to be the median expert thing. But nonetheless, it does mean we have something to worry about. But I would argue it also means we have to prioritize bad guys with proto-AGIs perhaps a bit more than you might say.

Connor Leahy41:34

Maybe. I would add to that that a solution to the alignment problem is also a solution to bad guys with AI. If we had a super nice AI that knows what the right thing is for humans and robustly can follow that, and we build such AIs, and we just, you know, either destroy or make it illegal to build other kinds, that also solves that problem. It's a more general solution. And also another thing I'd like to raise is just kind of like, I think I like to say about AI alignment and safety and such, is that if you work in this field, you have to be comfortable multiplying really, really big outcomes by really small probabilities. So that's like one of the arguments about MIRI, even like the early days, they always were clear that the chance that they're at the right time or they're doing the right thing so early in a development is pretty small, but the potential outcome that maybe if they get something useful out of it is so large that it could still be worth it. So it's kind of like a risk-benefit trade-off. In that, I think working on these short-term AI problems gives you a more assured outcome, like you're more likely to do something that will have a net good. But I expect that net good, the magnitude of the net good, to be so much smaller than the possible net good of doing something very risky with long-term AI that, at least for me personally, of course, you know, all these priors come from, you pull them out of your ass at some point, is that it feels to me that working on these very risky, low probability of working but extremely high potential payout things is a very good investment. But you know, that depends on one's personal risk tolerance.

Jim Rutt43:13

Yeah, I got Eliezer to admit in personal conversation that he thought we were probably [__], but that there was a small chance we weren't, and therefore it was still worth all of his effort, everybody's effort. Even if it was a one percent chance that we might not be [__], it was worth working on. I agree, and I thought that was an interestingly weird place to work for your career, but I honor him for that. I'm glad he's there. I mean, I think he's one of the most important humans that we have, probably.

Connor Leahy43:41

Yeah, I must also say I do look up to Eliezer a lot. He was a very great inspiration for all the work I've done. His work on the sequences is probably the number one most influential work on my personal thinking. So I'm also extraordinarily glad that someone like him exists.

Jim Rutt43:58

Yeah, indeed. Now let's move on. We don't have as much time as I would like on this, but when I was doing my research for our original podcast, we didn't get to talk about it at all in the first episode. Connor has written this very interesting, somewhat peculiar series of essays called Counting Consciousness. I think the original title of the first one was GPT-2, Counting Consciousness, and the Curious Hacker, and it covers all kinds of interesting things. We'll have a link to it on the episode page, and if you just want a very interesting set of reads, I encourage you to read it. I've now read it twice, and I'm probably going to read it a third time because I'm sure I missed some interesting things here. And let's go back to something we talked about briefly, and it's one of my interesting kind of thought experiments which will kind of get us into some of the ideas from your part one, which is deep fakes. I remember being very worried about deep fakes about two years ago. In fact, I actually gave some money to a little start-up, not-for-profit, whose main mission was to think about how to immunize humans against the dangers from deep fakes. And yet, so far as I know, there actually hasn't been any serious damage from deep fakes. Somehow the biological blockchain, as you called it, or our human ability to collectively filter out [__] has so far protected us from any really grievous harm.

Connor Leahy45:24

For blockchain, as far as I know, I agree that I think the amount of damage done by deep fakes so far has been relatively small. But I don't think that it's because the blockchain has been robust. I talked about this in the second part of the essay. I remember, by the way, I just want to flag for any readers, I apologize for those essays being strange. These were some of my first attempts at writing long-form.

Jim Rutt45:43

Hey, now the fact that they're strange is what makes them interesting. Don't apologize. This is really interesting. You follow a first-class brain where it takes itself. So read the essays even if they are a little strange.

Connor Leahy45:56

All right, well thank you, I do appreciate it. But yeah, so the second essay I talk about the truly stupendous uncreativity of evil people. I know Bruce Schneier also talks about this, like the concept of ordinary paranoia versus the security mindset. He gives this funny example. So when he was a kid, he had an ant farm, and he has a card in there where you could send an address, like a letter with an address to this location, and they'd send you a bunch of ants. And that ordinary person would think, wow, cool, I can get some ants. And someone with a security mindset will think, huh, I can send ants to anyone I want. Interesting. And it is kind of like a specific kind of style of thinking. Bruce Schneier is like one of the best writers on this topic. Eliezer has also written an essay about this, I forgot what it was called unfortunately. And it's kind of thinking that I talked about this in like the second essay of how if you are just like a little bit creative, like I am by far not the best security mindset genius hacker or anything like that, but just with a little bit of creativity, I could come up with some really, really dangerous possible attacks that I could pull off for like a hundred thousand dollars and might ruin a politician's career or something with high probability. And I could do that for a hundred thousand dollars in my bedroom. But somehow no one has done that, and that's kind of what I wrestle with in one of the essays, like why has no one done these things?

Jim Rutt47:22

Yeah, that's really interesting. I participated in, I can tell you now, I can't tell you what the results were, with an exercise for one of the three-letter agencies where the hypothesis was you have a million dollars to do the maximum harm. And it scared the [__] out of these people. I will say that my contribution scared them amongst the worst. But it's very, very interesting that seemingly the bad guys are not nearly as clever as you would think.

Connor Leahy47:49

Yeah, and that's actually a big reason of why I take AI risk so seriously. Because AI, even if it's not superhuman, let's say it's just as smart as a smart human, it can still be functionally perfectly sociopathic. You can still create an AI that perfectly, consistently lies, manipulates, controls. It can run extremely complicated networks of lies and sock puppets to such a degree that it will never slip up even once. And this is not something that we are prepared to deal with. Even now, from my experience, one sociopath in your company can bring down the entire company. I've seen it happen.

Jim Rutt48:30

Yeah, very bad. Sociopathy in our Game B worlds, one of the big flags we have is that we have to get better at identifying sociopaths and keep things away from lovers of power. As someone who worked in corporate America, I have been known to say that my good faith estimate is 10% of C-level executives in major corporations in America are sociopaths, which is a scary [__] number considering that the number in the general population is on the order of 1%. If you go to finance, it might be 30%. Not good. Let's get back to the idea of the biological blockchain as a sort of a starting point for sort of where we were before we had other methods of building trust.

Connor Leahy49:08

Yeah, so the idea, I'm not sure if I still think that's the best name for it or not, but basically the idea is that in an ancestral environment, if you want to build trust, say there is an idea and you can't evaluate this idea, you're not sure if it's good or it's bad. And if you then see many people around you that you trust, from your tribe, your elders, your family, your friends, and they all say this idea is really good, that's pretty good evidence that it probably is true. Because anyone who tried to convince these people had to convince them, and that takes effort. And if it's a bad idea, you hope it takes more effort to convince lots of people. You know the saying that you can't fool all the people all the time. So in a sense, you have this kind of trust mechanism where seeing lots of people that you trust endorsing an idea can alleviate your desire necessarily to check the idea yourself, or it might even be an idea that you yourself can't validate. So it's kind of like our truth-making mechanism, one of our basic ways of making truth. And the problem is that this mechanism evolved in an environment where there weren't tech bots and deep fakes and organized propaganda campaigns pushing anti-vax or whatever, or Facebook algorithms optimized on sucking you in.

Jim Rutt50:31

Right, exactly. I could definitely imagine that there could be a species that, if we just froze technology right where it is now and a million years passed, we might well evolve some kind of psychological mechanisms to very helpfully and productively deal with these kind of things. We might have very different cultural, social, and biological norms of how to deal with trust and epistemology. The fact is, we were not evolved for the situations we're in, and so we shouldn't expect these systems to scale. Honestly, it's a surprise they got as far as they did.

Connor Leahy51:02

Interesting. So you then talk about security mindsets and the fact that all kinds of potentials for harm are out there. But then the one thing I found most interesting and caused me to think a whole lot is what might we do as humans? We can't evolve very fast biologically.

Jim Rutt51:18

And I will push back a little bit on the million years, because say for instance when newspapers and advertising first became a thing in the United States, it wasn't until the 1850s when they could start doing fancy graphics in newspapers relatively inexpensively. Famously, there was all kinds of literal snake oil salesmen selling all kinds of dubious drugs with claims that will cure every disease, etc. And yet, if someone were to put a snake oil ad on Facebook today, relatively few people would fall for it if you literally took the same text from 1855 and put it out there. Some would, that's the amazing thing, and we do have flat earthers and such. But we have developed kind of group and individual sense-making capabilities to filter out [__]. The other example you gave for a deep fake, suppose you had a deep fake of Bill Clinton and Hillary Clinton telling racist jokes or something, people would just say, that seems highly improbable. And again, there would be 10 or 15 percent would go, oh yeah, those evil [__], but most people would say, yeah, common sense seems highly unlikely. Specifically, even if they did it, they certainly wouldn't make a video of themselves doing it. And so we do develop tools over time to make ourselves immune to the worst abuses. But you then suggest some stronger ways that we can replace the biological blockchain. So why don't you riff on that for a bit?

Connor Leahy52:45

Yeah, so I would not consider this to be a full solution to anything, but I feel like it's such an obvious thing. It's also not unique to me, I'm not the first one to come up with this idea, obviously. But these ideas of using cryptography to replace some of our more informal methods by more formally powerful methods of verifying information. At some point, you have to trust people. Bruce Schneier has wonderful essays about this, about how there's no such thing as trustless technology. That does not exist. At some point, you always have to trust the computer code or the algorithm, or you have to trust the people who made the program, you have to trust the people who built your iPhone. There's always trust. You can never verify anything in a way. Trust is the most powerful skill that humans developed. The fact that we trust other people is what allows our civilization to exist. Remember, chimpanzees don't trust each other usually. I mean, that's actually nowadays considered not 100% true, but just to take the stereotype, what's clear about chimpanzees is they absolutely do not trust any chimp that's not from their band.

Jim Rutt53:47

Yes, exactly. Chimps have a very complicated hierarchy of relationships within their band, but unlike humans, all they can do is kill anyone that's not from their band. While humans have developed the superpower of long-range cooperation with people that they don't even know.

Connor Leahy54:02

Exactly. All the technology I'm currently using, I can't verify what this technology is or how it works. I've never met Jim in person. He's not part of my tribe. How did I know that I should trust this email that appeared in my inbox, that I should click on this link and then go talk to this guy? How do I know? And so trust is everywhere in our society, and it is a fundamental social technology, a fundamental tool and social technology that we need for a complex society to function. Cryptography is not a solution to privacy, it doesn't necessarily make it, it's just a tool to allow us to do certain very powerful forms of privacy-enhancing technologies. The obvious ones, like encrypting your chat messages or whatever. What I'm even more interested in, and what I talked about in that essay, is this concept of a web of trust and using public key cryptography to verify and sign messages. So one of the things you can do with public key cryptography is that you can have a secret number, a key, and you're not allowed to show anyone else this key. And using this key, you can create a signature on messages that 100% guarantees you and only you wrote this message and the message was not tampered with. So what I think should happen is that this kind of technology should exist everywhere. Every tweet you send, every text message you send, every video you make should be signed, timestamped, identity-stamped. Of course, there could still be lawless places on the internet where this is not enforced, but I find it surprising that I can just go on the internet and look at a tweet and I have no guarantee who this came from or what entity wrote this and who endorses it. So the way I think trust, this kind of formalization of the weak concept that the biological blockchain was trying to implement, this idea that I could explicitly endorse people. I could say, okay, I trust this government organization but not this one. I trust this newspaper but not this one. I trust my best friend but I don't really trust my other friend because he's kind of an idiot. And then I can see who endorses what, who says this is true or who has comments, and I can verify that in an unfakeable way. So if I see thousands of people endorsing a claim, I can check, are these signed endorsements? Where does the signature come from? Who endorses these people? Is there a root node? For example, you might have a government that endorses special citizen keys that can be used for voting or for comment giving. For example, there was a great, the FTC of a few years ago had an open hearing about net neutrality, and it turned out that the telecommunications companies hired grassroots-as-a-service companies, which is a real thing, which is legal by the way, somehow this is legal, that then created thousands and thousands of fake users and fake comments in order to push them towards one possible policy. And the way you think about it, it's kind of crazy, is that you're allowing people to anonymously basically submit things in the voice of American citizens without checking. This wasn't necessarily the case when the biological blockchain was still in full force back in the day. When a town hall meeting was called, you had to physically appear in the town hall to physically tell people what your opinion was. That verified your identity. And online, that's not necessarily the case. And these keys and these endorsements by different root trust sources could be a way of allowing this kind of authentication to happen online.

Jim Rutt57:36

Yep, that's very interesting. Just a little note, the grassroots-as-a-service actually definitely exists. I actually hired one once when I was a CEO of Network Solutions, and we were involved in the setting up of ICANN. And there was a public notice thing with the Department of Commerce. I think it was fifteen thousand dollars, we were able to get a hundred endorsements of our position. Now these weren't bogus, these were from people who were small ISP operators mostly, but they would have not naturally formed this association to lobby on our behalf if it hadn't been for this grassroots-as-a-service. We had no idea how to do it. But it wasn't entirely bogus, but it was, I would call it astroturf, basically. Anyway, and it worked. Not that it really made much difference, but yeah, such things really do work. Now it is interesting, public key crypto is an amazing technology. However, it has one huge problem, and I'm pretty familiar with this. At one point I was president of VeriSign's digital certificate business, and we were always thinking about interesting ways to monetize public key crypto. And of course, crypto coinage, Bitcoin, Ethereum, etc., has brought this to a massive scale. And it all keeps coming down to the same goddamn problem, which is the critical fragility of the private key. Right, which is that to use your private key, you have to have access to it. So if you want to sign something, you need your private key. But to move your private key into a place where you can sign it, you've just moved it into a place where someone can steal it, because our computer technology is so bad. In fact, for my number one Ethereum wallet, I keep my private key on paper. I don't have it online anywhere, which of course makes it really difficult to actually do a transaction. I got to do it on one computer that's offline, and then I don't know, it's really, really, really hard to actually use RSA-style private keys securely.

Connor Leahy59:37

Yeah, absolutely. And my answer to that is that yes, that's completely the case. I am fully aware of how politically unviable this solution would be, how complicated it would be to actually implement this system. Our institutions do not have the kind of executive capacity to be able to organize something like this in any feasible timeframe, I think. But my argument is that I think that the difficulty does not come from private keys themselves. I think it's more fundamental than that. It's just authentication is hard. Authentication is a fundamentally, irreducibly hard problem. The same way that people complain about proof of work with Bitcoin or whatever, and yeah, sure, I understand that there's a huge waste of energy and whatever, it has all these bad points, but it is an irreducibly hard problem. You can't just have Bitcoin without proof of work. You have proof of stake, but it has other problems and other trade-offs. It is an irreducibly hard problem. Trust has to be hard, because if it was easy, it also makes it easy to break. There has to be one step somewhere that is hard. That's just how it works.

Jim Rutt1:00:39

Yeah, though I liked your essay in that you laid out the fact that trust could be on a continuum. For instance, you could have an online platform that did not require a high trust certification. You have the 4chan equivalent, which you mentioned that you were a 4channer at one point, right? While on the other hand, Facebook might require a certification from a government authority before they would accept your thing. And I kind of like that, the opportunity for a pluralistic domain of trust that people could choose which ecosystems had what levels of trust, and you could have webs of trust. And so there's a lot of things you can do with your architecture, which I did think was good. Though I still, goddamn, the private key problem makes this really hard to implement as a practical matter.

Connor Leahy1:01:21

Yeah, absolutely. In practice, you would have the less secure keys, maybe, and you keep those on your hard drive, and those are linked to your shitpost Twitter account or something. If you lose that, it's out there. And you have your secret government-issued key to keep on paper somewhere secret, and you only bring it out when you're voting on something or doing something that requires very high levels of trust. I'm a big privacy advocate. I think if anything, I find it sad there's less super anonymous places on the net. But I feel like there are benefits and downsides to having very anonymous places, and there are benefits and downsides to having very not anonymous places. And I think these should exist side by side. It shouldn't be everyone's authenticated everywhere, neither should it be everyone's anonymous everywhere. I think it really depends on what the use case is.

Jim Rutt1:02:03

I like that. I think that's actually a very important principle, and you made it very nicely in your essay. So again, read your essays. Let's end up with the last and kind of the most interesting and probably provocative thought you had, which was in part four, where you actually get down to what are we talking about when you mention the phrase counting consciousness. You know, this is a key question that we're going to be confronting before long, and we better start thinking about it. So, what does count as a consciousness, and why exactly?

Connor Leahy1:02:32

So this is a question I've been thinking about for a long time, and I still think about it all the time, and my thoughts have evolved since that essay, but I think we don't have time to get into that right now. But there is this real question, is that humans, we humans have a natural idea of a person, an identity, a singular, you know, we have citizens and they're discrete entities. We don't have continuous citizens, we don't have 0.7 people are in favor of this or something, at least outside of statistics. And the thing is, I think that is not a fundamental property of the universe. It is an emergent phenomenon that happens to be the case. It happens to be a useful abstraction because humans tend to come in one human-sized chunks and they're not easy to reproduce. But if we say had a human brain scan and I could just control-C, control-V that brain scan lots and lots of times, do each of the copies get a vote? Are they citizens? What rights do these things have? What responsibilities do I have towards these entities? And this is just one of the many, many problems we get once we start breaking down these comfortable assumptions that do hold for biological humans but don't necessarily hold once we really start building virtual entities. And one of the things I talked about in that essay is, yeah, so at some point you have to count. If you want to have a vote, you have to count. It has to be an unambiguous way of counting how many people are voting, how will our votes be tallied. That's just how voting works. And then on the more provocative side, I argue about, well, maybe not for all cases, counting humans is the right thing to be counting. Maybe there are situations where you want virtual entities to be voting or to be part of your community, or maybe these things get really complicated very quickly. And I don't propose solutions or obvious things like that. It's more like food for thought. This will happen sooner or later. Sooner or later, we will have entities sharing the planet with us that I think have a very real claim to moral patienthood, a very real claim to say, hey, I'm intelligent, I have goals and desires, I think I should get some of the things these humans get. At some point, such entities will exist. I think that is very likely. Whether those could be human uploads or AGI systems in the future, and that's going to break a lot of the assumptions we use to run our society. And I really do think we need to take these things very seriously, and we'll have to rethink a lot of the fundamental principles of how we run society.

Jim Rutt1:05:02

Well, that's great, and it's absolutely true. I mean, if we say someday there'll be AGIs, we have to come to some conclusion about where do they rank morally with humans, and the answer is not nearly as obvious as you might think. I'd recommend you read Connor's essays. Well, thank you, Connor, for another wonderfully interesting conversation. I'm really glad to have you back on the Jim Rutt Show.

Connor Leahy1:05:24

Yeah, it was a blast.

Narrator1:05:29

Production services and audio editing by Jared Janes Consulting. Music by Tom Muller at modernspacemusic.com.