Satya Nadella31:13
Thank you so much Karan. And so you know when you look at all of these tools that Karan showed, you know what does it do? It inspires you. So recently we had Thanksgiving in the United States. So we had some time off and so what does one do when you have time off? You build and so I built an app. And so this is by the way this is my regular laptop you are seeing. So this is my Azure subscription and this is in fact my app that's deployed right now I think in Canada Central. This is my repo. This is my GitHub repo. So this is my code. In fact the way my standard sort of way I work is I have a Windows 365 setup. So it travels with me everywhere. And then of course I use GitHub and then I use code spaces. So this is I think it'll show up with the code space here. So it's kind of like turtles all the way down, right? So I have my code space running on GitHub running on Windows 365. So that's kind of my virtual environment that I travel with. And of course like as Karan was showing every morning I get up and my favorite place to go is here right which is I go to my repo and I go to my copilot and start assigning it tasks like every day there's some multiple agents I kick off like this morning I said please upgrade this entire app to GPT 5.2 right so and it's actually done and when I'm done with my keynote and I'm on a flight back I'll kind of get there on my flight with Wi-Fi and be able to actually look at the PR and push it if you will. In fact, I'll test it once in my code spaces and then I'll commit. But that's in fact it shows you the power of how you can literally start really taking simple tasks to big updates and start powering through them. So what did I build? So what I built is essentially a new deep research tool. You know, one of my life's ambition is to figure out how to get a job in the copilot team. And so I'm preparing for it. I'm sort of desperately trying how to get competent enough to get hired. And so therefore, I said, okay, let me build a new more sophisticated deep research application. And so the thing that I did is you know, I said, there are all these models available. So why don't I create decision frameworks that work on top of the model and one of those decision frameworks is Andrej Karpathy who's a famous AI researcher came up with this LLM council he called it right so the idea is you'll have like a selection committee you'll have multiple members of the committee right so in this case I can have GPT 5.1 or 5.2 today Claude Gemini Llama Grok whatever right pick your favorite selection committee then you elect even a chairman like you can say who is the chairman is it Claude or Gemini or GPT or what have you pick your chair and then go ahead and issue a query and ask it to do stuff and then another one I did was DXO this is a pretty cool decision framework this comes out of healthcare in fact we first implemented it in fact we've implemented it in healthcare where you have just different roles. So for example, one role is a lead researcher who does let think of it as exhaustive breadth first research. There's a critical reviewer and by the way you can assign different models to these different roles. So I have a lead researcher that happens to be Claude, a critical reviewer that happens to be a GPT. And what does the critical reviewer do? They that particular role finds method problems, right? say recency bias or period bias whatever are the issues which are cognitive biases then you can have domain expert data analyst so different roles and in fact when we applied DXO in healthcare it turned out that having multiple roles work together in a multi-agent system outperformed any single model right it's intuitive right why not have all the smart people working with you on a particular decision versus one and so that was DXO then I implemented even another one called ensemble. And the idea with ensemble is you sort of fire off parallel queries anonymize the responses so there is no bias even in selection. And then synthesize it right in fact last night I said well let's even implement a regular old debate. So this is a new feature I added and thank god it's showed up here. And basically think of it as chain of debate. In fact, I think of all of these as instead of thinking of them as chain of thought, thinking of all of this as chain of debate. And I said, let's actually have a full-on debate. So, the bottom line here is you can have different formats of, you know, even debates. You can have pros and cons, SWOT, you can have risk versus impact, counterfactual versus devil's advocate, right? So, you can even pick your form framework. You can have critiques. So, each sort of each person makes an argument, everybody else critiques.
I can have multiple rounds of it. So I was just kind of I had a lot of time I guess in my hotel room yesterday. But I went on and added features. I said okay let's have, oh by the way, I even added because I was spending so many tokens I said man I might as well actually add cost control to my thing. And so I said do I need fast lower cost or deep higher cost. I'm picking balanced here. I can pick the different research analyst, system architect, risk officer, three types of roles, then have multiple rounds. So you get the point. So of course you build all this. What does one good old South Asian do? Use it to select the best ever Indian test cricket team. I think we need one. And so if I go to my history, you'll see I've been at it. I've been working it. In fact, I have a fantastic MLB lineup as well. But since we are here in Mumbai, I'll show you some of the stuff. So I'll show you the council one. I'll show you what it came back with. In fact, let me increase the font here. So this is the chairman synthesis. It shows me, obviously Sunil opens with Sehwag, Rahul, Sachin, Virat. A man it picked. So this is the synthesis and it shows VVS at six and seven and then it goes on. Here's the interesting thing. The chairman's report says that there was unanimous consensus, Gavaskar, Sehwag, Dravid, Tendulkar, Kohli, Kapil, Ashwin, Bumrah. But the big debate of course was, do you need an additional batsman or not? And thank God they selected, at least being a Hyderabadi, I'm really proud that they made VVS Laxman in there. And here's the interesting thing. It waited 51 and Claude's inclusion. I mean, think about this. These two models bid for my man Laxman, because of his crisis management and made the point about why, and then the other debate was who is the captain. In fact, I think, oh by the way, the Kumble versus Zaheer was also a very good one. It sort of chose Anil versus Zaheer. Zaheer is going to be 12th man, you know, depends. I guess that's where I think there's more work to be done. I would pick Zaheer some days and sometimes depending on where you're playing. Captaincy debate, it picked Kohli. So the key thing is I see the annotations of every chain of debate and it gives me insight. In fact, if I go back to show you one other case, let me go back to my history again. Let me go to the DXO view. I think this is interesting because the DXO view is slightly different. It says here is what the lead researcher did. Then it says here's what the critical reviewer did. But here's the cool thing. Look at the way it found the biases, the error bias. So the critical reviewer's job was to say, okay, when I'm selecting something, are you mixing up your stats? I mean, I always say, you know, sometimes you could say wow the fitness levels of the modern teams are so great and wonderful, but man, watch Gavaskar go in to bat without a helmet or even a, I don't know, Hazare go on uncovered wicket in play in England. How do you really equate for that? So that's the error bias. And so to me, to be able to have all these models debate that issue is what this is all about. So anyway, you get the point. You can use these decision frameworks as the new form of metacognition. So if you have all this abundance of models, the ability to literally do what I did, which is just build your own multi-agent system, that's the new commodity. I mean, the reality is, all of us are going to be doing this work just like how we do spreadsheets or documents. And if we have that power, then the question is what are we going to do with it? And one of the things I think is exercise better judgment, better decision, and in high stake situations in healthcare, in financial services, in insurance, in supply chain. And that's the beauty of it, to be able to think about these as metacognition frameworks for us to apply better judgment in a world where we have all of this. One of the other things I should mention is I built all this. I even made it into a Copilot. So this is all in my Azure private tenant. And I then did a basically an open API to it. And so then I made it into a Copilot agent and then I said great, let me go deploy and guess what, I was caught. So in other words, the Microsoft DLP, data loss prevention, came in and said you can't deploy it because this has to be deployed inside the Microsoft tenant. You just can't point it to an external agent because that's a massive exfiltration problem. And that is thanks to this other part which I think is the runtime of agents we have built called Agent 365. And Agent 365 takes our identity management, our Defender, things like Purview and data loss prevention, all of that, and brings it to the agent world just like what we have done for end-user computing. We're now doing it for agents because if people are going to build powerful agents like the one I built, you want full compliance, you want full visibility, you want full governance of it. And that's what Agent 365 does. Because you can't just run around and say I have thousands and hundreds of thousands of agents without that security and governance and visibility. And that's really what Agent 365 delivers. The beauty of all this is there is significant momentum, significant ambition level to translate this ultimately into people doing unbelievable things. So this morning I had a chance to meet with for example the folks from Adani, who are really doing this invoice to payment and reduce something that took 20 plus days to 4 hours. I had a chance to meet the team from Aditya Birla where they took their mobile app that's doing complex products and said okay, here's a conversational interface that just makes it simpler for them to be able to interact. I had a chance to meet with the folks at LTI Mindtree. They're doing a complete wall-to-wall transformation using things from Copilot to Copilot Studio to Foundry to do every business process and re-look at it from an AI first perspective. Mankind. It's a beautiful use case of really putting that power of AI and the expertise in the hands of the people who are the medical reps who are visiting doctors in rural areas to be able to make sure that the medical rep is confident in answering the questions and then improving care outcomes. Yes Bank has done a fantastic job of really the invoice turnaround times. So they're also doing a wall-to-wall transformation with 40 plus use cases, but really seeing the ROI in each one of these examples. Now, one of the chances I had again was to meet with the team that built this new agentic system called the Maha Crime OS. So this was sponsored by the Chief Minister of Maharashtra who's here with us. And it was really great to meet both the SP as well as the people in the team, and in fact the investigative team, and the pride they had in being able to use this system to deliver justice to a citizen in Nagpur who was unfortunately a victim of a crime. But the ability for them to use this agentic system to speed up the time to justice was just fantastic to see. So let's go ahead and play the video.