Arvind Narayanan

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

So it's not this one big thing that AI is going to do for everybody. I mean, that might happen in the future. But so far, I don't think there has been this one killer application. But it's 100 little things. So in my work, for instance, it's been enormously useful in helping me write code.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1039.279

Frankly, it's hard for me to imagine going back to a time before I had AI assistance in writing code because it's just so much faster but also more fun, frankly. And, you know, for lawyers, I'm hearing there are so many ways in which legal tech is making their lives better, making it easier to find information. Of course, it's not going to replace a lawyer.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1059.254

There have been many overblown claims of companies building a robot lawyer and things like that. I think that sort of stuff is a little bit silly for now, at least. But I think in every profession that involves basically dealing with knowledge in some way, AI can be a creativity enhancer or a way to automate certain mundane tasks in your everyday life. I think it can also be a good learning tool.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1084.146

There are pitfalls here because AI can hallucinate, that is, generate incorrect information and not even be aware that it is hallucinating. That said, again, once one spends a few hours learning to work around these pitfalls, I think it can be a very good learning tool. I use AI a lot for learning about new topics.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1106.392

I haven't stopped using books for learning, but, you know, I can't ask a book a question and I can't summarize my understanding of a topic to a book and ask it if I have gotten it right. These are things that I can do with chatbots. So those are a few of the ways in which I've been using it in my own work and in my own life. And I think Each person has to figure it out for themselves.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1147.233

There are broadly two different ways to use AI. One is you can use a specific AI app. You know, ChatGPT is the most well-known one. Or you can use AI that's embedded into the other apps or other physical products that you use. And I think both of those are interesting and both of those are worth trying out. So specifically...

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1167.327

If you do a Google search, there are relatively simple types of AI that have existed even in traditional Google search. But more recently, Google has started creating these AI overviews, which can often be wrong. So I think it's caveat emptor to actually verify that information.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1186.02

But I think it can be more enlightening to play with a standalone AI tool like ChatGPT or Gemini or whatever people want to use. and explore the kinds of things that it can do, as well as learn how, you know, it gets things wrong. And I think that's going to give you a much better understanding of AI's powers and limitations.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1226.5

That's a great question. We should not assume that whatever AI tells us is probably right. I think the accuracy rate varies greatly depending on the kind of topic. When I use AI with my kids, you know, when I ask it science questions, it's very good at explaining those things in a way that a five-year-old can understand and almost never makes mistakes.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1248.45

But if you ask it questions on a very specialized topic, there have been papers looking at the accuracy of AI in the legal sphere or medical sphere. Here, it's much more dodgy. And obviously, these are areas where accuracy is much more important. So one might wonder, you know, if it's going to sometimes make mistakes, should you use this tool at all?

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1269.155

I would argue still probably yes, because I think it can still enable you to do things that would otherwise be very hard. So one example of this is when I'm exploring a new topic, I don't even know how to frame my question. And if I don't know the right terms to put into Google search, I can't find the authoritative sources on that topic. But with chatbots, it's very easy.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1293.043

I just describe it in a fuzzy way in which I think of it in my head and it, you know, it rephrases it for me and then it gives me information about that topic. Sometimes it's reliable, sometimes it's not. So, it's just a very different way of interacting with information. It's just really hard to understand it in terms of previous ways of interacting with information.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1312.392

I can't give you a number saying, you know, 90% of the time it's going to be right. It just really varies depending on one's use case. So, I think Each of us has to put a little bit of trial and error into adapting it for our own specific purposes.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1335.57

Criminal justice, for instance, right? So I don't think we should be making decisions about people based on these crude statistical formulas with some caveats, like I was saying earlier. If it's the judge who is empowered to make that decision, that's a different story. There is a lot of the same coil in hiring.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1354.529

There are companies that claim that by analyzing a 30-second video of a candidate, of a job candidate, not even talking about their skills for the job, but about their hobbies or whatever, that they can do video analysis and look at the candidate's facial expressions and body language and that sort of thing and use that to derive a personality score, which companies should do their hiring based on.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1378.334

There's so much more. There is AI for detecting which students in a school or college might be at risk of suicide or mental health difficulties. There have been investigations of all these kinds of AI tools and they barely work better than the flip of a coin. So I think these are the kinds of things we should be very suspicious of.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1401.882

And unfortunately, these are the kinds of things that are often used in order to make very high stakes decisions about people.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1420.182

For ChatGPT, yes. But here's the difference between ChatGPT and trying to predict if someone will commit a crime. You know, ChatGPT is just trying to do things like, you know, a typical thing you might use ChatGPT for is to translate text from one language to another, right? That's not like a fundamentally impossible task. It's something that humans can do.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1441.653

And AI over time is learning to do it better, right? Or write code or whatever it is. On the other hand... Predicting what's going to happen in the future, no one knows. The universe doesn't know. It doesn't matter how much data you can throw at it. What we're seeing is that these technologies are not really getting better. They haven't got better in decades.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

1463.367

And it should be common sense that we can't really predict the future, or at least not with anything close to perfect accuracy. And yet a lot of companies are telling us to suspend our common sense because AI, right? And that's what we're trying to push back on.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

18.835

These ideas about AI developing an agency of its own and deciding to do stuff, these are pure sci-fi scenarios. Based on the way that AI is currently built today, those speculative scenarios really have no basis in reality.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

349.218

AI is an umbrella term for a loosely related set of technologies. You have, on the one hand, generative AI like chat GPT. On the other hand, you have self-driving cars and you have predictive AI, AI that's used in the criminal justice system, for instance, to make enormously consequential decisions about people, AI that's used in healthcare.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

370.978

These types of AI generally have very little to do with each other. And it's true that some types of AI, notably generative AI, are rapidly advancing.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

380.384

But we should be careful about, I think, the snake oil salesmen in the AI world who like to just slap the AI label on whatever tech product they're selling to try to get us to think that it is some remarkable technology that's going to solve all our problems for us.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

408.3

Oh, not at all. And I would say to listeners that if someone tells you it's too complicated, you should be skeptical. They're probably trying to hide something. But in broad strokes, so let's take a couple of different types of AI. So what's happening in ChatGPT is that it's simply a machine, and some of you may have heard this, a machine for predicting the next word in a sequence of words.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

431.767

What is the most likely next word? And it turns out, and this was largely a surprise to AI researchers as well, that the way for AI to be really good at predicting the next word in a sequence of words is to have some quote unquote understanding of language, of grammatical rules and patterns, and understanding of facts about the world.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

453.876

Because if you have a sentence like the capital of France is blank, It helps to know what the capital of France is so that you can complete that sentence with a high probability word instead of a low probability word. So it turns out that's really the secret behind it. And it might seem a little bit disappointing to hear that that's all it is. And in a sense, that's all it is.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

478.876

But I think it is truly remarkable that developers are able to create something useful with this really brute force approach.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

497.998

Chatbots have been trained on essentially all of the text on the internet, approximately speaking, and a lot of books and so on. So that's what it's pulling from, right?

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

506.304

So what training means is that it has learned the statistical patterns that allow it to say, for example, something you should, the bot would know that no is a likely next word because there are many discussions of this podcast online. And so it has learned that statistical pattern. So that's primarily what it's learning from.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

529.819

To a lesser extent, these bots learn from their conversations with us, but it's not in the way that a person would learn. It's not automatic. There's a cumbersome process by which companies have to filter these chat conversations and feed that back into the training data. But to a first approximation, it's learning from text on the web.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

555.994

Sure. Yeah. So predictive AI, on the other hand, is statistics that we've had for a century almost that's been rebranded into quote unquote AI. So this is used, for example, in the criminal justice system to determine if a defendant should be jailed before their trial, you know, which could be months or years away.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

577.283

It's used in healthcare to detect sepsis in a hospital context, for instance, by looking at various indicators. It's often used in hiring to try to predict which employee might be a fit for the role or who's going to be a good employee and that sort of thing. Now, what's happening in all of these cases is that the system is just picking up crude statistical patterns.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

603.446

So in criminal justice, the system learns that younger defendants are more likely to re-offend if they're released before their trial, and so recommends treating them more harshly. So these kinds of, again, fairly crude statistical patterns that we've known how to do for a long time, but it's not the same kind of technology behind ChatGPT

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

625.591

It is not something that's advancing quickly, and it is something that I think we should be pretty skeptical about.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

637.719

Exactly. It's making decisions about the future based on the past. So no matter how accurately it works... I think, you know, kind of on a fundamental philosophical level, we should think about, is this a just way to treat people, right? Should you deny someone their freedom in the criminal justice system because of the behavior of people like them in the past?

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

659.26

So that's, yeah, something that's deeply questionable as well.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

682.748

Yeah, image generation AI has been advancing very quickly. And over the last year or so, companies have been working on video generation AI. So yes, I think to some extent, the hype around this is real. I think deepfakes are already a problem. Specifically, the thing I'm most concerned about is deepfake nudes.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

705.217

And this has affected, from what I can tell, hundreds of thousands of people, primarily women, as you can imagine, around the world. And I think we desperately need regulation to curb some of the damage here. Now, in the political sphere, there's also concern that deepfakes can be used to trick voters and that sort of thing. I'm less convinced of that. There has been a lot of alarmism about that.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

729.732

But I think something we should think about is that In a world where we're online and we have no easy way to tell what's real and what's not, what does that mean for the erosion of trust in the online environment? And how easy that makes it for powerful people, politicians and others to evade accountability by claiming that even real videos are actually deepfakes.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

753.744

So we see that happening over and over. And that is something I'm worried about.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

783.07

First of all, with Elon Musk and other CEOs of AI companies, it's a very self-serving thing to say, right? So this is incredibly powerful technology. It's going to change the world, either bring about a utopia or destroy humanity. And we're the only ones in a position to ensure that this technology doesn't get out of control.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

804.486

We and many others in the AI community have spent a lot of time looking at the evidence behind these AI fears and And we've come up short, basically. So let's take some of the concerns that have been brought up, that AI could help bioterrorists, for instance, by finding information about how to create bioweapons.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

825.9

Now, the funny thing about the sphere is that finding that information is not the hard part. For the most part, that's readily available on Wikipedia. And so if a chatbot makes it easier and makes it 10 seconds faster for someone to access that information, that's not the end of the world.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

845.641

And there have been concerns in cybersecurity, for instance, that AI could be used to hack critical infrastructure and that could bring about catastrophic risk. But here's the funny thing. If the ability to use AI to find bugs in software and attack systems that way, if that is a critical capability for hackers, then we've already lost.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

874.668

Because for the last 10 or 20 years, automated ways to find bugs have been readily available even before AI, yet the world hasn't ended. And in fact, it's turned out that these methods primarily help defenders over attackers.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

889.617

They help software developers because those developers can use these automated tools, including AI, to find and fix bugs in their software before they ship it out, before hackers even have a chance to take a crack at it. And we think the same thing is happening with AI. And we think some of these fears are vastly overblown.

Something You Should Know

The Real and False Promises of AI & What They Really Ate at the First Thanksgiving

911.93

And then these ideas about AI developing an agency of its own and deciding to do stuff, these are pure sci-fi scenarios. Based on the way that AI is currently built today, those speculative scenarios really have no basis in reality. Yeah. So quick break here.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

0.069

we're not going to have too many more cycles, possibly zero more cycles of a model that's almost an order of magnitude bigger in terms of the number of parameters than what came before and thereby more powerful. And I think a reason for that is data becoming a bottleneck. These models are already trained on essentially all of the data that companies can get their hands on.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1000.384

On the other hand, if you want to scan all of someone's emails, for instance, right? If a model gets cheaper, you know, you're just going to have it running always on in the background. And then from emails, you're going to get to all their documents, right? And some of those attachments might be many megabytes long.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1014.658

And so there, even with Moore's law, I think cost is going to be significant in the medium term. And then you get to applications like writing code, where what we're seeing is that it's actually very beneficial to let the model do the same task tens of times, thousands of times, sometimes literally millions of times and pick the best answer.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1033.953

So in those cases, it doesn't matter how much cost goes down. You're going to just proportionally increase the number of retries so that you can get a better quality of output.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1050.842

So there is training compute, which is when the developer is building the model. And then there is inference compute, when the model is being deployed and the user is using it to do something. And it might seem like really the training cost is the one we should worry about, since it's trained on all of the text on the internet or whatever.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1068.829

But it turns out that over the lifetime of a model, when you have billions of people using it, the inference cost actually adds up. And for many of the popular models, that's the cost that dominates. So let's talk about each of those two costs.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1082.354

With respect to training costs, if you want to build a smaller model at the same level of capability or without compromising capability too much, you have to actually train it for longer. So that increases training costs. But that's maybe okay because you have a smaller model. You can push it to the consumer device or even if it's running on the cloud, your server costs are lower.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1104.588

So your training cost increases, your inference cost decreases. But because it's the inference cost that dominates, the total cost is probably going to come down. So total cost comes down. If you have the same workload and you have a smaller model doing it, then the total cost is going to come down.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1157.507

Sure. I think we are still in a period where, you know, these models have not yet quite become commoditized. There's obviously a lot of progress and there's a lot of demand on hardware as well. Hardware cycles are also improving rapidly. But, you know, there's the saying that every exponential is a sigmoid in disguise. So a sigmoid curve is one that looks like an exponential at the beginning.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1181.355

So imagine the S letter shape. But then after a while, it has to taper off like every exponential has to taper off. So I think that's going to happen both with models as well as with these hardware cycles. We are, I think, going to get to a world where models do get commoditized.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1210.669

A big part of it is this issue of vibes, right? So you evaluate LLMs on these benchmarks, but then it seems to perform really well on the benchmarks, but then the vibes are off. In other words, you start using it and somehow it doesn't feel adequate. It makes a lot of mistakes in ways that are not captured in the benchmark.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1227.765

And the reason for that is simply that when there is so much pressure to do well on these benchmarks, developers are intentionally or unintentionally optimizing these models in ways that look good on the benchmarks, but don't look good in real world evaluation.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1242.028

So when GPT-4 came out and OpenAI claimed that it passed the bar exam and the medical licensing exam, people were very excited slash scared about what this means for doctors and lawyers. And the answer turned out to be approximately nothing. Because it's not like a lawyer's job is to answer bar exam questions all day.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1261.976

These benchmarks that models are being tested on don't really capture what we would use them for in the real world. So that's one reason why LLM evaluation is a minefield. And there's also just a very simple factor of contamination. Maybe the model has already trained on the answers that it's being evaluated on in the benchmark. And so if you ask it new questions, it's going to struggle.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1283.212

We shouldn't put too much stock into benchmarks. We should look at people... We're actually trying to use these in professional context, whether it's lawyers or, you know, really anybody else. And we should go based on their experience of using these AI assistants.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1326.167

So let's talk for a second about what AGI is. Different people mean different things by it and so often talk past each other. The definition that we consider most relevant is AI that is capable of automating most economically valuable tasks. By this definition, you know, of automating most economically valuable tasks, if we did have AGI, that would truly be a profound thing in our society.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1347.968

So now for the CEO predictions, I think one thing that's helpful to keep in mind is that there have been these predictions of imminent AGI since the earliest days of AI for more than a half century. Alan Turing. When the first computers were built or about to be built, people thought, you know, the two main things we need for AI are hardware and software. We've done the hard part, the hardware.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1370.416

Now there's just one thing left, the easy part, the software. But of course, now we know how hard that is. So I think historically what we've seen, it's kind of like climbing a mountain. Wherever you are, it looks like there's just kind of one step to go. But when you climb up a little bit further, the complexity reveals itself. And so we've seen that over and over and over again.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1389.386

Now it's like, oh, you know, we just need to make these bigger and bigger models. So you have some silly projections based on that. But soon the limitations of that started becoming apparent. And now the next layer of complexity reveals itself. So that's my view. I wouldn't put too much stock into these overconfident predictions from CEOs.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1422.458

I certainly think the balance is possible. To some extent, every big company does this.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1445.43

That's fair. And I think, you know, it would take a discipline from a management to be able to pull it off in a way that one part of the company doesn't distract another too much. And we've seen this happen with OpenAI, which is the folks focused on superintelligence didn't feel very welcome at the company.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1463.766

And there has been an exodus of very prominent people and Anthropic has picked up a lot of them. So it seems like we're seeing a split emerging where OpenAI is more focused on products and Anthropic is more focused on superintelligence. While I can see the practical reasons why that is happening, I don't think it's impossible to have disciplines management that focuses on both objectives.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1491.044

In the past, they didn't have this balance. They were so enamored by this prospect of creating AGI that they didn't think there was a need to build products at all. And the craziest example for me is when OpenAI put out ChatGPT, there was no mobile app for six months. And the Android app took even longer than that.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1512.651

You know, there was this assumption that ChatGPT was just going to be this kind of really demo to show off the capabilities of the models. And OpenAI was, you know, in the business of building these models and third party developers would take the API and put it into products. But really, AGI was coming so quickly, even the notion of productization seemed obsolete.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1533.222

This was, you know, I'm not trying to put words in anyone's mouth, but this was kind of a coherent, but in my view, incorrect philosophy that I think a lot of AI developers had. And I think that has changed quite a bit now. And I think that's a good thing. So if they had to pick one, I think they should pick building products.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1548.867

But it certainly doesn't make sense for a company to be just an AGI company and not try to build products, not try to build something that people want. And just assuming that AI is going to be so general, that it's just going to, you know, do everything that people want, and that the company doesn't actually need to make products.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1595.976

So I don't know is the short answer. But at the same time, you know, we've been in this kind of historically interesting period where a lot of progress has come from building bigger and bigger models that need not continue in the future. It might. Or what might happen is that the models themselves get commoditized and a lot of the interesting development happens in a layer above the models.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1617.228

We're starting to see a lot of that happen now with AI agents. And if that's the case, great ideas could come from anywhere, right? It could come from a two-person startup. It could come from an academic lab. And my hope is that we will transition to that kind of mode of progress in AI development relatively soon.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1659.605

I think that's a very serious possibility. And I think this is actually one area where regulators should be paying attention. You know, what does this mean for market concentration, antitrust, and so forth. And I've been gratified that these are topics that, at least in my experience, US regulators are considering.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1677.832

And I believe in the UK, the CMA, the Competition and Markets Authority as well, and certainly in the EU. So yeah, in many jurisdictions, now that I think about it, this is something that regulators have been worried about.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1704.48

So in a sense, AI regulation is a misnomer. Let me give you an example from just this morning. The FTC has been worried about the Federal Trade Commission in the US, which is an antitrust and consumer protection authority, has been worried about people writing fake reviews for their products. And this has, of course, been a problem for many years. It's become a lot easier to do that with AI.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1726.212

So now someone who thinks about this in terms of AI regulation might say, oh, you know, regulators have to ensure that AI companies don't allow their products to be used for generating fake reviews. And I think this is a losing proposition. Like how would an AI model know whether something is a fake review or a real review, right? It just depends on who's writing the review.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1746.886

But instead, that's not the approach that the FTC took. They recognized correctly that it's a problem whether AI is generating the fake review or people are. So what they actually banned is fake reviews. And so what is often thought of as AI regulation is better understood as regulating certain harmful activities, whether or not AI is used as a tool for doing those harmful activities.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1770.659

80% of what gets called AI regulation is better seen this way.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1799.091

I broadly agree with that. I will add a couple of additions to that. One is there are many kinds of harms, which we already know about and are quite serious.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1808.713

So the use of AI to make non-consensual deep fakes, for instance, deep fake nudes, and this has affected, you know, thousands, perhaps hundreds of thousands of people, primarily women around the world and governments are taking action now, finally. So that's a good thing.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1840.646

I agree. So we call this the liar's dividend. People have been worried, for instance, about bots creating misinformation with AI and influence in elections and that sort of thing. We're very, very skeptical that that's going to be a real danger.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1867.252

But you could have created those things without AI.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1881.041

Yeah, I think that's fair. But I think the reason that might fool a lot of people is because it came from a legitimate media company. So I think the ability to do this, you know, emphasizes some of the things that have always been important, but have now become more important, like source credibility.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1909.179

That actually is our prediction. People we predict are going to be forced to rely much more on getting their news from trusted sources.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1941.966

So misinformation is a problem. In a way, I think misinformation is more of a symptom than a cause. Misinformation slots into and affirms people's existing beliefs as opposed to changing their beliefs. And I think the impact on AI here, again, has been tremendously exaggerated. Sure, you can create a Trump deepfake like you were talking about.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

1965.283

But when you look at the misinformation that's actually out there, it's things that are as crude as video game footage. Because again, it's telling people what they want to believe in a situation where they're not very skeptical.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2009.099

For sure, yeah. But I want to push on, you know, is this really an AI problem? These are, you know, deep problems in our society. So creating an image, you know, that looks like there were a lot more people there than there were. Yeah, it's become easier to do that with AI today. But you could have paid someone $100 to do that with Photoshop, you know, even before AI. It's a problem we've had.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2032.216

It's a problem we have been dealing with, often not very successfully. My worry is that if we treat this as a technology problem and try to intervene on the technology, we're going to miss what the real issues are and the hard things that we need to be doing to tackle those issues, which are, you know, which relate to issues of trust in society.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2050.23

And to the extent it's a technology problem, it's more of a social media problem, really, than an AI problem, because the hard part of misinformation is not generating it, it's distributing it to people and persuading them. And social media is often the medium for that. And so I think there should be more responsibility placed on social media companies.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2068.596

And my worry is that treating this as an AI problem is distracting from all of those more important interventions.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2084.902

Yeah, I think the primary control is being exercised today by social media companies.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

21.547

So while data is becoming a bottleneck, I think more compute still helps, but maybe not as much as it used to.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2100.259

So when we were talking about deepfakes, I'm much less worried about misinformation deepfakes and more worried about deepfake nudes that I was talking about, right? So those are things that can destroy a person's life. It's been shocking to me how little attention this got from the press and from policymakers until it happened to Taylor Swift. a few months ago. And then it got a lot of attention.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2120.746

So there were deep fake nudes of Taylor Swift posted on Twitter slash x. And after that, you know, policymakers started paying attention. But it has been happening for many years now, even before the latest wave of generative AI tools. So that's the type of misuse that is very clear.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2136.079

And then there are other kinds of misuses that are not necessarily dangerous in the same way, but impose a lot of costs on society.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2144.085

So when students are using AI to do their homework, for instance, now high school teachers and college teachers everywhere have to revamp how they're teaching in order to account for the fact that students are doing this and there is no way really to catch AI generated text or homework answers. And so these are costs upon society. I'm not saying that the availability of AI makes education worse.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2167.969

I don't think that's necessarily the case. But it forces a lot of costs upon the education system. And ideally, AI companies should be bearing some of that cost.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2209.369

Sure. So I don't think you're wrong. I think the reason there is a lot of talk about this is it goes back to something we've observed over and over, which is that when there are problems with an institution like the medical system, Right. Like the wait times are too long or it's too costly or in a lot of countries, you know, people don't even have access.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2228.721

You know, in developing countries, there might be entire villages with no physician. Then this kind of technological bandaid becomes very appealing. So I think that's what's going on here. I think the responsible way to use in medicine is for it to be integrated into the medical system. And actually, the medical system has been a very enthusiastic adopter of technology, including AI.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2249.377

So you can consider CAT scans, for instance, to be a form of AI to be able to reconstruct what's going on inside a person based on certain imaging. And now with generative AI as well, there's a lot of interest from the medical system in figuring out, can this be useful for diagnosis or for more mundane things like summarizing medical notes and so forth. So I think that work is really important.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2271.487

I think that should continue. It still does leave us with the harder question of, you know, here in America, you know, if it takes me three weeks to get a GP appointment, it's very tempting to ask chat GPT a question about my symptoms. So what do we do about that? You know, is that can that actually be helpful with appropriate guardrails? Or should that be discouraged?

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2290.044

I don't know the answer to that.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2318.089

I think there's different populations of students. There's a small subset of learners who are very self-motivated and will learn very well, even if there's no physical tutor. There are those kinds of learners at all different levels. And then there's the vast majority of learners for whom the social aspect of learning is really the most critical thing.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2339.502

And if you take that away, they're just not going to be able to learn very well. And I think this is often forgotten, especially because in the AI developer community, there are a lot of these self-taught learners. I'm among them, right? I just paid zero attention throughout school and college and everything that I know literally is stuff that I taught myself. So I grew up in India.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2357.539

The education system wasn't very great there. Our geography teacher thought that India was in the southern hemisphere. True story. Right. So again, I literally mean it when I say everything that I know I taught myself. And so, you know, you have a lot of AI developers who are thinking of themselves as the typical learner, and they're not.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2374.575

And I think for someone like me, AI is on a daily basis, an incredible tool for learning. I use generative AI tools for learning. It's a new way of learning compared to a book or really anything else. You know, I can't summarize my understanding of my topic to a book and ask it if I'm right. These are things I can do with AI.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

238.491

I'm super excited for this conversation.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2392.885

But I'm very skeptical that these new kinds of learning are going to get to a point anytime soon where they're going to become the default way in which people learn.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2417.256

I think for now, they are very much overblown. My favorite example of the thing you said of technology creating jobs is bank tellers. When ATMs became a thing, it would have been reasonable to assume that bank tellers were just going to go away. But in fact, the number of tellers increased. And the reason for that is that it became much cheaper for banks to open regional branches.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2439.462

And once they did open those regional branches, they did need humans for some of the things that you couldn't do with an ATM. And, you know, the more abstract way of saying that is, as economists would put it, jobs are bundles of tasks, and AI automates tasks, not jobs.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2454.736

So if there are, you know, 20 different tasks that comprise a job, the odds that AI is going to be able to automate all 20 of them are pretty low. And so there are some occupations certainly that have already been affected a lot by AI like translation or stock photography. But for most jobs out there, I don't think we're anywhere close to that.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2487.737

So I think it's a good question to ask. I think it's a bit of a category error there. I mean, a nuclear weapon is an actual weapon. AI is not a weapon. AI is something that, you know, might enable adversaries to do certain things more effectively. For example, find vulnerabilities, cybersecurity vulnerabilities in critical infrastructure, right?

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2507.852

So that's one way in which AI could be used on the quote unquote battlefield. So that being the case, I think it would be a big mistake to view it analogously to a weapon and to argue that it should be closed up for a couple of reasons. First of all, it's not going to work at all. So I think we have close to state of the art AI models that can already run on people's personal devices.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

251.339

Sure. So, I'm a professor of computer science, and I would say I do three things. One is technical AI research, and another is understanding the societal effects of AI, and the third is advising policymakers.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2530.138

And I think that trend is only going to accelerate. We talked earlier about Moore's law, and it still continues to apply to these models. And even if one country decides that models should be closed, the odds of getting every country to enact that kind of rule are just vanishingly small.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2548.404

So if our approach to safety with AI is going to be premised on ensuring that, quote unquote, bad guys don't get access to it, we've already lost. because it's only a matter of time before it becomes impossible to do that.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2561.992

And instead, I think we should radically embrace the opposite, which is to figure out how we're going to use AI for safety in a world where AI is very widely available, because it is going to be widely available. And when we look at how we've done that in the past, it's actually a very reassuring story.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2579.668

When we go back to the cybersecurity example, for 10 or 20 years, the software development community has been using automated tools, some of which you could call AI, to improve cybersecurity because software developers can use them to find bugs and fix bugs in software before they put them out there, before hackers even have a chance to take a crack at them.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2601.576

So my hope is that the same thing is going to happen with AI. We're going to be able to acknowledge the fact that it's going to be widely available and to shape its use for defense more than offense.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2618.749

Like a lot of people, I was fooled by how quickly after GPD 3.5, GPD 4 came out. It was just three months or so, but it had been in training for 18 months. That was only revealed later. So it gave a lot of people, including me, an inflated idea of how quickly AI was progressing.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2638.802

And what we've seen in the nearly year and a half since GPD 4 came out is that we haven't really had models that have surpassed it in a meaningful way. And this is not based on benchmarks. Again, I think benchmarks are not that useful. It's more based on vibes. When you get people using these things, what do they say? I don't think models have really qualitatively improved on GPT-4.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2660.777

And I don't think things are moving as quickly as I did 12 months ago.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2669.162

Making models bigger and bigger doesn't seem to be working anymore. I think new developments have to come from different scientific ideas. Maybe it's agents, maybe it's something else.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2681.778

I think our intuitions are too powerfully shaped by sci-fi portrayals of AI. And I think that's really a big problem. This idea that AI can become self-aware. When we look at the way that AI is architected today, that kind of fear has no basis in reality. Maybe one day in the future, people are going to build AI systems where that becomes at least somewhat possible. And we should have...

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2709.227

visibility, transparency, monitoring regulation around these systems to make sure that developers don't. But that would be a choice. That's a choice that society can make, that governments and companies can make. It's not that despite our best efforts, AI is going to become conscious and have agency and do things that are harmful to humanity.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2726.662

That whole line of fear, I think, is completely unfounded.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2743.014

Because the gap between benchmarks and the real world is big and it's only growing bigger. As AI becomes more useful, it's harder to figure out how useful it is based on these artificial environments.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2759.244

I would resign. I don't think I would be a good CEO. But if there were one thing I could change about OpenAI, I think the need for the public to know what is going on with AI development overrides the commercial interests of any company. So I think there needs to be a lot more transparency.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2777.509

So my hope is that the kind of thing we saw in the movie Her, not the sci-fi aspects of it, but the more kind of mundane aspects of it where you give your device a command and it interprets it in a pretty nuanced way and does what you want it to do, right? Book flight tickets, for instance, or really build an app based on what you want it to look like.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2798.156

So these are things that are potentially automatable, don't have like massively dubious societal consequences. Those are the things that I hope can happen.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

280.319

So I spent years of my time on this. I really believed that decentralization could have tremendous societal impacts. How is this going to make society better? It was not the money angle. But by around 2018, I had started to get really disillusioned. And that was because of a couple of main things.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2815.348

I do find it interesting that NVIDIA itself has been trying to migrate really, really hard out of hardware into becoming a services company.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2829.497

A lot of technologists kind of have a disdain for policy. They see policymakers as, well, morons, to put it bluntly. But I don't think that's the case. I think there are a lot of legitimate reasons why policy is very slow and doesn't often go in the way that a tech expert might want it to. And that's the 90% frustration. And the reason I say it's only 90% is that the other 10% is really worth it.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2851.949

We really need policy. And despite how frustrating it is, we need a lot of tech experts in policy.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2860.014

I have to say, I really like Jan LeCun's perspectives on various things, including his view that LLMs are, quote unquote, off-ramp to superintelligence that, you know, in other words, we need a lot more scientific breakthroughs, as well as tamping down the fears of super advanced AI.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2879.597

It's weird for me to be saying this, but I have to say, think of the children. I'm never asked this because, and what I mean by that is that AI, the role of AI in kids' lives, kids who are born today, for instance, is going to be so profound. And it's something that technologists should be thinking about. Every parent should be thinking about.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2897.268

Policymakers should be thinking about because it can be profoundly good or profoundly bad or anything in between. And both as a technologist and as a parent, I think about that a lot.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

2915.168

This has been really, really fun. I apologize for rambling occasionally, but I hope that it's, yeah, I'm really looking forward to hearing it when it's out there.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

297.062

One is, in a lot of cases where I had thought crypto or blockchain was going to be the solution, I realized that that was not the case. While there is potential for crypto to help the world's unbanked, the tech is not the real bottleneck there. And the other part of it was just a philosophical aspect of this community.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

314.347

I do believe that many of our institutions are in need of reform or maybe decentralization, whatever it is. And that includes academia, by the way, so many reforms so badly needed. And in an ideal world, we would have this, you know, hard but important conversation about how do you fix our institutions. But instead, these students have been sold on blockchain and they want to replace their

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

337.322

these institutions with a script. And that just didn't seem like the right approach to me. So both from a technical perspective, and from a philosophical perspective, I really soured on it. While there are harms around AI, I think it has been a net positive for society. I can't say the same thing about Bitcoin. Are we in an AI hype cycle right now? I think that's possible.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

357.192

Generative AI companies specifically made some serious mistakes in the last year or two about how they went about things. What mistakes did they make, Harvind? So when ChatGPT was released, people found, you know, a thousand new applications for it, right? That OpenAI application. might not have anticipated. And that was great.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

375.091

But I think developers, AI developers, took the wrong lesson from this. They thought that AI is so powerful and so special that you can just put these models out there and people will figure out what to do with them. They didn't think about actually building products, making things that people want, finding product market fit, and all those things that are so basic in tech.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

395.949

But somehow, AI companies deluded themselves into thinking that the normal rules don't apply here.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

430.171

So if we look at what's happened historically, the way in which compute has improved model performance is with companies building bigger models. In my view, at least the biggest thing that changed between GPT-3.5 and GPT-4 was the size of the model. And it was also trained with more data, presumably, although they haven't made the details of that public and more compute and so forth.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

452.145

So I think that's running out. We're not going to have too many more cycles, possibly zero more cycles of a model that's almost an order of magnitude bigger in terms of the number of parameters than what came before and thereby more powerful. And I think a reason for that is data becoming a bottleneck.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

469.372

These models are already trained on essentially all of the data that companies can get their hands on. So while data is becoming a bottleneck, I think more compute still helps, but maybe not as much as it used to. And the reason for that is that perhaps ironically, more compute allows one to build smaller models with the same capability level.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

490.762

And that's actually the trend we've been seeing over the last year or so. As you know, you know, the models today have gotten somewhat smaller and cheaper than when GPT-4 initially came out, but with the same capability level. So I think that's probably going to continue. Are we going to see a GPT-5 that's as big a leap over GPT-4 as GPT-4 was over GPT-3? I'm frankly skeptical.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

539.873

Right. So there are a lot of sources that haven't been mined yet. But when we start to look at the volume of that data, how many tokens is that? I think the picture is a little bit different. 150 billion hours of video sounds really impressive.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

554.78

But when you put that video through a speech recognizer and actually extract the text tokens out of it and deduplicate it and so forth, it's actually not that much. It's an order of magnitude smaller than than what some of the largest models today have already been trained with.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

571.346

Now training on video itself, instead of text extracted from the video, that could lead to some new capabilities, but not in the same fundamental way that we've had before, where you have the emergence of new capabilities, right? Models being able to do things that people just weren't anticipating.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

589.57

So like the kind of shock that the AI community had when I think back in the day, I think it was GPT-2,

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

595.191

was trained primarily on English text, and they had actually tried to filter out text in other languages to keep it clean, but a tiny amount of text from other languages had gotten into it, and it turned out that that was enough for the model to pick up a reasonable level of competence for conversing in various other languages.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

610.466

So these are the kinds of emergent capabilities that really spooked people, that has led to both a lot of hype and a lot of fears about what bigger and bigger models are going to be able to do. But I think that has pretty much run out because we're training on all of the capabilities that humans have expressed, like translating between languages, and have already put out there in the form of text.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

631.721

So if you make the data set a little bit more diverse with YouTube video, I don't think that's fundamentally going to change. Multimodal capabilities, yes, there's a lot of room there. But new, emergent text capabilities, I'm not sure. MARK BLYTH What about synthetic data?

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

647.085

Yeah, let's talk about synthetic data. So there's two ways to look at this, right? So one is the way in which synthetic data is being used today, which is not to increase the volume of training data, but it's actually to overcome limitations in the quality of the training data that we do have.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

665.539

So for instance, if in a particular language, there's too little data, you can try to augment that, or you can try to have a model, you know, solve a bunch of mathematical equations, throw that into the training data. And so for the next training run, that's going to be part of the pre training. And so the model will get better at doing that.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

684.791

And the other way to look at synthetic data is, okay, you take 1 trillion tokens, you train a model on it, and then you output 10 trillion tokens, so you get to the next bigger model, and then you use that to output 100 trillion tokens. I'll bet that that's just not going to happen. That's just a snake eating its own tail, and...

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

700.828

What we've learned in the last two years is that the quality of data matters a lot more than the quantity of data. So if you're using synthetic data to try to augment the quantity, I think it's just coming at the expense of quality. You're not learning new things from the data. You're only learning things that are already there.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

758.001

Yeah, I think that's really spot on. I think one way in which people's intuitions have been kind of misguided by the rapid improvements in LLMs is that all of this has been in the paradigm of learning from data on the web that's already there. And once that runs out, you have to switch to new kinds of learning, analog of riding a bike. That's just kind of tacit knowledge.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

781.207

It's not something that's been written down. So a lot of what happens in organizations is the cognitive equivalent of I think what happens in the physical skill of riding a bike.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

790.411

And I think for models to learn a lot of these diverse kinds of tasks that they're not going to pick up from the web, you have to have the cycle of actually using the AI system in your organization and for it to learn from that back and forth experience instead of just passively ingesting.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

819.071

It's got to be more than passive observation. You have to actually deploy AI to be able to get to certain types of learning. And I think that's going to be very slow. And I think a good analogy is self-driving cars, of which we had prototypes two or three decades ago.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

836.759

But for these things to actually be deployed, you have to roll it out on slightly larger and larger scales while you collect data, while you make sure you get to the next nine of reliability, four nines of reliability to five nines of reliability. So it's that very slow rollout process. It's a very slow feedback loop.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

854.845

And I think that's going to happen with a lot of AI deployment and organizations as well.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

879.301

Yeah, thank you for asking that. That's not obvious at all. My view is that in a lot of cases, the adoption of these models is not bottlenecked by capability. If these models were actually deployed today to do all the tasks that they're capable of, it would truly be a striking economic transformation. The bottlenecks are things other than capability. And one of the big ones is cost.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

902.136

And cost, of course, is roughly proportional to the size of the model. And that's putting a lot of downward pressure on model size.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

909.481

And once you get a model small enough that you can run it on device, that of course opens up a lot of new possibilities, both in terms of privacy, you know, people are much more comfortable with on device models, especially if it's something that's going to be listening to their phone conversations or looking at their desktop screenshots, which are exactly the kinds of AI assistance that companies are building and pushing.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

931.52

And just from the perspective of cost, you don't have to dedicate servers to run that model. So I think those are a lot of the reasons why companies are furiously working on making models smaller without a big hit in capability.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

950.716

You're right. Cost is going down dramatically. In certain applications, cost is going to become much less of a barrier, but not across the board.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

961.124

So there's this interesting concept called Jevons Paradox. And this was first in the context of coal in England in the 18th century. I think when coal mining got cheaper, there was more demand for coal. And so the amount invested into coal mining actually increased. And I predict that we're going to see the same thing with models. When models get cheaper, they're put into a lot more things.

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

20VC: AI Scaling Myths: More Compute is not the Answer | The Core Bottlenecks in AI Today: Data, Algorithms and Compute | The Future of Models: Open vs Closed, Small vs Large with Arvind Narayanan, Professor of Computer Science @ Princeton

982.488

And so the total amount that companies are spending on inference is actually going to increase. In an application like a chatbot, let's say, you know, it's text in, text out, no big deal. I think costs are going to come down. Even if someone is chatting with a chatbot all day, it's probably not going to get too expensive.

Appearances

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

Something You Should Know

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch

The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch