Andrej Karpathy

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Well, first of all, thank you for having me here.

7.26 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm excited to be here.

9.502 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So the quote that you've just mentioned, it's the decade of agents, that's actually a reaction to an existing, pre-existing quote, I should say, where I think some of the labs, I'm not actually sure who said this, but they were alluding to this being the year of agents with respect to LLMs and how they were going to evolve.

11.405 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think...

26.442 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I was triggered by that because I feel like there's some over-predictions going on in the industry.

27.824 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And in my mind, this is really a lot more accurately described as the decade of agents.

32.13 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And we have some very early agents that are actually extremely impressive and that I use daily, you know, Cloud and Codex and so on.

36.998 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I still feel like there's so much work to be done.

43.207 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I think my reaction is like, we'll be working with these things for a decade.

45.891 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They're going to get better and it's going to be wonderful.

49.857 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I think I was just reacting to the timelines, I suppose, of the

53.082 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

of the implication.

56.427 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And what do you think will take a decade to accomplish?

58.329 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What are the bottlenecks?

60.532 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Well, actually make it work.

62.234 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So in my mind, I mean, when you're talking about an agent, I guess, or what the labs have in mind and what maybe I have in mind as well, is it's, you should think of it almost like an employee or like an intern that you would hire to work with you.

64.257 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So for example, you work with some employees here.

73.489 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

When would you prefer to have an agent like Cloud or Codex do that work?

76.353 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like currently, of course they can't.

79.757 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What would it take for them to be able to do that?

81.339 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Why don't you do it today?

83.181 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And the reason you don't do it today is because they just don't work.

84.423 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, like, they don't have enough intelligence.

86.466 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They're not multimodal enough.

88.869 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They can't do computer use and all this kind of stuff.

89.89 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they don't do a lot of the things that you've alluded to earlier.

92.093 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You know, they don't have continual learning.

94.957 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can't just tell them something and they'll remember it.

96.399 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they're just cognitively lacking and it's just not working.

98.542 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I just think that it will take about a decade to work through all of those issues.

101.526 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I guess this is where you get into like a bit of, I guess, my own intuition a little bit.

136.373 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And also just kind of doing a bit of an extrapolation with respect to my own experience in the field, right?

140.178 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess I've been in AI for...

145.665 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

almost two decades.

147.788 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, it's going to be maybe 15 years or so.

148.63 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Not that long.

150.192 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You had Richard Sutton here who was around, of course, for much longer.

151.595 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do have about 15 years of experience of people making predictions of seeing how they actually turned out.

154.541 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And also, I was in the industry for a while and I was in research and I worked in the industry for a while.

159.53 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, I guess I kind of have just a general intuition that I have left from that.

163.537 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And...

168.066 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I feel like the problems are tractable.

169.388 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They're surmountable.

172.213 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But they're still difficult.

173.395 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And if I just average it out, it just kind of feels like a ticket, I guess, to me.

175.358 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I mean, that's a giant question because, of course, you're talking about 15 years of stuff that happened.

196.649 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, AI is actually like so wonderful because there have been a number of, I would say, seismic shifts that were like the entire field has sort of like suddenly looked a different way, right?

200.293 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I guess I've maybe lived through two or three of those.

208.283 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I still think there will continue to be some because they come with some kind of like almost surprising irregularity.

211.346 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Well, when my career began, of course, like when I started to work on deep learning, when I became interested in deep learning, this was just kind of like by chance of being right next to Jeff Hinton at University of Toronto.

216.352 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And Jeff Hinton, of course, is kind of like the godfather figure of AI.

225.567 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And he was training all these neural networks, and I thought it was incredible and interesting.

228.612 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But this was not like the main thing that everyone in AI was doing by far.

231.617 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This was a niche little subject on the side.

235.242 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's kind of maybe like the first like dramatic sort of seismic shift that came with the AlexNet and so on.

237.045 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I would say like AlexNet sort of reoriented everyone and everyone started to train neural networks, but it was still like very like per task, per specific task.

243.098 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So maybe I have an image classifier or I have a neural machine translator or something like that.

250.991 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And people became very slowly actually interested in basically kind of agents, I would say.

255.9 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And people started to think, okay, well, maybe we have a check mark next to the visual cortex or something like that.

260.227 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But what about the other parts of the brain?

264.812 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How can we get an actual like full agent or in full entity that can actually interact in the world?

266.114 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I would say the Atari sort of deep reinforcement learning shift in 2013 or so was part of that early effort of agents in my mind, because it was an attempt to try to get agents that not just perceive the world, but also take actions and interact and get rewards from environments.

270.679 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And at the time this was Atari games, right?

285.657 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I kind of feel like that was a misstep, actually.

288.02 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it was a misstep that actually even the early OpenAI that I was a part of, of course, kind of adopted.

291.183 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because at that time, the zeitgeist was reinforcement learning environments, games, game playing, beat games, get lots of different types of games.

296.409 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And OpenAI was doing a lot of that.

303.536 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So that was maybe like another prominent part of, I would say, AI where maybe for two or three or four years, everyone was doing reinforcement learning on games.

305.538 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And basically, that was a little bit of a misstep.

315.048 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And what I was trying to do at OpenAI actually is like, I was always a little bit suspicious of games as being like this thing that would actually lead to AGI because in my mind, you want something like an accountant or like something that's actually interacting with the real world.

318.226 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I just didn't see how games kind of like add up to it.

329.06 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so my project at OpenAI, for example, was within the scope of the Universe project on an agent that was using keyboard and mouse to operate web pages.

331.623 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I really wanted to have something that like interacts with, you know, the actual digital world that can do knowledge work.

341.616 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it just so turns out that this was extremely early, way too early.

346.762 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So early that we shouldn't have been working on that, you know, because if you're just stumbling your way around and keyboard mashing and mouse clicking and trying to get rewards in these environments, your reward is too sparse and you just won't learn and you're going to burn a forest computing and you're never actually going to get something off the ground.

350.548 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so what you're missing is this power of representation in the neural network.

367.894 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

372.281 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so, for example, today, people are training those computer-using agents, but they're doing it on top of a large language model.

372.361 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so you actually have to get the language model first.

377.185 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You have to get the representations first.

378.566 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you have to do that by all the pre-training and all the LLM stuff.

380.508 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I kind of feel like maybe, loosely speaking, it was like people keep maybe trying to get the full thing too early a few times, where people really try to go after agents too early, I would say.

383.35 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that was Atari and Universe, and even my own experience.

394.3 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you actually have to do some things first before you sort of get to those agents.

397.783 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And maybe now the agents are a lot more competent, but maybe we're still missing sort of some parts of that stack.

401.386 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I would say maybe those are like the three major buckets of what people were doing.

407.655 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Training neural nets per tasks, trying to the first round of agents, and then maybe the LLMs and actually seeking the representation power of the neural networks before you tack on everything else on top.

411.902 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think...

462.463 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, so Sutton was on your podcast, and I saw the podcast, and I had a write-up about that podcast almost that gets into a little bit of how I see things.

464.465 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I kind of feel like I'm very careful to make analogies to animals because they came about by a very different optimization process.

471.735 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Animals are evolved, and they actually come with a huge amount of hardware that's built in.

479.665 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And when, for example, my example in the post was the zebra.

483.55 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

A zebra gets born, and a few minutes later, it's running around and following its mother.

487.497 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's an extremely complicated thing to do.

490.843 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's not reinforcement learning.

492.706 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's something that's baked in.

495.351 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And evolution obviously has some way of encoding the weights of our neural nets in ATCGs.

496.393 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I have no idea how that works, but it apparently works.

501.001 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I kind of feel like brains just came from a very different process.

503.986 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I'm very hesitant to take inspiration from it because we're not actually running that process.

509.275 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So in my post, I kind of said, we're not actually building animals.

514.104 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're building ghosts or spirits or whatever people want to call it.

517.37 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because...

521.317 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're not doing training by evolution.

522.599 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're doing training by basically imitation of humans and the data that they've put on the internet.

525.985 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so you end up with these like sort of ethereal spirit entities because they're fully digital and they're kind of like mimicking humans.

531.313 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's a different kind of intelligence.

536.923 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like if you imagine a space of intelligences, we're starting off at a different point almost.

538.305 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're not really building animals.

542.452 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I think it's also possible to make them a bit more animal-like over time.

544.215 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think we should be doing that.

546.899 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I kind of feel like, sorry, just I guess one more point is, I do feel like Sutton basically has a very, like his framework is like we want to build animals.

548.441 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I actually think that would be wonderful.

556.354 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If we can get that to work, that would be amazing.

557.636 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If there was a single like algorithm that you can just, you know, run on the internet and it learns everything, that would be incredible.

559.399 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I almost suspect that I'm not actually sure that it exists.

566.65 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's certainly actually not what animals do.

570.176 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

because animals have this outer loop of evolution.

572.633 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And a lot of what looks like learning is actually a lot more maturation of the brain.

575.257 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think there's actually very little reinforcement learning for animals.

579.925 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think a lot of the reinforcement learning is actually more like motor tasks.

584.072 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's not intelligence tasks.

587.198 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I actually kind of think humans don't actually really use RL, roughly speaking is what I would say.

588.841 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

A lot of the reinforcement learning in my perspective would be things that are a lot more like motor-like, like simple kind of like tasks, throwing a hoop, something like that.

595.813 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I don't think that humans use reinforcement learning for a lot of intelligence tasks like problem solving and so on.

605.089 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Interesting.

610.538 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That doesn't mean we shouldn't do that for research, but I just feel like that's what animals do or don't.

611.18 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think so.

677.178 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I would agree with you that there's some miraculous compression going on, because obviously the weights of the neural net are not stored in ATCGs.

677.618 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's some kind of a dramatic compression, and there's some kind of learning algorithms encoded that take over and do some of the learning online.

683.325 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I definitely agree with you on that.

690.033 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Basically, I would say I'm a lot more kind of like practically minded.

692.438 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't come at it from the perspective of like, let's build animals.

695.203 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I come from the perspective of like, let's build useful things.

697.788 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I have a hard hat on.

700.493 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I'm just observing that, look, we're not going to do evolution, because I don't know how to do that.

701.835 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But it does turn out we can build these ghost spirit-like entities by imitating internet documents.

705.963 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This works.

711.589 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's actually kind of like, it's a way to bring you up to something that has a lot of sort of built-in knowledge and intelligence in some way, similar to maybe what evolution has done.

712.43 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So that's why I kind of call pre-training this kind of like crappy evolution.

721.84 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's like the practically possible version with our technology and what we have available to us to get to a starting point where we can actually do things like reinforcement learning and so on.

725.003 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So it's subtle, and I think you're right to push back on it.

762.699 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But basically, the thing that pre-training is doing, so you're basically getting the next token predictor over the internet, and you're training that into a neural net.

765.103 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's doing two things actually that are kind of like unrelated.

772.518 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Number one, it's picking up all this knowledge, as I call it.

775.284 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Number two, it's actually becoming intelligent.

777.65 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

By observing the algorithmic patterns in the internet, it actually kind of like boots up all these like little circuits and algorithms inside the neural net to do things like in-context learning and all this kind of stuff.

780.777 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And actually, you don't actually need or want the knowledge.

789.376 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I actually think that's probably actually holding back the neural networks overall, because it's actually like getting them to rely on the knowledge a little too much sometimes.

792.485 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

For example, I kind of feel like agents, one thing they're not very good at is going off the data manifold of what exists on the internet.

798.802 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If they had less knowledge or less memory, actually maybe they would be better.

805.019 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so what I think we have to do kind of going forward, and this would be part of the research paradigms, is actually think we need to start, we need to figure out ways to remove some of the knowledge and to keep what I call this cognitive core.

809.265 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's this like intelligent entity that is kind of stripped from knowledge but contains the algorithms and contains the magic, you know, of intelligence and problem solving and the strategies of it and all this kind of stuff.

820.402 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think I'm hesitant to say that in-context learning is not doing gradient descent because, I mean, it's not doing explicit gradient descent, but I still think that, so in-context learning, basically, it's pattern completion within a token window, right?

888.351 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it just turns out that there's a huge amount of patterns on the internet.

901.055 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so you're right, the model kind of like learns to complete the pattern, right?

903.159 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's inside the weights.

905.864 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The weights of the neural network are trying to discover patterns and complete the pattern.

907.667 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And there's some kind of an adaptation that happens inside the neural network, right?

911.554 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Which is kind of magical and just falls out from internet just because there's a lot of patterns.

915.26 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I will say that...

919.587 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There have been some papers that I thought were interesting that actually look at the mechanisms behind in-context learning.

921.17 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I do think it's possible that in-context learning actually runs a small gradient descent loop internally in the layers of the neural network.

925.518 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I recall one paper in particular where they were doing linear regression, actually, using in-context learning.

931.269 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So basically, your inputs into the neural network are XY pairs.

937.762 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

x, y, x, y, x, y that happened to be on the line.

942.179 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then you do x and you expect the y. And the neural network, when you train it in this way, actually does do linear regression.

945.063 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And normally when you would run linear regression, you have a small gradient descent optimizer that basically looks at x, y, looks at an error, calculates the gradient of the weights, and does the update a few times.

952.291 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It just turns out that when they looked at the weights of that in-context learning algorithm,

962.324 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

they actually found some analogies to gradient descent mechanics.

965.888 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

In fact, I think even the paper was stronger because they actually hard-coded the weights of a neural network to do gradient descent through attention and all the internals of the neural network.

970.74 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess that's just my only pushback is that who knows how in-context learning works, but I actually think that it's probably doing a little bit of some kind of funky gradient descent internally, and that I think that that's possible.

982.088 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess I was only pushing back on you're saying it's not doing in-context learning.

992.749 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Who knows what it's doing, but it's probably maybe doing something similar to it, but we don't know.

996.477 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think I kind of agree.

1066.295 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, the way I usually put this is that anything that happens during the training of the neural network, the knowledge is only kind of like a hazy recollection of what happened in the training time.

1067.637 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's because the compression is dramatic.

1077.115 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're taking 15 trillion tokens and you're compressing it to just your final network of a few billion parameters.

1078.717 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So obviously it's a massive amount of compression going on.

1083.206 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I kind of refer to it as like a hazy recollection of the internet documents.

1085.65 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Whereas anything that happens in the context window of the neural network, you're plugging all the tokens and it's building up all this KV cache representation, is very directly accessible to the neural net.

1089.555 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I compare the KV cache and the stuff that happens at test time to more like a working memory.

1097.325 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like all the stuff that's in the context window is very directly accessible to the neural net.

1102.973 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So there's always like these...

1107.519 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

almost surprising analogies between LLMs and humans.

1109.782 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I find them kind of surprising because we're not trying to build a human brain, of course, just directly.

1112.652 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're just finding that this works and we're doing it.

1116.807 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do think that...

1118.994 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Anything that's in the weights, it's kind of like a hazy recollection of what you read a year ago.

1120.733 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Anything that you give it as a context at test time is directly in the working memory.

1124.658 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think that's a very powerful analogy to think through things.

1129.905 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So when you, for example, go to an LLM and you ask it about some book and what happened in it, like Nick Lane's book or something like that, the LLM will often give you some stuff, which is roughly correct.

1132.588 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But if you give it the full chapter and ask it questions, you're going to get much better results because it's now loaded in the working memory of the model.

1141.4 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I basically agree with your very long way of saying that I kind of agree, and that's why.

1147.548 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I almost feel like just a lot of it still.

1159.985 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So maybe one way to think about it.

1165.991 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know if this is the best way, but I almost kind of feel like, again, making these analogies, imperfect as they are.

1167.272 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We've stumbled by with the transformer neural network, which is extremely powerful, very general.

1173.638 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can train transformers on audio or video or text or whatever you want, and it just learns patterns, and they're very powerful, and it works really well.

1179.003 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That, to me, almost indicates that this is kind of like some piece of cortical tissue.

1186.63 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's something like that.

1190.576 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because the cortex is famously very plastic as well.

1191.478 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can rewire, you know, parts of brains.

1194.182 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And there was the slightly gruesome experiments with rewiring, like, visual cortex to the auditory cortex, and this animal, like, learned fine, etc.

1197.407 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think that this is kind of like cortical tissue.

1205.88 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think when we're doing reasoning and planning inside the neural networks, so basically doing reasoning traces for thinking models, that's kind of like the prefrontal cortex.

1208.467 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I think maybe those are like little check marks, but I still think there's many brain parts and nuclei that are not explored.

1218.862 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So maybe, for example, there's a basal ganglia doing a bit of reinforcement learning when we fine tune the models on reinforcement learning.

1226.773 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But, you know, whereas like the hippocampus, not obvious what that would be.

1231.62 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

some parts are probably not important.

1234.825 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe the cerebellum is, like, not important to cognition, it's thought, so maybe we can skip some of it.

1236.528 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I still think there's, for example, the amygdala, all the emotions and instincts.

1240.794 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And there's probably, like, a bunch of other nuclei in the brain that are very ancient that I don't think we've, like, really replicated.

1244.319 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't actually know that we should be pursuing, you know, the building of an analog of human brain.

1250.469 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm, again, an engineer, mostly at heart.

1254.855 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But...

1257.019 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I still feel like maybe another way to answer the question is you're not going to hire this thing as an intern.

1258.501 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's missing a lot of it's because it comes with a lot of these cognitive deficits that we all intuitively feel when we talk to the models.

1263.693 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so it's just like not fully there yet.

1269.207 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can look at it as like not all the brain parts are checked off yet.

1272.054 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know that I fully resonate with that because I feel like these models, when you boot them up and they have zero tokens in the window, they're always like restarting from scratch where they were.

1326.895 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I don't actually know in that worldview what it looks like because, again, maybe making some analogies to humans just because I think it's roughly concrete and kind of interesting to think through.

1334.727 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I feel like when I'm awake, I'm building up a context window of stuff that's happening during the day.

1345.643 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I feel like when I go to sleep, something magical happens where I don't actually think that that context window stays around.

1349.768 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think there's some process of distillation into weights of my brain.

1355.154 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this happens during sleep and all this kind of stuff.

1359.398 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We don't have an equivalent of that in large language models.

1361.301 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's, to me, more adjacent to when you talk about continual learning and so on as absent.

1364.764 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

These models don't really have this distillation phase of taking what happened, analyzing it, obsessively thinking through it, basically doing some kind of a synthetic data generation process and distilling it back into the weights, and maybe having a specific neural net per person.

1369.69 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe it's a LoRa, it's not a full...

1387.625 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, it's not a full-weight neural network.

1389.248 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's just some of the small sparse subset of the weights are changed.

1392.312 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But basically, we do want to create ways of creating these individuals that have very long contacts.

1396.697 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's not only remaining in the contacts window because the contacts windows grow very, very long.

1402.204 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe we have some very elaborate sparse attention over it.

1406.489 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I still think that humans obviously have some process for distilling some of that knowledge into the weights.

1409.333 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're missing it.

1414.339 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I do also think that humans have some kind of a very elaborate sparse attention scheme, which I think we're starting to see some early hints of.

1415.32 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So DeepSeek v3.2 just came out, and I saw that they have like a sparse attention as an example.

1424.496 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this is one way to have very, very long context windows.

1429.705 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I almost feel like we are redoing a lot of the...

1432.47 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

cognitive tricks that evolution came up with through a very different process.

1435.655 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But we're, I think, going to converge on a similar architecture cognitively.

1438.98 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Well, the way I like to think about it is, okay, let's translation invariance in time, right?

1450.135 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So 10 years ago, where were we?

1453.62 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

2015, we had convolutional neural networks primarily.

1455.483 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Residual networks just came out.

1459.547 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So remarkably similar, I guess, but quite a bit different still.

1461.929 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, Transformer was not around.

1464.591 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You know, all these sort of like more modern tweaks on the Transformer were not around.

1467.494 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So maybe some of the things that we can bet on, I think, in 10 years by translational sort of equivariance is we're still training giant neural networks with forward, backward, pass, and update through gradient descent.

1473.039 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But maybe it looks a little bit different.

1485.47 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's just everything is much bigger.

1488.093 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Actually, recently, I also went back all the way to 1989, which was kind of a fun exercise for me a few years ago, because I was reproducing Jan LeCun's 1989 convolutional network, which was the first neural network I'm aware of trained via gradient descent, like modern neural network trained gradient descent on digit recognition.

1490.016 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I was just interested in, okay, how can I modernize this?

1508.639 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How much of this is algorithms?

1511.585 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How much of this is data?

1512.547 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How much of this progress is compute and systems?

1513.369 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I was able to very quickly like half the learning rate, just knowing by time travel by 33 years.

1516.175 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So if I time travel by algorithms to 33 years, I could adjust what Yann LeCun did in 1989, and I could basically half the learning, half the error.

1522.168 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But to get further gains, I had to add a lot more data.

1529.463 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I had to 10x the training set.

1533.007 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I had to actually add more computational optimizations.

1534.549 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I had to basically train for much longer with dropout and other regularization techniques.

1538.273 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so it's almost like all these things have to improve simultaneously.

1542.057 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So we're probably going to have a lot more data.

1545.28 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're probably going to have a lot better hardware.

1548.023 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're probably going to have a lot better kernels and software.

1549.565 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're probably going to have better algorithms.

1551.968 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And all of those, it's almost like no one of them is winning too much.

1553.449 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

All of them are surprisingly equal.

1557.013 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this has kind of been the trend for a while.

1559.416 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess to answer maybe your question, I expect differences algorithmically to what's happening today.

1561.78 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do also expect that some of the things that have stuck around for a very long time will probably still be there.

1568.693 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's probably still a giant neural network trained with gradient descent.

1573.281 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That would be my guess.

1575.665 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I guess what was shocking to me is everything needs to improve across the board.

1591.565 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Architecture, optimizer, loss function, and also has improved across the board forever.

1595.61 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I kind of expect all those changes to be alive and well.

1600.035 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Building NanoChat?

1627.474 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So NanoChat is a kind of a repository I released.

1628.335 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Was it yesterday or the day before?

1630.698 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I can't remember.

1632.421 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We can see this lead generation that went into the... Well, it's just trying to be a...

1634.844 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's trying to be the simplest, complete repository that covers the whole pipeline end-to-end of building a ChatGPT clone.

1640.652 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so, you know, you have all of the steps, not just any individual step, which is a bunch of... I worked on all the individual steps sort of in the past and released small pieces of code that kind of show you how that's done in algorithmic sense in like simple code.

1647.603 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But this kind of handles all the entire pipeline.

1661.284 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think in terms of learning, it's not so much, I don't know that I actually found something that I learned from it necessarily.

1663.467 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I kind of already had in my mind as like how you build it.

1669.758 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this is just a process of mechanically building it and making it clean enough and so that people can actually learn from it and that they find it useful.

1672.182 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I would probably say, so basically it's about 1,000 lines of code that takes you through the entire pipeline.

1691.791 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I would probably put it on the right monitor, like if you have two monitors, you put it on the right.

1696.638 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you want to build it from scratch.

1701.084 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You build it from start.

1703.006 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're not allowed to copy-paste.

1704.468 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're allowed to reference, you're not allowed to copy-paste.

1705.79 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe that's how I would do it.

1708.013 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I also think the repository by itself, it is like a pretty large beast.

1709.956 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, when you write this code, you don't go from top to bottom.

1713.2 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You go from chunks and you grow the chunks.

1717.506 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that information is absent, like you wouldn't know where to start.

1719.789 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I think it's not just a final repository that's needed.

1722.873 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's like the building of the repository, which is a complicated chunk growing process.

1725.577 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So that part is not there yet.

1730.123 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I would love to actually add that probably later this week or something in some way.

1732.028 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Either it's probably a video or something like that.

1735.658 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But maybe, roughly speaking, that's what I would try to do is build the stuff yourself, but don't allow yourself copy-paste.

1738.446 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

1745.906 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that there's two types of knowledge almost.

1746.307 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like there's the high-level surface knowledge.

1748.55 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But the thing is that when you actually build something from scratch, you're forced to come to terms with what you don't actually understand and you don't know that you don't understand it.

1750.592 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it always leads to a deeper understanding.

1757.8 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's like just the only way to build.

1759.842 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's like if I can't build it, I don't understand it.

1762.906 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Is that a Feynman quote, I believe, or something along those lines?

1765.228 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I 100%, I've always believed this very strongly because there's all these like micro things that are just not properly arranged and you don't really have the knowledge.

1768.312 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You just think you have the knowledge.

1776.233 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So don't write blog posts.

1777.376 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Don't do slides.

1778.579 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Don't do any of that.

1779.641 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like build a code, arrange it, get it to work.

1780.323 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's the only way to go.

1782.369 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Otherwise you're missing knowledge.

1783.131 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

1792.617 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So the repository, I guess I built it over a period of a bit more than a month.

1793.739 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I would say there's like three major classes of how people interact with code right now.

1797.846 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Some people completely reject all of LLMs, and they are just writing by scratch.

1802.153 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think this is probably not the right thing to do anymore.

1806.26 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The intermediate part, which is where I am, is you still write a lot of things from scratch, but you use the autocomplete that's basically available now from these models.

1809.706 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So when you start writing out a little piece of it, it will autocomplete for you, and you can just tap through, and most of the time it's correct.

1818.476 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Sometimes it's not, and you edit it.

1824.143 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But you're still very much the architect of what you're writing.

1826.005 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then there's the vibe coding.

1829.71 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You know, hi, please implement this or that, you know, enter, and then let the model do it.

1832.293 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's the agents.

1836.978 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do feel like the agents work in very specific settings, and I would use them in specific settings.

1838.36 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But again, these are all tools available to you, and you have to learn what they're good at and what they're not good at and when to use them.

1844.568 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So the agents are actually pretty good, for example, if you're doing boilerplate stuff.

1850.516 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Boilerplate code that's just copy-based stuff, they're very good at that.

1853.841 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They're very good at stuff that occurs very often on the intranet.

1858.107 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

because there's lots of examples of it in the training sets of these models.

1861.512 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So there's features of things where the models will do very well.

1866.223 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I would say NanoChat is not an example of this, because it's a fairly unique repository.

1869.892 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's not that much code, I think, in the way that I've structured it.

1874.563 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's not boilerplate code.

1878.893 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's like actually like intellectually intense code almost.

1880.475 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And everything has to be very precisely arranged.

1882.778 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And the models are always trying to, they kept trying to, I mean, they have so many cognitive deficits, right?

1884.741 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So one example, they keep trying to, they keep misunderstanding the code because they have too much memory from all the typical ways of doing things on the internet that I just wasn't adopting.

1889.428 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So the models, for example, I mean, I don't know if I want to get into the full details, but they keep thinking I'm writing normal code and I'm not.

1900.062 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe one example.

1908.323 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe one example is, so the way to synchronize, so we have eight GPUs that are all doing forward backwards.

1909.364 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The way to synchronize gradients between them is to use a distributed data parallel container of PyTorch, which automatically does all the, as you're doing the backward, it will start communicating and synchronizing gradients.

1914.89 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I didn't use DDP because I didn't want to use it because it's not necessary.

1924.08 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I threw it out.

1928.164 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I basically wrote my own synchronization routine that's inside the step of the optimizer.

1929.285 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so the models were trying to get me to use the DDP container, and they were very concerned about, okay, this gets way too technical, but I wasn't using that container because I don't need it, and I have a custom implementation of something like it.

1933.67 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, they couldn't get past that.

1947.216 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

and then um they kept trying to like mess up the style like they're way too over defensive they make all these try catch statements they keep trying to make a production code base and i have a bunch of assumptions in my code and it's okay and uh and it's just like i don't need all this extra stuff in there and so i just kind of feel like they're bloating the code base they're bloating the complexity they keep misunderstanding they're using deprecated apis a bunch of times so it's total mess um

1949.38 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

and it's just not that useful.

1974.828 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I can go in and I can clean it up, but it's not that useful.

1977.872 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I also feel like it's kind of annoying to have to, like, type out what I want in English because it's just too much typing.

1980.515 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, if I just navigate to the part of the code that I want and I go where I know the code has to appear and I start typing out the first three letters, autocomplete gets it and just gives you the code.

1985.261 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I think it's... This is a very high-information bandwidth to specify what you want.

1994.233 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If you point to the code where you want it and you type out the first few pieces, and the model will complete it.

1998.178 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess what I mean is...

2003.324 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think these models are good in certain parts of the stack.

2005.878 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I actually use the models a little bit in... There are two examples where I actually use the models that I think are illustrative.

2009.704 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

One was when I generated the report.

2016.234 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's actually more boilerplate-y.

2018.116 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I actually bytecoded partially some of that stuff.

2019.478 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That was fine.

2021.862 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because it's not like mission-critical stuff and it works fine.

2023.244 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then the other part is when I was rewriting the tokenizer in Rust...

2025.768 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm actually not as good at Rust because I'm fairly new to Rust.

2029.233 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I was doing, there's a bit of vibe coding going on when I was writing some of the Rust code.

2032.738 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I had Python implementation that I fully understand and I'm just making sure I'm making a more efficient version of it and I have tests.

2037.524 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I feel safer doing that stuff.

2043.131 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so basically they lower or like they increase accessibility to languages or paradigms that you might not be as familiar with.

2045.714 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think they're very helpful there as well.

2054.105 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

2056.068 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because there's a ton of Rust code out there.

2056.148 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The models are actually pretty good at it.

2057.831 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I happen to not know that much about it.

2059.494 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So the models are very useful there.

2060.956 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think you're getting at some of my, like why my timelines are a bit longer.

2106.305 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're right.

2110.014 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think, yeah, they're not very good at code that has never been written before.

2110.895 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe it's like one way to put it, which is like what we're trying to achieve when we're building these models.

2115.145 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's tough.

2141.401 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think they kind of know, but they don't fully know.

2142.223 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they don't know how to fully integrate it into the repo and your style and your code and your place and some of the custom things that you're doing and how it fits with all the assumptions of the repository and all this kind of stuff.

2144.508 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think they do have some knowledge, but...

2153.69 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

they haven't gotten to the place where they can actually integrate it, make sense of it, and so on.

2156.437 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that a lot of the stuff, by the way, continues to improve.

2162.226 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think currently probably state-of-the-art model that I go to is the GPT-5 Pro.

2164.45 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's a very, very powerful model.

2168.937 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So if I actually have 20 minutes, I will copy-paste my entire repo and I go to GPT-5 Pro, the Oracle, for like some questions.

2171.602 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And often it's not too bad and surprisingly good compared to what existed a year ago.

2177.311 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

2181.618 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do think that overall the models are – they're not there.

2181.698 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I kind of feel like the industry, it's over – it's making too big of a jump.

2186.483 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's trying to pretend like this is amazing.

2193.39 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's not.

2196.334 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's slop.

2196.874 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think they're not coming to terms with it.

2197.875 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And maybe they're trying to fundraise or something like that.

2199.537 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm not sure what's going on.

2201.419 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But we're at this intermediate stage.

2202.26 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The models are amazing.

2204.843 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They still need a lot of work.

2205.664 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

For now, autocomplete is my sweet spot.

2207.045 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But sometimes, for some types of code, I will go to a nullim agent.

2209.909 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe you guys discussed one other kind of thought that is like, I do feel like I have a hard time differentiating where AI begins and stops.

2245.132 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because I do see AI as fundamentally an extension of computing in some pretty fundamental way.

2251.962 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I feel like I see a continuum of this kind of like recursive self-improvement or like of speeding up programmers all the way from the beginning.

2256.83 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like even like I would say like code editors.

2264.081 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

2267.506 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

syntax highlighting, syntax or like checking even of the types, like data type checking.

2268.723 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

All these kinds of tools that we've built for each other, even search engines, like why aren't search engines part of AI?

2275.333 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, I don't know, like ranking is kind of AI, right?

2281.081 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

At some point, Google was like, even early on, they were thinking of themselves as an AI company doing Google search engine, which I think is totally fair.

2284.626 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I kind of see it as a lot more of a continuum than I think other people do, and I don't, it's hard for me to draw the line.

2290.154 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I kind of feel like, okay, we're now getting a much better autocomplete.

2295.563 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And now we're also getting some agents which are kind of like these loopy things, but they kind of go off rails sometimes.

2298.368 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And what's going on is that the human is progressively doing a bit less and less of the low-level stuff.

2303.437 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

For example, we're not writing the assembly code because we have compilers, right?

2309.347 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like compilers will take my highlight language in C and write the assembly code.

2312.292 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

2315.117 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So we're abstracting ourselves very, very slowly.

2315.197 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And there's this what I call autonomy slider of like more and more stuff is automated of the stuff that can be automated at any point in time.

2317.84 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And we're doing a bit less and less and raising ourselves in the layer of abstraction over the automation.

2323.988 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, maybe the way I would put it is humans don't use reinforcement learning is maybe what I, as I've said it all.

2446.438 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think they do something different, which is, yeah, you experience.

2451.124 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So reinforcement learning is a lot worse than I think the average person thinks.

2453.967 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Reinforcement learning is terrible.

2459.413 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It just so happens that everything that we had before is much worse.

2461.958 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because previously, we were just imitating people, so it has all these issues.

2467.444 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So in reinforcement learning, say you're working with, you're solving a math problem.

2471.208 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This is very simple.

2474.992 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're given a math problem, and you're trying to find a solution.

2475.673 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Now, in reinforcement learning, you will try lots of things in parallel first.

2478.816 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So you're given a problem, you try hundreds of things,

2484.803 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

different attempts.

2487.946 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And these attempts can be complex, right?

2488.808 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They can be like, oh, let me try this, let me try that, this didn't work, that didn't work, et cetera.

2490.15 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then maybe you get an answer.

2493.836 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And now you check the back of the book and you see, okay, the correct answer is this.

2495.279 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then you can see that, okay, this one, this one, and that one got the correct answer, but these other 97 of them didn't.

2499.286 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So literally what reinforcement learning does is it goes to the ones that worked really well, and every single thing you did along the way, every single token gets up-weighted of, like, do more of this.

2504.956 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The problem with that is, I mean, people will say that your estimator has high variance, but, I mean, it's just noisy.

2514.348 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's noisy.

2519.876 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So basically, it kind of almost assumes that every single little piece of the solution that you made that right at the right answer was the correct thing to do, which is not true.

2521.157 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, you may have gone down the wrong alleys

2528.507 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

until you write the right solution.

2530.69 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Every single one of those incorrect things you did, as long as you got to the correct solution, will be up-weighted as do more of this.

2532.212 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's terrible.

2537.219 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's noise.

2538.481 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You've done all this work only to find a single, at the end, you get a single number of like, oh, you did correct.

2539.443 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And based on that, you weigh that entire trajectory as like up-weight or down-weight.

2545.171 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so the way I like to put it is you're sucking supervision through a straw because you've done all this work that could be a minute to roll out and you're like sucking the bits of supervision of the final reward signal through a straw and you're like putting it, you're like, you're basically like, yeah, you're broadcasting that across the entire trajectory and using that to upweigh or downweigh that trajectory.

2549.337 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's crazy.

2569.4 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

A human would never do this.

2570.581 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Number one, a human would never do hundreds of rollouts.

2571.502 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Number two, when a person sort of finds a solution, they will have a pretty complicated process of review of like, okay, I think these parts that I did well, these parts I did not do that well.

2573.705 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I should probably do this or that.

2583.582 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they think through things.

2585.485 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's nothing in current LLMs that does this.

2586.828 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's no equivalent of it.

2588.631 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do see papers popping out that are trying to do this because it's obvious to everyone in the field.

2590.734 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I kind of see it as like, the first imitation learning actually, by the way, was extremely surprising and miraculous and amazing that we can fine-tune by imitation in humans.

2595.743 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that was incredible.

2603.631 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because in the beginning, all we had was base models.

2605.073 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Base models are autocomplete.

2606.875 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it wasn't obvious to me at the time, and I had to learn this, and the paper that blew my mind was InstructGPT.

2609.017 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because it pointed out that, hey, you can take the pre-trained model, which is autocomplete,

2615.824 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And if you just fine-tune it on text that looks like conversations, the model will very rapidly adapt to become very conversational.

2619.588 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it keeps all the knowledge from pre-training.

2626.175 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this blew my mind because I didn't understand that this just like stylistically can adjust so quickly and become an assistant to a user through just a few loops of fine-tuning on that kind of data.

2628.318 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It was very miraculous to me that that worked.

2638.189 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So incredible.

2641.012 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that was like two years, three years of work.

2641.953 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And now came RL.

2644.428 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And RL allows you to do a bit better than just imitation learning, right?

2645.731 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because you can't have these reward functions and you can hill climb on the reward functions.

2649.158 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so some problems have just correct answers.

2653.808 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can hill climb on that without getting expert trajectories to imitate.

2655.853 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So that's amazing.

2659 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And the model can also discover solutions that the human might never come up with.

2660.062 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So this is incredible.

2663.75 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And yet, it's still stupid.

2665.252 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think we need more.

2667.955 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I saw a paper from Google yesterday that tried to have this reflect and review page idea in mind.

2669.917 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What was the memory bank paper or something?

2676.565 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know.

2679.409 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I've actually seen a few papers along these lines.

2680.15 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I expect there to be some kind of a major update to how we do algorithms for LLMs coming in that realm.

2681.912 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I think we need three or four or five more.

2688.8 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Something like that.

2693.085 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So process-based supervision just refers to the fact that we're not going to have a reward function only at the very end of after you've made 10 minutes of work, I'm not going to tell you you did well or not well.

2728.37 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm going to tell you at every single step of the way how well you're doing.

2735.98 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this is basically the reason we don't have that.

2738.683 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's tricky how you do that properly because you have partial solutions and you don't know how to assign credit.

2740.726 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So when you get the right answer, it's just an equality match to the answer.

2746.353 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Very simple to implement.

2750.577 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If you're doing basically process supervision, how do you assign, in an automatable way, partial credit assignment?

2752.399 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's not obvious how you do it.

2758.265 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Lots of labs, I think, are trying to do it with these LLM judges.

2759.446 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So basically, you get LLMs to try to do it.

2762.149 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So you prompt an LLM, hey, look at a partial solution of a student.

2764.071 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How well do you think they're doing if the answer is this?

2766.894 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they try to tune the prompt.

2769.036 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The reason that I think this is kind of tricky is quite subtle.

2771.058 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's the fact that anytime you use an LLM to assign a reward, those LLMs are giant things with billions of parameters and they're gameable.

2774.001 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And if you're reinforcement learning with respect to them, you will find adversarial examples for your LLM judges almost guaranteed.

2781.488 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can't do this for too long.

2787.394 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You do maybe 10 steps or 20 steps, maybe it will work, but you can't do 100 or 1,000 because it's not obvious.

2788.455 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because I understand it's not obvious, but basically the model will find little cracks,

2793.98 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

it will find all these spurious things in the nooks and crannies of the giant model and find a way to cheat it.

2800.066 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So one example that's prominently in my mind is, I think this was probably public, but basically, if you're using an element judge for a reward, so you just give it a solution from a student and ask it if the student will or not,

2806.398 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We were training with reinforcement learning against that reward function, and it worked really well, and then suddenly the reward became extremely large.

2818.78 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It was a massive jump, and it did perfect.

2827.107 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you're looking at it like, wow, this means the student is perfect in all these problems.

2829.029 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's fully solved math.

2833.253 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But actually what's happening is that when you look at the completions that you're getting from the model, they are complete nonsense.

2835.214 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They start out okay, and then they change to da-da-da-da-da-da-da.

2839.999 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So it's just like, oh, okay, let's take two plus three, and we do this and this, and then da-da-da-da-da-da-da-da.

2842.861 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you're looking at it and it's like, this is crazy.

2847.265 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How is it getting a reward of one or 100%?

2848.79 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you look at the LLM judge and it turns out that the, the, the, the, the is an adversarial examples for the model and it assigns 100% probability to it.

2851.058 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's just because this is an out-of-sample example to the LLM.

2858.639 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's never seen it during training, and you're in pure generalization land.

2861.824 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's never seen it during training, and in the pure generalization land, you can find these examples that break it.

2865.61 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Not even that.

2876.206 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Prompt injection is way too fancy.

2876.647 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're finding adversarial examples, as they're called.

2878.55 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

These are nonsensical solutions that are obviously wrong, but the model thinks they're amazing.

2880.353 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

2900.693 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think the labs are probably doing all that.

2901.254 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, okay, so the obvious thing is like the should not get 100% reward.

2902.958 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

okay, well, take the, the, the, the, put in the training set of the LLM judge and say, this is not 100%, this is 0%.

2906.125 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can do this.

2911.091 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But every time you do this, you get a new LLM and it still has adversarial examples.

2912.072 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's infinity adversarial examples.

2916.418 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think probably if you iterate this a few times, it'll probably be harder and harder to find adversarial examples.

2918 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I'm not 100% sure because this thing has a trillion parameters or whatnot.

2923.106 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I bet you the labs are trying.

2927.311 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't actually, I still think, I still think we need other ideas.

2930.455 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So like this idea of like a review solution and come up with synthetic examples such that when you train on them, you get better and like meta-learn it in some way.

2941.471 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think there's some papers that I'm starting to see pop out.

2951.667 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I only am at a stage of like reading abstracts because a lot of these papers, you know, they're just ideas.

2953.97 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Someone has to actually like make it work on a frontier LLM lab scale.

2958.377 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

in full generality.

2962.083 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because when you see these papers, they pop up and it's just like a little bit of noisy, you know?

2963.868 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's cool ideas, but I haven't actually seen anyone convincingly show that this is possible.

2967.657 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That said, the LLM labs are fairly closed, so who knows what they're doing now, but...

2973.011 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I do think that we're missing some aspects there.

3016.668 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So as an example, when you're reading a book,

3018.492 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I almost feel like currently when LLMs are reading a book, what that means is we stretch out the sequence of text and the model is predicting the next token and it's getting some knowledge from that.

3022.141 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's not really what humans do, right?

3030.716 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So when you're reading a book, I almost don't even feel like the book is like exposition I'm supposed to be attending to and training on.

3032.038 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The book is a set of prompts for me to do synthetic data generation.

3037.468 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

or for you to get into a book club and talk about it with your friends.

3041.615 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's by manipulating that information that you actually gain that knowledge.

3044.72 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think we have no equivalent of that, again, with LLMs.

3048.587 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They don't really do that, but I'd love to see during pre-training some kind of a stage that thinks through the material and tries to reconcile it with what it already knows and thinks through for some amount of time and gets that to work.

3051.672 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so there's no equivalence of any of this.

3063.232 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This is all research.

3064.898 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's some subtle, very subtle that I think are very hard to understand reasons why it's not trivial.

3065.882 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So if I can just describe one.

3070.881 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Why can't we just synthetically generate and train on it?

3073.171 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Well, because every synthetic example, like if I just give synthetic generation of the model thinking about a book, you look at it and you're like, this looks great.

3075.535 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Why can't I train on it?

3082.405 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Well, you could try, but the model will actually get much worse if you continue trying.

3083.447 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's because all of the samples you get from models are silently collapsed.

3086.612 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They're silently, this is not obvious if you look at any individual example of it, they occupy a very tiny manifold of the possible space of sort of thoughts about content.

3091.139 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So the LLMs, when they come off, they're what we call collapsed.

3099.952 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They have a collapsed data distribution.

3103.358 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If you sample, one easy way to say it is go to ChatGPT and ask it, tell me a joke.

3104.921 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It only has like three jokes.

3110.089 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's not giving you the whole breadth of possible jokes.

3111.652 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's giving you like, it knows like three jokes.

3114.237 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They're silently collapsed.

3116.38 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So basically, you're not getting the richness and diversity and the entropy from these models as you would get from humans.

3117.642 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So humans are a lot more sort of noisier, but at least they're not biased.

3123.813 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They're not in a statistical sense.

3127.179 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They're not silently collapsed.

3129.162 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They maintain a huge amount of entropy.

3130.524 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So how do you get synthetic data generation to work despite the collapse and while maintaining the entropy is a research problem.

3132.247 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Say we have a chapter of a book and I ask an alum to think about it.

3154.002 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It will give you something that looks very reasonable.

3158.309 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But if I ask it 10 times, you'll notice that all of them are the same.

3159.971 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, yeah, yeah.

3172.811 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So any individual sample will look okay, but the distribution of it is quite terrible.

3173.432 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Interesting.

3177.458 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's quite terrible in such a way that if you continue training on too much of your own stuff, you actually collapse.

3177.678 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I actually think that there's no like fundamental solutions to this possibly.

3182.505 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I also think humans collapse over time.

3185.749 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think this is, again, these analogies are surprisingly good, but humans collapse during the course of their lives.

3188.233 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This is why children have completely, you know, they haven't overfit yet.

3193.4 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they will say stuff that will shock you because it's kind of, you can see where they're coming from, but it's just not the thing people say.

3197.466 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

3203.214 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And because they're not yet collapsed.

3203.294 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But we're collapsed, we end up revisiting the same thoughts, we end up saying more and more of the same stuff, and the learning rates go down, and the collapse continues to get worse, and then everything deteriorates.

3205.581 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's an interesting idea.

3236.034 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I do think that...

3236.815 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

When you're generating things in your head and then you're attending to it, you're kind of like training on your own samples.

3238.478 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're training on your synthetic data.

3242.904 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And if you do it for too long, you go off rails and you collapse way too much.

3244.306 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So you always have to like seek entropy in your life.

3247.87 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So talking to other people is a great source of entropy and things like that.

3252.636 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So maybe the brain has also built some internal mechanisms for increasing the amount of entropy in that process.

3256.742 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But yeah, maybe that's an interesting idea.

3263.811 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think there's something very interesting about that.

3319.732 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, 100%.

3321.316 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that humans actually, they do kind of like have a lot more of an element compared to LLMs of like seeing the forest for the trees.

3322.338 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And we're not actually that good at memorization, which is actually a feature.

3329.393 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because we're not that good at memorization, we actually are kind of like forced to...

3334.425 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

to find the patterns in a marginal sense.

3338.193 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think LLNs, in comparison, are extremely good at memorization.

3343.177 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They will recite passages from all these training sources.

3346.42 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can give them completely nonsensical data, like you can hash some amount of text or something like that.

3350.024 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You get a completely random sequence.

3354.868 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If you train on it, even just, I think, a single iteration or two, it can suddenly regurgitate the entire thing.

3356.289 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It will memorize it.

3360.773 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's no way a person can read a single sequence of random numbers and recite it to you.

3361.354 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's a feature, not a bug almost, because it forces you to like only learn the generalizable components.

3366.138 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Whereas LLMs are distracted by all the memory that they have of the pre-trained documents.

3372.405 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's probably very distracting to them in a certain sense.

3377.29 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So that's why when I talk about the cognitive core, I actually want to remove the memory, which is what we talked about.

3380.574 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'd love to have less memory so that they have to look things up.

3385.379 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they only maintain the algorithms for like thought and the idea of an experiment and all this cognitive glue of acting.

3389.003 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm not sure.

3403.86 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think it's almost like a separate axis.

3407.164 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's almost like the models are way too good at memorization and somehow we should remove that.

3409.006 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think people are much worse, but it's a good thing.

3413.892 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I think that's a great question.

3432.196 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, you can imagine having a regularization for entropy and things like that.

3433.478 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I guess they just don't work as well empirically because right now, like, the models are collapsed.

3436.441 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I will say...

3441.326 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Most of the tasks that we want of them don't actually demand the diversity.

3443.527 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's probably the answer of what's going on.

3448.513 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so it's just that the frontier labs are trying to make the models useful.

3450.476 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I kind of just feel like the diversity of the outputs is not so much.

3454.281 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Number one, it's much harder to work with and evaluate and all this kind of stuff.

3458.026 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But maybe it's not what's actually capturing most of the value.

3460.429 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

3468.94 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Or like maybe if you're doing a lot of writing help from LLMs and stuff like that, I think it's probably bad because the models will give you these like silently all the same stuff, you know.

3469.581 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So they're not, they won't explore lots of different ways of answering a question, right?

3477.932 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I kind of feel like maybe this diversity is just not as big of a, yeah, maybe like, yeah, not as many applications needed so the models don't have it, but then it's actually a problem with synthetic generation time, et cetera.

3482.238 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So we're actually shooting ourselves in the foot by not allowing this entropy to maintain in the model.

3491.251 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think possibly the labs should try harder.

3495.456 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

3498.02 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't actually know if it's super fundamental.

3505.677 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't actually know if I intended to say that.

3508.765 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that...

3511.11 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I haven't done these experiments, but I do think that you could probably regularize the entropy to be higher.

3513.895 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So you're encouraging the model to give you more and more solutions.

3518.504 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But you don't want it to start deviating too much from the training data.

3521.99 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's going to start making up its own language.

3524.575 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's going to start using words that are extremely rare.

3526.058 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So it's going to drift too much from the distribution.

3528.964 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think controlling the distribution is just like a tricky... It's just like someone just has to...

3531.889 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's probably not trivial in that sense.

3536.879 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So it's really interesting in the history of the field because at one point everything was very scaling-pilled in terms of like, oh, we're going to make much bigger models, trillions of parameter models.

3551.576 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And actually what the models have done in size is they've gone up and now they've actually kind of like

3560.205 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

actually even come down.

3565.685 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

State-of-the-art models are smaller.

3566.606 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And even then, I actually think they memorized way too much.

3568.408 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think I had a prediction a while back that I almost feel like we can get cognitive cores that are very good at even like a billion, billion parameters.

3571.731 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It should be all very like, like if you talk to a billion parameter model, I think in 20 years, you can actually have a very productive conversation, it thinks.

3579.198 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's a lot more like a human.

3586.985 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But if you ask it some factual question, it might have to look it up.

3589.328 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But it knows that it doesn't know and it might have to look it up and it will just do all the reasonable things.

3591.59 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

No, because I basically think that the training data is, so here's the issue.

3631.598 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The training data is the internet, which is really terrible.

3634.201 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So there's a huge amount of gains to be made because the internet is terrible.

3637.506 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like if you actually, and even the internet, when you and I think of the internet, you're thinking of like a Wall Street Journal or that's not what this is.

3640.129 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

When you're actually looking at a pre-training data set in the front of your lab and you look at a random internet document, it's total garbage.

3646.538 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like I don't even know how this works at all.

3651.865 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's some like stock ticker symbols.

3654.108 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's a huge amount of slop and garbage from like all the corners of the internet.

3658.174 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's not like your Wall Street Journal article that's extremely rare.

3661.699 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I almost feel like because the internet is so terrible, we actually have to sort of like build really big models to compress all that.

3665.605 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Most of that compression is memory work instead of like cognitive work.

3671.974 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But what we really want is the cognitive part to actually delete the memory.

3675.719 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then, so I guess what I'm saying is like we need

3678.563 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

intelligent models to help us refine even the pre-training set to just narrow it down to the cognitive components.

3681.768 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I think you get away with a much smaller model because it's a much better data set and you could train it on it.

3687.417 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But probably it's not trained directly on it.

3692.164 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's probably distilled for a much better model still.

3693.586 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I just feel like distillation works extremely well.

3700.111 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So almost every small model, if you have a small model, it's almost certainly distilled.

3701.994 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, come on, right?

3713.572 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know.

3715.615 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

At some point, it should take at least a billion knobs to do something interesting.

3715.896 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're thinking it should be even smaller?

3721.184 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I almost feel like I'm already contrarian by talking about a billion-parameter cognitive core, and you're outdoing me.

3743.159 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think, yeah, maybe we could get a little bit smaller.

3748.625 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I still think that there should be enough.

3751.108 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, maybe it can be smaller.

3753.431 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that, practically speaking, you want the model to have some knowledge.

3755.052 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You don't want it to be looking up everything.

3757.956 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because then you can't think in your head.

3760.298 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're looking up way too much stuff all the time.

3761.64 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I do think it needs to be some basic curriculum needs to be there for knowledge.

3763.001 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But it doesn't have esoteric knowledge, you know?

3768.107 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I don't know that I have a super strong prediction.

3794.611 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that the labs are just being practical.

3797.758 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They have a flops budget and a cost budget.

3800.103 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it just turns out that pre-training is not where you want to put most of your flops or your cost.

3802.488 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So that's why the models have gotten smaller, because they are a bit smaller, the pre-training stage is smaller, et cetera, but they make it up in reinforcement learning and all this kind of stuff, mid-training and all this kind of stuff that follows.

3805.775 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So they're just being practical in terms of all the stages and how you get the most bang for the buck.

3814.895 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess like forecasting that trend, I think, is quite hard.

3819.229 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do still expect that there's so much longing for it.

3822.298 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's my basic expectation.

3824.545 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

3827.515 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I have a very wide distribution here.

3828.85 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Probably most part, yeah.

3851.269 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I expect the data sets to get much, much better because when you look at the average data sets, they're extremely terrible.

3852.07 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like so bad that I don't even know how anything works, to be honest.

3856.26 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like look at the average example in the training set.

3858.505 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like factual mistakes, errors, nonsensical things.

3862.093 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Somehow when you do it at scale, the noise washes away and you're left with some of the signal.

3866.543 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

um so data sets will improve a ton it's just everything gets better so um our hardware um our all the kernels um all the kernels for running the hardware and maximizing what you get with the hardware you know so nvidia is slowly tuning the actual hardware itself tensor course and so on all that needs to happen and will continue to happen uh all the kernels will get better and utilize the chip to the max extent all the algorithms will probably improve over optimization architecture and just all the modeling components of how everything is done and what the algorithms are that we're even training with

3870.953 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I do kind of expect like a just very just everything.

3899.911 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Nothing dominates.

3904.499 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Everything plus 20%.

3905.68 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Right.

3907.303 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Interesting.

3908.906 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This is like roughly what I've seen.

3909.166 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess I have two answers to that.

4026.052 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Number one, I'm almost tempted to, like, reject the question entirely because, again, like, I see this as an extension of computing.

4027.434 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Have we talked about, like, how to chart progress in computing or how do you chart progress in computing since 1970s or whatever?

4032.701 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What is the x-axis?

4037.928 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I kind of feel like the whole question is kind of, like, funny from that perspective a little bit.

4039.249 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I will say, I guess, like, when people talk about AI and the original AGI and how we spoke about it when we – when OpenAI started –

4043.515 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

AGI was a system you can go to that can do any task that is economically valuable, any economically valuable task at human performance or better.

4050.284 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Okay.

4060.749 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So that was the definition.

4061.33 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I was pretty happy with that at the time.

4062.152 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I kind of feel like I've stuck to that definition forever.

4063.555 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then people have made up all kinds of other definitions.

4066.342 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I feel like I like that definition.

4068.948 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Now, number one, the first concession that people make all the time is they just take out all the physical stuff because we're just talking about digital knowledge work.

4072.274 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I feel like that's a pretty major concession compared to the original definition, which was like any task a human can do.

4079.447 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I can lift things, et cetera.

4084.296 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like AI can't do that, obviously.

4085.539 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, okay, but we'll take it.

4087.202 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What fraction of the economy are we taking away by saying, oh, only knowledge work?

4088.945 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't actually know the numbers.

4093.775 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I feel like it's about 10% to 20%, if I had to guess, is only knowledge work.

4094.737 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like someone could work from home and perform tasks, something like that.

4100.689 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

4104.677 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I still think it's a really large market.

4104.757 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, yeah, what is the size of the economy and what is 10%, 20%?

4107.1 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, we're still talking about a few trillion dollars of, even in the U.S., of market share almost, or like work.

4110.304 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, still a very massive bucket.

4117.794 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, but I guess like going back to the definition, I guess what I would be looking for is to what extent is that definition true?

4119.817 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, are there jobs or lots of tasks?

4125.705 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If we think of tasks as, you know, not jobs, but tasks, kind of difficult definitions.

4129.049 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because the problem is like society will refactor based on the tasks that make up jobs compared to what's based on what's automatable or not.

4133.495 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But today, what jobs are replaceable by AI?

4140.686 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So a good example recently was Jeff Hinton's prediction that radiologists would not be a job anymore.

4143.59 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this turned out to be very wrong in a bunch of ways, right?

4149.559 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So radiologists are alive and well and growing, even though computer vision is really, really good at recognizing all the different things that they have to recognize in images.

4151.722 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's just messy, complicated things.

4159.073 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

job with a lot of surfaces and dealing with patients and all this kind of stuff in the context of it.

4161.136 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess I don't actually know that by that definition, AI has made a huge amount of dent yet.

4166.305 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But some of the jobs maybe that I would be looking for have some features that I think make it very amenable to automation earlier than later.

4172.556 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

As an example, call center employees often come up, and I think rightly so, because call center employees are

4178.787 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

have a number of simplifying properties with respect to what's automatable today.

4184.137 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Their jobs are pretty simple.

4189.225 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's a sequence of tasks and every task looks similar.

4191.088 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like you take a phone call with a person, it's 10 minutes of interaction or whatever it is, probably a bit longer.

4193.793 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

In my experience, a lot longer.

4198.28 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you complete some task in some scheme and you change some database entries around or something like that.

4199.742 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So you keep repeating something over and over again, and that's your job.

4205.652 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So basically, you do want to bring in the task horizon, how long it takes to perform a task.

4209.355 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then you want to also remove context, like you're not dealing with different parts of services of companies or other customers.

4214.5 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's just the database, you and a person you're serving.

4220.505 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so it's more closed.

4223.188 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's more understandable.

4224.069 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's purely digital.

4225.73 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I would be looking for those things.

4227.151 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But even there, I'm not actually looking at full automation yet.

4228.573 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm looking for an autonomy slider.

4231.555 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I almost expect that we are not going to,

4233.217 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

instantly replace people.

4235.639 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We're going to be swapping in AIs that do 80% of the volume.

4237.14 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They delegate 20% of the volume to humans.

4240.704 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And humans are supervising teams of five AIs doing the call center work that's more rote.

4242.765 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I would be looking for new interfaces or new companies that provide some kind of a layer that allows you to manage some of these AIs that are not yet perfect.

4247.79 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I would expect that across the economy.

4258.68 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And a lot of jobs are a lot harder than call center employee.

4260.321 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I think that's an interesting question.

4337.97 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't think we're currently seeing that with radiology.

4341.736 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I don't have, like...

4343.819 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

In my understanding, but I think radiology is not a good example, basically.

4346.103 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know why Jeff Hinton picked on radiology because I think it's an extremely messy, complicated profession.

4349.45 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I would be a lot more interested in what's happening with call center employees today, for example, because I would expect a lot of the road stuff to be automatable today.

4356.526 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I don't have a first level access to it, but maybe I would be looking for trends of what's happening with the call center employees.

4363.822 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe some of the things I would also expect is maybe they are swapping in AI, but then I would still wait for a year or two because I would potentially expect them to pull back and actually rehire some of the people.

4370.035 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think there's an interesting point here because I do believe coding is like the perfect first thing for these LLMs and agents.

4454.467 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's because coding has always fundamentally worked around text.

4463.901 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's computer terminals and text, and everything is based around text.

4468.548 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And LLMs, the way they're trained on the internet, love text.

4471.872 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so they're perfect text processors, and there's all this data out there, and it's just a perfect fit.

4475.777 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And also we have a lot of infrastructure pre-built for handling code and text.

4480.863 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, for example, we have Visual Studio Code or, you know, your favorite IDE showing you code.

4484.988 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And an agent can plug into that.

4491.937 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So for example, if an agent has a diff where it made some change, we suddenly have all this code already that shows all the differences to a code base using a diff.

4494.3 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So it's almost like we've pre-built a lot of the infrastructure for code.

4502.268 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Now contrast that with some of the things that don't enjoy that at all.

4507.314 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So as an example, like there's people trying to build automation, not for coding, but for example, for slides.

4510.437 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like I saw a company doing slides.

4515.363 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's much, much harder.

4516.904 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And the reason it's much, much harder is because slides are not text.

4518.226 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Slides are little graphics, and they're arranged spatially, and there's visual components to it.

4521.029 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And slides don't have this pre-built infrastructure.

4526.035 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, for example, if an agent is to make a different change to your slides, how does a thing show you the diff?

4530.28 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How do you see the diff?

4536.007 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's nothing that shows diffs for slides.

4536.728 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Someone has to build it.

4539.531 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So it's just some of these things are not amenable to AIs as they are, which is text processors.

4541.233 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And code, surprisingly, is.

4547.24 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I think that makes sense.

4618.669 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I would say, yeah, I'm not saying that anything text is trivial, right?

4620.791 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that code is like, it's pretty structured.

4626.598 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Text is maybe a lot more flowery and there's a lot more like entropy in text, I would say.

4629.822 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know how else to put it.

4636.509 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And also, I mean, code is hard.

4637.49 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so people sort of feel quite empowered by LLMs, even from like simple kind of knowledge.

4640.053 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I basically, I don't actually know that I have a very good answer.

4647.541 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, obviously like text makes it much, much easier maybe is maybe why I put it, but it doesn't mean that all text is trivial.

4650.704 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I guess I see it as like a progression of automation in society, right?

4666.61 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And again, like extrapolating the trend of computing.

4670.457 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I just feel like there will be a gradual automation of a lot of things, and superintelligence will be sort of like the extrapolation of that.

4672.821 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I do think we expect more and more autonomous entities over time that are doing a lot of the digital work, and then eventually even the physical work, probably some amount of time later.

4678.632 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But basically, I see it as just automation.

4687.427 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Well, but some of the things that people do is invent new things, which I would just put into the automation, if that makes sense.

4696.428 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, it is fundamentally automation, but I mean, it will be like extremely foreign.

4731.812 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think it will look really strange because like you mentioned, we can run all of this on a computer cluster, et cetera, and much faster and all this thing.

4735.34 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

4744.02 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, maybe some of the scenarios, for example, that I start to get, like, nervous about with respect to when the world looks like that is this kind of, like, gradual loss of control and understanding of what's happening.

4744.1 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think that's actually the most likely outcome, probably, is that there will be a gradual loss of understanding of... And we'll gradually layer all this stuff everywhere, and there'll be fewer and fewer people who understand it, and that there will be a sort of this, like, scenario of a gradual loss of control and understanding of what's happening.

4753.172 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That, to me, seems most likely outcome of how all of this stuff will go down.

4768.553 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I think that's fair.

4807.779 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's a good pushback.

4808.82 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think, like, I guess I expect loss of both power.

4809.581 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So we're really far into a territory of... I don't know what this looks like, but if I was to write sci-fi novels, they would look along the lines of not even a single entity or something like that, that just sort of takes over everything, but actually multiple competing entities that gradually become more and more autonomous, and some of them go rogue, and the others fight them off, and all this kind of stuff.

4820.642 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's like this hot pot of...

4842.465 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

completely autonomous activity that we've delegated to.

4844.928 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I kind of feel like it would have that flavor.

4849.294 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I basically expect there to be, I mean, a lot of these things, I mean, they will be tools to people and the people could, some of the population is like, they're acting on behalf of people or something like that.

4867.928 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So maybe those people are in control, but maybe it's a loss of control overall for society in the sense of like outcomes we want or something like that, where you have entities acting on behalf of individuals that are still kind of roughly seen as out of control.

4877.765 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I guess what I mean is...

4922.428 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do, but it's business as usual because we're in an intelligence explosion already and have been for decades.

4924.713 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And when you look at GDP, it's basically the GDP curve that is an exponential weighted sum over so many aspects of the industry.

4930.798 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Everything is gradually being automated, has been for hundreds of years.

4936.824 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Industrial revolution is automation and some of the physical components and the tool building and all this kind of stuff.

4940.507 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Compilers are early software automation, et cetera.

4944.671 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I kind of feel like we've been recursively self-improving and exploding for a long time.

4947.654 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe another way to see it is,

4953.119 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, Earth was a pretty, I mean, if you don't look at the biomechanics and so on, it was a pretty boring place, I think, and looked very similar if you just look from space.

4954.7 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And Earth is spinning, and then, like, we're in the middle of this, like, firecracker event.

4963.153 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Right.

4967.42 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But we're seeing it in slow motion.

4967.56 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I definitely feel like this has already happened for a very long time.

4969.063 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And, again, like, I don't see AI as, like, a distinct technology with respect to what has already been happening for a long time.

4973.95 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This was very interesting to me because I was trying to find AI in the GDP for a while.

4984.62 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I thought that GDP should go up.

4989.126 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But then I looked at some of the other technologies that I thought were very transformative, like maybe computers or mobile phones or etc.

4990.828 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can't find them in GDP.

4998.097 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

GDP is the same exponential.

4999.358 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's just that even, for example, the early iPhone didn't have the App Store and it didn't have a lot of the bells and whistles that the modern iPhone has.

5000.6 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so even though we think of 2008, was it, when iPhone came out as like some major seismic change, it's actually not.

5007.212 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Everything is like so spread out and so slowly diffuses that everything ends up being averaged up into the same exponential.

5013.083 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's the exact same thing with computers.

5018.914 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can't find them in the GDP.

5020.256 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's like, oh, we have computers now.

5021.318 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's not what happened because it's such a slow progression.

5022.841 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And with AI, we're going to see the exact same thing.

5025.264 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's just more automation.

5026.766 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It allows us to write different kinds of programs that we couldn't write before.

5028.148 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But AI is still fundamentally a program.

5031.253 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's a new kind of computer and a new kind of computing system.

5033.536 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But it has all these problems.

5038.563 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's going to diffuse over time.

5039.745 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's still going to add up to the same exponential.

5041.627 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And we're still going to have an exponential that's going to get extremely vertical.

5043.63 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's going to be very foreign to live in that kind of an environment.

5047.936 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Basically, I guess what I'm saying is for a while, I tried to find AI or look for AI in like the GDP curve.

5101.36 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I kind of convinced myself that this is false.

5106.087 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that even when people talk about recursive self-improvement and labs and stuff like that, I even don't, this is business as usual.

5108.131 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Of course, it's going to recursively self-improve and it's been recursively self-improving.

5113.279 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like LLMs allow the engineers to work much more efficiently to build the next round of LLM.

5117.125 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And a lot more of the components are being automated and tuned and et cetera.

5122.834 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So,

5125.778 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

all the engineers having access to Google search is sort of part of it.

5126.099 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

All the engineers having an ID, all of them having autocomplete or having cloth code, et cetera.

5130.767 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's all just part of the same speed up of the whole thing.

5135.114 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So it's just so smooth.

5137.758 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, my expectation is that it stays the same pattern.

5154.075 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, maybe a counterpoint.

5204.777 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, number one, I'm actually pretty willing to be convinced one way or another on this point.

5206.079 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I will say, for example, computing is labor.

5211.227 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Computing was labor.

5213.692 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Computers, like, a lot of jobs disappeared because computers are automating a bunch of digital information processing that you now don't need a human for.

5214.433 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so computers are labor.

5221.645 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that has played out.

5223.528 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

5226.493 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And, you know, self-driving as an example is also like computers doing labor.

5226.573 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, like, I guess that's already been playing out.

5230.497 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's still business as usual.

5232.62 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I kind of...

5254.804 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I see where it's coming from.

5255.885 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

At the same time, I do feel like people make this assumption of like, okay, we have God in the box and now it can do everything.

5258.047 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it just won't look like that.

5263.872 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's going to be able to do some of the things.

5265.753 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's going to fail at some other things.

5267.875 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's going to be gradually put into society and basically end up with the same pattern, is my prediction.

5268.976 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because this assumption of suddenly having a completely intelligent, fully flexible, fully general human in a box and we can dispense it at arbitrary problems in society, I don't think that we will have this

5273.58 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

like discrete change.

5285.45 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I think we'll arrive at the same kind of gradual diffusion of this across the industry.

5287.532 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think I understand, but I still think that you're presupposing some discrete jump, some unlock that we're waiting to claim.

5374.077 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And suddenly we're going to have geniuses in data centers.

5382.091 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I still think you're presupposing some discrete jump that I think has basically no historical precedent that I can't find in any of the statistics and that I think probably won't happen.

5384.515 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm a little bit suspicious.

5400.619 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I would have to look at it.

5401.941 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm a little bit suspicious, and I would have to take a look.

5402.942 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

For example, maybe some of the logs are not very good from before the Industrial Revolution or something like that.

5405.345 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I'm a little bit suspicious of it, but yeah, maybe you're right.

5410.951 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't have strong opinions.

5414.135 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe you're saying that this was a singular event that was extremely magical, and you're saying that maybe there's going to be another event that's going to be just like that, extremely magical, it will break paradigm, and so on.

5415.877 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's still some overhang that's being unlocked.

5453.623 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like maybe there's a new energy source.

5455.467 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's some unlock, in this case, some kind of a cognitive capacity.

5456.89 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And there's an overhang of cognitive work to do.

5460.478 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's right.

5463.044 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you're expecting that overhang to be filled by this new technology when it crosses the threshold.

5463.585 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I mean, yeah, it's really hard to tell.

5498.607 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I understand that viewpoint.

5501.954 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't intuitively feel that viewpoint.

5503.518 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I love Nick Fling's books, by the way.

5600.152 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, yeah, I was just listening to his podcast on the way up here.

5601.493 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

With respect to intelligence and its evolution, I do think it came fairly, I mean, it's very, very recent, right?

5605.417 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I am surprised that it evolved.

5611.586 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

5613.75 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I find it fascinating to think about all the worlds out there.

5614.791 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, say, there's a thousand planets like Earth and what they look like.

5616.554 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think Nick Lane was here talking about some of the early parts, right?

5619.178 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, okay, he expects basically very similar life forms, roughly speaking, and bacteria-like things in most of them.

5621.321 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then there's a few breaks in there.

5627.911 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I would expect that the evolution of intelligence intuitively feels to me like it should be a fairly rare event.

5630.255 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And there have been animals for... I guess maybe you should base it on how long something has existed.

5635.645 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, for example, if bacteria have been around for 2 billion years and nothing happened, then going to your carrier is probably pretty hard because bacteria actually came up quite early in Earth's evolution or history.

5640.473 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I guess, how long have we had animals?

5652.835 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe a couple hundred million years, like multicellular animals that like run, run, crawl, et cetera, which is maybe 10% of Earth's lifespan or something like that.

5655.021 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So maybe on that timescale, it's actually not too tricky.

5663.665 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I still feel like

5667.576 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's still surprising to me, I think, intuitively that it developed.

5669.66 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I would maybe expect just a lot of like animal-like life forms doing animal-like things.

5672.126 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The fact that you can get something that creates culture and knowledge and accumulates it, it is surprising to me.

5676.277 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yes.

5731.928 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Basically, it's so hard to tell, right, with any of this stuff.

5732.429 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I guess you can base it a little bit on how long something has existed or how long it feels like something has been bottlenecked.

5734.531 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So Nicolain is very good about describing this, like, very apparent bottleneck in bacteria and archaea.

5739.576 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

For two billion years, nothing happened.

5744.382 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, extreme diversity of chemical, of biochemistry, and yet nothing that grows to become animals.

5745.683 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Two billion years.

5752.811 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know that we've seen exactly that kind of an equivalent with animals and intelligence to your point, right?

5754.693 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I guess maybe we could also look at it with respect to how many times we think evolution or intelligence has like individually sprung up.

5760.186 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's a really good thing to investigate.

5767.147 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe one thought on that is, I almost feel like, well, there's the hominid intelligence.

5769.611 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And there's, I would say, like the bird intelligence, right?

5774.739 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like ravens, et cetera, are extremely clever.

5777.684 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But their brain parts are actually quite distinct, and we don't have that much existence.

5779.507 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So maybe that's a slight event of, there's a slight indication of maybe intelligence springing up a few times.

5784.936 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so in that case, you'd maybe expect it more frequently or something.

5790.384 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, and just stuff to work with.

5849.197 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I'm guessing it would be harder to, if I was a dolphin, I mean, how do you do, you can't have fire, for example, and stuff like that.

5850.199 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, probably like the universe of things you can do in water, like inside water, is probably lower than what you can do on land.

5856.77 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

just chemically.

5863.081 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, I do agree with this viewpoint of these niches and what's being incentivized.

5864.503 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I still find it kind of miraculous that I don't, I would have maybe expected things to get stuck on like animals with bigger muscles, you know?

5869.31 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

5877.862 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like going through intelligence is actually a really fascinating breaking point.

5878.303 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, exactly.

5908.311 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You have to incentivize some kind of adaptability.

5908.752 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You actually want environments that are unpredictable.

5910.555 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So evolution can't bake your algorithms into your weights.

5913.459 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

A lot of animals are basically pre-baked in this sense.

5916.183 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so humans have to figure it out at test time when they get born.

5920.149 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so maybe you actually want these kinds of environments that actually change really rapidly or something like that where you can't foresee what will work well.

5922.852 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so you actually put all that intelligence, you create intelligence to figure it out at test time.

5931.825 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yes and no, because LLMs don't really have the equivalent of culture.

5987.062 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And maybe we're giving them way too much and incentivizing not to create it or something like that.

5990.968 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I guess like the mention of culture and of written record and of like passing down notes between each other, I don't think there's an equivalent of that with LLMs right now.

5994.513 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So LLMs don't really have culture right now.

6002.124 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's kind of like one of the, I think, impediments, I would say.

6004.708 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Can you give me some sense of what LLM culture might look like?

6007.953 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So in the simplest case, it would be a giant scratchpad that the LLM can edit.

6011.738 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And as it's reading stuff or as it's helping out with work, it's editing the scratchpad for itself.

6015.623 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Why can't an LLM write a book for the other LLMs?

6020.429 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That would be cool.

6023.233 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, why can't other LLMs read this LLM's book and be inspired by it or shocked by it or something like that?

6024.334 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's no equivalence for any of this stuff.

6030.442 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Interesting.

6031.904 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think there's two powerful ideas in the realm of multi-agent that have both not been like really claimed or so on.

6042.197 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The first one I would say is culture and LLM is basically a growing repertoire of knowledge for their own purposes.

6048.445 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The second one looks a lot more like the powerful idea of self-play.

6055.974 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

in my mind, is extremely powerful.

6058.898 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So evolution actually has a lot of competition, basically, driving intelligence and evolution.

6060.539 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And in AlphaGo, more algorithmically, AlphaGo is playing against itself, and that's how it learns to get really good at Go.

6067.526 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And there's no equivalent of self-playing LLMs, but I would expect that to also exist, but no one has done it yet.

6075.213 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Why can't an LLM, for example, create a bunch of problems that another LLM is learning to solve?

6080.417 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then the LLM is always trying to serve more and more difficult problems.

6085.102 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

stuff like that, you know?

6088.885 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So like, I think there's a bunch of ways to actually organize it.

6089.927 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think it's a realm of research.

6093.152 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I think I haven't seen anything that convincingly like claims both of those, like multi-agent improvements.

6095.476 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I still think we're mostly in the realm of a single individual agent.

6101.505 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I think, I also think that will change.

6104.27 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And in the realm of culture also, I would bucket also organizations.

6106.674 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And we haven't seen anything like that convincingly either.

6111.141 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So that's why we're still early.

6114.246 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Somehow remarkably, again, some of these analogies work and they shouldn't, but somehow remarkably they do.

6125.437 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

A lot of the smaller models or the dumber, like the smaller models somehow remarkably resemble like a kindergarten student or then like a elementary school student or high school student, et cetera.

6129.441 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And somehow we still haven't like graduated enough where this stuff can take over.

6139.03 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like it's still mostly like my cloth coat or codex, they still kind of feel like this elementary grade student.

6142.353 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I know that they can take PhD quizzes, but they still cognitively feel like a kindergarten or an elementary school student.

6148.759 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Interesting.

6154.545 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I don't think they can create culture because they're still kids, you know, like they're savant kids.

6154.765 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They have perfect memory of all this stuff, et cetera.

6161.216 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they can convincingly create all kinds of slop that looks really good.

6164.641 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I still think they don't really know what they're doing and they don't really have the cognition across all these little checkboxes that we still have to collect.

6168.267 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

6194.161 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I would say one thing I will almost instantly also push back on is this is not even near done.

6194.962 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So in a bunch of ways that I'm going to get to.

6202.151 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that self-driving is very interesting because it's definitely like where I get a lot of my intuitions because I spent five years on it.

6203.973 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it has this entire history where actually the first demos of self-driving go all the way to the 1980s.

6210.821 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can see a demo from CMU in 1986.

6217.029 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's a truck that's driving itself on roads.

6219.672 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But OK, fast forward.

6222.856 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think when I was joining Tesla, I had a very early demo of Waymo.

6224.539 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it basically gave me a perfect drive in 2014 or something like that.

6229.167 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So perfect Waymo drive a decade ago.

6235.297 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It took us around Palo Alto and so on because I had a friend who worked there.

6237.942 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I thought it was like very close and then still took a long time.

6241.289 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I do think that for some kinds of tasks and jobs and so on, there's a very large demo to product gap where the demo is very easy, but the product is very hard.

6245.197 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it's especially the case in cases like self-driving where the cost of failure is too high, right?

6256.1 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Many industries, tasks, and jobs maybe don't have that property, but when you do have that property, that definitely increases the timelines.

6262.732 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that, for example, in software engineering, I do actually think that that property does exist.

6269.584 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think for a lot of vibe coding, it doesn't.

6274.072 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I think if you're writing actual production grade code, I think that property should exist because any mistake actually leads to security vulnerability or something like that.

6276.236 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Millions and hundreds of millions of people's personal social security numbers, etc, get leaked or something like that.

6283.629 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that it is a case that in software, people should be careful.

6289.118 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Kind of like in self-driving.

6293.005 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like in self-driving, if things go wrong, you might get injury.

6295.588 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I guess there's worse outcomes.

6298.712 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I guess in software, I almost feel like it's almost unbounded how terrible some things could be.

6301.415 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I do think that they share that property.

6308.603 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I think basically what takes the long amount of time and the way to think about it is that it's a march of nines, and every single nine is a constant amount of work.

6310.225 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So every single nine is the same amount of work.

6319.803 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So when you get a demo and something works 90% of the time, that's just the first nine.

6321.806 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then you need a second nine, a third nine, fourth nine, fifth nine.

6328.254 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And while I was at Tesla for, was it five years or so, I think we went through maybe three nines, two nines, I don't know what it is.

6330.336 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But like multiple nines of iteration, there's still more nines to go.

6335.563 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so that's why these things take so long.

6338.827 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so it's definitely formative for me, like seeing something that was a demo.

6342.432 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm very unimpressed by demos.

6346.317 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So whenever I see demos of anything, I'm extremely unimpressed by that.

6349.06 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It works better if you can, if it's a demo that someone cooked up and is just showing you its worst.

6352.985 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If you can interact with it, it's a bit better.

6357.57 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But even then, you're not done.

6359.192 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You need actual product.

6360.154 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's going to face all these challenges when it comes in contact with reality and all these different pockets of behavior that need patching.

6361.235 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I think we're going to see all that stuff play out.

6367.002 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's a march of nines.

6369.287 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Each nine is constant.

6370.349 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Demos are encouraging.

6372.032 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Still a huge amount of work to do.

6373.295 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think it is a kind of a critical safety domain, unless you're doing vibe coding, which is all nice and fun and so on.

6374.377 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so that's why I think this also enforced my timelines from that perspective.

6382.533 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, it's a much harder problem.

6431.798 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, self-driving is just one of thousands of things that people do.

6432.76 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's almost like a single vertical, I suppose.

6436.164 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Whereas when we're talking about general software engineering, it's even more, there's more surface area.

6438.848 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, basically, I'm not 100% sure if I fully agree with that.

6490.105 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know how much we're getting for free, and I still think there's a lot of gaps in understanding in what we are getting.

6492.869 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, we're definitely getting more generalizable intelligence in a single entity, whereas self-driving is a very special-purpose task that requires, in some sense, building a special-purpose task is maybe even harder in a certain sense because it doesn't fall out from a more general thing that you're doing at scale, if that makes sense.

6498.397 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So...

6514.22 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I still think that the analogy doesn't, I still don't know if it fully resonates because like the LLMs are still pretty fallible and I still think that they have a lot of gaps and that it still needs to be filled in.

6515.722 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I don't think that we're getting like magical generalization completely out of the box sort of in a certain sense.

6524.973 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And the other aspect that I want to also actually return to when I was in the beginning was self-driving cars are nowhere near done still.

6530.88 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Mm-hmm.

6538.889 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So even though, so the deployments still are pretty minimal, right?

6539.009 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So even Waymo and so on has very few cars, and they're doing that, roughly speaking, because they're not economical, right?

6542.794 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because they've built something that lives in the future.

6548.241 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so they had to, like, pull back future, but they had to make it uneconomical.

6551.104 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So they have all these, like, you know, there's all these costs, not just marginal costs for those cars and their operation and maintenance, but also the capex of the entire thing.

6555.33 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So making it economical is still going to be a slog, I think, for them.

6564.662 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And

6568.086 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then also I think when you look at these cars and there's no one driving, I also think it's a little bit deceiving because there are actually very elaborate teleoperation centers of people actually kind of like in a loop with these cars.

6568.827 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I don't have the full extent of it, but I think...

6580.573 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

there's more human in the loop that you might expect.

6583.88 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And there's people somewhere out there basically beaming in from the sky.

6585.862 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I don't actually know that they're fully in the loop with the driving.

6589.987 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think some of the times they are, but they're certainly involved and there are people.

6592.83 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And in some sense, we haven't actually removed the person.

6595.733 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

We've like moved them to somewhere where you can't see them.

6597.515 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I still think there will be some work, as you mentioned, going from environment to environment.

6600.218 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I think like there's still challenges to make self-driving real.

6603.402 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do agree that it's definitely across the threshold where it kind of feels real, unless it's like really teleoperated.

6607.367 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

For example, Waymo can't go to all the different parts of the city.

6613.073 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

My suspicion is it's like parts of city where you don't get a good signal.

6616.999 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Anyway, so basically, I don't actually know anything about the stack.

6621.205 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I'm just making up stuff.

6624.289 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Sorry, I don't know anything about the specifics of Waymo.

6629.517 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I feel like I talk about them.

6631.18 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I actually, by the way, love Waymo, and I take it all the time.

6632.521 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I don't want to say, like, I just think people, again, are sometimes a little bit too naive about some of the progress, and I still think there's a huge amount of work.

6634.765 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think Tesla took, in my mind, a lot more scalable approach.

6642.436 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

6645.519 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think the team is doing extremely well and is going to and I I'm kind of like on the record for predicting how this thing will go, which is like when we had like early start because you can package up so many sensors.

6645.639 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do think Tesla is taking the more scalable strategy and is going to look a lot more like that.

6655.949 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think this will have to still play out and hasn't.

6660.414 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But basically, like I don't want to talk about self-driving or something that took a decade because it didn't take it didn't take it.

6663.257 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If that makes sense.

6669.503 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, the end is not near yet.

6676.579 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because when we're talking about self-driving, usually in my mind, it's self-driving at scale.

6678.082 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

People don't have to get a driver's license, etc.

6683.113 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think that's right.

6760.156 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think if you're sticking in the realm of bits, bits are like a million times easier than anything that touches the physical world.

6760.677 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I definitely grant that.

6767.465 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Bits are completely changeable, arbitrarily reshuffleable at a very rapid speed.

6769.527 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So you would expect a lot more faster adaptation also in the industry and so on.

6774.393 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then what was the first one?

6779.98 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think that's roughly right.

6785.009 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I also think that if we are talking about knowledge work at scale, there will be some latency requirements, practically speaking, because we're going to have to create a huge amount of compute and serve that.

6785.911 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I think the last aspect that I very briefly want to also talk about is all the rest of it.

6796.472 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

just all the rest of it.

6802.664 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So what does society think about it?

6803.665 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What is the legal, how is it working legally?

6806.367 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How is it working insurance-wise?

6809.47 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Who's really, like, what is the, what are those layers of it and aspects of it?

6810.891 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What happens with, what is the equivalent of people putting a cone on a Waymo?

6815.736 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You know, there's going to be equivalents of all that.

6819.519 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I do think that, I almost feel like self-driving is a very nice analogy that you can borrow things from.

6822.061 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, what is the equivalent of a cone on a car?

6827.987 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What is the equivalent of a tele-operating worker who's, like, hidden away?

6829.648 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And almost like all the aspects of it.

6832.651 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Kind of like what happened with railroads and all this kind of stuff.

6858.648 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

With what, sorry?

6861.032 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Was it railroads?

6861.653 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Sorry.

6862.414 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

6863.355 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There is like historical precedent or was it with telecommunication industry, right?

6863.836 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like prepaving the internet that only came like a decade later, you know, and creating like a whole bubble in the telecommunications industry in the late 90s kind of thing.

6867.541 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

6876.012 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I don't know.

6877.594 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I understand I'm sounding very pessimistic here.

6879.115 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm only doing that.

6882.44 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm actually optimistic.

6883.541 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think this will work.

6884.422 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think it's tractable.

6885.083 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm only sounding pessimistic because when I go on my Twitter timeline, I see all this stuff that makes no sense to me.

6886.144 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And

6892.012 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think there's a lot of reasons for why that exists.

6893.654 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think a lot of it is, I think, honestly, just fundraising.

6896.14 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's just incentive structures.

6899.006 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

A lot of it may be fundraising.

6900.73 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

A lot of it is just attention, you know, converting attention to money on the internet, you know, stuff like that.

6902.013 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think there's a lot of that going on, and I think I'm only reacting to that.

6909.55 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I'm still, like, overall very bullish on technology.

6914.798 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think we're going to work through all this stuff, and I think there's been a rapid amount of progress.

6917.302 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't actually know that there's overbuilding.

6921.288 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think that we're going to be able to gobble up what, in my understanding, is being built.

6923.652 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because I do think that, for example, cloud code or OpenAI Codex and stuff like that, they didn't even exist a year ago, right?

6928.599 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Is that right?

6934.068 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think it's roughly right.

6934.508 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This is miraculous technology that didn't exist.

6935.69 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think there's going to be a huge amount of demand as we see the demand in ChashiPT already and so on.

6938.614 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So yeah, I don't actually know that there's overbuilding, but I guess I'm just reacting to some of the very fast timelines that people continue to say incorrectly.

6944.382 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I've heard many, many times over the course of my 15 years in AI where very reputable people keep getting this wrong all the time.

6953.816 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think I want us to be properly calibrated.

6961.947 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think some of this also, it does have like geopolitical ramifications and things like that when, like some of these questions.

6964.552 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think I don't want people to make mistakes on that sphere of things.

6970.764 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I do want us to be grounded in reality of what technology is and isn't.

6974.791 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I guess maybe, like, the way I would put it is...

6996.458 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I feel some amount of like determinism around the things that AI labs are doing.

6999.46 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I feel like I could help out there, but I don't know that I would like uniquely... I don't know that I would like uniquely improve it.

7003.946 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I think like my personal big fear is that a lot of this stuff happens on the side of humanity and that humanity gets disempowered by it.

7012.939 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I kind of like...

7020.11 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I care not just about all the Dyson spheres that we're going to build and that AI is going to build in a fully autonomous way.

7022.393 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I care about what happens to humans.

7027.041 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I want humans to be well off in this future.

7029.385 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I feel like that's where I can a lot more uniquely add value than like an incremental improvement in the frontier lab.

7031.909 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I guess I'm most afraid of something maybe like depicted in movies like WALL-E or Idiocracy or something like that, where humanity is sort of on the side of this stuff.

7037.819 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

and I want humans to be much, much better in this future.

7047.335 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I guess, to me, this is kind of like through education that you can actually achieve this.

7051.525 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Oh, yeah.

7058.622 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So Eureka is trying to build, I think maybe the easiest way I can describe it is we're trying to build the Starfleet Academy.

7059.023 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know if you've watched Star Trek.

7064.176 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I haven't, but yeah.

7066.459 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Okay, Starfleet Academy is this elite institution for frontier technology, building spaceships and graduating cadets to be the pilots of these spaceships and whatnot.

7067.54 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I just imagine an elite institution for technical knowledge and basically a kind of school that's very up-to-date and very like a premier institution.

7077.291 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

7107.595 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

With respect to Eureka, I think one thing that is very fascinating to me about education is I do think education will pretty fundamentally change with AIs on the side.

7108.476 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think it has to be rewired and changed to some extent.

7116.266 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I still think that we're pretty early.

7120.091 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think there's going to be a lot of people who are going to try to do the obvious things, which is like, oh, have an LLM and ask it questions and do all the basic things that you would do via prompting right now.

7121.833 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think it's helpful, but it still feels to me a bit like slop.

7131.065 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'd like to do it properly, and I think the capability is not there for what I would want.

7134.089 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What I'd want is like an actual tutor experience.

7137.694 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe a prominent example in my mind is I was recently learning Korean, so language learning.

7142.78 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I went through a phase where I was learning Korean by myself on the internet.

7148.708 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I went through a phase where I was actually part of a small class in Korea, taking Korean with a bunch of other people, which was really funny.

7152.032 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But we had a teacher and like 10 people or so taking Korean.

7158.481 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I switched to a one-on-one tutor.

7161.365 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I guess what was fascinating to me is I think I had a really good tutor, but I mean, just thinking through like what this tutor was doing for me and how incredible that experience was and how high the bar is for like what I actually want to build eventually.

7164.008 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because I mean, she was extremely, so she instantly from a very short conversation understood like where I am as a student, what I know and don't know.

7179.908 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And she was able to like probe exactly like the kinds of questions or things to understand my world model.

7187.317 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

No LLM will do that for you 100% right now, not even close, right?

7192.544 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But a tutor will do that if they're good.

7196.029 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Once she understands, she actually like really served me all the things that I needed at my current sliver of capability.

7198.333 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I need to be always appropriately challenged.

7204.503 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I can't be faced with something too hard or too trivial.

7206.286 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And a tutor is really good at serving you just the right stuff.

7209.591 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so basically I felt like I was the only constraint to learning, like my own.

7212.355 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I was the only constraint.

7215.983 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I was always given the perfect information.

7217.005 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm the only constraint.

7218.508 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I felt good because I'm the only impediment that exists.

7220.071 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's not that I can't find knowledge or that it's not properly explained or et cetera.

7222.737 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like it's just my ability to memorize and so on.

7225.764 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this is what I want for people.

7228.089 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How do you automate that?

7230.353 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So a very good question about the current capability, you don't.

7231.829 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do think that with, and that's why I think it's not actually the right time to actually build this kind of an AI tutor.

7234.493 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I still think it's a useful product, and lots of people will build it.

7241.102 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I still feel like the bar is so high, and the capability is not there.

7244.648 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I mean, even today, I would say ChachiBT is an extremely valuable educational product.

7251.117 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I think for me, it was so fascinating to see how high the bar is.

7257.084 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And when I was with her, I almost felt like, there's no way I can build this.

7260.148 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Anyone who's had a really good tutor is like, how are you going to build this?

7265.795 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess I'm waiting for that capability.

7271.002 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that in a lot of ways in the industry, for example, I did some AI consulting for computer vision.

7273.144 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

A lot of my times, the value that I brought to the company was telling them not to use AI.

7278.691 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It wasn't like, I was the AI expert, and they described a problem, and I said, don't use AI.

7282.955 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This was my value add.

7287.299 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I feel like it's the same in education right now, where I kind of feel like, for what I have in mind, it's not yet the time, but the time will come.

7288.4 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But for now, I'm building something that looks maybe a bit more conventional, that has a physical and digital component and so on.

7295.506 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I think there's obvious, it's obvious how this should look like in the future.

7301.892 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Well, so I'm building the first course, and I want to have a really, really good course.

7312.465 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

State-of-the-art, obvious state-of-the-art destination you go to learn AI, in this case, because that's just what I'm familiar with, so I think it's a really good first product to get to be really good.

7317.313 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so that's what I'm building, and NanoChat, which you briefly mentioned, is a capstone project of LLM101N, which is a class that I'm building.

7325.826 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So that's a really big piece of it, but now I have to build out a lot of the intermediates, and then I have to actually, like, hire a small team of, you know, TAs and so on, and actually, like, build the entire course.

7332.016 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And maybe one more thing that I would say is, like, many times when people think about education, they think about sort of, like, the more...

7342.798 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

what I would say is like kind of a softer component of like diffusing knowledge or like, but I actually have something very hard and technical in mind.

7348.891 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so in my mind, education is kind of like the very difficult technical like process of building ramps to knowledge.

7355.938 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So in my mind, NanoChat is a ramp to knowledge because it's a very simple, it's like the super simplified full stack thing.

7362.645 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If you give this artifact to someone and they like look through it, they're learning a ton of stuff.

7369.613 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

7373.837 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so it's giving you a lot of what I call Eurekas per second, which is like understanding per second.

7373.917 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's what I want.

7379.684 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Lots of Eurekas per second.

7380.485 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so to me, this is a technical problem of how do we build these ramps to knowledge.

7382.086 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I always think of Eureka as almost like a, it's not like maybe that different, maybe through some of the frontier labs or some of the work that's going to be going on, because I want to figure out how to build these frontier, these ramps very efficiently so that people are never stuck.

7386.151 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And everything is always not too hard or not too trivial.

7400.107 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you have just the right material to actually progress.

7404.512 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, I think you always have to be calibrated to what the capability, what capability exists in the industry.

7442.547 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think a lot of people are going to pursue like, oh, just ask Chachi PT, et cetera.

7446.733 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I think like right now, for example, if you go to Chachi PT and you say, oh, teach me AI, there's no way.

7450.898 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, it's going to give you some slop, right?

7455.504 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like when I, AI is never going to write nano chat right now, but nano chat is a really useful, I think, intermediate point.

7457.667 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I still, I'm collaborating with AI to create all this material.

7463.455 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So AI is still fundamentally very helpful.

7467.422 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Earlier on, I built a CS231 at Stanford, which was one of the earlier, actually, sorry, I think it was the first deep learning class at Stanford, which became very popular.

7470.067 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And the difference in building out 231N and LLM 101N now is quite stark because I feel really empowered by the LLMs as they exist right now, but I'm very much in the loop.

7478.862 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So they're helping me build little materials.

7489.251 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I go much faster.

7490.893 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They're doing a lot of the boring stuff, et cetera.

7491.674 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I feel like I'm developing the course much faster and those LLM infused in it, but it's not yet at a place where I can creatively create the content.

7494.736 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm still there to do that.

7501.743 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think the trickiness is always calibrating yourself to what exists.

7503.124 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

7520.609 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think it would change over time.

7520.73 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think right now it would be hiring faculty to help work hand-in-hand with AI and a team of people probably to build state-of-the-art courses.

7521.851 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

7532.208 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I think over time it can, maybe some of the TAs can actually become AIs because some of the TAs like, okay, you just take all the course materials and then I think you could serve a very good like automated TA for the student when they have more basic questions or something like that, right?

7532.288 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I think you'll need faculty for the overall architecture of a course and making sure that it fits.

7545.941 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I kind of see a progression of how this will evolve and maybe at some future point, you know, I'm not even that useful and AI is doing most of the design much better than I could.

7551.906 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I still think that that's going to take some time to play out.

7559.373 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

No, I will hire faculty, I think, because there are domains in which I'm not an expert.

7582.812 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think that's the only way to offer the state-of-the-art experience for the student, ultimately.

7586.88 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, yeah, I do expect that I would hire faculty, but I will probably stick around in AI for some time.

7591.59 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do have something, I think, more conventional in mind for the current capability, I think, than what people would probably anticipate.

7598.063 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And when I'm building Starfleet Academy, I do probably imagine a physical institution and maybe a tier below that, a digital offering that is not the state-of-the-art experience you would get when someone comes in physically full-time and we work through material from start to end and make sure you understand it.

7604.176 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's the physical offering.

7621.735 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The digital offering is, yeah, a bunch of stuff on the internet and maybe some LLM assistant and it's a bit more gimmicky and a tier below, but at least it's accessible to like eight billion people.

7623.717 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

7648.765 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

7648.945 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think there's going to have to be a lot of not just education, but also re-education.

7649.345 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I would love to help out there because I think the jobs will probably change quite a bit.

7652.869 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so, for example, today, a lot of people are trying to upskill in AI specifically.

7657.815 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think it's a really good course to teach in this respect.

7661.358 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And yeah, I think the motivation-wise, before AGI, motivation is very simple to solve because people want to make money.

7664.081 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And this is how you make money in the industry today.

7672.611 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think post-AGI is a lot more interesting, possibly, because, yeah, if everything is automated and there's nothing to do for anyone, why would anyone go to a school, et cetera?

7674.713 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think, I guess, like, I often say that pre-AGI education is useful.

7685.134 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Post-AGI education is fun.

7690.285 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And in a similar way, as people, for example, people go to gym today, but we don't need their physical strength to manipulate heavy objects because we have machines that do that.

7692.77 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

They still go to gym.

7703.311 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Why do they go to gym?

7703.972 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Well, because it's fun, it's healthy, and you look hot when you have a six-pack.

7704.894 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know.

7709.623 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I guess like...

7710.685 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I guess what I'm saying is it's attractive for people to do that in a certain like very deep psychological evolutionary sense for humanity.

7711.687 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I kind of think that education will kind of play out in the same way.

7720.54 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like you'll go to school, like you go to gym.

7723.525 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think that right now, I think not that many people learn because learning is hard.

7725.408 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You bounce from material.

7731.717 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And some people overcome that barrier, but for most people, it's hard.

7733.06 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I do think that it's a technical problem to solve.

7736.286 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's a technical problem to do what my tutor did for me when I was learning Korean.

7739.312 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think it's tractable and buildable, and someone should build it.

7743.882 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think it's going to make learning anything trivial and desirable, and people will do it for fun because it's trivial.

7745.846 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

If I had a tutor like that for any arbitrary piece of, like, knowledge, I think it's going to be so much easier to learn anything.

7752.018 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And people will do it.

7757.511 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they'll do it for the same reasons they go to gym.

7758.433 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think this, so I do definitely feel like people will be, I do think like eventually it's a bit of a losing game, if that makes sense.

7784.293 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that it is in long term.

7791.961 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

7794.464 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Long term, which I think is longer than I think maybe most people in the industry.

7794.624 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's a losing game.

7797.968 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that people can go so far and that we barely scratch the surface of how much a person can go.

7799.169 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And that's just because people are bouncing off of material that's too easy or too hard.

7804.956 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I actually kind of feel that people will be able to go much further.

7808.46 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like anyone speaks five languages, because why not?

7812.966 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because it's so trivial.

7815.068 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Anyone knows, you know, all the basic curriculum of undergrad, et cetera.

7816.37 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I kind of feel like I am betting a little bit implicitly on some of the timelessness of human nature.

7857.573 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

7861.963 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think it will be desirable to do all these things.

7862.484 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think people will look up to it as they have for millennia.

7869.822 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think this will continue to be true.

7874.593 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And actually also maybe there's some evidence of that historically because if you look at, for example, aristocrats or you look at maybe ancient Greece or something like that, whenever you had little pocket environments that were post-AGI in a certain sense,

7876.457 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do feel like people have spent a lot of their time flourishing in a certain way, either physically or cognitively.

7886.14 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I think I feel okay about the prospects of that.

7892.367 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think if this is false and I'm wrong and we end up in like, you know, WALL-E or idiocracy future, then I think it's very, I don't even care if there's like Dyson spheres.

7896.091 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This is a terrible outcome.

7905.822 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like I actually really do care about humanity.

7908.545 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like,

7910.087 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Everyone has to just be superhuman in a certain sense.

7910.828 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, maybe.

7943.313 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't actually think that... I think there will be a transitionary period where we are going to be able to be in the loop and advance things if we actually understand a lot of stuff.

7944.614 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do think that long-term, that probably goes away, right?

7952.984 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But maybe it's going to even become a sport.

7955.386 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like right now, you have powerlifters who go extreme on this direction.

7958.43 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So what is powerlifting in a cognitive era?

7962.054 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe it's people who are really trying to make Olympics out of knowing stuff.

7965.117 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like...

7969.342 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And if you have a perfect AI tutor, maybe you can get extremely far.

7970.363 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I almost feel like we're just barely... The geniuses of today are barely scratching the surface of what a human mind can do, I think.

7975.267 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'm similar for that matter.

7999.709 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I mean, a lot of people, for example, hate school and want to get out of it.

8000.73 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I really liked school.

8004.396 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I loved learning things, et cetera.

8006.359 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I wanted to stay in school.

8007.64 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I stayed all the way until PhD, and then they wouldn't let me stay longer, so I went to the industry.

8008.401 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I mean, basically, roughly speaking, I love learning, even for the sake of learning, but I also love learning because it's a form of empowerment and being useful and productive.

8012.908 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because it feels bad to bounce from material.

8054.254 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It feels bad.

8056.236 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You get negative reward from sinking an amount of time in something and this doesn't pan out.

8057.117 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Or like being completely bored because what you're getting is too easy or too hard.

8062.141 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think, yeah, I think when you actually do it properly, learning feels good.

8065.685 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think it's a technical problem to get there.

8070.41 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think for a while it's going to be AI plus human collab, and at some point maybe it's just AI.

8071.951 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think that's a pretty broad topic.

8100.74 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do feel like there's basically, I almost feel like there are 10, 20 tips and tricks that I kind of semi-consciously probably do.

8102.763 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I guess like on a high level, I always try to, I think a lot of this comes from my physics background.

8108.353 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I really, really did enjoy my physics background.

8115.845 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I have a whole rant when I think how everyone should learn physics in early school education, because I think early school education is not about

8117.708 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

crumbling knowledge or memory for tasks later in the industry.

8125.782 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's about booting up a brain.

8128.968 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I think physics uniquely boots up the brain the best because some of the things that they get you to do in your brain during physics is extremely valuable later.

8130.23 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The idea of building models and abstractions and understanding that there's a first order of approximation that describes most of the system, but then there's a second order, third order, first order terms that may or may not be present.

8138.184 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And the idea that you're observing like a very noisy system, but actually there's like these fundamental frequencies that you can abstract away.

8148.462 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like when a physicist walks into the class and they say, assume there's a spherical cow and dot, dot, dot.

8154.82 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And everyone laughs at that, but actually it's brilliant.

8160.195 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's brilliant thinking.

8162.261 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's very generalizable across the industry because, yeah, cows can be approximated as a sphere, I guess, in a bunch of ways.

8163.464 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's a really good book, for example, Scale.

8170.432 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's basically from a physicist talking about biology.

8172.735 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And maybe this is also a book I would recommend reading.

8175.758 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But you can actually get a lot of really interesting approximations and chart scaling laws of animals.

8177.38 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you can look at their heartbeats and things like that, and they actually line up with the size of the animal and things like that.

8182.907 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can talk about an animal as a volume, and you can actually derive a lot of...

8188.934 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can talk about the heat dissipation of that because your heat dissipation grows as the surface area, which is growing a square, but your heat creation or generation is growing as a cube.

8192.438 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I just feel like physicists have all the right cognitive tools to approach problem solving in the world.

8203.668 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I think because of that training, I always try to find the first order terms or the second order terms of everything.

8207.652 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

When I'm observing a system or a thing, I have a tangle of a web of ideas or knowledge in my world, in my mind.

8212.796 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I'm trying to find what is the thing that actually matters?

8218.421 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

What is the first order component?

8220.884 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How can I simplify it?

8222.425 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

How can I have a simple thing that actually shows that thing, right?

8223.506 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It shows an action.

8226.029 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I can tack on the other terms.

8227.451 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe an example from one of my repos that I think illustrates it well is called micrograd.

8229.333 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't know if you're familiar with this.

8234.66 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So micrograd is 100 lines of code that shows backpropagation.

8236.502 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can create neural networks out of simple operations like plus and times, et cetera, Lego blocks of neural networks.

8240.707 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you build up a computational graph, and you do a forward pass and a backward pass to get the gradients.

8245.933 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Now, this is at the heart of all neural network learning.

8250.218 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So MicroGrad is a 100 lines of pre-interpretable Python code, and it can do forward and backward arbitrary neural networks, but not efficiently.

8253.443 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So MicroGrad, these 100 lines of Python, are everything you need to understand how neural networks train.

8260.994 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Everything else is just efficiency.

8265.4 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Everything else is efficiency.

8268.285 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And there's a huge amount of work to do efficiency.

8269.567 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You need your tensors, you lay them out, you stride them, you make sure your kernels are orchestrating memory movement correctly, et cetera.

8271.189 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's all just efficiency, roughly speaking.

8276.617 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But the core intellectual sort of piece of neural network training is micrograph.

8278.72 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's 100 lines.

8281.823 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You can easily understand it.

8282.424 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're chaining.

8283.925 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

It's a recursive application of chain rule to derive the gradient, which allows you to optimize any arbitrary differential function.

8284.606 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So I love finding these, like, you know, the smaller terms and serving them on a platter and discovering them.

8289.691 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I feel like education is, like, the most intellectually interesting thing because you have a tangle of understanding, and you're trying to lay it out in a way that creates a ramp

8298.5 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

where everything only depends on the thing before it.

8308.049 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I find that this, like, you know, untangling of knowledge is just so intellectually interesting as a cognitive task.

8310.092 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And so I love doing it personally, but I just have fascination with trying to lay things out in a certain way.

8316.582 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe that helps me.

8321.269 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, yeah.

8355.114 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah, you're presenting the pain before you present a solution.

8355.535 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And how clever is that?

8357.979 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you want to take the student through that progression.

8358.921 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So there's a lot of other small things like that that I think make it nice and engaging and interesting.

8360.884 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And always prompting the student.

8366.313 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

There's a lot of small things like that that I think are important and a lot of good educators will do.

8368.817 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, how would you solve this?

8373.825 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, I'm not going to present a solution before you're going to guess.

8375.468 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That would be wasteful.

8378.974 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's a little bit of a...

8380.236 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't want to swear, but, like, it's a dick move towards you to present you with the solution before I give you a shot to try to come up with it yourself.

8382.856 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

8404.334 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Well, you have a chance to try yourself, and you have an appreciation when I give you the solution.

8404.574 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And it maximizes the amount of knowledge per new fact added.

8409.564 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

That's right, yeah.

8412.73 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

well as the curse of knowledge and expertise yeah this is a real phenomenon and i actually suffered from it myself as much as i try to not not suffer from it but you take certain things for granted and you can't put yourself in the shoes of new of people who are just starting out and this is pervasive it happens to me as well one thing that i actually think is extremely helpful as an example someone was trying to show me a paper in biology recently and i just had instantly so many terrible questions so what i did was i used chat gpt to ask the questions with the with the paper in the context window

8425.516 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then it worked through some of the simple things.

8454.153 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then I actually shared the thread to the person who shared it, who actually like wrote that paper or like worked on that work.

8457.118 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I almost feel like it was like, like if they can see the dumb questions I had, it might help them explain better in the future or something like that.

8462.406 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Because, so for example, for my material, I would love if people shared their dumb conversations with Chachi PT about the stuff that I've created, because it really helps me put myself again in the shoes of someone who's starting out.

8469.938 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

To say the thing.

8529.204 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Yeah.

8530.006 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Actually, I saw that tweet.

8530.668 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I thought it was really good.

8531.47 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I shared it with a bunch of people, actually.

8532.352 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think it was really good.

8533.776 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And I noticed this many, many times.

8534.818 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Maybe the most prominent example is I remember back in my PhD days doing research, et cetera.

8537.465 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You read someone's paper, right?

8542.615 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you work to understand what it's doing, et cetera.

8544.017 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And then you catch them.

8546.863 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're having beers at the conference later.

8547.584 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you ask them, so, like, this paper, like, so, what were you doing?

8549.648 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, what is the paper about?

8552.413 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they will just tell you these, like, three sentences that, like, perfectly capture the essence of that paper and totally give you the idea.

8553.435 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And you didn't have to read the paper.

8558.444 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And, like, it's only when you're sitting at the table with a beer or something like that and, like, oh, yeah, the paper is just, oh, you take this idea, you take that idea, and you try this experiment, and you try this thing.

8559.807 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And they have a way of just putting it conversationally.

8568.663 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And just, like, perfectly, like, why isn't that the abstract?

8571.368 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I don't actually know that I have unique tips and tricks, to be honest.

8601.016 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Basically, it's kind of a painful process.

8607.151 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But, you know, redraft one.

8610.399 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think one thing that has always helped me quite a bit is...

8612.203 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I had a small tweet about this, actually.

8618.212 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

So, like, learning things on demand is pretty nice.

8619.775 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Learning depth-wise.

8622.258 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I do feel like you need a bit of alternation of learning depth-wise on demand.

8623.38 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You're trying to achieve a certain project that you're going to get a reward from.

8626.345 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And learning breadth-wise, which is just, oh, let's do whatever one-on-one.

8629.21 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

And here's all the things you might need.

8632.335 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Which is a lot of school does a lot of breadth-wise learning.

8633.897 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, oh, trust me, you'll need this later.

8635.88 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

You know, that kind of stuff.

8637.463 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

Like, okay, I trust you.

8638.765 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I'll learn it because I guess I need it.

8640.327 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

But I love the kind of learning where you'll actually get a reward out of doing something and you're learning on demand.

8642.591 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

The other thing that I've found is extremely helpful is maybe this is an aspect where education is a bit more selfless because explaining things to people is a beautiful way to learn something more deeply.

8647.518 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

This happens to me all the time.

8658.954 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I think it probably happens to other people too because

8660.336 View full episode →

Dwarkesh Podcast

Andrej Karpathy — AGI is still a decade away

I realize if I don't really understand something, I can't explain it.