Lex Fridman Podcast
#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity
Yeah, it's finding how to fit it efficiently or something like this. The gradient descent is doing this. And in fact, so this sort of says that gradient descent, you know, it could just represent a dense neural network, but it sort of says that gradient descent is implicitly searching over the space of extremely sparse models that could be projected into this low dimensional space. Mm-hmm.
0
💬
0
Comments
Log in to comment.
There are no comments yet.