Lex Fridman Podcast
#434 – Aravind Srinivas: Perplexity CEO on Future of AI, Search & the Internet
Aravind Srinivas
I would say it's almost like the last answer that like nothing has changed since 2017, except maybe a few changes on what the non-linearities are and like how the square D scaling should be done. Like some of that has changed, but, and then people have tried mixture of experts having more parameters for the same flop and things like that. But the core transformer architecture has not changed.
0
💬
0
Comments
Log in to comment.
There are no comments yet.