Lex Fridman Podcast

#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity

18444.614 - 18465.092

But scaling monosemanticity sort of, I think, was significant evidence that even for very large models, and we did it on Claude 3 sauna, which at that point was one of our production models, you know, even these models seem to be very, you know, seem to be substantially explained, at least, by linear features and, you know, doing dictionary learning on them works.

💬 0

Comments

There are no comments yet.

Back to full episode

Lex Fridman Podcast

#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity

Comments

Login Required