Lex Fridman Podcast
#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity
That had a lot of nice breakthrough results. That's very kind of you to describe it that way. Yeah, I mean, this was our first real success year. using sparse autoencoders. So we took a one-layer model. And it turns out if you go and you, you know, do dictionary learning on it, you find all these really nice interpretable features.
0
💬
0
Comments
Log in to comment.
There are no comments yet.