Lex Fridman Podcast
#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity
Dario Amodei
We had some results came out about like sleeper agents and there was a more recent paper about, you know, can the models mislead attempts to, you know, sandbag their own abilities, right? Show them, you know, present themselves as being less capable than they are. And so I think with ASL 4, there's going to be an important component of using other things than just interacting with the models.
0
💬
0
Comments
Log in to comment.
There are no comments yet.