Lex Fridman Podcast
#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity
Lex Fridman
To me, one of the really interesting features, especially for AI safety, is deception and lying. And the possibility that these kinds of methods could detect lying in a model, especially gets smarter and smarter and smarter. Presumably that's a big threat of a super intelligent model that it can deceive the people operating it. as to its intentions or any of that kind of stuff.
0
💬
0
Comments
Log in to comment.
There are no comments yet.