Lex Fridman Podcast
#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity
I mean, there's all kinds of other features about withholding information and not answering questions, features about power seeking and coups and stuff like that. So there's a lot of features that are kind of related to spooky things. And if you force them active, Claude will behave in ways that are not the kinds of behaviors you want.
0
💬
0
Comments
Log in to comment.
There are no comments yet.