Lex Fridman Podcast
#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity
Yeah, so I think we're in some ways in early days for that. We find quite a few features related to deception and lying. There's one feature where it fires for people lying and being deceptive and you force it active and Claude starts lying to you. So we have a deception feature.
0
💬
0
Comments
Log in to comment.
There are no comments yet.