Lex Fridman Podcast
#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity
Dario Amodei
Yes. So this was from two years ago. The basic idea is, so we describe what RLHF is. You have a model. And it, you know, spits out two, you know, like you just sample from it twice. It spits out two possible responses and you're like human, which response do you like better? Or another variant of it is rate this response on a scale of one to seven.
0
💬
0
Comments
Log in to comment.
There are no comments yet.