Lex Fridman Podcast
#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity
Lex Fridman
So it seems that the modern post-training recipe has a little bit of everything. So supervised fine-tuning, RLHF, the constitutional, yeah, it was RLAIF. Best acronym. It's again that naming thing. And then synthetic data, seems like a lot of synthetic data, or at least trying to figure out ways to have high quality synthetic data.
0
💬
0
Comments
Log in to comment.
There are no comments yet.