Menu
Sign In Pricing Add Podcast

Lex Fridman Podcast

#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity

6862.894 - 6888.977 Lex Fridman

So it seems that the modern post-training recipe has a little bit of everything. So supervised fine-tuning, RLHF, the constitutional, yeah, it was RLAIF. Best acronym. It's again that naming thing. And then synthetic data, seems like a lot of synthetic data, or at least trying to figure out ways to have high quality synthetic data.

0
💬 0

Comments

There are no comments yet.

Log in to comment.