Menu
Sign In Add Podcast

Lex Fridman Podcast

#416 – Yann Lecun: Meta AI, Open Source, Limits of LLMs, AGI & the Future of AI

5803.377 - 5818.481 Lex Fridman

And we use RL in that case to adjust the world model or the critic. So you mentioned RLHF, reinforcement learning with human feedback. Why do you still hate reinforcement learning?

0
💬 0

Comments

There are no comments yet.

Log in to comment.