Lex Fridman Podcast
#416 – Yann Lecun: Meta AI, Open Source, Limits of LLMs, AGI & the Future of AI
Lex Fridman
And we use RL in that case to adjust the world model or the critic. So you mentioned RLHF, reinforcement learning with human feedback. Why do you still hate reinforcement learning?
0
💬
0
Comments
Log in to comment.
There are no comments yet.