Lex Fridman Podcast
#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters
Nathan Lambert
mathematical verifiable task, generate many traces of reasoning, right? And keep branching them out, keep branching them out. And then check at the end, hey, which one actually has the right answer? Most of them are wrong. Great. These are the few that are right. Maybe we use some sort of reward model outside of this to select even the best one to preference as well.
0
💬
0
Comments
Log in to comment.
There are no comments yet.