Lex Fridman Podcast
#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters
Dylan Patel
And what you do there is instead of a human data or instead of the model you're currently training, you take completions from a different, normally more powerful model. I think there's rumors that these big models that people are waiting for, these GPT-5s of the world, the CLOD-3 opuses of the world are used internally to do this distillation process.
0
💬
0
Comments
Log in to comment.
There are no comments yet.