Menu
Sign In Pricing Add Podcast

Lex Fridman Podcast

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

12997.344 - 13018.575 Dylan Patel

And what you do there is instead of a human data or instead of the model you're currently training, you take completions from a different, normally more powerful model. I think there's rumors that these big models that people are waiting for, these GPT-5s of the world, the CLOD-3 opuses of the world are used internally to do this distillation process.

0
💬 0

Comments

There are no comments yet.

Log in to comment.