Lex Fridman Podcast
#447 – Cursor Team: Future of Programming with AI
Sualeh Asif
But MLA is from this company called DeepSeek. It's quite an interesting algorithm. Maybe the key idea is sort of in both MQA and in other places, what you're doing is you're sort of reducing the number of KV heads. The advantage you get from that is
0
💬
0
Comments
Log in to comment.
There are no comments yet.