Lex Fridman Podcast
#447 – Cursor Team: Future of Programming with AI
Aman Sanger
Then when I'm, let's say I have sorted for the last n tokens, if I now want to compute the output token for the n plus one token, I don't need to pass those first n tokens through the entire model because I already have all those keys and values. And so you just need to do the forward pass through that last token.
0
💬
0
Comments
Log in to comment.
There are no comments yet.