Lex Fridman Podcast
#447 – Cursor Team: Future of Programming with AI
Aman Sanger
Yeah, I mean, so we can go over a lot of the strategies that we use. One interesting thing is cache warming. And so what you can do is if, as the user is typing, you can have, you're probably going to use some piece of context. And you can know that before the user's done typing. So, you know, as we discussed before, Reusing the KV cache results in lower latency, lower costs, cross requests.
0
💬
0
Comments
Log in to comment.
There are no comments yet.