Lex Fridman Podcast
#447 – Cursor Team: Future of Programming with AI
Aman Sanger
So if you try it in all these benchmarks and things that are in the distribution of the benchmarks they're evaluated on, you know, they'll do really well. But when you push them a little bit outside of that, Sonnet's I think the one that kind of does best at kind of maintaining that same capability.
0
💬
0
Comments
Log in to comment.
There are no comments yet.