Lex Fridman Podcast

#447 – Cursor Team: Future of Programming with AI

2934.684 - 2954.71 Michael Truell

And so, for instance, one of the most popular agent benchmarks, SweetBench, is really, really contaminated in the training data of these foundation models. And so if you ask these foundation models to do a sweet bench problem, but you actually don't give them the context of a code base, they can like hallucinate the right file pass, they can hallucinate the right function names.

💬 0

Comments

There are no comments yet.

Back to full episode

Lex Fridman Podcast

#447 – Cursor Team: Future of Programming with AI

Comments

Login Required