Lex Fridman Podcast
#447 – Cursor Team: Future of Programming with AI
Sualeh Asif
And then you can give a reward to the things that humans would like more and sort of punish the things that it won't like and sort of then train the model to output the suggestions that humans would like more. You have these like RL loops that are very useful that exploit these passive K-curves. Oman maybe can go into even more detail.
0
💬
0
Comments
Log in to comment.
There are no comments yet.