Has anyone used leetcode for training data yet?
Has anyone used leetcode for training data yet?

Has anyone used leetcode for training data yet?

As a passing observer of the AI space it has always made sense to me that leetcode would be the best coding database out there. The quality and organization of the data seems perfect. Not old would there be more than enough problems, but each problem have multiple valid solutions, and those solutions would be organized by performance. I hear all the time that the stack and stack overflow are use for LLMs, but why not leetcode.

This recently paper shows the value of smaller models trained on higher quality data, so it would make sense that leetcode would be the best right?

https://youtu.be/7S68y6huEpU

submitted by /u/TrainquilOasis1423
[link] [comments]