MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering – OpenAI
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering – OpenAI