MLE-STAR: Machine Learning Engineering Agent via Search and Targeted Refinement

Jaehyun Nam; Jinsung Yoon; Jiefeng Chen; Jinwoo Shin; Sercan \"O. Ar{\i}k; Tomas Pfister

arXiv:2506.15692·cs.LG·August 29, 2025

MLE-STAR: Machine Learning Engineering Agent via Search and Targeted Refinement

Jaehyun Nam, Jinsung Yoon, Jiefeng Chen, Jinwoo Shin, Sercan \"O. Ar{\i}k, Tomas Pfister

PDF

Open Access 1 Video

TL;DR

MLE-STAR is a novel machine learning engineering agent that combines search-based retrieval and targeted refinement to improve model selection and component exploration, significantly enhancing performance on Kaggle competitions.

Contribution

It introduces a search-guided, iterative refinement approach for LLM-based MLE agents, enabling effective task-specific model selection and deep component exploration.

Findings

01

Achieves medals in 64% of Kaggle competitions on MLE-bench Lite.

02

Outperforms the best alternative methods significantly.

03

Introduces a novel ensembling strategy guided by MLE-STAR.

Abstract

Agents based on large language models (LLMs) for machine learning engineering (MLE) can automatically implement ML models via code generation. However, existing approaches to build such agents often rely heavily on inherent LLM knowledge and employ coarse exploration strategies that modify the entire code structure at once. This limits their ability to select effective task-specific models and perform deep exploration within specific components, such as experimenting extensively with feature engineering options. To overcome these, we propose MLE-STAR, a novel approach to build MLE agents. MLE-STAR first leverages external knowledge by using a search engine to retrieve effective models from the web, forming an initial solution, then iteratively refines it by exploring various strategies targeting specific ML components. This exploration is guided by ablation studies analyzing the impact…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

MLE-STAR: Machine Learning Engineering Agent via Search and Targeted Refinement· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning in Materials Science · Topic Modeling