SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning

Yizhou Chi; Yizhang Lin; Sirui Hong; Duyi Pan; Yaying Fei; Guanghao; Mei; Bangbang Liu; Tianqi Pang; Jacky Kwok; Ceyao Zhang; Bang Liu; Chenglin; Wu

arXiv:2410.17238·cs.AI·October 23, 2024·2 cites

SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning

Yizhou Chi, Yizhang Lin, Sirui Hong, Duyi Pan, Yaying Fei, Guanghao, Mei, Bangbang Liu, Tianqi Pang, Jacky Kwok, Ceyao Zhang, Bang Liu, Chenglin, Wu

PDF

Open Access 1 Repo 3 Reviews

TL;DR

SELA introduces a tree-search enhanced approach for LLM-based AutoML, using Monte Carlo Tree Search to improve pipeline exploration and optimize machine learning solutions across multiple datasets.

Contribution

The paper presents a novel agent system that employs Monte Carlo Tree Search to enhance the exploration and optimization in LLM-based AutoML pipelines.

Findings

01

SELA outperforms traditional AutoML methods with a 65-80% win rate.

02

Tree-based exploration improves pipeline diversity and quality.

03

Extensive evaluation across 20 datasets demonstrates effectiveness.

Abstract

Automated Machine Learning (AutoML) approaches encompass traditional methods that optimize fixed pipelines for model selection and ensembling, as well as newer LLM-based frameworks that autonomously build pipelines. While LLM-based agents have shown promise in automating machine learning tasks, they often generate low-diversity and suboptimal code, even after multiple iterations. To overcome these limitations, we introduce Tree-Search Enhanced LLM Agents (SELA), an innovative agent-based system that leverages Monte Carlo Tree Search (MCTS) to optimize the AutoML process. By representing pipeline configurations as trees, our framework enables agents to conduct experiments intelligently and iteratively refine their strategies, facilitating a more effective exploration of the machine learning solution space. This novel approach allows SELA to discover optimal pathways based on experimental…

Peer Reviews

Decision·Submitted to ICLR 2025

Reviewer 01Rating 3Confidence 3

Strengths

- The paper conveys the main ideas and results clearly. - Handling the NP-hardness of the problem by incorporating LLM agents with a search procedure is intuitive and makes a lot of sense.

Weaknesses

My main concern with this paper is regarding the novelty of the proposed method. The authors note that prior work has already explored combining LLMs with tree search methods. This makes it unclear if the technical contribution of this paper is novel enough for ICLR. The primary contribution of applying LLMs with tree search to AutoML could be an interesting contribution had the paper shown that tree search cannot be trivially combined with LLMs to solve AutoML problems, and highlighted the key

Reviewer 02Rating 5Confidence 3

Strengths

The paper presents an interesting pipeline for AutoML based on LLM agents, where the functional attributes of each node layer are intuitively and innovatively defined.

Weaknesses

1. The paper lacks sufficient detail and resources for reproducibility. 2. The study involves multiple modules and details; however, the results in Table 2 appear to show only marginal improvements. 3. Additional issues are mentioned in the "Questions" section.

Reviewer 03Rating 1Confidence 5

Strengths

The ablation study showing the strenghts of different base LLMs is interesting.

Weaknesses

The work largely seems to be unfamiliar with AutoML tools and the broader literature. This is particularly evident with the work repeatedly claiming falsehoods about AutoML. For example, early on it is said that AutoML tools typically only focus on the model training aspect, while ignoring feature engineering stages and so on. This claim is easily refuted as many of the cited tools do include data preprocessing and feature preprocessing into account and generally tend to the whole machine learni

Code & Models

Repositories

geekan/metagpt
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining Algorithms and Applications · Semantic Web and Ontologies · Data Stream Mining Techniques