LLM as a Tool, Not an Agent: Code-Mined Tree Transformations for Neural Architecture Search

Masakazu Yoshimura; Zitang Sun; Yuiko Sakuma; Junji Otsuka; Atsushi Irie; Takeshi Ohashi

arXiv:2604.16555·cs.LG·April 21, 2026

LLM as a Tool, Not an Agent: Code-Mined Tree Transformations for Neural Architecture Search

Masakazu Yoshimura, Zitang Sun, Yuiko Sakuma, Junji Otsuka, Atsushi Irie, Takeshi Ohashi

PDF

TL;DR

This paper introduces LLMasTool, a hierarchical tree-based NAS framework that combines reliable tree transformations with LLM assistance to enable open-ended neural architecture search beyond LLM biases.

Contribution

The paper presents a novel NAS approach that extracts reusable modules from source code, representing architectures as hierarchical trees for stable evolution guided by Bayesian modeling.

Findings

01

Improves NAS performance on CIFAR-10, CIFAR-100, and ImageNet16-120 datasets.

02

Uses a diversity-guided algorithm for efficient exploration.

03

Ensures executable architectures through tree transformations rather than code generation.

Abstract

Neural Architecture Search (NAS) aims to automatically discover high-performing deep neural network (DNN) architectures. However, conventional algorithm-driven NAS relies on carefully hand-crafted search spaces to ensure executability, which restricts open-ended exploration. Recent coding-based agentic approaches using large language models (LLMs) reduce manual design, but current LLMs struggle to reliably generate complex, valid architectures, and their proposals are often biased toward a narrow set of patterns observed in their training data. To bridge reliable algorithmic search with powerful LLM assistance, we propose LLMasTool, a hierarchical tree-based NAS framework for stable and open-ended model evolution. Our method automatically extracts reusable modules from arbitrary source code and represents full architectures as hierarchical trees, enabling evolution through reliable tree…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.