LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents

Taro Yano; Yoichi Ishibashi; Masafumi Oyamada

arXiv:2505.21963·cs.CL·May 29, 2025

LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents

Taro Yano, Yoichi Ishibashi, Masafumi Oyamada

PDF

Open Access 1 Video

TL;DR

LaMDAgent is an autonomous framework that uses LLM-based agents to automatically construct and optimize complete post-training pipelines for large language models, improving performance with minimal human input.

Contribution

It introduces LaMDAgent, the first framework to autonomously build and optimize full post-training pipelines using LLM agents, exploring diverse strategies and configurations.

Findings

01

Improves tool-use accuracy by 9.0 points.

02

Discovers effective post-training strategies often missed by humans.

03

Scaling data size enables cost-effective pipeline discovery.

Abstract

Large Language Models (LLMs) have demonstrated exceptional performance across a wide range of tasks. To further tailor LLMs to specific domains or applications, post-training techniques such as Supervised Fine-Tuning (SFT), Preference Learning, and model merging are commonly employed. While each of these methods has been extensively studied in isolation, the automated construction of complete post-training pipelines remains an underexplored area. Existing approaches typically rely on manual design or focus narrowly on optimizing individual components, such as data ordering or merging strategies. In this work, we introduce LaMDAgent (short for Language Model Developing Agent), a novel framework that autonomously constructs and optimizes full post-training pipelines through the use of LLM-based agents. LaMDAgent systematically explores diverse model generation techniques, datasets, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents· underline

Taxonomy

TopicsTopic Modeling · Machine Learning and Data Classification · Artificial Intelligence in Healthcare and Education

MethodsFocus