LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents
Taro Yano, Yoichi Ishibashi, Masafumi Oyamada

TL;DR
LaMDAgent is an autonomous framework that uses LLM-based agents to automatically construct and optimize complete post-training pipelines for large language models, improving performance with minimal human input.
Contribution
It introduces LaMDAgent, the first framework to autonomously build and optimize full post-training pipelines using LLM agents, exploring diverse strategies and configurations.
Findings
Improves tool-use accuracy by 9.0 points.
Discovers effective post-training strategies often missed by humans.
Scaling data size enables cost-effective pipeline discovery.
Abstract
Large Language Models (LLMs) have demonstrated exceptional performance across a wide range of tasks. To further tailor LLMs to specific domains or applications, post-training techniques such as Supervised Fine-Tuning (SFT), Preference Learning, and model merging are commonly employed. While each of these methods has been extensively studied in isolation, the automated construction of complete post-training pipelines remains an underexplored area. Existing approaches typically rely on manual design or focus narrowly on optimizing individual components, such as data ordering or merging strategies. In this work, we introduce LaMDAgent (short for Language Model Developing Agent), a novel framework that autonomously constructs and optimizes full post-training pipelines through the use of LLM-based agents. LaMDAgent systematically explores diverse model generation techniques, datasets, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Machine Learning and Data Classification · Artificial Intelligence in Healthcare and Education
MethodsFocus
