LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback

Thai Hoang; Kung-Hsiang Huang; Shirley Kokane; Jianguo Zhang; Zuxin Liu; Ming Zhu; Jake Grigsby; Tian Lan; Michael S Ryoo; Chien-Sheng Wu; Shelby Heinecke; Huan Wang; Silvio Savarese; Caiming Xiong; Juan Carlos Niebles

arXiv:2506.02298·cs.CL·June 4, 2025

LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback

Thai Hoang, Kung-Hsiang Huang, Shirley Kokane, Jianguo Zhang, Zuxin Liu, Ming Zhu, Jake Grigsby, Tian Lan, Michael S Ryoo, Chien-Sheng Wu, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong, Juan Carlos Niebles

PDF

Open Access

TL;DR

LAM SIMULATOR is a framework that enhances data generation for training Large Action Models by enabling autonomous exploration and feedback-driven learning, leading to substantial performance improvements on benchmark tasks.

Contribution

We introduce LAM SIMULATOR, a novel online exploration framework that generates high-quality training data for LAMs with minimal human intervention.

Findings

01

Models trained with data from LAM SIMULATOR show up to 49.3% performance improvement.

02

The framework enables autonomous task exploration and multi-approach discovery.

03

Significant reduction in human effort for dataset creation.

Abstract

Large Action Models (LAMs) for AI Agents offer incredible potential but face challenges due to the need for high-quality training data, especially for multi-steps tasks that involve planning, executing tool calls, and responding to feedback. To address these issues, we present LAM SIMULATOR, a comprehensive framework designed for online exploration of agentic tasks with high-quality feedback. Our framework features a dynamic task query generator, an extensive collection of tools, and an interactive environment where Large Language Model (LLM) Agents can call tools and receive real-time feedback. This setup enables LLM Agents to explore and solve tasks autonomously, facilitating the discovery of multiple approaches to tackle any given task. The resulting action trajectory data are then used to create high-quality training datasets for LAMs. Our experiments on popular agentic benchmarks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Context-Aware Activity Recognition Systems · Human-Automation Interaction and Safety