Terminal-World: Scaling Terminal-Agent Environments via Agent Skills
Zihao Cheng, Hongru Wang, Zeming Liu, Xinyi Wang, Xiangrong Zhu, Yuhang Guo, Wei Lin, Jeff Z. Pan, and Yunhong Wang

TL;DR
Terminal-World introduces an automated pipeline that synthesizes diverse terminal environments and tasks using agent skills, significantly enhancing training data quality and agent performance in command-line tasks.
Contribution
It proposes a novel skill-based synthesis method for creating extensive terminal environments, enabling scalable training and improved agent capabilities.
Findings
Constructed 5,723 training environments using the pipeline.
Terminal-World models outperform baselines on 6 benchmarks.
Achieved state-of-the-art results with minimal training data.
Abstract
Terminal agents extend Large Language Models with the ability to execute tasks directly in command-line environments, but their progress is bottlenecked by the scarcity of high-quality training data. Existing approaches bootstrap from partial sources such as human-defined seeds or GitHub repositories to instantiate one component and then complete the rest, producing tasks confined to narrow seed distributions, environments misaligned with task semantics, and inefficient trajectories from unguided exploration. To address these limitations, we introduce Terminal-World, a fully automated pipeline that uses agent skills as the central synthesis primitive, which jointly encode what to accomplish, when to apply (preconditions and environment state), and how to execute, enabling task instructions, environments, and teacher trajectories to be co-derived. To further broaden the synthesis space,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
