TL;DR
TACO is a self-evolving, workflow-adaptive compression framework that enhances terminal agent performance and efficiency by intelligently filtering observations, reducing token costs, and maintaining task relevance across diverse benchmarks.
Contribution
It introduces a training-free, self-evolving compression method that automatically refines and reuses structured rules for terminal observations, improving long-horizon agent performance.
Findings
TACO improves accuracy by 1%-4% on TerminalBench.
It reduces token consumption while maintaining success rates.
TACO enhances task performance across multiple benchmarks.
Abstract
As terminal agents scale to long-horizon, multi-turn workflows, a key bottleneck is not merely limited context length, but the accumulation of noisy terminal observations in the interaction history. Retaining raw observations preserves useful environment feedback, but also leads to context saturation and high token cost; conversely, naive compression may discard task-critical signals needed for subsequent actions. Because terminal environments are highly heterogeneous across repositories, commands, and execution states, heuristic-based or fixed-prompt compression methods are difficult to generalize. We propose TACO, a plug-and-play, training-free, self-evolving Terminal Agent Compression framework for existing terminal agents. TACO automatically discovers, refines, and reuses structured compression rules from interaction trajectories, enabling workflow-adaptive filtering of low-value…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
