TL;DR
Agent-Omit introduces an adaptive omission framework for LLM agents, reducing redundant thoughts and observations to enhance efficiency without sacrificing effectiveness, validated across multiple benchmarks.
Contribution
The paper proposes a novel training framework and reinforcement learning approach enabling LLM agents to adaptively omit unnecessary information, improving efficiency and effectiveness.
Findings
Agent-Omit-8B achieves comparable performance to frontier LLM agents.
The method outperforms seven existing efficient LLM agent approaches.
The omission policy deviation is theoretically bounded by KL-divergence.
Abstract
Managing agent context (e.g., thought and observation) during multi-turn agent-environment interactions is an emerging strategy to improve agent efficiency. However, existing studies treat the entire interaction trajectories equally, overlooking the thought necessity and observation utility varies across turns. To this end, we first conduct quantitative investigations into how thought and observation affect agent effectiveness and efficiency. Based on our findings, we propose Agent-Omit, a unified training framework that empowers LLM agents to adaptively omit redundant thoughts and observations. Specifically, we first synthesize a small amount of cold-start data, including both single-turn and multi-turn omission scenarios, to fine-tune the agent for omission behaviors. Furthermore, we introduce an omit-aware agentic reinforcement learning approach, incorporating a dual sampling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
