TL;DR
LongSeeker introduces an adaptive context management paradigm for long-horizon search agents, enabling dynamic shaping of working memory to improve reasoning efficiency and reliability.
Contribution
It proposes Context-ReAct, a unified framework with atomic operations for elastic context orchestration, and develops LongSeeker, a fine-tuned agent demonstrating superior performance.
Findings
LongSeeker outperforms baseline agents on multiple benchmarks.
The framework reduces costs and hallucination risk in long-horizon reasoning.
The Compress operator is proven to be expressively complete.
Abstract
Long-horizon search agents must manage a rapidly growing working context as they reason, call tools, and observe information. Naively accumulating all intermediate content can overwhelm the agent, increasing costs and the risk of errors. We propose that effective context management should be adaptive: parts of the agent's trajectory are maintained at different levels of detail depending on their current relevance to the task. To operationalize this principle, we introduce Context-ReAct, a general agentic paradigm for elastic context orchestration that integrates reasoning, context management, and tool use in a unified loop. Context-ReAct provides five atomic operations: Skip, Compress, Rollback, Snippet and Delete, which allow the agent to dynamically reshape its working context, preserving important evidence, summarizing resolved information, discarding unhelpful branches, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
