TL;DR
ExpSeek introduces a step-level, self-triggered experience seeking method for web agents, improving their adaptability and performance by proactively estimating entropy thresholds for intervention during interactions.
Contribution
It proposes a novel step-level experience seeking approach that uses entropy as a self-triggering signal, enhancing web agent performance during dynamic interactions.
Findings
ExpSeek improves web agent performance by 9.3% and 7.5% on benchmark tasks.
Using entropy as a self-triggering signal is effective for experience intervention.
A small 4B experience model can significantly boost larger agent models.
Abstract
Experience intervention in web agents emerges as a promising technical paradigm, enhancing agent interaction capabilities by providing valuable insights from accumulated experiences. However, existing methods predominantly inject experience passively as global context before task execution, struggling to adapt to dynamically changing contextual observations during agent-environment interaction. We propose ExpSeek, which shifts experience toward step-level proactive seeking: (1) estimating step-level entropy thresholds to determine intervention timing using the model's intrinsic signals; (2) designing step-level tailored experience content. Experiments on Qwen3-8B and 32B models across four challenging web agent benchmarks demonstrate that ExpSeek achieves absolute improvements of 9.3% and 7.5%, respectively. Our experiments validate the feasibility and advantages of entropy as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
