Rethinking Deep Research from the Perspective of Web Content Distribution Matching
Zixuan Yu, Zhenheng Tang, Tongliang Liu, Chengqi Zhang, Xiaowen Chu, Bo Han

TL;DR
This paper introduces WeDas, a framework that enhances deep search agents by aligning web content distribution with agent goals, improving retrieval precision and reasoning accuracy through a novel scoring and probing mechanism.
Contribution
WeDas is a novel framework that incorporates web content distribution characteristics into search agents, enabling dynamic goal recalibration and improved retrieval alignment.
Findings
Improves sub-goal completion across benchmarks
Enhances reasoning accuracy in deep search agents
Bridges gap between reasoning and retrieval
Abstract
Despite the integration of search tools, Deep Search Agents often suffer from a misalignment between reasoning-driven queries and the underlying web indexing structures. Existing frameworks treat the search engine as a static utility, leading to queries that are either too coarse or too granular to retrieve precise evidence. We propose WeDas, a Web Content Distribution Aware framework that incorporates search-space structural characteristics into the agent's observation space. Central to our method is the Query-Result Alignment Score, a metric quantifying the compatibility between agent intent and retrieval outcomes. To overcome the intractability of indexing the dynamic web, we introduce a few-shot probing mechanism that iteratively estimates this score via limited query accesses, allowing the agent to dynamically recalibrate sub-goals based on the local content landscape. As a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Web Data Mining and Analysis · Topic Modeling
