Revisiting Text Ranking in Deep Research
Chuan Meng, Litu Ou, Sean MacAvaney, Jeff Dalton

TL;DR
This paper systematically analyzes text ranking methods in deep research, focusing on retrieval units, pipeline configurations, and query characteristics, to understand their effectiveness in open-web exploration tasks using LLM-based agents.
Contribution
It reproduces key findings and best practices for IR text ranking in deep research, highlighting the impact of retrieval units, pipeline setups, and query translation on performance.
Findings
Passage-level retrieval is more efficient with limited context.
Re-ranking significantly improves retrieval effectiveness.
Translating queries into natural language reduces mismatch issues.
Abstract
Deep research has emerged as an important task that aims to address hard queries through extensive open-web exploration. To tackle it, most prior work equips large language model (LLM)-based agents with opaque web search APIs, enabling agents to iteratively issue search queries, retrieve external evidence, and reason over it. Despite search's essential role in deep research, black-box web search APIs hinder systematic analysis of search components, leaving the behaviour of established text ranking methods in deep research largely unclear. To fill this gap, we reproduce a selection of key findings and best practices for IR text ranking methods in the deep research setting. In particular, we examine their effectiveness from three perspectives: (i) retrieval units (documents vs. passages), (ii) pipeline configurations (different retrievers, re-rankers, and re-ranking depths), and (iii)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- grill-lab/browsecomp-plus-passage-corpusdataset· 98 dl98 dl
- grill-lab/browsecomp-plus-indexesdataset· 973 dl973 dl
- grill-lab/browsecomp-plus-runsdataset· 22 dl22 dl
- grill-lab/browsecomp-plus-passage-corpus-pyserinidataset· 75 dl75 dl
- AmanPriyanshu/tool-reasoning-sft-RESEARCH-grill-lab-browsecomp-plus-runs-data-cleaned-rectifieddataset· 66 dl66 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Expert finding and Q&A systems
