Revisiting Text Ranking in Deep Research

Chuan Meng; Litu Ou; Sean MacAvaney; Jeff Dalton

arXiv:2602.21456·cs.IR·February 26, 2026

Revisiting Text Ranking in Deep Research

Chuan Meng, Litu Ou, Sean MacAvaney, Jeff Dalton

PDF

Open Access 5 Datasets

TL;DR

This paper systematically analyzes text ranking methods in deep research, focusing on retrieval units, pipeline configurations, and query characteristics, to understand their effectiveness in open-web exploration tasks using LLM-based agents.

Contribution

It reproduces key findings and best practices for IR text ranking in deep research, highlighting the impact of retrieval units, pipeline setups, and query translation on performance.

Findings

01

Passage-level retrieval is more efficient with limited context.

02

Re-ranking significantly improves retrieval effectiveness.

03

Translating queries into natural language reduces mismatch issues.

Abstract

Deep research has emerged as an important task that aims to address hard queries through extensive open-web exploration. To tackle it, most prior work equips large language model (LLM)-based agents with opaque web search APIs, enabling agents to iteratively issue search queries, retrieve external evidence, and reason over it. Despite search's essential role in deep research, black-box web search APIs hinder systematic analysis of search components, leaving the behaviour of established text ranking methods in deep research largely unclear. To fill this gap, we reproduce a selection of key findings and best practices for IR text ranking methods in the deep research setting. In particular, we examine their effectiveness from three perspectives: (i) retrieval units (documents vs. passages), (ii) pipeline configurations (different retrievers, re-rankers, and re-ranking depths), and (iii)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Information Retrieval and Search Behavior · Expert finding and Q&A systems