Assessing Efficiency-Effectiveness Tradeoffs in Multi-Stage Retrieval Systems Without Using Relevance Judgments
Charles L. A. Clarke, J. Shane Culpepper, and Alistair Moffat

TL;DR
This paper introduces a relevance-judgment-free method to evaluate and optimize filtering stages in multi-stage retrieval systems, enabling performance tuning without reliance on relevance data.
Contribution
It proposes a novel quality score for filtering stages that correlates with effectiveness and does not require relevance judgments, facilitating system tuning across diverse queries.
Findings
The proposed quality score correlates with actual retrieval effectiveness.
The method can identify poorly performing queries without relevance judgments.
Various filtering approaches and parameters were systematically evaluated.
Abstract
Large-scale retrieval systems are often implemented as a cascading sequence of phases -- a first filtering step, in which a large set of candidate documents are extracted using a simple technique such as Boolean matching and/or static document scores; and then one or more ranking steps, in which the pool of documents retrieved by the filter is scored more precisely using dozens or perhaps hundreds of different features. The documents returned to the user are then taken from the head of the final ranked list. Here we examine methods for measuring the quality of filtering and preliminary ranking stages, and show how to use these measurements to tune the overall performance of the system. Standard top-weighted metrics used for overall system evaluation are not appropriate for assessing filtering stages, since the output is a set of documents, rather than an ordered sequence of documents.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Data Management and Algorithms · Advanced Database Systems and Queries
