Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking
Minghan Li, Xinyu Zhang, Ji Xin, Hongyang Zhang, Jimmy Lin

TL;DR
This paper introduces a method for candidate set pruning in relevance ranking that guarantees error control with high probability, balancing speed and accuracy in information retrieval tasks.
Contribution
It proposes the concept of certified error control for candidate set pruning, providing theoretical guarantees on test error while improving reranking speed.
Findings
Achieves 37x speedup in reranking on MS MARCO Passage v1
Guarantees test error within user-specified thresholds with 90% coverage
Outperforms empirical baselines lacking error control guarantees
Abstract
In information retrieval (IR), candidate set pruning has been commonly used to speed up two-stage relevance ranking. However, such an approach lacks accurate error control and often trades accuracy off against computational efficiency in an empirical fashion, lacking theoretical guarantees. In this paper, we propose the concept of certified error control of candidate set pruning for relevance ranking, which means that the test error after pruning is guaranteed to be controlled under a user-specified threshold with high probability. Both in-domain and out-of-domain experiments show that our method successfully prunes the first-stage retrieved candidate sets to improve the second-stage reranking speed while satisfying the pre-specified accuracy constraints in both settings. For example, on MS MARCO Passage v1, our method yields an average candidate set size of 27 out of 1,000 which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Bayesian Modeling and Causal Inference · Data Management and Algorithms
MethodsPruning · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
