Two-stage Risk Control with Application to Ranked Retrieval
Yunpeng Xu, Mufang Ying, Wenge Guo, Zhi Wei

TL;DR
This paper introduces two-stage risk control methods tailored for ranked retrieval systems, leveraging sequential problem structure to provide theoretical guarantees and demonstrate effectiveness on large-scale datasets.
Contribution
The paper develops novel two-stage risk control methods based on LTT and CRC frameworks, specifically designed for ranked retrieval tasks, with reduced computational complexity and theoretical guarantees.
Findings
Effective risk control demonstrated on large-scale datasets
Theoretical guarantees established for proposed methods
Tailored loss functions improve ranking performance
Abstract
Practical machine learning systems often operate in multiple sequential stages, as seen in ranking and recommendation systems, which typically include a retrieval phase followed by a ranking phase. Effectively assessing prediction uncertainty and ensuring effective risk control in such systems pose significant challenges due to their inherent complexity. To address these challenges, we developed two-stage risk control methods based on the recently proposed learn-then-test (LTT) and conformal risk control (CRC) frameworks. Unlike the methods in prior work that address multiple risks, our approach leverages the sequential nature of the problem, resulting in reduced computational burden. We provide theoretical guarantees for our proposed methods and design novel loss functions tailored for ranked retrieval tasks. The effectiveness of our approach is validated through experiments on two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Image and Video Retrieval Techniques · Algorithms and Data Compression
MethodsSparse Evolutionary Training
