Ranking with Confidence for Large Scale Comparison Data
Filipa Valdeira, Cl\'audia Soares

TL;DR
This paper introduces PD-Rank, a fast and accurate ranking algorithm from noisy pairwise comparison data that provides confidence measures and outperforms existing methods in speed and accuracy.
Contribution
We develop a novel ranking algorithm based on a generative noise model, using an efficient optimization approach that improves accuracy and scalability in large-scale comparison data.
Findings
PD-Rank achieves higher Kendall tau scores than competing methods.
It is faster by an order of magnitude compared to existing algorithms.
Performs well even with 10% incorrect comparisons in simulated data.
Abstract
In this work, we leverage a generative data model considering comparison noise to develop a fast, precise, and informative ranking algorithm from pairwise comparisons that produces a measure of confidence on each comparison. The problem of ranking a large number of items from noisy and sparse pairwise comparison data arises in diverse applications, like ranking players in online games, document retrieval or ranking human perceptions. Although different algorithms are available, we need fast, large-scale algorithms whose accuracy degrades gracefully when the number of comparisons is too small. Fitting our proposed model entails solving a non-convex optimization problem, which we tightly approximate by a sum of quasi-convex functions and a regularization term. Resorting to an iterative reweighted minimization and the Primal-Dual Hybrid Gradient method, we obtain PD-Rank, achieving a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Statistical Mechanics and Entropy · Machine Learning and Algorithms
