Online Ranking: Discrete Choice, Spearman Correlation and Other Feedback
Nir Ailon

TL;DR
This paper introduces an online ranking algorithm with improved regret bounds for adversarial feedback scenarios, optimizing performance for various ranking loss functions including Spearman correlation.
Contribution
The authors develop a new algorithm with tighter regret bounds and faster running time, also providing matching lower bounds and extending results to complex ranking losses.
Findings
Expected regret of $O(n^{3/2}\sqrt{Tk})$ over $T$ steps.
Improved running time from quadratic to $O(n\log n)$ per round.
Extension of bounds to Spearman correlation and other ranking losses.
Abstract
Given a set of objects, an online ranking system outputs at each time step a full ranking of the set, observes a feedback of some form and suffers a loss. We study the setting in which the (adversarial) feedback is an element in , and the loss is the position (0th, 1st, 2nd...) of the item in the outputted ranking. More generally, we study a setting in which the feedback is a subset of at most elements in , and the loss is the sum of the positions of those elements. We present an algorithm of expected regret over a time horizon of steps with respect to the best single ranking in hindsight. This improves previous algorithms and analyses either by a factor of either , a factor of or by improving running time from quadratic to per round. We also prove a matching lower bound. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Game Theory and Applications
