Ranking via Sinkhorn Propagation
Ryan Prescott Adams, Richard S. Zemel

TL;DR
This paper introduces Sinkhorn propagation, a novel method for learning ranking functions by optimizing rank-linear objectives through doubly-stochastic matrices and Sinkhorn normalization, enabling gradient-based training.
Contribution
It proposes a new differentiable approach for ranking using Sinkhorn normalization on doubly-stochastic matrices, bridging permutation space and gradient-based learning.
Findings
Effective ranking optimization demonstrated on information retrieval datasets.
Sinkhorn propagation enables gradient-based learning for permutation problems.
Method generalizes well across different ranking metrics.
Abstract
It is of increasing importance to develop learning methods for ranking. In contrast to many learning objectives, however, the ranking problem presents difficulties due to the fact that the space of permutations is not smooth. In this paper, we examine the class of rank-linear objective functions, which includes popular metrics such as precision and discounted cumulative gain. In particular, we observe that expectations of these gains are completely characterized by the marginals of the corresponding distribution over permutation matrices. Thus, the expectations of rank-linear objectives can always be described through locations in the Birkhoff polytope, i.e., doubly-stochastic matrices (DSMs). We propose a technique for learning DSM-based ranking functions using an iterative projection operator known as Sinkhorn normalization. Gradients of this operator can be computed via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Bayesian Modeling and Causal Inference · Recommender Systems and Techniques
