TL;DR
This paper introduces PL-Rank-3, a novel learning-to-rank algorithm that achieves unbiased gradient estimation with minimal computational complexity, enabling scalable ranking optimization without performance loss.
Contribution
The paper presents PL-Rank-3, a new algorithm that significantly reduces computational complexity in gradient estimation for learning-to-rank models, scalable to larger datasets.
Findings
Large reductions in optimization time
Unbiased gradient estimation maintained
Scalable to larger ranking datasets
Abstract
Plackett-Luce gradient estimation enables the optimization of stochastic ranking models within feasible time constraints through sampling techniques. Unfortunately, the computational complexity of existing methods does not scale well with the length of the rankings, i.e. the ranking cutoff, nor with the item collection size. In this paper, we introduce the novel PL-Rank-3 algorithm that performs unbiased gradient estimation with a computational complexity comparable to the best sorting algorithms. As a result, our novel learning-to-rank method is applicable in any scenario where standard sorting is feasible in reasonable time. Our experimental results indicate large gains in the time required for optimization, without any loss in performance. For the field, our contribution could potentially allow state-of-the-art learning-to-rank methods to be applied to much larger scales than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
