Efficient Exploration of Gradient Space for Online Learning to Rank
Huazheng Wang, Ramsey Langley, Sonwoo Kim, Eric McCord-Snook, Hongning, Wang

TL;DR
This paper introduces Null Space Gradient Descent, an efficient exploration method for online learning to rank that accelerates convergence by focusing exploration on promising gradient directions, improving early-stage ranking quality.
Contribution
The paper proposes a novel Null Space Gradient Descent algorithm that reduces exploration to the null space of poor gradients, enhancing learning speed and ranking performance in OL2R.
Findings
Faster convergence compared to state-of-the-art algorithms
Improved early-stage ranking quality
Effective in various public benchmark datasets
Abstract
Online learning to rank (OL2R) optimizes the utility of returned search results based on implicit feedback gathered directly from users. To improve the estimates, OL2R algorithms examine one or more exploratory gradient directions and update the current ranker if a proposed one is preferred by users via an interleaved test. In this paper, we accelerate the online learning process by efficient exploration in the gradient space. Our algorithm, named as Null Space Gradient Descent, reduces the exploration space to only the \emph{null space} of recent poorly performing gradients. This prevents the algorithm from repeatedly exploring directions that have been discouraged by the most recent interactions with users. To improve sensitivity of the resulting interleaved test, we selectively construct candidate rankers to maximize the chance that they can be differentiated by candidate ranking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
