Efficient Exploration of Gradient Space for Online Learning to Rank

Huazheng Wang; Ramsey Langley; Sonwoo Kim; Eric McCord-Snook; Hongning; Wang

arXiv:1805.07317·cs.IR·November 28, 2018

Efficient Exploration of Gradient Space for Online Learning to Rank

Huazheng Wang, Ramsey Langley, Sonwoo Kim, Eric McCord-Snook, Hongning, Wang

PDF

TL;DR

This paper introduces Null Space Gradient Descent, an efficient exploration method for online learning to rank that accelerates convergence by focusing exploration on promising gradient directions, improving early-stage ranking quality.

Contribution

The paper proposes a novel Null Space Gradient Descent algorithm that reduces exploration to the null space of poor gradients, enhancing learning speed and ranking performance in OL2R.

Findings

01

Faster convergence compared to state-of-the-art algorithms

02

Improved early-stage ranking quality

03

Effective in various public benchmark datasets

Abstract

Online learning to rank (OL2R) optimizes the utility of returned search results based on implicit feedback gathered directly from users. To improve the estimates, OL2R algorithms examine one or more exploratory gradient directions and update the current ranker if a proposed one is preferred by users via an interleaved test. In this paper, we accelerate the online learning process by efficient exploration in the gradient space. Our algorithm, named as Null Space Gradient Descent, reduces the exploration space to only the \emph{null space} of recent poorly performing gradients. This prevents the algorithm from repeatedly exploring directions that have been discouraged by the most recent interactions with users. To improve sensitivity of the resulting interleaved test, we selectively construct candidate rankers to maximize the chance that they can be differentiated by candidate ranking…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.