Online Learning of Optimally Diverse Rankings
Stefan Magureanu, Alexandre Proutiere, Marcus Isaksson, Boxun Zhang

TL;DR
This paper introduces LDR, an online learning algorithm that optimally balances diversity in rankings without prior knowledge of query topics, achieving near-optimal regret and outperforming existing methods.
Contribution
The paper proposes LDR, a novel algorithm for learning diverse rankings in an online setting with no prior topic information, and proves its optimal regret scaling.
Findings
LDR achieves regret of O((N-L) log T), which is order optimal.
LDR outperforms existing learning-to-rank algorithms in experiments.
Numerical results confirm LDR's effectiveness on artificial and real data.
Abstract
Search engines answer users' queries by listing relevant items (e.g. documents, songs, products, web pages, ...). These engines rely on algorithms that learn to rank items so as to present an ordered list maximizing the probability that it contains relevant item. The main challenge in the design of learning-to-rank algorithms stems from the fact that queries often have different meanings for different users. In absence of any contextual information about the query, one often has to adhere to the {\it diversity} principle, i.e., to return a list covering the various possible topics or meanings of the query. To formalize this learning-to-rank problem, we propose a natural model where (i) items are categorized into topics, (ii) users find items relevant only if they match the topic of their query, and (iii) the engine is not aware of the topic of an arriving query, nor of the frequency at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttentive Walk-Aggregating Graph Neural Network
