Learning to Rank from Relevance Judgments Distributions
Alberto Purpura, Gianmaria Silvello, Gian Antonio Susto

TL;DR
This paper introduces probabilistic loss functions for learning to rank models trained on relevance judgment distributions, demonstrating improved performance over traditional methods and strong baselines like LambdaMART.
Contribution
It proposes five new probabilistic loss functions and shows how training on relevance judgment distributions enhances LETOR model effectiveness.
Findings
Relevance judgment distributions improve model performance.
Training on sampled distributions can outperform traditional labels.
Models trained on distributions outperform LambdaMART on several datasets.
Abstract
Learning to Rank (LETOR) algorithms are usually trained on annotated corpora where a single relevance label is assigned to each available document-topic pair. Within the Cranfield framework, relevance labels result from merging either multiple expertly curated or crowdsourced human assessments. In this paper, we explore how to train LETOR models with relevance judgments distributions (either real or synthetically generated) assigned to document-topic pairs instead of single-valued relevance labels. We propose five new probabilistic loss functions to deal with the higher expressive power provided by relevance judgments distributions and show how they can be applied both to neural and GBM architectures. Moreover, we show how training a LETOR model on a sampled version of the relevance judgments from certain probability distributions can improve its performance when relying either on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
