Learning to Rank from Relevance Judgments Distributions

Alberto Purpura; Gianmaria Silvello; Gian Antonio Susto

arXiv:2202.06337·cs.IR·February 15, 2022

Learning to Rank from Relevance Judgments Distributions

Alberto Purpura, Gianmaria Silvello, Gian Antonio Susto

PDF

Open Access 1 Repo

TL;DR

This paper introduces probabilistic loss functions for learning to rank models trained on relevance judgment distributions, demonstrating improved performance over traditional methods and strong baselines like LambdaMART.

Contribution

It proposes five new probabilistic loss functions and shows how training on relevance judgment distributions enhances LETOR model effectiveness.

Findings

01

Relevance judgment distributions improve model performance.

02

Training on sampled distributions can outperform traditional labels.

03

Models trained on distributions outperform LambdaMART on several datasets.

Abstract

Learning to Rank (LETOR) algorithms are usually trained on annotated corpora where a single relevance label is assigned to each available document-topic pair. Within the Cranfield framework, relevance labels result from merging either multiple expertly curated or crowdsourced human assessments. In this paper, we explore how to train LETOR models with relevance judgments distributions (either real or synthetically generated) assigned to document-topic pairs instead of single-valued relevance labels. We propose five new probabilistic loss functions to deal with the higher expressive power provided by relevance judgments distributions and show how they can be applied both to neural and GBM architectures. Moreover, we show how training a LETOR model on a sampled version of the relevance judgments from certain probability distributions can improve its performance when relying either on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

albpurpura/pltr
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning