PairDistill: Pairwise Relevance Distillation for Dense Retrieval

Chao-Wei Huang; Yun-Nung Chen

arXiv:2410.01383·cs.IR·October 3, 2024

PairDistill: Pairwise Relevance Distillation for Dense Retrieval

Chao-Wei Huang, Yun-Nung Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

PairDistill introduces pairwise relevance distillation to improve dense retrieval models by leveraging fine-grained pairwise comparisons, leading to state-of-the-art results across multiple benchmarks.

Contribution

The paper proposes a novel pairwise relevance distillation method that enhances dense retrieval training by utilizing pairwise rerankers instead of pointwise ones.

Findings

01

Outperforms existing knowledge distillation methods in dense retrieval.

02

Achieves new state-of-the-art results on multiple benchmarks.

03

Demonstrates the effectiveness of pairwise comparisons in model training.

Abstract

Effective information retrieval (IR) from vast datasets relies on advanced techniques to extract relevant information in response to queries. Recent advancements in dense retrieval have showcased remarkable efficacy compared to traditional sparse retrieval methods. To further enhance retrieval performance, knowledge distillation techniques, often leveraging robust cross-encoder rerankers, have been extensively explored. However, existing approaches primarily distill knowledge from pointwise rerankers, which assign absolute relevance scores to documents, thus facing challenges related to inconsistent comparisons. This paper introduces Pairwise Relevance Distillation (PairDistill) to leverage pairwise reranking, offering fine-grained distinctions between similarly relevant documents to enrich the training of dense retrieval models. Our experiments demonstrate that PairDistill outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

miulab/pairdistill
pytorchOfficial

Videos

PairDistill: Pairwise Relevance Distillation for Dense Retrieval· underline

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques

MethodsKnowledge Distillation