Online Distillation for Pseudo-Relevance Feedback

Sean MacAvaney; Xi Wang

arXiv:2306.09657·cs.IR·June 19, 2023·1 cites

Online Distillation for Pseudo-Relevance Feedback

Sean MacAvaney, Xi Wang

PDF

Open Access

TL;DR

This paper introduces an online distillation method for pseudo-relevance feedback, enabling efficient lexical models to replicate neural re-ranking and improve document retrieval performance.

Contribution

It presents a novel online distillation approach that allows query-specific models to be distilled from neural re-ranking results for enhanced retrieval.

Findings

01

Online distilled models can effectively mimic neural re-ranking.

02

Distilled models improve retrieval by identifying missed relevant documents.

03

Approach outperforms traditional pseudo relevance feedback and hybrid methods.

Abstract

Model distillation has emerged as a prominent technique to improve neural search models. To date, distillation taken an offline approach, wherein a new neural model is trained to predict relevance scores between arbitrary queries and documents. In this paper, we explore a departure from this offline distillation strategy by investigating whether a model for a specific query can be effectively distilled from neural re-ranking results (i.e., distilling in an online setting). Indeed, we find that a lexical model distilled online can reasonably replicate the re-ranking of a neural model. More importantly, these models can be used as queries that execute efficiently on indexes. This second retrieval stage can enrich the pool of documents for re-ranking by identifying documents that were missed in the first retrieval stage. Empirically, we show that this approach performs favourably when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Neural Networks and Applications · Explainable Artificial Intelligence (XAI)