Domain Adaptation for Dense Retrieval through Self-Supervision by Pseudo-Relevance Labeling
Minghan Li, Eric Gaussier

TL;DR
This paper introduces a self-supervised domain adaptation method for dense retrieval models using pseudo-relevance labels generated through BM25 and re-ranking, combined with knowledge distillation, to improve cross-domain generalization.
Contribution
It proposes a novel self-supervision approach with pseudo-relevance labeling and knowledge distillation to enhance dense retrieval in new domains, outperforming existing methods.
Findings
Pseudo-relevance labeling with T53B improves retrieval performance.
Combining pseudo-labels with knowledge distillation yields better results.
The approach outperforms state-of-the-art query generation methods.
Abstract
Although neural information retrieval has witnessed great improvements, recent works showed that the generalization ability of dense retrieval models on target domains with different distributions is limited, which contrasts with the results obtained with interaction-based models. To address this issue, researchers have resorted to adversarial learning and query generation approaches; both approaches nevertheless resulted in limited improvements. In this paper, we propose to use a self-supervision approach in which pseudo-relevance labels are automatically generated on the target domain. To do so, we first use the standard BM25 model on the target domain to obtain a first ranking of documents, and then use the interaction-based model T53B to re-rank top documents. We further combine this approach with knowledge distillation relying on an interaction-based teacher model trained on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Topic Modeling
MethodsKnowledge Distillation
