LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval

Kai Zhang; Chongyang Tao; Tao Shen; Can Xu; Xiubo Geng; Binxing Jiao,; Daxin Jiang

arXiv:2208.13661·cs.CL·March 6, 2023·6 cites

LED: Lexicon-Enlightened Dense Retriever for Large-Scale Retrieval

Kai Zhang, Chongyang Tao, Tao Shen, Can Xu, Xiubo Geng, Binxing Jiao,, Daxin Jiang

PDF

Open Access 3 Repos

TL;DR

This paper introduces LED, a dense retrieval model enhanced with lexicon-aware representations, which aligns with a lexicon-aware teacher model through knowledge distillation, improving retrieval performance on benchmark datasets.

Contribution

The paper proposes a novel alignment method for dense retrievers using weakened knowledge distillation with lexicon-aware contrastive and rank regularization techniques.

Findings

01

LED outperforms baseline dense retrievers on three benchmarks.

02

The model surpasses its lexicon-aware teacher in retrieval effectiveness.

03

Combining LED with standard ranker distillation further improves results.

Abstract

Retrieval models based on dense representations in semantic space have become an indispensable branch for first-stage retrieval. These retrievers benefit from surging advances in representation learning towards compressive global sequence-level embeddings. However, they are prone to overlook local salient phrases and entity mentions in texts, which usually play pivot roles in first-stage retrieval. To mitigate this weakness, we propose to make a dense retriever align a well-performing lexicon-aware representation model. The alignment is achieved by weakened knowledge distillations to enlighten the retriever via two aspects -- 1) a lexicon-augmented contrastive objective to challenge the dense encoder and 2) a pair-wise rank-consistent regularization to make dense model's behavior incline to the other. We evaluate our model on three public benchmarks, which shows that with a comparable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsALIGN