A Multi-level Distillation based Dense Passage Retrieval Model

Haifeng Li; Mo Hai; Dong Tang

arXiv:2312.16821·cs.IR·December 29, 2023·1 cites

A Multi-level Distillation based Dense Passage Retrieval Model

Haifeng Li, Mo Hai, Dong Tang

PDF

Open Access

TL;DR

This paper introduces MD2PR, a dense passage retrieval model that uses multi-level distillation from a cross-encoder to a dual-encoder, improving retrieval accuracy while maintaining real-time efficiency.

Contribution

The paper proposes a novel multi-level distillation approach for dense passage retrieval, enhancing dual-encoder performance by transferring knowledge from cross-encoders at sentence and word levels.

Findings

01

MD2PR outperforms 11 baseline models in MRR and Recall.

02

The dynamic filtering method improves false negative identification.

03

Knowledge distillation enhances dual-encoder semantic understanding.

Abstract

Ranker and retriever are two important components in dense passage retrieval. The retriever typically adopts a dual-encoder model, where queries and documents are separately input into two pre-trained models, and the vectors generated by the models are used for similarity calculation. The ranker often uses a cross-encoder model, where the concatenated query-document pairs are input into a pre-trained model to obtain word similarities. However, the dual-encoder model lacks interaction between queries and documents due to its independent encoding, while the cross-encoder model requires substantial computational cost for attention calculation, making it difficult to obtain real-time retrieval results. In this paper, we propose a dense retrieval model called MD2PR based on multi-level distillation. In this model, we distill the knowledge learned from the cross-encoder to the dual-encoder at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Information Retrieval and Search Behavior · Expert finding and Q&A systems