A Multi-level Distillation based Dense Passage Retrieval Model
Haifeng Li, Mo Hai, Dong Tang

TL;DR
This paper introduces MD2PR, a dense passage retrieval model that uses multi-level distillation from a cross-encoder to a dual-encoder, improving retrieval accuracy while maintaining real-time efficiency.
Contribution
The paper proposes a novel multi-level distillation approach for dense passage retrieval, enhancing dual-encoder performance by transferring knowledge from cross-encoders at sentence and word levels.
Findings
MD2PR outperforms 11 baseline models in MRR and Recall.
The dynamic filtering method improves false negative identification.
Knowledge distillation enhances dual-encoder semantic understanding.
Abstract
Ranker and retriever are two important components in dense passage retrieval. The retriever typically adopts a dual-encoder model, where queries and documents are separately input into two pre-trained models, and the vectors generated by the models are used for similarity calculation. The ranker often uses a cross-encoder model, where the concatenated query-document pairs are input into a pre-trained model to obtain word similarities. However, the dual-encoder model lacks interaction between queries and documents due to its independent encoding, while the cross-encoder model requires substantial computational cost for attention calculation, making it difficult to obtain real-time retrieval results. In this paper, we propose a dense retrieval model called MD2PR based on multi-level distillation. In this model, we distill the knowledge learned from the cross-encoder to the dual-encoder at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Expert finding and Q&A systems
