Dense Passage Retrieval: Is it Retrieving?
Benjamin Reichman, Larry Heck

TL;DR
This paper investigates the inner workings of dense passage retrieval (DPR), revealing how knowledge is stored and accessed within the model, and identifies limitations related to pre-trained knowledge bounds, suggesting future improvements.
Contribution
It provides a mechanistic analysis of DPR training, uncovering how knowledge decentralization occurs and highlighting the bounds imposed by pre-trained model knowledge.
Findings
DPR decentralizes knowledge storage with multiple access pathways.
Pre-trained model knowledge limits retrieval capabilities.
Insights suggest new directions for enhancing dense retrieval.
Abstract
Dense passage retrieval (DPR) is the first step in the retrieval augmented generation (RAG) paradigm for improving the performance of large language models (LLM). DPR fine-tunes pre-trained networks to enhance the alignment of the embeddings between queries and relevant textual data. A deeper understanding of DPR fine-tuning will be required to fundamentally unlock the full potential of this approach. In this work, we explore DPR-trained models mechanistically by using a combination of probing, layer activation analysis, and model editing. Our experiments show that DPR training decentralizes how knowledge is stored in the network, creating multiple access pathways to the same information. We also uncover a limitation in this training style: the internal knowledge of the pre-trained model bounds what the retrieval model can retrieve. These findings suggest a few possible directions for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning
