Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study
Hang Li, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy, Lin, Guido Zuccon

TL;DR
This paper reproduces and analyzes the ANCE-PRF method for improving dense retrieval effectiveness through pseudo-relevance feedback, examining its reproducibility, hyper-parameter sensitivity, and generalizability across different dense retrievers.
Contribution
It provides a comprehensive reproducibility study of ANCE-PRF, extends empirical analysis on hyper-parameters, and explores its applicability with various dense retrievers.
Findings
Reproducibility of ANCE-PRF training and inference steps is confirmed.
Hyper-parameter settings significantly impact PRF effectiveness.
ANCE-PRF generalizes to other dense retrievers beyond ANCE.
Abstract
Pseudo-Relevance Feedback (PRF) utilises the relevance signals from the top-k passages from the first round of retrieval to perform a second round of retrieval aiming to improve search effectiveness. A recent research direction has been the study and development of PRF methods for deep language models based rankers, and in particular in the context of dense retrievers. Dense retrievers, compared to more complex neural rankers, provide a trade-off between effectiveness, which is often reduced compared to more complex neural rankers, and query latency, which also is reduced making the retrieval pipeline more efficient. The introduction of PRF methods for dense retrievers has been motivated as an attempt to further improve their effectiveness. In this paper, we reproduce and study a recent method for PRF with dense retrievers, called ANCE-PRF. This method concatenates the query text and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Machine Learning and Algorithms
