Improving Query Representations for Dense Retrieval with Pseudo   Relevance Feedback: A Reproducibility Study

Hang Li; Shengyao Zhuang; Ahmed Mourad; Xueguang Ma; Jimmy; Lin; Guido Zuccon

arXiv:2112.06400·cs.IR·March 22, 2023·1 cites

Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study

Hang Li, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy, Lin, Guido Zuccon

PDF

Open Access 1 Repo

TL;DR

This paper reproduces and analyzes the ANCE-PRF method for improving dense retrieval effectiveness through pseudo-relevance feedback, examining its reproducibility, hyper-parameter sensitivity, and generalizability across different dense retrievers.

Contribution

It provides a comprehensive reproducibility study of ANCE-PRF, extends empirical analysis on hyper-parameters, and explores its applicability with various dense retrievers.

Findings

01

Reproducibility of ANCE-PRF training and inference steps is confirmed.

02

Hyper-parameter settings significantly impact PRF effectiveness.

03

ANCE-PRF generalizes to other dense retrievers beyond ANCE.

Abstract

Pseudo-Relevance Feedback (PRF) utilises the relevance signals from the top-k passages from the first round of retrieval to perform a second round of retrieval aiming to improve search effectiveness. A recent research direction has been the study and development of PRF methods for deep language models based rankers, and in particular in the context of dense retrievers. Dense retrievers, compared to more complex neural rankers, provide a trade-off between effectiveness, which is often reduced compared to more complex neural rankers, and query latency, which also is reduced making the retrieval pipeline more efficient. The introduction of PRF methods for dense retrievers has been motivated as an attempt to further improve their effectiveness. In this paper, we reproduce and study a recent method for PRF with dense retrievers, called ANCE-PRF. This method concatenates the query text and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ielab/apr
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Machine Learning and Algorithms