PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense   Passage Retrieval

Ruiyang Ren; Shangwen Lv; Yingqi Qu; Jing Liu; Wayne Xin Zhao,; QiaoQiao She; Hua Wu; Haifeng Wang; Ji-Rong Wen

arXiv:2108.06027·cs.IR·April 25, 2023

PAIR: Leveraging Passage-Centric Similarity Relation for Improving Dense Passage Retrieval

Ruiyang Ren, Shangwen Lv, Yingqi Qu, Jing Liu, Wayne Xin Zhao,, QiaoQiao She, Hua Wu, Haifeng Wang, Ji-Rong Wen

PDF

1 Repo

TL;DR

The paper introduces PAIR, a novel dense passage retrieval method that leverages both query-centric and passage-centric similarity relations, significantly improving retrieval performance over previous models.

Contribution

It proposes a new approach that incorporates passage-centric similarity relations into dense retrieval, with formal formulations, pseudo-labeling via knowledge distillation, and a two-stage training process.

Findings

01

Outperforms state-of-the-art models on MSMARCO and Natural Questions datasets.

02

Effectively captures comprehensive similarity relations for better retrieval.

03

Demonstrates significant improvements in retrieval accuracy.

Abstract

Recently, dense passage retrieval has become a mainstream approach to finding relevant information in various natural language processing tasks. A number of studies have been devoted to improving the widely adopted dual-encoder architecture. However, most of the previous studies only consider query-centric similarity relation when learning the dual-encoder retriever. In order to capture more comprehensive similarity relations, we propose a novel approach that leverages both query-centric and PAssage-centric sImilarity Relations (called PAIR) for dense passage retrieval. To implement our approach, we make three major technical contributions by introducing formal formulations of the two kinds of similarity relations, generating high-quality pseudo labeled data via knowledge distillation, and designing an effective two-stage training procedure that incorporates passage-centric similarity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

paddlepaddle/rocketqa
paddleOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.