Momentum Posterior Regularization for Multi-hop Dense Retrieval
Zehua Xia, Yuyang Wu, Yiyun Xia, Cam-Tu Nguyen

TL;DR
This paper introduces MoPo, a novel method for multi-hop dense retrieval that distills posterior knowledge into prior retrieval using momentum updates, significantly improving multi-hop QA performance.
Contribution
MoPo proposes a new posterior regularization approach with momentum updates to enhance knowledge distillation in multi-hop dense retrieval.
Findings
MoPo outperforms existing baselines on HotpotQA and StrategyQA.
Effective knowledge distillation improves multi-hop QA accuracy.
Momentum-based training stabilizes the distillation process.
Abstract
Multi-hop question answering (QA) often requires sequential retrieval (multi-hop retrieval), where each hop retrieves missing knowledge based on information from previous hops. To facilitate more effective retrieval, we aim to distill knowledge from a posterior retrieval, which has access to posterior information like an answer, into a prior retrieval used during inference when such information is unavailable. Unfortunately, current methods for knowledge distillation in one-time retrieval are ineffective for multi-hop QA due to two issues: 1) Posterior information is often defined as the response (i.e. the answer), which may not clearly connect to the query without intermediate retrieval; and 2) The large knowledge gap between prior and posterior retrievals makes existing distillation methods unstable, even resulting in performance loss. As such, we propose MoPo (Momentum Posterior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
