Momentum Contrastive Pre-training for Question Answering

Minda Hu; Muzhi Li; Yasheng Wang; Irwin King

arXiv:2212.05762·cs.CL·October 17, 2023

Momentum Contrastive Pre-training for Question Answering

Minda Hu, Muzhi Li, Yasheng Wang, Irwin King

PDF

Open Access

TL;DR

This paper introduces MCROSS, a momentum contrastive pre-training method that aligns cloze-like and natural questions to improve extractive question answering models, showing significant gains on benchmark datasets.

Contribution

The paper proposes a novel contrastive pre-training framework that enhances transferability from cloze-like to natural questions in extractive QA.

Findings

01

Improved performance on three QA benchmarks.

02

Effective in both supervised and zero-shot settings.

03

Outperforms baseline pre-training methods.

Abstract

Existing pre-training methods for extractive Question Answering (QA) generate cloze-like queries different from natural questions in syntax structure, which could overfit pre-trained models to simple keyword matching. In order to address this problem, we propose a novel Momentum Contrastive pRe-training fOr queStion anSwering (MCROSS) method for extractive QA. Specifically, MCROSS introduces a momentum contrastive learning framework to align the answer probability between cloze-like and natural query-passage sample pairs. Hence, the pre-trained models can better transfer the knowledge learned in cloze-like samples to answering natural questions. Experimental results on three benchmarking QA datasets show that our method achieves noticeable improvement compared with all baselines in both supervised and zero-shot scenarios.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsALIGN · Contrastive Learning