Robust Domain Adaptation for Machine Reading Comprehension

Liang Jiang; Zhenyu Huang; Jia Liu; Zujie Wen; Xi Peng

arXiv:2209.11615·cs.LG·September 26, 2022

Robust Domain Adaptation for Machine Reading Comprehension

Liang Jiang, Zhenyu Huang, Jia Liu, Zujie Wen, Xi Peng

PDF

Open Access 1 Video

TL;DR

This paper introduces RMRC, a novel domain adaptation method for machine reading comprehension that leverages dialogue context to generate more accurate QA pairs, reducing noise and improving performance across domains.

Contribution

The paper proposes a new domain adaptation approach for MRC that uses dialogue information and a reinforced self-training process to mitigate noisy pseudo QA pairs caused by domain shifts.

Findings

01

RMRC significantly improves MRC accuracy on multiple datasets.

02

Dialogue-based QA pair construction reduces noise in domain adaptation.

03

The method demonstrates robustness against mismatched question-answer pairs.

Abstract

Most domain adaptation methods for machine reading comprehension (MRC) use a pre-trained question-answer (QA) construction model to generate pseudo QA pairs for MRC transfer. Such a process will inevitably introduce mismatched pairs (i.e., noisy correspondence) due to i) the unavailable QA pairs in target documents, and ii) the domain shift during applying the QA construction model to the target domain. Undoubtedly, the noisy correspondence will degenerate the performance of MRC, which however is neglected by existing works. To solve such an untouched problem, we propose to construct QA pairs by additionally using the dialogue related to the documents, as well as a new domain adaptation method for MRC. Specifically, we propose Robust Domain Adaptation for Machine Reading Comprehension (RMRC) method which consists of an answer extractor (AE), a question selector (QS), and an MRC model.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Robust Domain Adaptation for Machine Reading Comprehension· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsAutoencoders