Robust Domain Adaptation for Machine Reading Comprehension
Liang Jiang, Zhenyu Huang, Jia Liu, Zujie Wen, Xi Peng

TL;DR
This paper introduces RMRC, a novel domain adaptation method for machine reading comprehension that leverages dialogue context to generate more accurate QA pairs, reducing noise and improving performance across domains.
Contribution
The paper proposes a new domain adaptation approach for MRC that uses dialogue information and a reinforced self-training process to mitigate noisy pseudo QA pairs caused by domain shifts.
Findings
RMRC significantly improves MRC accuracy on multiple datasets.
Dialogue-based QA pair construction reduces noise in domain adaptation.
The method demonstrates robustness against mismatched question-answer pairs.
Abstract
Most domain adaptation methods for machine reading comprehension (MRC) use a pre-trained question-answer (QA) construction model to generate pseudo QA pairs for MRC transfer. Such a process will inevitably introduce mismatched pairs (i.e., noisy correspondence) due to i) the unavailable QA pairs in target documents, and ii) the domain shift during applying the QA construction model to the target domain. Undoubtedly, the noisy correspondence will degenerate the performance of MRC, which however is neglected by existing works. To solve such an untouched problem, we propose to construct QA pairs by additionally using the dialogue related to the documents, as well as a new domain adaptation method for MRC. Specifically, we propose Robust Domain Adaptation for Machine Reading Comprehension (RMRC) method which consists of an answer extractor (AE), a question selector (QS), and an MRC model.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsAutoencoders
