Adversarial Domain Adaptation for Machine Reading Comprehension
Huazheng Wang, Zhe Gan, Xiaodong Liu, Jingjing Liu, Jianfeng Gao,, Hongning Wang

TL;DR
This paper introduces AdaMRC, an adversarial domain adaptation framework for machine reading comprehension that generates pseudo questions and learns domain-invariant representations, improving cross-domain performance without labeled target data.
Contribution
The paper proposes a novel adversarial domain adaptation method for MRC that generates pseudo questions and enforces domain-invariant features, applicable to various models and pre-trained language models.
Findings
Effective domain adaptation across different datasets
Compatibility with large-scale pre-trained models like BERT and ELMo
Extension to semi-supervised learning scenarios
Abstract
In this paper, we focus on unsupervised domain adaptation for Machine Reading Comprehension (MRC), where the source domain has a large amount of labeled data, while only unlabeled passages are available in the target domain. To this end, we propose an Adversarial Domain Adaptation framework (AdaMRC), where () pseudo questions are first generated for unlabeled passages in the target domain, and then () a domain classifier is incorporated into an MRC model to predict which domain a given passage-question pair comes from. The classifier and the passage-question encoder are jointly trained using adversarial learning to enforce domain-invariant representation learning. Comprehensive evaluations demonstrate that our approach () is generalizable to different MRC models and datasets, () can be combined with pre-trained large-scale language models (such as ELMo and BERT), and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Bidirectional LSTM · Softmax · ELMo
