Continual Domain Adaptation for Machine Reading Comprehension

Lixin Su; Jiafeng Guo; Ruqing Zhang; Yixing Fan; Yanyan Lan; Xueqi; Cheng

arXiv:2008.10874·cs.IR·August 26, 2020

Continual Domain Adaptation for Machine Reading Comprehension

Lixin Su, Jiafeng Guo, Ruqing Zhang, Yixing Fan, Yanyan Lan, Xueqi, Cheng

PDF

TL;DR

This paper introduces the continual domain adaptation (CDA) task for machine reading comprehension, creates benchmark datasets, and proposes models to address domain shift challenges, with dynamic-architecture models performing best.

Contribution

First to study continual learning in MRC, introducing CDA task, creating benchmark datasets, and proposing effective models including dynamic-architecture approaches.

Findings

01

Dynamic-architecture models outperform regularization-based models.

02

Catastrophic forgetting is observed in MRC under CDA.

03

Benchmark datasets reveal domain adaptation challenges.

Abstract

Machine reading comprehension (MRC) has become a core component in a variety of natural language processing (NLP) applications such as question answering and dialogue systems. It becomes a practical challenge that an MRC model needs to learn in non-stationary environments, in which the underlying data distribution changes over time. A typical scenario is the domain drift, i.e. different domains of data come one after another, where the MRC model is required to adapt to the new domain while maintaining previously learned ability. To tackle such a challenge, in this work, we introduce the \textit{Continual Domain Adaptation} (CDA) task for MRC. So far as we know, this is the first study on the continual learning perspective of MRC. We build two benchmark datasets for the CDA task, by re-organizing existing MRC collections into different domains with respect to context type and question…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.