Continual Domain Adaptation for Machine Reading Comprehension
Lixin Su, Jiafeng Guo, Ruqing Zhang, Yixing Fan, Yanyan Lan, Xueqi, Cheng

TL;DR
This paper introduces the continual domain adaptation (CDA) task for machine reading comprehension, creates benchmark datasets, and proposes models to address domain shift challenges, with dynamic-architecture models performing best.
Contribution
First to study continual learning in MRC, introducing CDA task, creating benchmark datasets, and proposing effective models including dynamic-architecture approaches.
Findings
Dynamic-architecture models outperform regularization-based models.
Catastrophic forgetting is observed in MRC under CDA.
Benchmark datasets reveal domain adaptation challenges.
Abstract
Machine reading comprehension (MRC) has become a core component in a variety of natural language processing (NLP) applications such as question answering and dialogue systems. It becomes a practical challenge that an MRC model needs to learn in non-stationary environments, in which the underlying data distribution changes over time. A typical scenario is the domain drift, i.e. different domains of data come one after another, where the MRC model is required to adapt to the new domain while maintaining previously learned ability. To tackle such a challenge, in this work, we introduce the \textit{Continual Domain Adaptation} (CDA) task for MRC. So far as we know, this is the first study on the continual learning perspective of MRC. We build two benchmark datasets for the CDA task, by re-organizing existing MRC collections into different domains with respect to context type and question…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
