TL;DR
This paper introduces QADA, a self-supervised framework for adapting question answering models to new domains by augmenting data in hidden spaces and using contrastive learning to improve generalization across domains.
Contribution
QADA is the first to combine hidden space augmentation with attention-based contrastive adaptation for self-supervised QA domain adaptation.
Findings
QADA outperforms state-of-the-art baselines on multiple datasets.
The hidden space augmentation improves sample diversity.
Contrastive learning enhances domain generalization.
Abstract
Question answering (QA) has recently shown impressive results for answering questions from customized domains. Yet, a common challenge is to adapt QA models to an unseen target domain. In this paper, we propose a novel self-supervised framework called QADA for QA domain adaptation. QADA introduces a novel data augmentation pipeline used to augment training QA samples. Different from existing methods, we enrich the samples via hidden space augmentation. For questions, we introduce multi-hop synonyms and sample augmented token embeddings with Dirichlet distributions. For contexts, we develop an augmentation method which learns to drop context spans via a custom attentive sampling strategy. Additionally, contrastive learning is integrated in the proposed self-supervised adaptation framework QADA. Unlike existing approaches, we generate pseudo labels and propose to train the model via a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning
