Semi-Supervised QA with Generative Domain-Adaptive Nets
Zhilin Yang, Junjie Hu, Ruslan Salakhutdinov, William W. Cohen

TL;DR
This paper introduces a semi-supervised question answering framework that leverages unlabeled text through generative models and domain adaptation techniques, significantly improving QA performance.
Contribution
It presents a novel generative domain-adaptive training framework for semi-supervised QA, combining question generation with reinforcement learning-based domain adaptation.
Findings
Substantial performance improvements using unlabeled data
Effective domain adaptation reduces data distribution mismatch
Generative models enhance question answering accuracy
Abstract
We study the problem of semi-supervised question answering----utilizing unlabeled text to boost the performance of question answering models. We propose a novel training framework, the Generative Domain-Adaptive Nets. In this framework, we train a generative model to generate questions based on the unlabeled text, and combine model-generated questions with human-generated questions for training question answering models. We develop novel domain adaptation algorithms, based on reinforcement learning, to alleviate the discrepancy between the model-generated data distribution and the human-generated data distribution. Experiments show that our proposed framework obtains substantial improvement from unlabeled text.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning
