Improving Unsupervised Question Answering via Summarization-Informed Question Generation
Chenyang Lyu, Lifeng Shang, Yvette Graham, Jennifer Foster, Xin Jiang,, Qun Liu

TL;DR
This paper introduces an unsupervised question generation method leveraging summarization and linguistic heuristics to create synthetic QA pairs, significantly improving unsupervised QA performance across multiple datasets.
Contribution
The paper presents a novel unsupervised QG approach using summarization-informed heuristics to generate training data, reducing reliance on domain-specific datasets.
Findings
Outperforms previous unsupervised models on multiple datasets
Uses only 20k synthetic QA pairs for training
Demonstrates strong transferability across domains
Abstract
Question Generation (QG) is the task of generating a plausible question for a given <passage, answer> pair. Template-based QG uses linguistically-informed heuristics to transform declarative sentences into interrogatives, whereas supervised QG uses existing Question Answering (QA) datasets to train a system to generate a question given a passage and an answer. A disadvantage of the heuristic approach is that the generated questions are heavily tied to their declarative counterparts. A disadvantage of the supervised approach is that they are heavily tied to the domain/language of the QA dataset used as training data. In order to overcome these shortcomings, we propose an unsupervised QG method which uses questions generated heuristically from summaries as a source of training data for a QG system. We make use of freely available news summary data, transforming declarative summary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
