A Lightweight Method to Generate Unanswerable Questions in English
Vagrant Gautam, Miaoran Zhang, Dietrich Klakow

TL;DR
This paper introduces a simple, lightweight data augmentation method using antonym and entity swaps to generate unanswerable questions, improving QA model performance without complex automated generation techniques.
Contribution
The authors propose a novel, training-free approach for unanswerable question generation that outperforms previous methods and serves as a strong baseline for future research.
Findings
Improves F1 score by 1.6 points on SQuAD 2.0 with BERT-large
Achieves +9.3 F1 points on TydiQA-MinSpan with BERT-large
Generated questions have higher human-judged relatedness and readability
Abstract
If a question cannot be answered with the available information, robust systems for question answering (QA) should know _not_ to answer. One way to build QA models that do this is with additional training data comprised of unanswerable questions, created either by employing annotators or through automated methods for unanswerable question generation. To show that the model complexity of existing automated approaches is not justified, we examine a simpler data augmentation method for unanswerable question generation in English: performing antonym and entity swaps on answerable questions. Compared to the prior state-of-the-art, data generated with our training-free and lightweight strategy results in better models (+1.6 F1 points on SQuAD 2.0 data with BERT-large), and has higher human-judged relatedness and readability. We quantify the raw benefits of our approach compared to no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
