Retrieval-guided Counterfactual Generation for QA
Bhargavi Paranjape, Matthew Lamm, Ian Tenney

TL;DR
This paper introduces RGF, a retrieval-based method for generating diverse, fluent counterfactual questions to improve QA model robustness and performance, especially on out-of-domain and challenging datasets.
Contribution
We develop a Retrieve-Generate-Filter technique for automatic counterfactual data creation in QA, reducing human effort and enhancing model robustness.
Findings
RGF-generated data improves QA performance on out-of-domain datasets
Counterfactual augmentation enhances model robustness to local perturbations
Our method outperforms existing approaches in generating diverse, fluent counterfactuals
Abstract
Deep NLP models have been shown to learn spurious correlations, leaving them brittle to input perturbations. Recent work has shown that counterfactual or contrastive data -- i.e. minimally perturbed inputs -- can reveal these weaknesses, and that data augmentation using counterfactuals can help ameliorate them. Proposed techniques for generating counterfactuals rely on human annotations, perturbations based on simple heuristics, and meaning representation frameworks. We focus on the task of creating counterfactuals for question answering, which presents unique challenges related to world knowledge, semantic diversity, and answerability. To address these challenges, we develop a Retrieve-Generate-Filter(RGF) technique to create counterfactual evaluation and training data with minimal human supervision. Using an open-domain QA framework and question generation model trained on original…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques
MethodsCounterfactuals Explanations
