TL;DR
This paper introduces a synthetic adversarial data generation pipeline to enhance question answering model robustness, achieving significant improvements on adversarial and generalization benchmarks while reducing vulnerability to human-crafted attacks.
Contribution
It is the first to use synthetic data generation for question answering robustness, amplifying small datasets to improve model resilience against adversarial attacks.
Findings
Improved state-of-the-art on AdversarialQA by 3.7 F1 score.
Enhanced generalization on nine MRQA datasets.
Models are less fooled by human adversaries, with fooling rate reduced from 17.6% to 8.8%.
Abstract
Despite recent progress, state-of-the-art question answering models remain vulnerable to a variety of adversarial attacks. While dynamic adversarial data collection, in which a human annotator tries to write examples that fool a model-in-the-loop, can improve model robustness, this process is expensive which limits the scale of the collected data. In this work, we are the first to use synthetic adversarial data generation to make question answering models more robust to human adversaries. We develop a data generation pipeline that selects source passages, identifies candidate answers, generates questions, then finally filters or re-labels them to improve quality. Using this approach, we amplify a smaller human-written adversarial dataset to a much larger set of synthetic question-answer pairs. By incorporating our synthetic data, we improve the state-of-the-art on the AdversarialQA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
