Improving Question Answering Model Robustness with Synthetic Adversarial   Data Generation

Max Bartolo; Tristan Thrush; Robin Jia; Sebastian Riedel; Pontus; Stenetorp; Douwe Kiela

arXiv:2104.08678·cs.CL·March 16, 2022

Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation

Max Bartolo, Tristan Thrush, Robin Jia, Sebastian Riedel, Pontus, Stenetorp, Douwe Kiela

PDF

3 Models

TL;DR

This paper introduces a synthetic adversarial data generation pipeline to enhance question answering model robustness, achieving significant improvements on adversarial and generalization benchmarks while reducing vulnerability to human-crafted attacks.

Contribution

It is the first to use synthetic data generation for question answering robustness, amplifying small datasets to improve model resilience against adversarial attacks.

Findings

01

Improved state-of-the-art on AdversarialQA by 3.7 F1 score.

02

Enhanced generalization on nine MRQA datasets.

03

Models are less fooled by human adversaries, with fooling rate reduced from 17.6% to 8.8%.

Abstract

Despite recent progress, state-of-the-art question answering models remain vulnerable to a variety of adversarial attacks. While dynamic adversarial data collection, in which a human annotator tries to write examples that fool a model-in-the-loop, can improve model robustness, this process is expensive which limits the scale of the collected data. In this work, we are the first to use synthetic adversarial data generation to make question answering models more robust to human adversaries. We develop a data generation pipeline that selects source passages, identifies candidate answers, generates questions, then finally filters or re-labels them to improve quality. Using this approach, we amplify a smaller human-written adversarial dataset to a much larger set of synthetic question-answer pairs. By incorporating our synthetic data, we improve the state-of-the-art on the AdversarialQA…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.