Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question   Answering

Aditya Gupta; Jiacheng Xu; Shyam Upadhyay; Diyi Yang; Manaal Faruqui

arXiv:2106.04016·cs.CL·June 9, 2021

Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering

Aditya Gupta, Jiacheng Xu, Shyam Upadhyay, Diyi Yang, Manaal Faruqui

PDF

2 Repos 1 Datasets

TL;DR

Disfl-QA is a new dataset that introduces disfluencies into question answering to evaluate and improve model robustness in understanding naturally spoken language, highlighting the performance gap and potential solutions.

Contribution

We created Disfl-QA, the first large-scale dataset with disfluencies in QA, and demonstrated its impact on model performance and potential mitigation strategies.

Findings

01

State-of-the-art models' performance drops significantly on Disfl-QA.

02

Data augmentation partially recovers model performance.

03

Fine-tuning with gold data improves robustness.

Abstract

Disfluencies is an under-studied topic in NLP, even though it is ubiquitous in human conversation. This is largely due to the lack of datasets containing disfluencies. In this paper, we present a new challenge question answering dataset, Disfl-QA, a derivative of SQuAD, where humans introduce contextual disfluencies in previously fluent questions. Disfl-QA contains a variety of challenging disfluencies that require a more comprehensive understanding of the text than what was necessary in prior datasets. Experiments show that the performance of existing state-of-the-art question answering models degrades significantly when tested on Disfl-QA in a zero-shot setting.We show data augmentation methods partially recover the loss in performance and also demonstrate the efficacy of using gold data for fine-tuning. We argue that we need large-scale disfluency datasets in order for NLP models to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

google-research-datasets/disfl_qa
dataset· 66 dl
66 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.