The Impacts of Unanswerable Questions on the Robustness of Machine   Reading Comprehension Models

Son Quoc Tran; Phong Nguyen-Thuan Do; Uyen Le; Matt Kretchmar

arXiv:2302.00094·cs.AI·February 2, 2023

The Impacts of Unanswerable Questions on the Robustness of Machine Reading Comprehension Models

Son Quoc Tran, Phong Nguyen-Thuan Do, Uyen Le, Matt Kretchmar

PDF

Open Access

TL;DR

This paper investigates whether training on unanswerable questions in SQuAD 2.0 enhances the robustness of machine reading comprehension models against adversarial attacks, revealing hidden robustness and out-of-domain generalization.

Contribution

It demonstrates that training on SQuAD 2.0 introduces hidden robustness in models and presents a new adversarial attack to identify learned artifacts.

Findings

01

Models trained on SQuAD 2.0 show hidden robustness.

02

Robustness extends to out-of-domain datasets.

03

A new adversarial attack reveals artifacts learned from SQuAD 2.0.

Abstract

Pretrained language models have achieved super-human performances on many Machine Reading Comprehension (MRC) benchmarks. Nevertheless, their relative inability to defend against adversarial attacks has spurred skepticism about their natural language understanding. In this paper, we ask whether training with unanswerable questions in SQuAD 2.0 can help improve the robustness of MRC models against adversarial attacks. To explore that question, we fine-tune three state-of-the-art language models on either SQuAD 1.1 or SQuAD 2.0 and then evaluate their robustness under adversarial attacks. Our experiments reveal that current models fine-tuned on SQuAD 2.0 do not initially appear to be any more robust than ones fine-tuned on SQuAD 1.1, yet they reveal a measure of hidden robustness that can be leveraged to realize actual performance gains. Furthermore, we find that the robustness of models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Adversarial Robustness in Machine Learning