Are you tough enough? Framework for Robustness Validation of Machine   Comprehension Systems

Barbara Rychalska; Dominika Basaj; Przemyslaw Biecek

arXiv:1812.02205·cs.CL·December 7, 2018·5 cites

Are you tough enough? Framework for Robustness Validation of Machine Comprehension Systems

Barbara Rychalska, Dominika Basaj, Przemyslaw Biecek

PDF

Open Access 1 Repo

TL;DR

This paper introduces a framework for evaluating the robustness of question answering models using explainability techniques and adversarial training, revealing fragility in current models and proposing improvements.

Contribution

It proposes a novel robustness validation framework utilizing model explainers and adversarial training for QA systems, along with a new dataset for robustness testing.

Findings

01

State-of-the-art models are fragile to input changes.

02

Adversarial training improves model robustness by up to 7%.

03

The new dataset enables robustness validation of QA models.

Abstract

Deep Learning NLP domain lacks procedures for the analysis of model robustness. In this paper we propose a framework which validates robustness of any Question Answering model through model explainers. We propose that a robust model should transgress the initial notion of semantic similarity induced by word embeddings to learn a more human-like understanding of meaning. We test this property by manipulating questions in two ways: swapping important question word for 1) its semantically correct synonym and 2) for word vector that is close in embedding space. We estimate importance of words in asked questions with Locally Interpretable Model Agnostic Explanations method (LIME). With these two steps we compare state-of-the-art Q&A models. We show that although accuracy of state-of-the-art models is high, they are very fragile to changes in the input. Moreover, we propose 2 adversarial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MI2DataLab/nlp_interpretability_framework
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning