Connecting Attributions and QA Model Behavior on Realistic   Counterfactuals

Xi Ye; Rohan Nair; Greg Durrett

arXiv:2104.04515·cs.CL·September 15, 2021

Connecting Attributions and QA Model Behavior on Realistic Counterfactuals

Xi Ye, Rohan Nair, Greg Durrett

PDF

Open Access 1 Repo

TL;DR

This paper evaluates how well different attribution techniques explain reading comprehension models' behavior on realistic counterfactuals, finding pairwise attributions more effective than token-level methods.

Contribution

It introduces a framework for assessing attribution methods' alignment with counterfactual reasoning in reading comprehension tasks, proposing a modification to improve pairwise attribution performance.

Findings

01

Pairwise attributions outperform token-level attributions in RC.

02

A new modification to an existing pairwise attribution method improves results.

03

Attribution methods can be connected to model behavior on realistic counterfactuals.

Abstract

When a model attribution technique highlights a particular part of the input, a user might understand this highlight as making a statement about counterfactuals (Miller, 2019): if that part of the input were to change, the model's prediction might change as well. This paper investigates how well different attribution techniques align with this assumption on realistic counterfactuals in the case of reading comprehension (RC). RC is a particularly challenging test case, as token-level attributions that have been extensively studied in other NLP tasks such as sentiment analysis are less suitable to represent the reasoning that RC models perform. We construct counterfactual sets for three different RC settings, and through heuristics that can connect attribution methods' outputs to high-level model behavior, we can evaluate how useful different attribution methods and even different formats…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xiye17/EvalQAExpl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Software Engineering Research

MethodsCounterfactuals Explanations