Connecting Attributions and QA Model Behavior on Realistic Counterfactuals
Xi Ye, Rohan Nair, Greg Durrett

TL;DR
This paper evaluates how well different attribution techniques explain reading comprehension models' behavior on realistic counterfactuals, finding pairwise attributions more effective than token-level methods.
Contribution
It introduces a framework for assessing attribution methods' alignment with counterfactual reasoning in reading comprehension tasks, proposing a modification to improve pairwise attribution performance.
Findings
Pairwise attributions outperform token-level attributions in RC.
A new modification to an existing pairwise attribution method improves results.
Attribution methods can be connected to model behavior on realistic counterfactuals.
Abstract
When a model attribution technique highlights a particular part of the input, a user might understand this highlight as making a statement about counterfactuals (Miller, 2019): if that part of the input were to change, the model's prediction might change as well. This paper investigates how well different attribution techniques align with this assumption on realistic counterfactuals in the case of reading comprehension (RC). RC is a particularly challenging test case, as token-level attributions that have been extensively studied in other NLP tasks such as sentiment analysis are less suitable to represent the reasoning that RC models perform. We construct counterfactual sets for three different RC settings, and through heuristics that can connect attribution methods' outputs to high-level model behavior, we can evaluate how useful different attribution methods and even different formats…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Software Engineering Research
MethodsCounterfactuals Explanations
