Evaluating Deep Taylor Decomposition for Reliability Assessment in the   Wild

Stephanie Brandl; Daniel Hershcovich; Anders S{\o}gaard

arXiv:2206.02661·cs.CL·June 7, 2022

Evaluating Deep Taylor Decomposition for Reliability Assessment in the Wild

Stephanie Brandl, Daniel Hershcovich, Anders S{\o}gaard

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the effectiveness of Deep Taylor Decomposition for interpretability in real-world scenarios, demonstrating its benefits in aiding journalists' decision-making and source evaluation.

Contribution

It provides an in-the-wild assessment of token attribution methods, specifically Deep Taylor Decomposition, in a professional setting involving journalists and news source reliability.

Findings

01

Faster and more accurate human decisions with the method

02

Increased critical attitude towards news sources

03

Positive qualitative feedback from journalists

Abstract

We argue that we need to evaluate model interpretability methods 'in the wild', i.e., in situations where professionals make critical decisions, and models can potentially assist them. We present an in-the-wild evaluation of token attribution based on Deep Taylor Decomposition, with professional journalists performing reliability assessments. We find that using this method in conjunction with RoBERTa-Large, fine-tuned on the Gossip Corpus, led to faster and better human decision-making, as well as a more critical attitude toward news sources among the journalists. We present a comparison of human and model rationales, as well as a qualitative analysis of the journalists' experiences with machine-in-the-loop decision making.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

coastalcph/reliability-wild
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Natural Language Processing Techniques