Saliency Map Verbalization: Comparing Feature Importance Representations   from Model-free and Instruction-based Methods

Nils Feldhus; Leonhard Hennig; Maximilian Dustin Nasert; Christopher; Ebert; Robert Schwarzenberg; Sebastian M\"oller

arXiv:2210.07222·cs.CL·June 8, 2023·1 cites

Saliency Map Verbalization: Comparing Feature Importance Representations from Model-free and Instruction-based Methods

Nils Feldhus, Leonhard Hennig, Maximilian Dustin Nasert, Christopher, Ebert, Robert Schwarzenberg, Sebastian M\"oller

PDF

Open Access 1 Repo

TL;DR

This paper explores translating neural saliency maps into natural language to improve interpretability, comparing novel verbalization methods with traditional visual explanations through human and automatic evaluations.

Contribution

It introduces and evaluates search-based and instruction-based verbalization methods for saliency maps, highlighting their strengths and limitations in interpretability.

Findings

01

Instruction-based verbalizations achieve highest human ratings.

02

Search-based verbalizations are faithful but less helpful.

03

Verbalizations improve comprehensibility over visual explanations.

Abstract

Saliency maps can explain a neural model's predictions by identifying important input features. They are difficult to interpret for laypeople, especially for instances with many features. In order to make them more accessible, we formalize the underexplored task of translating saliency maps into natural language and compare methods that address two key challenges of this approach -- what and how to verbalize. In both automatic and human evaluation setups, using token-level attributions from text classification tasks, we compare two novel methods (search-based and instruction-based verbalizations) against conventional feature importance representations (heatmap visualizations and extractive rationales), measuring simulatability, faithfulness, helpfulness and ease of understanding. Instructing GPT-3.5 to generate saliency map verbalizations yields plausible explanations which include…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dfki-nlp/smv
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Machine Learning in Materials Science

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Multi-Head Attention · Attention Is All You Need · Weight Decay · Cosine Annealing · Attention Dropout · Layer Normalization · Byte Pair Encoding