What You Read is What You Classify: Highlighting Attributions to Text and Text-Like Inputs
Daniel S. Berman, Brian Merritt, Stanley Ta, Dana Udwin, Amanda Ernlund, Jeremy Ratcliff, Vijay Narayan

TL;DR
This paper introduces a mask-based explainable AI method for token classifiers, especially in text and sequence data, which highlights relevant input segments and improves interpretability over existing techniques.
Contribution
It generalizes a mask-based explanation approach from images to token sequences, enabling human-readable explanations for models like transformers.
Findings
Masked segments are less relevant to classification than unmasked ones
The method produces human-readable explanations for token importance
Effective for nucleotide sequence classifiers
Abstract
At present, there are no easily understood explainable artificial intelligence (AI) methods for discrete token inputs, like text. Most explainable AI techniques do not extend well to token sequences, where both local and global features matter, because state-of-the-art models, like transformers, tend to focus on global connections. Therefore, existing explainable AI algorithms fail by (i) identifying disparate tokens of importance, or (ii) assigning a large number of tokens a low value of importance. This method for explainable AI for tokens-based classifiers generalizes a mask-based explainable AI algorithm for images. It starts with an Explainer neural network that is trained to create masks to hide information not relevant for classification. Then, the Hadamard product of the mask and the continuous values of the classifier's embedding layer is taken and passed through the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Explainable Artificial Intelligence (XAI) · Cell Image Analysis Techniques
