What You Read is What You Classify: Highlighting Attributions to Text and Text-Like Inputs

Daniel S. Berman; Brian Merritt; Stanley Ta; Dana Udwin; Amanda Ernlund; Jeremy Ratcliff; Vijay Narayan

arXiv:2602.24149·cs.LG·March 20, 2026

What You Read is What You Classify: Highlighting Attributions to Text and Text-Like Inputs

Daniel S. Berman, Brian Merritt, Stanley Ta, Dana Udwin, Amanda Ernlund, Jeremy Ratcliff, Vijay Narayan

PDF

Open Access

TL;DR

This paper introduces a mask-based explainable AI method for token classifiers, especially in text and sequence data, which highlights relevant input segments and improves interpretability over existing techniques.

Contribution

It generalizes a mask-based explanation approach from images to token sequences, enabling human-readable explanations for models like transformers.

Findings

01

Masked segments are less relevant to classification than unmasked ones

02

The method produces human-readable explanations for token importance

03

Effective for nucleotide sequence classifiers

Abstract

At present, there are no easily understood explainable artificial intelligence (AI) methods for discrete token inputs, like text. Most explainable AI techniques do not extend well to token sequences, where both local and global features matter, because state-of-the-art models, like transformers, tend to focus on global connections. Therefore, existing explainable AI algorithms fail by (i) identifying disparate tokens of importance, or (ii) assigning a large number of tokens a low value of importance. This method for explainable AI for tokens-based classifiers generalizes a mask-based explainable AI algorithm for images. It starts with an Explainer neural network that is trained to create masks to hide information not relevant for classification. Then, the Hadamard product of the mask and the continuous values of the classifier's embedding layer is taken and passed through the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Explainable Artificial Intelligence (XAI) · Cell Image Analysis Techniques