Task-Specific Embeddings for Ante-Hoc Explainable Text Classification
Kishaloy Halder, Josip Krapac, Alan Akbik, Anthony Brew, Matti Lyra

TL;DR
This paper introduces task-specific embeddings for text classification that enable interpretable, ante-hoc explanations and incremental learning without sacrificing accuracy, by replacing softmax classifiers with a distance-based approach.
Contribution
It proposes a novel training objective for learning class-specific embeddings that support explainability and incremental learning in text classification.
Findings
Embeddings allow direct explanation of classification decisions.
Qualitative analysis helps identify data quality issues.
Distances generalize to unseen classes, enabling incremental learning.
Abstract
Current state-of-the-art approaches to text classification typically leverage BERT-style Transformer models with a softmax classifier, jointly fine-tuned to predict class labels of a target task. In this paper, we instead propose an alternative training objective in which we learn task-specific embeddings of text: our proposed objective learns embeddings such that all texts that share the same target class label should be close together in the embedding space, while all others should be far apart. This allows us to replace the softmax classifier with a more interpretable k-nearest-neighbor classification approach. In a series of experiments, we show that this yields a number of interesting benefits: (1) The resulting order induced by distances in the embedding space can be used to directly explain classification decisions. (2) This facilitates qualitative inspection of the training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Healthcare · Explainable Artificial Intelligence (XAI)
MethodsMulti-Head Attention · Attention Is All You Need · Adam · Layer Normalization · Absolute Position Encodings · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Label Smoothing · Linear Layer
