Multi-Head Explainer: A General Framework to Improve Explainability in CNNs and Transformers
Bohang Sun, Pietro Li\`o

TL;DR
The paper presents MHEX, a modular framework that improves the explainability and accuracy of CNNs and Transformers by integrating attention, supervision, and unified representations, with demonstrated success in medical imaging and text tasks.
Contribution
Introduces MHEX, a versatile framework that enhances model interpretability and performance with minimal modifications across CNNs and Transformers.
Findings
Improves classification accuracy on benchmark datasets.
Produces detailed and interpretable saliency maps.
Easily integrates into existing architectures.
Abstract
In this study, we introduce the Multi-Head Explainer (MHEX), a versatile and modular framework that enhances both the explainability and accuracy of Convolutional Neural Networks (CNNs) and Transformer-based models. MHEX consists of three core components: an Attention Gate that dynamically highlights task-relevant features, Deep Supervision that guides early layers to capture fine-grained details pertinent to the target class, and an Equivalent Matrix that unifies refined local and global representations to generate comprehensive saliency maps. Our approach demonstrates superior compatibility, enabling effortless integration into existing residual networks like ResNet and Transformer architectures such as BERT with minimal modifications. Extensive experiments on benchmark datasets in medical imaging and text classification show that MHEX not only improves classification accuracy but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
MethodsAttention Is All You Need · Byte Pair Encoding · Attention Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Average Pooling · Absolute Position Encodings · Linear Layer · Softmax · Dense Connections · Linear Warmup With Linear Decay
