SIDE: Sparse Information Disentanglement for Explainable Artificial Intelligence
Viktar Dubovik, {\L}ukasz Struski, Jacek Tabor, Dawid Rymarczyk

TL;DR
SIDE introduces a sparse training and pruning scheme for prototypical neural networks, significantly reducing explanation complexity while maintaining accuracy, thereby enhancing interpretability in high-stakes computer vision applications.
Contribution
It proposes a novel sparsity-enforcing method for prototypical networks, improving explanation simplicity without sacrificing model performance.
Findings
Reduces explanation size by over 90%
Maintains comparable accuracy to existing methods
Enhances interpretability of prototype-based explanations
Abstract
Understanding the decisions made by deep neural networks is essential in high-stakes domains such as medical imaging and autonomous driving. Yet, these models often lack transparency, particularly in computer vision. Prototypical-parts-based neural networks have emerged as a promising solution by offering concept-level explanations. However, most are limited to fine-grained classification tasks, with few exceptions such as InfoDisent. InfoDisent extends prototypical models to large-scale datasets like ImageNet, but produces complex explanations. We introduce Sparse Information Disentanglement for Explainability (SIDE), a novel method that improves the interpretability of prototypical parts through a dedicated training and pruning scheme that enforces sparsity. Combined with sigmoid activations in place of softmax, this approach allows SIDE to associate each class with only a small set…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
