Inherently Interpretable Multi-Label Classification Using Class-Specific Counterfactuals
Susu Sun, Stefano Woerner, Andreas Maier, Lisa M. Koch, Christian F., Baumgartner

TL;DR
Attri-Net is an inherently interpretable multi-label classification model for medical images that uses class-specific counterfactual attribution maps, providing trustworthy explanations while maintaining competitive accuracy.
Contribution
This paper introduces Attri-Net, a novel inherently interpretable multi-label classifier that generates class-specific explanations using counterfactuals, addressing limitations of post-hoc methods.
Findings
Attri-Net produces high-quality, clinically consistent explanations.
Attri-Net achieves classification performance comparable to state-of-the-art models.
The approach outperforms existing post-hoc explanation techniques in multi-label scenarios.
Abstract
Interpretability is essential for machine learning algorithms in high-stakes application fields such as medical image analysis. However, high-performing black-box neural networks do not provide explanations for their predictions, which can lead to mistrust and suboptimal human-ML collaboration. Post-hoc explanation techniques, which are widely used in practice, have been shown to suffer from severe conceptual problems. Furthermore, as we show in this paper, current explanation techniques do not perform adequately in the multi-label scenario, in which multiple medical findings may co-occur in a single image. We propose Attri-Net, an inherently interpretable model for multi-label classification. Attri-Net is a powerful classifier that provides transparent, trustworthy, and human-understandable explanations. The model first generates class-specific attribution maps based on counterfactuals…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · COVID-19 diagnosis using AI · Explainable Artificial Intelligence (XAI)
MethodsCounterfactuals Explanations · Logistic Regression
