Integrated feature analysis for deep learning interpretation and class activation maps
Yanli Li, Tahereh Hassanzadeh, Denis P. Shamonin, Monique Reijnierse,, Annette H.M. van der Helm-van Mil, Berend C. Stoel

TL;DR
This paper introduces an integrated feature analysis method that enhances deep learning interpretability by examining feature distributions and decompositions, providing deeper insights than traditional CAMs across diverse datasets.
Contribution
The proposed method combines feature distribution analysis and decomposition to improve interpretability and identify dataset/model issues, extending beyond existing CAM techniques.
Findings
High correlation (near 100%) between CAMs and model logits across datasets.
5%-25% of features can produce equally informative saliency maps.
Method validated on eight diverse datasets, demonstrating reliability.
Abstract
Understanding the decisions of deep learning (DL) models is essential for the acceptance of DL to risk-sensitive applications. Although methods, like class activation maps (CAMs), give a glimpse into the black box, they do miss some crucial information, thereby limiting its interpretability and merely providing the considered locations of objects. To provide more insight into the models and the influence of datasets, we propose an integrated feature analysis method, which consists of feature distribution analysis and feature decomposition, to look closer into the intermediate features extracted by DL models. This integrated feature analysis could provide information on overfitting, confounders, outliers in datasets, model redundancies and principal features extracted by the models, and provide distribution information to form a common intensity scale, which are missing in current CAM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsClass-activation map
