Fusion-CAM: Integrating Gradient and Region-Based Class Activation Maps for Robust Visual Explanations
Hajar Dekdegue, Moncef Garouani, Josiane Mothe, Jordan Bernigaud

TL;DR
Fusion-CAM combines gradient-based and region-based class activation maps through a novel fusion mechanism, producing more robust, detailed, and context-aware visual explanations for deep neural networks.
Contribution
The paper introduces Fusion-CAM, a new framework that unifies gradient and region-based CAM methods with an adaptive fusion mechanism for improved interpretability.
Findings
Fusion-CAM outperforms existing CAM methods in visualization quality.
The adaptive fusion enhances class coverage and explanation robustness.
Fusion-CAM provides more accurate and context-aware visual explanations.
Abstract
Interpreting the decision-making process of deep convolutional neural networks remains a central challenge in achieving trustworthy and transparent artificial intelligence. Explainable AI (XAI) techniques, particularly Class Activation Map (CAM) methods, are widely adopted to visualize the input regions influencing model predictions. Gradient-based approaches (e.g. Grad-CAM) provide highly discriminative, fine-grained details by computing gradients of class activations but often yield noisy and incomplete maps that emphasize only the most salient regions rather than the complete objects. Region-based approaches (e.g. Score-CAM) aggregate information over larger areas, capturing broader object coverage at the cost of over-smoothing and reduced sensitivity to subtle features. We introduce Fusion-CAM, a novel framework that bridges this explanatory gap by unifying both paradigms through a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Advanced Neural Network Applications
