Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks
Qinglong Zhang, Lu Rao, Yubin Yang

TL;DR
Group-CAM is an efficient method for generating saliency maps for deep convolutional networks, using a split-transform-merge strategy that produces target-related explanations with minimal network queries.
Contribution
It introduces a novel Group score-weighted approach that improves saliency map quality and efficiency over existing methods.
Findings
Achieves better visual explanations than state-of-the-art methods.
Requires only dozens of network queries for effective saliency maps.
Demonstrates strong performance on ImageNet and COCO benchmarks.
Abstract
In this paper, we propose an efficient saliency map generation method, called Group score-weighted Class Activation Mapping (Group-CAM), which adopts the "split-transform-merge" strategy to generate saliency maps. Specifically, for an input image, the class activations are firstly split into groups. In each group, the sub-activations are summed and de-noised as an initial mask. After that, the initial masks are transformed with meaningful perturbations and then applied to preserve sub-pixels of the input (i.e., masked inputs), which are then fed into the network to calculate the confidence scores. Finally, the initial masks are weighted summed to form the final saliency map, where the weights are confidence scores produced by the masked inputs. Group-CAM is efficient yet effective, which only requires dozens of queries to the network while producing target-related saliency maps. As a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Cell Image Analysis Techniques · Anomaly Detection Techniques and Applications
