Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation
Ziheng Zhang, Jianyang Gu, Arpita Chowdhury, Zheda Mai, David Carlyn,, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao

TL;DR
Finer-CAM improves the localization of discriminative image regions by explicitly comparing target classes with similar classes to highlight unique details, maintaining efficiency and compatibility with existing CAM methods.
Contribution
It introduces a simple, efficient method that enhances CAM explanations by focusing on differences between classes, enabling finer detail localization without added complexity.
Findings
Finer-CAM outperforms baselines in identifying discriminative regions.
Masking top 5% activated pixels with Finer-CAM causes larger confidence drops.
Finer-CAM is compatible with various CAM methods and extendable to multi-modal models.
Abstract
Class activation map (CAM) has been widely used to highlight image regions that contribute to class predictions. Despite its simplicity and computational efficiency, CAM often struggles to identify discriminative regions that distinguish visually similar fine-grained classes. Prior efforts address this limitation by introducing more sophisticated explanation processes, but at the cost of extra complexity. In this paper, we propose Finer-CAM, a method that retains CAM's efficiency while achieving precise localization of discriminative regions. Our key insight is that the deficiency of CAM lies not in "how" it explains, but in "what" it explains. Specifically, previous methods attempt to identify all cues contributing to the target class's logit value, which inadvertently also activates regions predictive of visually similar classes. By explicitly comparing the target class with similar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
MethodsClass-activation map
