Beyond Softmax: Dual-Branch Sigmoid Architecture for Accurate Class Activation Maps
Yoojin Oh, Junhyug Noh

TL;DR
This paper introduces a dual-branch sigmoid architecture that improves class activation maps by decoupling localization from classification, enhancing explanation fidelity without sacrificing recognition accuracy.
Contribution
A simple, architecture-agnostic dual-branch sigmoid head that enhances CAM explanations by preserving feature sign and magnitude, compatible with pretrained models.
Findings
Improved explanation fidelity on fine-grained and WSOL benchmarks.
Achieved consistent Top-1 Localization gains.
No drop in classification accuracy.
Abstract
Class Activation Mapping (CAM) and its extensions have become indispensable tools for visualizing the evidence behind deep network predictions. However, by relying on a final softmax classifier, these methods suffer from two fundamental distortions: additive logit shifts that arbitrarily bias importance scores, and sign collapse that conflates excitatory and inhibitory features. We propose a simple, architecture-agnostic dual-branch sigmoid head that decouples localization from classification. Given any pretrained model, we clone its classification head into a parallel branch ending in per-class sigmoid outputs, freeze the original softmax head, and fine-tune only the sigmoid branch with class-balanced binary supervision. At inference, softmax retains recognition accuracy, while class evidence maps are generated from the sigmoid branch -- preserving both magnitude and sign of feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
