DOEI: Dual Optimization of Embedding Information for Attention-Enhanced Class Activation Maps
Hongjie Zhu, Zeyu Zhang, Guansong Pang, Xu Wang, Shimin Wen, Yu Bai,, Daji Ergu, Ying Cai, Yang Zhao

TL;DR
DOEI introduces a novel dual optimization approach that enhances embedding representations in attention-based weakly supervised semantic segmentation, significantly improving CAM quality and segmentation accuracy.
Contribution
The paper proposes DOEI, a new method that reconstructs embedding representations with semantic-aware attention to better align activation responses with semantic information.
Findings
Improved CAM quality and segmentation accuracy on PASCAL VOC and MS COCO datasets.
DOEI enhances the propagation and decoupling of target features in high-level semantic space.
Significant performance gains demonstrate DOEI's effectiveness as a plug-and-play module.
Abstract
Weakly supervised semantic segmentation (WSSS) typically utilizes limited semantic annotations to obtain initial Class Activation Maps (CAMs). However, due to the inadequate coupling between class activation responses and semantic information in high-dimensional space, the CAM is prone to object co-occurrence or under-activation, resulting in inferior recognition accuracy. To tackle this issue, we propose DOEI, Dual Optimization of Embedding Information, a novel approach that reconstructs embedding representations through semantic-aware attention weight matrices to optimize the expression capability of embedding information. Specifically, DOEI amplifies tokens with high confidence and suppresses those with low confidence during the class-to-patch interaction. This alignment of activation responses with semantic information strengthens the propagation and decoupling of target features,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Online Learning and Analytics
MethodsSoftmax · Attention Is All You Need · Class-activation map
