Sharpen Focus: Learning with Attention Separability and Consistency
Lezi Wang, Ziyan Wu, Srikrishna Karanam, Kuan-Chuan Peng, Rajat Vikram, Singh, Bo Liu, Dimitris N. Metaxas

TL;DR
This paper introduces a new learning framework that enhances class-discriminative attention in CNNs by enforcing attention separability and consistency, leading to improved classification accuracy across multiple benchmarks.
Contribution
The paper proposes novel objectives for attention separability and cross-layer consistency, making discriminative attention an integral part of the training process.
Findings
Improved classification accuracy on CIFAR-100 (+3.33%)
Enhanced attention discriminability reducing visual confusion
Consistent performance gains across multiple datasets
Abstract
Recent developments in gradient-based attention modeling have seen attention maps emerge as a powerful tool for interpreting convolutional neural networks. Despite good localization for an individual class of interest, these techniques produce attention maps with substantially overlapping responses among different classes, leading to the problem of visual confusion and the need for discriminative attention. In this paper, we address this problem by means of a new framework that makes class-discriminative attention a principled part of the learning process. Our key innovations include new learning objectives for attention separability and cross-layer consistency, which result in improved attention discriminability and reduced visual confusion. Extensive experiments on image classification benchmarks show the effectiveness of our approach in terms of improved classification accuracy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
