Sharpen Focus: Learning with Attention Separability and Consistency

Lezi Wang; Ziyan Wu; Srikrishna Karanam; Kuan-Chuan Peng; Rajat Vikram; Singh; Bo Liu; Dimitris N. Metaxas

arXiv:1811.07484·cs.CV·August 9, 2019·1 cites

Sharpen Focus: Learning with Attention Separability and Consistency

Lezi Wang, Ziyan Wu, Srikrishna Karanam, Kuan-Chuan Peng, Rajat Vikram, Singh, Bo Liu, Dimitris N. Metaxas

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new learning framework that enhances class-discriminative attention in CNNs by enforcing attention separability and consistency, leading to improved classification accuracy across multiple benchmarks.

Contribution

The paper proposes novel objectives for attention separability and cross-layer consistency, making discriminative attention an integral part of the training process.

Findings

01

Improved classification accuracy on CIFAR-100 (+3.33%)

02

Enhanced attention discriminability reducing visual confusion

03

Consistent performance gains across multiple datasets

Abstract

Recent developments in gradient-based attention modeling have seen attention maps emerge as a powerful tool for interpreting convolutional neural networks. Despite good localization for an individual class of interest, these techniques produce attention maps with substantially overlapping responses among different classes, leading to the problem of visual confusion and the need for discriminative attention. In this paper, we address this problem by means of a new framework that makes class-discriminative attention a principled part of the learning process. Our key innovations include new learning objectives for attention separability and cross-layer consistency, which result in improved attention discriminability and reduced visual confusion. Extensive experiments on image classification benchmarks show the effectiveness of our approach in terms of improved classification accuracy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MacroMayhem/SharpenFocus-Pytorch
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning