See Better Before Looking Closer: Weakly Supervised Data Augmentation   Network for Fine-Grained Visual Classification

Tao Hu; Honggang Qi; Qingming Huang; Yan Lu

arXiv:1901.09891·cs.CV·March 26, 2019·100 cites

See Better Before Looking Closer: Weakly Supervised Data Augmentation Network for Fine-Grained Visual Classification

Tao Hu, Honggang Qi, Qingming Huang, Yan Lu

PDF

Open Access 4 Repos

TL;DR

This paper introduces WS-DAN, a weakly supervised data augmentation network that uses attention maps to enhance fine-grained visual classification by focusing on discriminative object parts, leading to improved accuracy.

Contribution

The paper proposes a novel weakly supervised data augmentation method that leverages attention maps for more effective training in fine-grained classification tasks.

Findings

01

WS-DAN surpasses state-of-the-art methods on fine-grained datasets.

02

Attention-guided augmentation improves discriminative feature extraction.

03

The approach enhances model focus on relevant object parts.

Abstract

Data augmentation is usually adopted to increase the amount of training data, prevent overfitting and improve the performance of deep models. However, in practice, random data augmentation, such as random image cropping, is low-efficiency and might introduce many uncontrolled background noises. In this paper, we propose Weakly Supervised Data Augmentation Network (WS-DAN) to explore the potential of data augmentation. Specifically, for each training image, we first generate attention maps to represent the object's discriminative parts by weakly supervised learning. Next, we augment the image guided by these attention maps, including attention cropping and attention dropping. The proposed WS-DAN improves the classification accuracy in two folds. In the first stage, images can be seen better since more discriminative parts' features will be extracted. In the second stage, attention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques