Domain Adaptive Transfer Learning on Visual Attention Aware Data Augmentation for Fine-grained Visual Categorization
Ashiq Imran, Vassilis Athitsos

TL;DR
This paper introduces an attention-aware data augmentation technique combined with domain adaptive transfer learning to improve fine-grained visual categorization accuracy, achieving state-of-the-art results on multiple challenging datasets.
Contribution
It proposes a novel visual attention-based data augmentation method integrated with domain adaptive transfer learning for FGVC.
Findings
Significant accuracy improvements on six FGVC datasets.
Outperforms existing methods on multiple benchmarks.
Effective transfer learning using attention mechanisms.
Abstract
Fine-Grained Visual Categorization (FGVC) is a challenging topic in computer vision. It is a problem characterized by large intra-class differences and subtle inter-class differences. In this paper, we tackle this problem in a weakly supervised manner, where neural network models are getting fed with additional data using a data augmentation technique through a visual attention mechanism. We perform domain adaptive knowledge transfer via fine-tuning on our base network model. We perform our experiment on six challenging and commonly used FGVC datasets, and we show competitive improvement on accuracies by using attention-aware data augmentation techniques with features derived from deep learning model InceptionV3, pre-trained on large scale datasets. Our method outperforms competitor methods on multiple FGVC datasets and showed competitive results on other datasets. Experimental studies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
