A Novel Plug-in Module for Fine-Grained Visual Classification
Po-Yung Chou, Cheng-Hung Lin, Wen-Chung Kao

TL;DR
This paper introduces a new plug-in module for neural networks that enhances fine-grained visual classification by automatically identifying discriminative regions, achieving state-of-the-art accuracy with end-to-end training.
Contribution
The novel plug-in module can be integrated into various backbone networks to improve fine-grained classification without multi-stage training.
Findings
Achieves 92.77% accuracy on CUB200-2011
Achieves 92.83% accuracy on NABirds
Outperforms existing state-of-the-art methods
Abstract
Visual classification can be divided into coarse-grained and fine-grained classification. Coarse-grained classification represents categories with a large degree of dissimilarity, such as the classification of cats and dogs, while fine-grained classification represents classifications with a large degree of similarity, such as cat species, bird species, and the makes or models of vehicles. Unlike coarse-grained visual classification, fine-grained visual classification often requires professional experts to label data, which makes data more expensive. To meet this challenge, many approaches propose to automatically find the most discriminative regions and use local features to provide more precise features. These approaches only require image-level annotations, thereby reducing the cost of annotation. However, most of these methods require two- or multi-stage architectures and cannot be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
