Fast Fine-grained Image Classification via Weakly Supervised Discriminative Localization
Xiangteng He, Yuxin Peng, Junjie Zhao

TL;DR
This paper introduces a fast, weakly supervised discriminative localization method for fine-grained image classification that improves speed and accuracy without relying on labor-intensive annotations.
Contribution
The proposed WSDL approach combines an end-to-end multi-region localization network with multi-level attention learning, eliminating the need for object annotations and boosting classification performance.
Findings
Achieves state-of-the-art results on two fine-grained datasets.
Significantly reduces localization and classification time.
Does not require object or part annotations for training.
Abstract
Fine-grained image classification is to recognize hundreds of subcategories in each basic-level category. Existing methods employ discriminative localization to find the key distinctions among subcategories. However, they generally have two limitations: (1) Discriminative localization relies on region proposal methods to hypothesize the locations of discriminative regions, which are time-consuming. (2) The training of discriminative localization depends on object or part annotations, which are heavily labor-consuming. It is highly challenging to address the two key limitations simultaneously, and existing methods only focus on one of them. Therefore, we propose a weakly supervised discriminative localization approach (WSDL) for fast fine-grained image classification to address the two limitations at the same time, and its main advantages are: (1) n-pathway end-to-end discriminative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
