A Novel Plug-in Module for Fine-Grained Visual Classification

Po-Yung Chou; Cheng-Hung Lin; Wen-Chung Kao

arXiv:2202.03822·cs.CV·February 9, 2022·36 cites

A Novel Plug-in Module for Fine-Grained Visual Classification

Po-Yung Chou, Cheng-Hung Lin, Wen-Chung Kao

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new plug-in module for neural networks that enhances fine-grained visual classification by automatically identifying discriminative regions, achieving state-of-the-art accuracy with end-to-end training.

Contribution

The novel plug-in module can be integrated into various backbone networks to improve fine-grained classification without multi-stage training.

Findings

01

Achieves 92.77% accuracy on CUB200-2011

02

Achieves 92.83% accuracy on NABirds

03

Outperforms existing state-of-the-art methods

Abstract

Visual classification can be divided into coarse-grained and fine-grained classification. Coarse-grained classification represents categories with a large degree of dissimilarity, such as the classification of cats and dogs, while fine-grained classification represents classifications with a large degree of similarity, such as cat species, bird species, and the makes or models of vehicles. Unlike coarse-grained visual classification, fine-grained visual classification often requires professional experts to label data, which makes data more expensive. To meet this challenge, many approaches propose to automatically find the most discriminative regions and use local features to provide more precise features. These approaches only require image-level annotations, thereby reducing the cost of annotation. However, most of these methods require two- or multi-stage architectures and cannot be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chou141253/fgvc-pim
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning