Fine-grained Visual Classification with High-temperature Refinement and Background Suppression
Po-Yung Chou, Yu-Yung Kao, Cheng-Hung Lin

TL;DR
The paper introduces HERBS, a novel network with high-temperature refinement and background suppression modules, which effectively enhances feature discrimination and suppresses background noise, achieving state-of-the-art results in fine-grained visual classification.
Contribution
HERBS is a new network architecture that combines multi-scale feature refinement and background suppression for improved fine-grained classification.
Findings
Achieves over 93% accuracy on CUB-200-2011 and NABirds datasets.
Effectively fuses multi-scale features and suppresses background noise.
Outperforms previous state-of-the-art methods.
Abstract
Fine-grained visual classification is a challenging task due to the high similarity between categories and distinct differences among data within one single category. To address the challenges, previous strategies have focused on localizing subtle discrepancies between categories and enhencing the discriminative features in them. However, the background also provides important information that can tell the model which features are unnecessary or even harmful for classification, and models that rely too heavily on subtle features may overlook global features and contextual information. In this paper, we propose a novel network called ``High-temperaturE Refinement and Background Suppression'' (HERBS), which consists of two modules, namely, the high-temperature refinement module and the background suppression module, for extracting discriminative features and suppressing background noise,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
Methods1-Dimensional Convolutional Neural Networks
