On Learning Discriminative Features from Synthesized Data for   Self-Supervised Fine-Grained Visual Recognition

Zihu Wang; Lingqiao Liu; Scott Ricardo Figueroa Weston; Samuel Tian,; Peng Li

arXiv:2407.14676·cs.CV·July 23, 2024

On Learning Discriminative Features from Synthesized Data for Self-Supervised Fine-Grained Visual Recognition

Zihu Wang, Lingqiao Liu, Scott Ricardo Figueroa Weston, Samuel Tian,, Peng Li

PDF

Open Access

TL;DR

This paper proposes a novel self-supervised learning strategy that synthesizes data to enhance the extraction of discriminative features, significantly improving fine-grained visual recognition performance.

Contribution

It introduces a data synthesis method that guides SSL models to focus on discriminative features by perturbing non-discriminative ones, advancing FGVR capabilities.

Findings

01

Improved FGVR accuracy across multiple datasets

02

Effective identification and perturbation of non-discriminative features

03

Enhanced model invariance to irrelevant feature variations

Abstract

Self-Supervised Learning (SSL) has become a prominent approach for acquiring visual representations across various tasks, yet its application in fine-grained visual recognition (FGVR) is challenged by the intricate task of distinguishing subtle differences between categories. To overcome this, we introduce an novel strategy that boosts SSL's ability to extract critical discriminative features vital for FGVR. This approach creates synthesized data pairs to guide the model to focus on discriminative features critical for FGVR during SSL. We start by identifying non-discriminative features using two main criteria: features with low variance that fail to effectively separate data and those deemed less important by Grad-CAM induced from the SSL loss. We then introduce perturbations to these non-discriminative features while preserving discriminative ones. A decoder is employed to reconstruct…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection

MethodsFocus