Coping with Change: Learning Invariant and Minimum Sufficient   Representations for Fine-Grained Visual Categorization

Shuo Ye; Shujian Yu; Wenjin Hou; Yu Wang; Xinge You

arXiv:2306.04893·cs.CV·October 11, 2023·1 cites

Coping with Change: Learning Invariant and Minimum Sufficient Representations for Fine-Grained Visual Categorization

Shuo Ye, Shujian Yu, Wenjin Hou, Yu Wang, Xinge You

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach combining invariant risk minimization and information bottleneck principles to learn invariant and minimal representations for fine-grained visual categorization, improving generalization to unseen data.

Contribution

It proposes a new information-theoretic method for FGVC that addresses distribution shifts and enhances feature robustness and generalization.

Findings

01

Consistent performance improvements on benchmark datasets.

02

Effective stabilization of training via matrix-based Rényi entropy.

03

First to address FGVC from a generalization perspective using IMS.

Abstract

Fine-grained visual categorization (FGVC) is a challenging task due to similar visual appearances between various species. Previous studies always implicitly assume that the training and test data have the same underlying distributions, and that features extracted by modern backbone architectures remain discriminative and generalize well to unseen test data. However, we empirically justify that these conditions are not always true on benchmark datasets. To this end, we combine the merits of invariant risk minimization (IRM) and information bottleneck (IB) principle to learn invariant and minimum sufficient (IMS) representations for FGVC, such that the overall model can always discover the most succinct and consistent fine-grained features. We apply the matrix-based R{\'e}nyi's $α$ -order entropy to simplify and stabilize the training of IB; we also design a ``soft" environment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SYe-hub/IMS
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification

MethodsTest