Coping with Change: Learning Invariant and Minimum Sufficient Representations for Fine-Grained Visual Categorization
Shuo Ye, Shujian Yu, Wenjin Hou, Yu Wang, Xinge You

TL;DR
This paper introduces a novel approach combining invariant risk minimization and information bottleneck principles to learn invariant and minimal representations for fine-grained visual categorization, improving generalization to unseen data.
Contribution
It proposes a new information-theoretic method for FGVC that addresses distribution shifts and enhances feature robustness and generalization.
Findings
Consistent performance improvements on benchmark datasets.
Effective stabilization of training via matrix-based Rényi entropy.
First to address FGVC from a generalization perspective using IMS.
Abstract
Fine-grained visual categorization (FGVC) is a challenging task due to similar visual appearances between various species. Previous studies always implicitly assume that the training and test data have the same underlying distributions, and that features extracted by modern backbone architectures remain discriminative and generalize well to unseen test data. However, we empirically justify that these conditions are not always true on benchmark datasets. To this end, we combine the merits of invariant risk minimization (IRM) and information bottleneck (IB) principle to learn invariant and minimum sufficient (IMS) representations for FGVC, such that the overall model can always discover the most succinct and consistent fine-grained features. We apply the matrix-based R{\'e}nyi's -order entropy to simplify and stabilize the training of IB; we also design a ``soft" environment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification
MethodsTest
