Frequency-Enhanced Dual-Subspace Networks for Few-Shot Fine-Grained Image Classification
Meijia Wang, Guochao Wang, Haozhen Chu, Bin Yao, Weichuan Zhang, Yuan Wang, Junpo Yang

TL;DR
FEDSNet enhances few-shot fine-grained image classification by isolating structural features through frequency domain analysis and dual subspace modeling, reducing background noise influence.
Contribution
Introduces a novel frequency-enhanced dual-subspace network that explicitly separates structural and texture features for improved robustness in few-shot learning.
Findings
Achieves state-of-the-art results on four benchmark datasets.
Demonstrates robustness against background interference.
Balances high accuracy with computational efficiency.
Abstract
Few-shot fine-grained image classification aims to recognize subcategories with high visual similarity using only a limited number of annotated samples. Existing metric learning-based methods typically rely solely on spatial domain features. Confined to this single perspective, models inevitably suffer from inherent texture biases, entangling essential structural details with high-frequency background noise. Furthermore, lacking cross-view geometric constraints, single-view metrics tend to overfit this noise, resulting in structural instability under few-shot conditions. To address these issues, this paper proposes the Frequency-Enhanced Dual-Subspace Network (FEDSNet). Specifically, FEDSNet utilizes the Discrete Cosine Transform (DCT) and a low-pass filtering mechanism to explicitly isolate low-frequency global structural components from spatial features, thereby suppressing background…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
