TL;DR
This paper introduces ARF-SFR-Net, an adaptive network that dynamically determines receptive field sizes to improve feature reconstruction for few-shot fine-grained image classification, demonstrating superior performance.
Contribution
The novel ARF-SFR-Net adaptively selects receptive fields for spatial and frequency features, enhancing feature reconstruction in FSFGIC tasks.
Findings
ARF-SFR-Net outperforms state-of-the-art methods on multiple benchmarks.
The adaptive receptive field mechanism improves feature extraction quality.
End-to-end training from scratch is effectively supported.
Abstract
Feature reconstruction techniques are widely applied for few-shot fine-grained image classification (FSFGIC). Our research indicates that one of the main challenges facing existing feature-based FSFGIC methods is how to choose the size of the receptive field to extract feature descriptors (including spatial and frequency feature descriptors) from different category input images, thereby better performing the FSFGIC tasks. To address this, an adaptive receptive field-based spatial-frequency feature reconstruction network (ARF-SFR-Net) is proposed. The designed ARF-SFR-Net has the capability to adaptively determine receptive field sizes for obtaining spatial and frequency features, and effectively fuse them for reconstruction and FSFGIC tasks. The designed ARF-SFR-Net can be easily embedded into a given episodic training mechanism for end-to-end training from scratch. Extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
