Flatten Long-Range Loss Landscapes for Cross-Domain Few-Shot Learning
Yixiong Zou, Yicong Liu, Yiman Hu, Yuhua Li, Ruixuan Li

TL;DR
This paper proposes a novel normalization layer that flattens long-range minima in the loss landscape's representation space, significantly improving cross-domain few-shot learning performance across multiple datasets.
Contribution
It introduces a simple, lightweight normalization layer that achieves long-range flattening of the loss landscape, enhancing transferability and fine-tuning in CDFSL models.
Findings
Outperforms state-of-the-art methods on 8 datasets
Achieves up to 9% accuracy improvement on individual datasets
Effective for CNNs and ViTs
Abstract
Cross-domain few-shot learning (CDFSL) aims to acquire knowledge from limited training data in the target domain by leveraging prior knowledge transferred from source domains with abundant training samples. CDFSL faces challenges in transferring knowledge across dissimilar domains and fine-tuning models with limited training data. To address these challenges, we initially extend the analysis of loss landscapes from the parameter space to the representation space, which allows us to simultaneously interpret the transferring and fine-tuning difficulties of CDFSL models. We observe that sharp minima in the loss landscapes of the representation space result in representations that are hard to transfer and fine-tune. Moreover, existing flatness-based methods have limited generalization ability due to their short-range flatness. To enhance the transferability and facilitate fine-tuning, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeophysical Methods and Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
