Flatten Long-Range Loss Landscapes for Cross-Domain Few-Shot Learning

Yixiong Zou; Yicong Liu; Yiman Hu; Yuhua Li; Ruixuan Li

arXiv:2403.00567·cs.CV·April 22, 2024·2 cites

Flatten Long-Range Loss Landscapes for Cross-Domain Few-Shot Learning

Yixiong Zou, Yicong Liu, Yiman Hu, Yuhua Li, Ruixuan Li

PDF

Open Access 1 Repo

TL;DR

This paper proposes a novel normalization layer that flattens long-range minima in the loss landscape's representation space, significantly improving cross-domain few-shot learning performance across multiple datasets.

Contribution

It introduces a simple, lightweight normalization layer that achieves long-range flattening of the loss landscape, enhancing transferability and fine-tuning in CDFSL models.

Findings

01

Outperforms state-of-the-art methods on 8 datasets

02

Achieves up to 9% accuracy improvement on individual datasets

03

Effective for CNNs and ViTs

Abstract

Cross-domain few-shot learning (CDFSL) aims to acquire knowledge from limited training data in the target domain by leveraging prior knowledge transferred from source domains with abundant training samples. CDFSL faces challenges in transferring knowledge across dissimilar domains and fine-tuning models with limited training data. To address these challenges, we initially extend the analysis of loss landscapes from the parameter space to the representation space, which allows us to simultaneously interpret the transferring and fine-tuning difficulties of CDFSL models. We observe that sharp minima in the loss landscapes of the representation space result in representations that are hard to transfer and fine-tune. Moreover, existing flatness-based methods have limited generalization ability due to their short-range flatness. To enhance the transferability and facilitate fine-tuning, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zoilsen/flor
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGeophysical Methods and Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM