Random Registers for Cross-Domain Few-Shot Learning
Shuai Yi, Yixiong Zou, Yuhua Li, Ruixuan Li

TL;DR
This paper introduces the use of random registers to perturb attention in Vision Transformers, significantly improving cross-domain few-shot learning performance by promoting flatter minima and better transferability.
Contribution
It reveals that replacing learnable prompts with random registers enhances ViT transferability in CDFSL and proposes a simple method to perturb attention maps for improved results.
Findings
Random registers improve target-domain performance in CDFSL.
Perturbing attention with random registers leads to flatter loss landscapes.
The approach achieves state-of-the-art results on four benchmarks.
Abstract
Cross-domain few-shot learning (CDFSL) aims to transfer knowledge from a data-sufficient source domain to data-scarce target domains. Although Vision Transformer (ViT) has shown superior capability in many vision tasks, its transferability against huge domain gaps in CDFSL is still under-explored. In this paper, we find an intriguing phenomenon: during the source-domain training, prompt tuning, as a common way to train ViT, could be harmful for the generalization of ViT in target domains, but setting them to random noises (i.e., random registers) could consistently improve target-domain performance. We then delve into this phenomenon for an interpretation. We find that learnable prompts capture domain information during the training on the source dataset, which views irrelevant visual patterns as vital cues for recognition. This can be viewed as a kind of overfitting and increases the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeophysical Methods and Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM
