Unleashing Foundation Vision Models: Adaptive Transfer for Diverse Data-Limited Scientific Domains
Qiankun Li, Feng He, Huabao Chen, Xin Ning, Kun Wang, Zengfu Wang

TL;DR
This paper introduces CLAdapter, a novel attention-based method that adapts large-scale pre-trained vision models to various data-limited scientific domains, achieving state-of-the-art results.
Contribution
The paper presents CLAdapter, a unified, attention-based adapter that effectively transfers knowledge from large-scale pre-trained models to diverse, data-scarce scientific tasks.
Findings
Achieves state-of-the-art performance on 10 diverse datasets.
Effectively adapts models across multiple architectures and data types.
Demonstrates significant improvements in data-limited scenarios.
Abstract
In the big data era, the computer vision field benefits from large-scale datasets such as LAION-2B, LAION-400M, and ImageNet-21K, Kinetics, on which popular models like the ViT and ConvNeXt series have been pre-trained, acquiring substantial knowledge. However, numerous downstream tasks in specialized and data-limited scientific domains continue to pose significant challenges. In this paper, we propose a novel Cluster Attention Adapter (CLAdapter), which refines and adapts the rich representations learned from large-scale data to various data-limited downstream tasks. Specifically, CLAdapter introduces attention mechanisms and cluster centers to personalize the enhancement of transformed features through distribution correlation and transformation matrices. This enables models fine-tuned with CLAdapter to learn distinct representations tailored to different feature sets, facilitating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Face recognition and analysis
