Unleashing Foundation Vision Models: Adaptive Transfer for Diverse Data-Limited Scientific Domains

Qiankun Li; Feng He; Huabao Chen; Xin Ning; Kun Wang; Zengfu Wang

arXiv:2512.22664·cs.CV·December 30, 2025

Unleashing Foundation Vision Models: Adaptive Transfer for Diverse Data-Limited Scientific Domains

Qiankun Li, Feng He, Huabao Chen, Xin Ning, Kun Wang, Zengfu Wang

PDF

Open Access

TL;DR

This paper introduces CLAdapter, a novel attention-based method that adapts large-scale pre-trained vision models to various data-limited scientific domains, achieving state-of-the-art results.

Contribution

The paper presents CLAdapter, a unified, attention-based adapter that effectively transfers knowledge from large-scale pre-trained models to diverse, data-scarce scientific tasks.

Findings

01

Achieves state-of-the-art performance on 10 diverse datasets.

02

Effectively adapts models across multiple architectures and data types.

03

Demonstrates significant improvements in data-limited scenarios.

Abstract

In the big data era, the computer vision field benefits from large-scale datasets such as LAION-2B, LAION-400M, and ImageNet-21K, Kinetics, on which popular models like the ViT and ConvNeXt series have been pre-trained, acquiring substantial knowledge. However, numerous downstream tasks in specialized and data-limited scientific domains continue to pose significant challenges. In this paper, we propose a novel Cluster Attention Adapter (CLAdapter), which refines and adapts the rich representations learned from large-scale data to various data-limited downstream tasks. Specifically, CLAdapter introduces attention mechanisms and cluster centers to personalize the enhancement of transformed features through distribution correlation and transformation matrices. This enables models fine-tuned with CLAdapter to learn distinct representations tailored to different feature sets, facilitating…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Face recognition and analysis