BlackVIP: Black-Box Visual Prompting for Robust Transfer Learning
Changdae Oh, Hyeji Hwang, Hee-young Lee, YongTaek Lim, Geunyoung Jung,, Jiyoung Jung, Hosik Choi, Kyungwoo Song

TL;DR
BlackVIP introduces a novel black-box visual prompting method that enables robust transfer learning from large pre-trained models without requiring access to model parameters, making it practical for real-world applications with limited memory.
Contribution
It proposes a parameter-free, black-box visual prompting approach with a gradient estimation technique, expanding transfer learning capabilities to proprietary or API-based models.
Findings
Effective on 16 datasets, demonstrating robustness across diverse domains.
Requires minimal memory, suitable for real-world deployment.
Outperforms existing methods in black-box transfer learning scenarios.
Abstract
With the surge of large-scale pre-trained models (PTMs), fine-tuning these models to numerous downstream tasks becomes a crucial problem. Consequently, parameter efficient transfer learning (PETL) of large models has grasped huge attention. While recent PETL methods showcase impressive performance, they rely on optimistic assumptions: 1) the entire parameter set of a PTM is available, and 2) a sufficiently large memory capacity for the fine-tuning is equipped. However, in most real-world applications, PTMs are served as a black-box API or proprietary software without explicit parameter accessibility. Besides, it is hard to meet a large memory requirement for modern PTMs. In this work, we propose black-box visual prompting (BlackVIP), which efficiently adapts the PTMs without knowledge about model architectures and parameters. BlackVIP has two components; 1) Coordinator and 2)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Video Surveillance and Tracking Methods · Multimodal Machine Learning Applications
