Understanding Model Reprogramming for CLIP via Decoupling Visual Prompts
Chengyi Cai, Zesheng Ye, Lei Feng, Jianzhong Qi, Feng Liu

TL;DR
This paper proposes a decoupling-and-reweighting framework for visual reprogramming of CLIP, improving the adaptation to downstream tasks by grouping prompts and probabilistically reweighting their contributions, leading to better performance and interpretability.
Contribution
It introduces a novel decoupled visual prompts method with a reweighting mechanism, enhancing CLIP reprogramming by capturing diverse description aspects and providing interpretability.
Findings
DVP outperforms baseline methods on 11 datasets.
The PRM integration offers insights into prompt influence.
Theoretically reduces empirical risk bound.
Abstract
Model reprogramming adapts pretrained models to downstream tasks by modifying only the input and output spaces. Visual reprogramming (VR) is one instance for vision tasks that adds a trainable noise pattern (i.e., a visual prompt) to input images to facilitate downstream classification. The existing VR approaches for CLIP train a single visual prompt using all descriptions of different downstream classes. However, the limited learning capacity may result in (1) a failure to capture diverse aspects of the descriptions (e.g., shape, color, and texture), and (2) a possible bias toward less informative attributes that do not help distinguish between classes. In this paper, we introduce a decoupling-and-reweighting framework. Our decoupled visual prompts (DVP) are optimized using descriptions grouped by explicit causes (DVP-cse) or unsupervised clusters (DVP-cls). Then, we integrate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmbedded Systems Design Techniques · Teaching and Learning Programming
