Sample-specific Masks for Visual Reprogramming-based Prompting
Chengyi Cai, Zesheng Ye, Lei Feng, Jianzhong Qi, Feng Liu

TL;DR
This paper introduces sample-specific multi-channel masks (SMM) for visual reprogramming, enhancing generalization and reducing approximation error by generating individual masks per sample instead of using a shared mask.
Contribution
The paper proposes a novel SMM framework that uses a lightweight ConvNet to generate sample-specific masks, improving VR performance over state-of-the-art methods.
Findings
SMM reduces approximation error compared to shared masks.
SMM improves performance on ResNet and ViT models.
Sample-specific masks enhance VR's generalization ability.
Abstract
Visual reprogramming (VR) is a prompting technique that aims to re-purpose a pre-trained model (e.g., a classifier on ImageNet) to target tasks (e.g., medical data prediction) by learning a small-scale pattern added into input images instead of tuning considerable parameters within the model. The location of the pattern within input samples is usually determined by a pre-defined mask shared across all samples. In this paper, we show that the shared mask potentially limits VR's generalization and increases its approximation error due to the lack of sample-level adaptation. Motivated by this finding, we design a new framework for VR called sample-specific multi-channel masks (SMM). Specifically, SMM employs a lightweight ConvNet and patch-wise interpolation to generate sample-specific three-channel masks instead of a shared and pre-defined mask. Since we generate different masks for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors
MethodsConvolution · Kaiming Initialization · Max Pooling · Average Pooling · Global Average Pooling
