Examining the Threat Landscape: Foundation Models and Model Stealing
Ankita Raj, Deepankar Varma, Chetan Arora

TL;DR
This paper investigates how foundation models in computer vision are more vulnerable to model stealing attacks than traditional architectures, highlighting security risks in deploying such models in commercial settings.
Contribution
It provides empirical evidence that foundation models are more susceptible to model stealing, emphasizing the need for enhanced security measures in their deployment.
Findings
ViT-L/16 achieved 94.28% agreement in theft scenarios
ResNet-18 achieved 73.20% agreement in theft scenarios
Foundation models pose higher security risks for commercial APIs
Abstract
Foundation models (FMs) for computer vision learn rich and robust representations, enabling their adaptation to task/domain-specific deployments with little to no fine-tuning. However, we posit that the very same strength can make applications based on FMs vulnerable to model stealing attacks. Through empirical analysis, we reveal that models fine-tuned from FMs harbor heightened susceptibility to model stealing, compared to conventional vision architectures like ResNets. We hypothesize that this behavior is due to the comprehensive encoding of visual patterns and features learned by FMs during pre-training, which are accessible to both the attacker and the victim. We report that an attacker is able to obtain 94.28% agreement (matched predictions with victim) for a Vision Transformer based victim model (ViT-L/16) trained on CIFAR-10 dataset, compared to only 73.20% agreement for a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTerrorism, Counterterrorism, and Political Violence · Information and Cyber Security
MethodsAttention Is All You Need · Absolute Position Encodings · Dense Connections · Linear Layer · Layer Normalization · Byte Pair Encoding · Residual Connection · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer
