Examining the Threat Landscape: Foundation Models and Model Stealing

Ankita Raj; Deepankar Varma; Chetan Arora

arXiv:2502.18077·cs.CV·April 16, 2025

Examining the Threat Landscape: Foundation Models and Model Stealing

Ankita Raj, Deepankar Varma, Chetan Arora

PDF

Open Access

TL;DR

This paper investigates how foundation models in computer vision are more vulnerable to model stealing attacks than traditional architectures, highlighting security risks in deploying such models in commercial settings.

Contribution

It provides empirical evidence that foundation models are more susceptible to model stealing, emphasizing the need for enhanced security measures in their deployment.

Findings

01

ViT-L/16 achieved 94.28% agreement in theft scenarios

02

ResNet-18 achieved 73.20% agreement in theft scenarios

03

Foundation models pose higher security risks for commercial APIs

Abstract

Foundation models (FMs) for computer vision learn rich and robust representations, enabling their adaptation to task/domain-specific deployments with little to no fine-tuning. However, we posit that the very same strength can make applications based on FMs vulnerable to model stealing attacks. Through empirical analysis, we reveal that models fine-tuned from FMs harbor heightened susceptibility to model stealing, compared to conventional vision architectures like ResNets. We hypothesize that this behavior is due to the comprehensive encoding of visual patterns and features learned by FMs during pre-training, which are accessible to both the attacker and the victim. We report that an attacker is able to obtain 94.28% agreement (matched predictions with victim) for a Vision Transformer based victim model (ViT-L/16) trained on CIFAR-10 dataset, compared to only 73.20% agreement for a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTerrorism, Counterterrorism, and Political Violence · Information and Cyber Security

MethodsAttention Is All You Need · Absolute Position Encodings · Dense Connections · Linear Layer · Layer Normalization · Byte Pair Encoding · Residual Connection · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer