Can't Steal? Cont-Steal! Contrastive Stealing Attacks Against Image Encoders
Zeyang Sha, Xinlei He, Ning Yu, Michael Backes, Yang Zhang

TL;DR
This paper reveals the vulnerability of self-supervised image encoders to model stealing attacks and introduces Cont-Steal, a contrastive-learning-based attack that effectively mimics encoder performance, raising concerns about intellectual property protection.
Contribution
The paper demonstrates the vulnerability of unsupervised image encoders to stealing attacks and proposes Cont-Steal, a novel contrastive-learning-based method for more effective model stealing.
Findings
Conventional attacks are more effective against encoders than classifiers.
Cont-Steal significantly improves stealing success across various settings.
Encoders are vulnerable to new contrastive-learning-based stealing methods.
Abstract
Self-supervised representation learning techniques have been developing rapidly to make full use of unlabeled images. They encode images into rich features that are oblivious to downstream tasks. Behind their revolutionary representation power, the requirements for dedicated model designs and a massive amount of computation resources expose image encoders to the risks of potential model stealing attacks - a cheap way to mimic the well-trained encoder performance while circumventing the demanding requirements. Yet conventional attacks only target supervised classifiers given their predicted labels and/or posteriors, which leaves the vulnerability of unsupervised encoders unexplored. In this paper, we first instantiate the conventional stealing attacks against encoders and demonstrate their severer vulnerability compared with downstream classifiers. To better leverage the rich…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
