Can't Steal? Cont-Steal! Contrastive Stealing Attacks Against Image   Encoders

Zeyang Sha; Xinlei He; Ning Yu; Michael Backes; Yang Zhang

arXiv:2201.07513·cs.CR·March 28, 2023·1 cites

Can't Steal? Cont-Steal! Contrastive Stealing Attacks Against Image Encoders

Zeyang Sha, Xinlei He, Ning Yu, Michael Backes, Yang Zhang

PDF

Open Access 1 Repo

TL;DR

This paper reveals the vulnerability of self-supervised image encoders to model stealing attacks and introduces Cont-Steal, a contrastive-learning-based attack that effectively mimics encoder performance, raising concerns about intellectual property protection.

Contribution

The paper demonstrates the vulnerability of unsupervised image encoders to stealing attacks and proposes Cont-Steal, a novel contrastive-learning-based method for more effective model stealing.

Findings

01

Conventional attacks are more effective against encoders than classifiers.

02

Cont-Steal significantly improves stealing success across various settings.

03

Encoders are vulnerable to new contrastive-learning-based stealing methods.

Abstract

Self-supervised representation learning techniques have been developing rapidly to make full use of unlabeled images. They encode images into rich features that are oblivious to downstream tasks. Behind their revolutionary representation power, the requirements for dedicated model designs and a massive amount of computation resources expose image encoders to the risks of potential model stealing attacks - a cheap way to mimic the well-trained encoder performance while circumventing the demanding requirements. Yet conventional attacks only target supervised classifiers given their predicted labels and/or posteriors, which leaves the vulnerability of unsupervised encoders unexplored. In this paper, we first instantiate the conventional stealing attacks against encoders and demonstrate their severer vulnerability compared with downstream classifiers. To better leverage the rich…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zeyangsha/cont-steal
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning