DualCF: Efficient Model Extraction Attack from Counterfactual   Explanations

Yongjie Wang; Hangwei Qian; Chunyan Miao

arXiv:2205.06504·cs.CR·May 16, 2022

DualCF: Efficient Model Extraction Attack from Counterfactual Explanations

Yongjie Wang, Hangwei Qian, Chunyan Miao

PDF

Open Access

TL;DR

This paper introduces DualCF, a novel querying strategy that efficiently extracts models from cloud-based MLaaS platforms by leveraging counterfactual explanations and their counterfactuals, reducing query complexity and improving fidelity.

Contribution

The paper proposes the DualCF strategy that addresses decision boundary shift issues by using pairs of counterfactual explanations and their counterfactuals for more efficient model extraction.

Findings

01

DualCF achieves high-fidelity model extraction with fewer queries.

02

Experimental results show DualCF outperforms existing strategies.

03

The approach is effective on both synthetic and real-world datasets.

Abstract

Cloud service providers have launched Machine-Learning-as-a-Service (MLaaS) platforms to allow users to access large-scale cloudbased models via APIs. In addition to prediction outputs, these APIs can also provide other information in a more human-understandable way, such as counterfactual explanations (CF). However, such extra information inevitably causes the cloud models to be more vulnerable to extraction attacks which aim to steal the internal functionality of models in the cloud. Due to the black-box nature of cloud models, however, a vast number of queries are inevitably required by existing attack strategies before the substitute model achieves high fidelity. In this paper, we propose a novel simple yet efficient querying strategy to greatly enhance the querying efficiency to steal a classification model. This is motivated by our observation that current querying strategies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Scientific Computing and Data Management

Methodstravel james