Simulating Unknown Target Models for Query-Efficient Black-box Attacks
Chen Ma, Li Chen, Jun-Hai Yong

TL;DR
This paper introduces a 'Simulator' model trained via meta-learning to mimic unknown target models, significantly reducing query complexity in black-box adversarial attacks across multiple datasets.
Contribution
The study proposes a generalized Simulator for black-box attacks, trained with meta-learning to efficiently mimic various target models, reducing query requirements.
Findings
Reduces query complexity by several orders of magnitude.
Effective across CIFAR-10, CIFAR-100, and TinyImageNet datasets.
Achieves accurate model simulation with limited feedback.
Abstract
Many adversarial attacks have been proposed to investigate the security issues of deep neural networks. In the black-box setting, current model stealing attacks train a substitute model to counterfeit the functionality of the target model. However, the training requires querying the target model. Consequently, the query complexity remains high, and such attacks can be defended easily. This study aims to train a generalized substitute model called "Simulator", which can mimic the functionality of any unknown target model. To this end, we build the training data with the form of multiple tasks by collecting query sequences generated during the attacks of various existing networks. The learning process uses a mean square error-based knowledge-distillation loss in the meta-learning to minimize the difference between the Simulator and the sampled networks. The meta-gradients of this loss are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
