DST: Dynamic Substitute Training for Data-free Black-box Attack
Wenxuan Wang, Xuelin Qian, Yanwei Fu, Xiangyang Xue

TL;DR
This paper introduces a dynamic substitute training method for data-free black-box attacks on neural networks, adaptively optimizing the substitute model structure and enhancing data quality to improve attack success rates.
Contribution
It proposes a novel dynamic structure learning strategy and a graph-based data quality constraint, advancing the effectiveness of data-free black-box adversarial attacks.
Findings
Outperforms state-of-the-art methods on multiple datasets
Achieves higher attack success rates
Demonstrates adaptability to various models and tasks
Abstract
With the wide applications of deep neural network models in various computer vision tasks, more and more works study the model vulnerability to adversarial examples. For data-free black box attack scenario, existing methods are inspired by the knowledge distillation, and thus usually train a substitute model to learn knowledge from the target model using generated data as input. However, the substitute model always has a static network structure, which limits the attack ability for various target models and tasks. In this paper, we propose a novel dynamic substitute training attack method to encourage substitute model to learn better and faster from the target model. Specifically, a dynamic substitute structure learning strategy is proposed to adaptively generate optimal substitute model structure via a dynamic gate according to different target models and tasks. Moreover, we introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
