AdaGAT: Adaptive Guidance Adversarial Training for the Robustness of Deep Neural Networks
Zhenyu Liu, Huizhi Liang, Xinrun Li, Vaclav Snasel, and Varun Ojha

TL;DR
AdaGAT introduces an adaptive guidance adversarial training method that dynamically adjusts the guide model to improve the robustness of deep neural networks against adversarial attacks.
Contribution
The paper proposes a novel AdaGAT method with dynamic guide model adjustment and dual loss functions for enhanced adversarial robustness transfer.
Findings
AdaGAT improves target model robustness across multiple datasets.
Dynamic guide adjustment outperforms static guidance methods.
The approach is effective against various adversarial attacks.
Abstract
Adversarial distillation (AD) is a knowledge distillation technique that facilitates the transfer of robustness from teacher deep neural network (DNN) models to lightweight target (student) DNN models, enabling the target models to perform better than only training the student model independently. Some previous works focus on using a small, learnable teacher (guide) model to improve the robustness of a student model. Since a learnable guide model starts learning from scratch, maintaining its optimal state for effective knowledge transfer during co-training is challenging. Therefore, we propose a novel Adaptive Guidance Adversarial Training (AdaGAT) method. Our method, AdaGAT, dynamically adjusts the training state of the guide model to install robustness to the target model. Specifically, we develop two separate loss functions as part of the AdaGAT method, allowing the guide model to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
