Guidance Through Surrogate: Towards a Generic Diagnostic Attack
Muzammal Naseer, Salman Khan, Fatih Porikli, Fahad Shahbaz Khan

TL;DR
This paper introduces G-PGA, a guided attack method that uses surrogate models to improve adversarial attack efficiency and diagnose gradient masking in defenses, advancing robustness evaluation techniques.
Contribution
The paper proposes G-PGA, a novel guided attack that enhances attack success and diagnostic capabilities against gradient masking defenses, without requiring extensive search or restarts.
Findings
G-PGA improves attack success rate and efficiency.
G-PGA effectively diagnoses gradient masking in defenses.
The method can be combined with ensemble attacks for better robustness evaluation.
Abstract
Adversarial training is an effective approach to make deep neural networks robust against adversarial attacks. Recently, different adversarial training defenses are proposed that not only maintain a high clean accuracy but also show significant robustness against popular and well studied adversarial attacks such as PGD. High adversarial robustness can also arise if an attack fails to find adversarial gradient directions, a phenomenon known as `gradient masking'. In this work, we analyse the effect of label smoothing on adversarial training as one of the potential causes of gradient masking. We then develop a guided mechanism to avoid local minima during attack optimization, leading to a novel attack dubbed Guided Projected Gradient Attack (G-PGA). Our attack approach is based on a `match and deceive' loss that finds optimal adversarial directions through guidance from a surrogate model.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsLabel Smoothing · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
