Unifying Gradients to Improve Real-world Robustness for Deep Networks
Yingwen Wu, Sizhe Chen, Kun Fang, Xiaolin Huang

TL;DR
This paper introduces UniG, a method that unifies gradients across data to defend deep neural networks against score-based query attacks, significantly enhancing real-world robustness without sacrificing accuracy.
Contribution
UniG is a novel defense technique that unifies gradients across data samples to weaken attack directions, improving robustness against SQAs while maintaining accuracy.
Findings
UniG maintains 77.80% accuracy under 2500-query Square attack on CIFAR10.
UniG outperforms state-of-the-art adversarial training in robustness.
UniG achieves minimal output modification, preserving clean accuracy.
Abstract
The wide application of deep neural networks (DNNs) demands an increasing amount of attention to their real-world robustness, i.e., whether a DNN resists black-box adversarial attacks, among which score-based query attacks (SQAs) are most threatening since they can effectively hurt a victim network with the only access to model outputs. Defending against SQAs requires a slight but artful variation of outputs due to the service purpose for users, who share the same output information with SQAs. In this paper, we propose a real-world defense by Unifying Gradients (UniG) of different data so that SQAs could only probe a much weaker attack direction that is similar for different samples. Since such universal attack perturbations have been validated as less aggressive than the input-specific perturbations, UniG protects real-world DNNs by indicating attackers a twisted and less informative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Cardiac Arrest and Resuscitation · Anomaly Detection Techniques and Applications
Methodstravel james
