HYDRA: Pruning Adversarially Robust Neural Networks
Vikash Sehwag, Shiqi Wang, Prateek Mittal, Suman Jana

TL;DR
HYDRA is a novel pruning method that jointly optimizes for robustness and efficiency in neural networks, achieving state-of-the-art accuracy on multiple datasets and robust training techniques.
Contribution
The paper introduces a robust training-aware pruning approach formulated as an empirical risk minimization problem, improving robustness and compression simultaneously.
Findings
Achieves state-of-the-art benign and robust accuracy.
Effective across multiple datasets and training techniques.
Identifies highly robust sub-networks within larger networks.
Abstract
In safety-critical but computationally resource-constrained applications, deep learning faces two key challenges: lack of robustness against adversarial attacks and large neural network size (often millions of parameters). While the research community has extensively explored the use of robust training and network pruning independently to address one of these challenges, only a few recent works have studied them jointly. However, these works inherit a heuristic pruning strategy that was developed for benign training, which performs poorly when integrated with robust training techniques, including adversarial training and verifiable robust training. To overcome this challenge, we propose to make pruning techniques aware of the robust training objective and let the training objective guide the search for which connections to prune. We realize this insight by formulating the pruning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications
MethodsPruning · Stochastic Gradient Descent
