Sparse Mixture Once-for-all Adversarial Training for Efficient In-Situ   Trade-Off Between Accuracy and Robustness of DNNs

Souvik Kundu; Sairam Sundaresan; Sharath Nittur Sridhar; Shunlin Lu,; Han Tang; Peter A. Beerel

arXiv:2302.03523·cs.CV·February 8, 2023

Sparse Mixture Once-for-all Adversarial Training for Efficient In-Situ Trade-Off Between Accuracy and Robustness of DNNs

Souvik Kundu, Sairam Sundaresan, Sharath Nittur Sridhar, Shunlin Lu,, Han Tang, Peter A. Beerel

PDF

Open Access

TL;DR

The paper introduces SMART, a sparse mixture approach enabling a single DNN to efficiently balance accuracy and robustness against adversarial attacks with reduced compute and parameter costs.

Contribution

SMART is a novel method that trains a model once with sparse expert paths for clean and adversarial images, enabling in-situ trade-offs with lower overhead.

Findings

01

Achieves up to 2.72x fewer non-zero parameters.

02

Maintains state-of-the-art accuracy-robustness trade-off.

03

Reduces compute overhead while preserving performance.

Abstract

Existing deep neural networks (DNNs) that achieve state-of-the-art (SOTA) performance on both clean and adversarially-perturbed images rely on either activation or weight conditioned convolution operations. However, such conditional learning costs additional multiply-accumulate (MAC) or addition operations, increasing inference memory and compute costs. To that end, we present a sparse mixture once for all adversarial training (SMART), that allows a model to train once and then in-situ trade-off between accuracy and robustness, that too at a reduced compute and parameter overhead. In particular, SMART develops two expert paths, for clean and adversarial images, respectively, that are then conditionally trained via respective dedicated sets of binary sparsity masks. Extensive evaluations on multiple image classification datasets across different models show SMART to have up to 2.72x…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Anomaly Detection Techniques and Applications

MethodsConvolution