Adversarial Robustness of Supervised Sparse Coding
Jeremias Sulam, Ramchandran Muthukumar, Raman Arora

TL;DR
This paper analyzes the adversarial robustness of supervised sparse coding models, providing theoretical bounds and certificates for their stability against perturbations, and demonstrating practical certified accuracy on real data.
Contribution
It introduces a model combining sparse encoding with linear classification, offering new theoretical bounds and robustness certificates for adversarial robustness.
Findings
Bounded robust risk for models with mild encoder gap
Provided a certified robustness guarantee for end-to-end classification
Achieved certified accuracy on real datasets and compared with other methods
Abstract
Several recent results provide theoretical insights into the phenomena of adversarial examples. Existing results, however, are often limited due to a gap between the simplicity of the models studied and the complexity of those deployed in practice. In this work, we strike a better balance by considering a model that involves learning a representation while at the same time giving a precise generalization bound and a robustness certificate. We focus on the hypothesis class obtained by combining a sparsity-promoting encoder coupled with a linear classifier, and show an interesting interplay between the expressivity and stability of the (supervised) representation map and a notion of margin in the feature space. We bound the robust risk (to -bounded perturbations) of hypotheses parameterized by dictionaries that achieve a mild encoder gap on training data. Furthermore, we provide a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Sparse and Compressive Sensing Techniques · Wireless Communication Security Techniques
