Asymmetric Certified Robustness via Feature-Convex Neural Networks
Samuel Pfrommer, Brendon G. Anderson, Julien Piet, Somayeh Sojoudi

TL;DR
This paper introduces a novel feature-convex neural network architecture that combines ICNNs with Lipschitz feature maps to achieve certified adversarial robustness across multiple datasets, with improved efficiency and state-of-the-art radii.
Contribution
It proposes a new feature-convex neural network model that extends ICNNs for adversarial robustness, providing theoretical guarantees and practical state-of-the-art certified radii.
Findings
Achieves state-of-the-art certified $oldsymbol{ ext{l}_1}$-radii.
Attains substantial $oldsymbol{ ext{l}_2}$- and $oldsymbol{ ext{l}_ ext{infinity}}$-radii.
More computationally efficient than existing baselines.
Abstract
Recent works have introduced input-convex neural networks (ICNNs) as learning models with advantageous training, inference, and generalization properties linked to their convex structure. In this paper, we propose a novel feature-convex neural network architecture as the composition of an ICNN with a Lipschitz feature map in order to achieve adversarial robustness. We consider the asymmetric binary classification setting with one "sensitive" class, and for this class we prove deterministic, closed-form, and easily-computable certified robust radii for arbitrary -norms. We theoretically justify the use of these models by characterizing their decision region geometry, extending the universal approximation theorem for ICNN regression to the classification setting, and proving a lower bound on the probability that such models perfectly fit even unstructured uniformly distributed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning
