An Analysis of Robustness of Non-Lipschitz Networks
Maria-Florina Balcan, Avrim Blum, Dravyansh Sharma, Hongyang, Zhang

TL;DR
This paper investigates the robustness of non-Lipschitz neural networks against adversarial attacks, proposing a theoretical framework that explains their vulnerabilities and potential defenses through abstention and data-driven parameter tuning.
Contribution
It introduces a novel attack model based on low-dimensional subspace perturbations, analyzes the limitations of classifiers under this model, and offers robustness guarantees for abstention strategies.
Findings
Adversaries can be powerful within the proposed model, defeating classifiers without abstention.
Allowing abstention enables classifiers to maintain high accuracy against adversaries.
Empirical results show high robust accuracy with low abstention rates in contrastive learning.
Abstract
Despite significant advances, deep networks remain highly susceptible to adversarial attack. One fundamental challenge is that small input perturbations can often produce large movements in the network's final-layer feature space. In this paper, we define an attack model that abstracts this challenge, to help understand its intrinsic properties. In our model, the adversary may move data an arbitrary distance in feature space but only in random low-dimensional subspaces. We prove such adversaries can be quite powerful: defeating any algorithm that must classify any input it is given. However, by allowing the algorithm to abstain on unusual inputs, we show such adversaries can be overcome when classes are reasonably well-separated in feature space. We further provide strong theoretical guarantees for setting algorithm parameters to optimize over accuracy-abstention trade-offs using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
