An Analysis of Robustness of Non-Lipschitz Networks

Maria-Florina Balcan; Avrim Blum; Dravyansh Sharma; Hongyang; Zhang

arXiv:2010.06154·cs.LG·April 20, 2023·1 cites

An Analysis of Robustness of Non-Lipschitz Networks

Maria-Florina Balcan, Avrim Blum, Dravyansh Sharma, Hongyang, Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates the robustness of non-Lipschitz neural networks against adversarial attacks, proposing a theoretical framework that explains their vulnerabilities and potential defenses through abstention and data-driven parameter tuning.

Contribution

It introduces a novel attack model based on low-dimensional subspace perturbations, analyzes the limitations of classifiers under this model, and offers robustness guarantees for abstention strategies.

Findings

01

Adversaries can be powerful within the proposed model, defeating classifiers without abstention.

02

Allowing abstention enables classifiers to maintain high accuracy against adversaries.

03

Empirical results show high robust accuracy with low abstention rates in contrastive learning.

Abstract

Despite significant advances, deep networks remain highly susceptible to adversarial attack. One fundamental challenge is that small input perturbations can often produce large movements in the network's final-layer feature space. In this paper, we define an attack model that abstracts this challenge, to help understand its intrinsic properties. In our model, the adversary may move data an arbitrary distance in feature space but only in random low-dimensional subspaces. We prove such adversaries can be quite powerful: defeating any algorithm that must classify any input it is given. However, by allowing the algorithm to abstain on unusual inputs, we show such adversaries can be overcome when classes are reasonably well-separated in feature space. We further provide strong theoretical guarantees for setting algorithm parameters to optimize over accuracy-abstention trade-offs using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dravyanshsharma/adversarial-contrastive
pytorchOfficial

Videos

An Analysis of Robustness of Non-Lipschitz Networks· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications