SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation

Wuxinlin Cheng; Chenhui Deng; Zhiqiang Zhao; Yaohui Cai; Zhiru Zhang,; Zhuo Feng

arXiv:2102.03716·cs.LG·June 15, 2021·1 cites

SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation

Wuxinlin Cheng, Chenhui Deng, Zhiqiang Zhao, Yaohui Cai, Zhiru Zhang,, Zhuo Feng

PDF

Open Access 2 Repos 1 Video

TL;DR

SPADE is a spectral method that evaluates and enhances the adversarial robustness of machine learning models by analyzing input/output data manifolds and identifying vulnerable samples, leading to improved training strategies.

Contribution

Introduces SPADE, a spectral approach utilizing graph theory and eigenvector analysis to evaluate and improve model robustness against adversarial attacks.

Findings

01

SPADE provides an upper bound on the Lipschitz constant for robustness evaluation.

02

Spectral embedding identifies highly vulnerable data samples.

03

Empirical results show improved robustness on MNIST and CIFAR-10 datasets.

Abstract

A black-box spectral method is introduced for evaluating the adversarial robustness of a given machine learning (ML) model. Our approach, named SPADE, exploits bijective distance mapping between the input/output graphs constructed for approximating the manifolds corresponding to the input/output data. By leveraging the generalized Courant-Fischer theorem, we propose a SPADE score for evaluating the adversarial robustness of a given model, which is proved to be an upper bound of the best Lipschitz constant under the manifold setting. To reveal the most non-robust data samples highly vulnerable to adversarial attacks, we develop a spectral graph embedding procedure leveraging dominant generalized eigenvectors. This embedding step allows assigning each data sample a robustness score that can be further harnessed for more effective adversarial training. Our experiments show the proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning

MethodsSpatially-Adaptive Normalization