Linking average- and worst-case perturbation robustness via class   selectivity and dimensionality

Matthew L. Leavitt; Ari Morcos

arXiv:2010.07693·cs.LG·March 31, 2021

Linking average- and worst-case perturbation robustness via class selectivity and dimensionality

Matthew L. Leavitt, Ari Morcos

PDF

Open Access

TL;DR

This paper investigates how class selectivity and representational dimensionality in neural networks influence robustness to both average-case and worst-case input perturbations, revealing a trade-off between robustness types.

Contribution

It demonstrates that lower class selectivity improves average-case robustness but reduces worst-case robustness, linking these effects to representational dimensionality and attack surface.

Findings

01

Lower class selectivity correlates with increased robustness to naturalistic perturbations.

02

Higher class selectivity enhances robustness against adversarial attacks.

03

Representational dimensionality is inversely related to class selectivity and impacts vulnerability.

Abstract

Representational sparsity is known to affect robustness to input perturbations in deep neural networks (DNNs), but less is known about how the semantic content of representations affects robustness. Class selectivity-the variability of a unit's responses across data classes or dimensions-is one way of quantifying the sparsity of semantic representations. Given recent evidence that class selectivity may not be necessary for, and in some cases can impair generalization, we investigate whether it also confers robustness (or vulnerability) to perturbations of input data. We found that networks regularized to have lower levels of class selectivity were more robust to average-case (naturalistic) perturbations, while networks with higher class selectivity are more vulnerable. In contrast, class selectivity increases robustness to multiple types of worst-case (i.e. white box adversarial)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Smart Grid Security and Resilience