Linking average- and worst-case perturbation robustness via class selectivity and dimensionality
Matthew L. Leavitt, Ari Morcos

TL;DR
This paper investigates how class selectivity and representational dimensionality in neural networks influence robustness to both average-case and worst-case input perturbations, revealing a trade-off between robustness types.
Contribution
It demonstrates that lower class selectivity improves average-case robustness but reduces worst-case robustness, linking these effects to representational dimensionality and attack surface.
Findings
Lower class selectivity correlates with increased robustness to naturalistic perturbations.
Higher class selectivity enhances robustness against adversarial attacks.
Representational dimensionality is inversely related to class selectivity and impacts vulnerability.
Abstract
Representational sparsity is known to affect robustness to input perturbations in deep neural networks (DNNs), but less is known about how the semantic content of representations affects robustness. Class selectivity-the variability of a unit's responses across data classes or dimensions-is one way of quantifying the sparsity of semantic representations. Given recent evidence that class selectivity may not be necessary for, and in some cases can impair generalization, we investigate whether it also confers robustness (or vulnerability) to perturbations of input data. We found that networks regularized to have lower levels of class selectivity were more robust to average-case (naturalistic) perturbations, while networks with higher class selectivity are more vulnerable. In contrast, class selectivity increases robustness to multiple types of worst-case (i.e. white box adversarial)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Smart Grid Security and Resilience
