Neural Representations Reveal Distinct Modes of Class Fitting in   Residual Convolutional Networks

Micha{\l} Jamro\.z; Marcin Kurdziel

arXiv:2212.00771·cs.LG·December 2, 2022

Neural Representations Reveal Distinct Modes of Class Fitting in Residual Convolutional Networks

Micha{\l} Jamro\.z, Marcin Kurdziel

PDF

Open Access 1 Repo

TL;DR

This paper uses probabilistic models to analyze how residual networks fit classes, revealing two distinct modes of class representation that relate to memorization and robustness, especially in deeper layers.

Contribution

It uncovers two different modes of class fitting in residual networks' deep layers using class-conditional density models, linking these modes to memorization and robustness.

Findings

01

Classes are fitted with two distinct distribution modes.

02

Deeper layers reveal these modes, not low-level features.

03

Representation structures correlate with memorization and robustness.

Abstract

We leverage probabilistic models of neural representations to investigate how residual networks fit classes. To this end, we estimate class-conditional density models for representations learned by deep ResNets. We then use these models to characterize distributions of representations across learned classes. Surprisingly, we find that classes in the investigated models are not fitted in an uniform way. On the contrary: we uncover two groups of classes that are fitted with markedly different distributions of representations. These distinct modes of class-fitting are evident only in the deeper layers of the investigated models, indicating that they are not related to low-level image features. We show that the uncovered structure in neural representations correlate with memorization of training examples and adversarial robustness. Finally, we compare class-conditional distributions of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mjamroz90/dnn-class-fitting
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Model Reduction and Neural Networks