Exploring Learned Representations of Neural Networks with Principal Component Analysis
Amit Harlev, Andrew Engel, Panos Stinis, Tony Chiang

TL;DR
This paper uses PCA to analyze layer-wise representations in ResNet-18, revealing that a small subset of principal components can explain classifier performance and providing insights into neural collapse phenomena.
Contribution
It introduces a PCA-based analysis revealing that few principal components suffice for high classification accuracy and offers surrogate models to interpret neural collapse in DNNs.
Findings
20% of variance suffices for high accuracy in some layers
First ~100 PCs determine classifier performance
Surrogate models help locate neural collapse onset
Abstract
Understanding feature representation for deep neural networks (DNNs) remains an open question within the general field of explainable AI. We use principal component analysis (PCA) to study the performance of a k-nearest neighbors classifier (k-NN), nearest class-centers classifier (NCC), and support vector machines on the learned layer-wise representations of a ResNet-18 trained on CIFAR-10. We show that in certain layers, as little as 20% of the intermediate feature-space variance is necessary for high-accuracy classification and that across all layers, the first ~100 PCs completely determine the performance of the k-NN and NCC classifiers. We relate our findings to neural collapse and provide partial evidence for the related phenomenon of intermediate neural collapse. Our preliminary work provides three distinct yet interpretable surrogate models for feature representation with an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Neural Networks and Applications
Methodsk-Nearest Neighbors
