On the importance of single directions for generalization
Ari S. Morcos, David G.T. Barrett, Neil C. Rabinowitz, Matthew, Botvinick

TL;DR
This paper shows that a neural network's reliance on single directions correlates with its generalization ability, and that techniques like batch normalization reduce this reliance, which is a better predictor of performance than class selectivity.
Contribution
It demonstrates that reliance on single directions predicts generalization across various training conditions and analyzes how regularization methods influence this reliance.
Findings
Reliance on single directions predicts generalization performance.
Batch normalization reduces reliance on single directions.
Class selectivity is a poor predictor of task importance.
Abstract
Despite their ability to memorize large datasets, deep neural networks often achieve good generalization performance. However, the differences between the learned solutions of networks which generalize and those which do not remain unclear. Additionally, the tuning properties of single directions (defined as the activation of a single unit or some linear combination of units in response to some input) have been highlighted, but their importance has not been evaluated. Here, we connect these lines of inquiry to demonstrate that a network's reliance on single directions is a good predictor of its generalization performance, across networks trained on datasets with different fractions of corrupted labels, across ensembles of networks trained on datasets with unmodified labels, across different hyperparameters, and over the course of training. While dropout only regularizes this quantity up…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Blind Source Separation Techniques · Visual perception and processing mechanisms
MethodsDropout
