Simplicity Bias in 1-Hidden Layer Neural Networks
Depen Morwani, Jatin Batra, Prateek Jain, Praneeth Netrapalli

TL;DR
This paper rigorously defines and demonstrates the presence of simplicity bias in one-hidden-layer neural networks, showing they rely on low-dimensional input projections and proposing an ensemble method to improve robustness.
Contribution
It provides a formal definition of simplicity bias, proves its occurrence in linearly separable data, empirically verifies it on real datasets, and introduces an ensemble approach to enhance robustness.
Findings
Neural networks depend on low-dimensional input projections.
Simplicity bias occurs even with complex features present.
Ensemble training on diverse features improves robustness.
Abstract
Recent works have demonstrated that neural networks exhibit extreme simplicity bias(SB). That is, they learn only the simplest features to solve a task at hand, even in the presence of other, more robust but more complex features. Due to the lack of a general and rigorous definition of features, these works showcase SB on semi-synthetic datasets such as Color-MNIST, MNIST-CIFAR where defining features is relatively easier. In this work, we rigorously define as well as thoroughly establish SB for one hidden layer neural networks. More concretely, (i) we define SB as the network essentially being a function of a low dimensional projection of the inputs (ii) theoretically, we show that when the data is linearly separable, the network primarily depends on only the linearly separable (-dimensional) subspace even in the presence of an arbitrarily large number of other, more complex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications · Digital Imaging for Blood Diseases · Domain Adaptation and Few-Shot Learning
