What Do Neural Networks Learn When Trained With Random Labels?
Hartmut Maennel, Ibrahim Alabdulmohsin, Ilya Tolstikhin and, Robert J. N. Baldock, Olivier Bousquet, Sylvain Gelly, Daniel, Keysers

TL;DR
This paper investigates what deep neural networks learn when trained on random labels, revealing an alignment between network parameters and data that can lead to positive transfer in downstream tasks.
Contribution
It analytically demonstrates parameter-data alignment in networks trained with random labels and shows how this alignment facilitates transfer learning.
Findings
Networks trained on random labels exhibit parameter-data alignment.
Pre-trained networks with random labels transfer positively to downstream tasks.
Alignment effects are observed across various architectures and datasets.
Abstract
We study deep neural networks (DNNs) trained on natural image data with entirely random labels. Despite its popularity in the literature, where it is often used to study memorization, generalization, and other phenomena, little is known about what DNNs learn in this setting. In this paper, we show analytically for convolutional and fully connected networks that an alignment between the principal components of network parameters and data takes place when training with random labels. We study this alignment effect by investigating neural networks pre-trained on randomly labelled image data and subsequently fine-tuned on disjoint datasets with random or real labels. We show how this alignment produces a positive transfer: networks pre-trained with random labels train faster downstream compared to training from scratch even after accounting for simple effects, such as weight scaling. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Neural Networks and Applications · Machine Learning and Algorithms
