Critical Learning Periods in Deep Neural Networks
Alessandro Achille, Matteo Rovere, Stefano Soatto

TL;DR
This paper investigates critical learning periods in deep neural networks, revealing that early training phases are crucial for establishing effective connections, with implications for understanding learning dynamics and invariance in artificial systems.
Contribution
It introduces a novel analysis using Fisher Information to identify critical periods in deep learning and demonstrates the importance of early training in shaping network performance.
Findings
Early training phases rapidly increase Fisher Information in weights.
Post-initial phase, information decreases, indicating reduced plasticity.
Critical periods influence the network's ability to learn invariances.
Abstract
Similar to humans and animals, deep artificial neural networks exhibit critical periods during which a temporary stimulus deficit can impair the development of a skill. The extent of the impairment depends on the onset and length of the deficit window, as in animal models, and on the size of the neural network. Deficits that do not affect low-level statistics, such as vertical flipping of the images, have no lasting effect on performance and can be overcome with further training. To better understand this phenomenon, we use the Fisher Information of the weights to measure the effective connectivity between layers of a network during training. Counterintuitively, information rises rapidly in the early phases of training, and then decreases, preventing redistribution of information resources in a phenomenon we refer to as a loss of "Information Plasticity". Our analysis suggests that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and ELM · Neural dynamics and brain function
