Reinitializing weights vs units for maintaining plasticity in neural networks

J. Fernando Hernandez-Garcia; Shibhansh Dohare; Jun Luo; Rich S. Sutton

arXiv:2508.00212·cs.NE·August 21, 2025

Reinitializing weights vs units for maintaining plasticity in neural networks

J. Fernando Hernandez-Garcia, Shibhansh Dohare, Jun Luo, Rich S. Sutton

PDF

Open Access

TL;DR

This paper compares reinitializing weights versus units to preserve neural network plasticity during continual learning, introducing a new selective weight reinitialization algorithm that outperforms previous methods in certain settings.

Contribution

The paper introduces a novel selective weight reinitialization algorithm and systematically compares it to existing reinitialization schemes in continual learning scenarios.

Findings

01

Reinitializing weights is more effective for small networks and those with layer normalization.

02

Reinitializing weights maintains plasticity across a broader range of settings.

03

Reinitializing units and weights are equally effective in large, normalization-free networks.

Abstract

Loss of plasticity is a phenomenon in which a neural network loses its ability to learn when trained for an extended time on non-stationary data. It is a crucial problem to overcome when designing systems that learn continually. An effective technique for preventing loss of plasticity is reinitializing parts of the network. In this paper, we compare two different reinitialization schemes: reinitializing units vs reinitializing weights. We propose a new algorithm, which we name \textit{selective weight reinitialization}, for reinitializing the least useful weights in a network. We compare our algorithm to continual backpropagation and ReDo, two previously proposed algorithms that reinitialize units in the network. Through our experiments in continual supervised learning problems, we identify two settings when reinitializing weights is more effective at maintaining plasticity than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Neural Networks and Applications