Neural Network Plasticity and Loss Sharpness

Max Koster; Jude Kukla

arXiv:2409.17300·cs.LG·September 27, 2024

Neural Network Plasticity and Loss Sharpness

Max Koster, Jude Kukla

PDF

Open Access

TL;DR

This paper investigates the relationship between neural network plasticity and loss landscape sharpness in continual learning, finding that sharpness regularization techniques do not effectively reduce plasticity loss in non-stationary environments.

Contribution

The study examines the effectiveness of sharpness regularization methods in mitigating plasticity loss during continual learning, revealing their limited impact.

Findings

01

Sharpness regularization techniques do not significantly reduce plasticity loss.

02

Plasticity loss is highly related to loss landscape sharpness in non-stationary RL.

03

Regularization methods aimed at smooth minima may not improve adaptation in continual learning.

Abstract

In recent years, continual learning, a prediction setting in which the problem environment may evolve over time, has become an increasingly popular research field due to the framework's gearing towards complex, non-stationary objectives. Learning such objectives requires plasticity, or the ability of a neural network to adapt its predictions to a different task. Recent findings indicate that plasticity loss on new tasks is highly related to loss landscape sharpness in non-stationary RL frameworks. We explore the usage of sharpness regularization techniques, which seek out smooth minima and have been touted for their generalization capabilities in vanilla prediction settings, in efforts to combat plasticity loss. Our findings indicate that such techniques have no significant effect on reducing plasticity loss.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications