Predicting Plasticity in Deep Continual Learning: A Theoretical Perspective

Jiuqi Wang; Jayanth Srinivasa; Claire Chen; Shuze Daniel Liu; Ali Payani; Shangtong Zhang

arXiv:2605.09044·cs.LG·May 12, 2026

Predicting Plasticity in Deep Continual Learning: A Theoretical Perspective

Jiuqi Wang, Jayanth Srinivasa, Claire Chen, Shuze Daniel Liu, Ali Payani, Shangtong Zhang

PDF

TL;DR

This paper investigates the concept of plasticity in deep continual learning, critiques existing diagnostics, and introduces a new metric called optimization readiness that better predicts a model's future trainability.

Contribution

It provides a theoretical analysis showing limitations of existing diagnostics and proposes a novel, theoretically grounded metric that empirically outperforms prior methods in predicting plasticity.

Findings

01

Existing diagnostics like representation rank can fail to predict trainability.

02

Optimization readiness correlates with one-step optimization gain.

03

Empirical results show optimization readiness ranks checkpoints more reliably.

Abstract

Deep continual learning requires models to adapt to new tasks without retraining from scratch. However, neural networks can lose their ability to adapt to new tasks after training on previous ones, a phenomenon known as loss of plasticity. There have been several explanations and diagnostics proposed for plasticity loss. Motivated by the philosophy that "all models are wrong, but some are useful", we ask: can existing diagnostics predict a neural network's plasticity? In this work, we take a practical view to interpret plasticity as trainability, i.e., a neural network's future optimization gain on a target task. We first take a theoretical approach, showing, by constructing a few counterexamples, that some widely adopted diagnostics of plasticity, including representation rank and neural tangent kernel rank, can fail to predict the loss of trainability in both regression and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.