Attribution-Based Neuron Utility for Plasticity Restoration in Deep Networks

Patrick Elisii; Lucas Beauchemin; Dawer Jamshed

arXiv:2605.06834·cs.LG·May 11, 2026

Attribution-Based Neuron Utility for Plasticity Restoration in Deep Networks

Patrick Elisii, Lucas Beauchemin, Dawer Jamshed

PDF

TL;DR

This paper introduces GXD, a new utility measure based on gradient attribution, to improve neuron reset interventions for maintaining plasticity in deep networks during continual learning.

Contribution

It proposes GXD, a theoretically motivated utility measure that better aligns with functional cost, enhancing the reliability of neuron reset strategies in continual learning.

Findings

01

GXD outperforms existing utility measures in reset interventions.

02

Utility measures aligned with functional cost improve plasticity restoration.

03

GXD provides a practical approach to robust continual learning.

Abstract

Continual learning research attempts to conserve two fundamental capabilities: new knowledge acquisition and the preservation of previously acquired knowledge. While knowledge in this case can be measured through performance over an implicit or explicit task space, model plasticity generally concerns adaptability as data distributions evolve. Though much of the literature has focused on catastrophic forgetting, deep networks can also suffer from loss of plasticity, becoming progressively harder to update under continued training. Recent research has identified multiple mechanisms underlying this phenomenon, including neuron saturation, parameter norm growth, and loss of useful curvature directions. Adaptive reset-based interventions, which selectively reinitialize low-utility network parameters, have emerged as practical solutions to restore trainability. Existing utility measures used…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.