Non-Equilibrium Stochastic Dynamics as a Unified Framework for Insight and Repetitive Learning: A Kramers Escape Approach to Continual Learning
Gunn Kim

TL;DR
This paper models continual learning as a non-equilibrium stochastic process, unifying insight and repetition through temperature-controlled transitions on an energy landscape, and explains stability-plasticity trade-offs.
Contribution
It introduces a physics-inspired framework using Langevin dynamics and Kramers escape theory to analyze and improve continual learning in neural networks.
Findings
EWC penalty is an energy barrier growing with tasks, causing exponential transition rate decay.
Insight events are transient temperature spikes enabling rapid state transitions.
Repetitive learning maintains a steady temperature for gradual, sustained transitions.
Abstract
Continual learning in artificial neural networks is fundamentally limited by the stability--plasticity dilemma: systems that retain prior knowledge tend to resist acquiring new knowledge, and vice versa. Existing approaches, most notably elastic weight consolidation~(EWC), address this empirically without a physical account of why plasticity eventually collapses as tasks accumulate. Separately, the distinction between sudden insight and gradual skill acquisition through repetitive practice has lacked a unified theoretical description. Here, we show that both problems admit a common resolution within non-equilibrium statistical physics. We model the state of a learning system as a particle evolving under Langevin dynamics on a double-well energy landscape, with the noise amplitude governed by a time-dependent effective temperature . The probability density obeys a Fokker--Planck…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
