Sufficient conditions for offline reactivation in recurrent neural networks
Nanda H. Krishna, Colin Bredenberg, Daniel Levenstein, Blake A. Richards, Guillaume Lajoie

TL;DR
This paper develops a mathematical framework to identify conditions under which neural networks can autonomously reactivate states associated with online behavior during offline periods, such as sleep.
Contribution
It introduces a theoretical model showing how task-optimized noisy recurrent networks naturally develop reactivation dynamics, supported by numerical experiments.
Findings
Neural networks can spontaneously revisit online states during offline periods.
Denoising dynamics emerge in networks optimized for environmental tracking.
Theoretical results are validated with neuroscience-inspired tasks.
Abstract
During periods of quiescence, such as sleep, neural activity in many brain circuits resembles that observed during periods of task engagement. However, the precise conditions under which task-optimized networks can autonomously reactivate the same network states responsible for online behavior is poorly understood. In this study, we develop a mathematical framework that outlines sufficient conditions for the emergence of neural reactivation in circuits that encode features of smoothly varying stimuli. We demonstrate mathematically that noisy recurrent networks optimized to track environmental state variables using change-based sensory information naturally develop denoising dynamics, which, in the absence of input, cause the network to revisit state configurations observed during periods of online activity. We validate our findings using numerical experiments on two canonical…
Peer Reviews
Decision·ICLR 2024 poster
— Understanding the neural mechanisms underlying reactions/replays is an important question. -- The paper seeks to analyze RNNs trained to perform a class of tasks theoretically, which is a somewhat rare excise in this literature. — The mathematical analysis of the loss function used in the training and the connection to the Langevin dynamics, while under somewhat strong assumptions, remain interesting.
1. A number of relevant studied were not cited and discussed. This include work using RNNs and attractor dynamics to model the replay and theta sequences in the hippocampus, e.g., Hopfield, John J. "Neurodynamics of mental exploration." Proceedings of the National Academy of Sciences 107.4 (2010): 1648-1653. Kang, Louis, and Michael R. DeWeese. "Replay as wavefronts and theta sequences as bump oscillations in a grid cell attractor network." Elife 8 (2019): e46351. Chu, Tianhao, et al. "Firing ra
- the paper is generally well written - the mathematical theory is well presented and interesting (as far as I'm aware it is novel, but I am not certain of this) - offline reactivation should be of interest both to machine learning and neuroscience researchers; the paper does make some theoretical and experimental headway into why/how it occurs
- perhaps this is not in the scope of the paper, but the authors do not provide any theoretical/numerical support for the functional benefit of offline reactivation. The authors mention other works which demonstrate that it may aid the formation of long-term memories, schema, planning. Do the theoretical/numerical results in this paper support these possibilities, or any one in particular? - Relatedly, a key feature of this study is that the RNNs considered are noisy during awake states. It's un
This paper does a nice job of outlining how the cost function can be deconstructed into two components, the denoising component and the signal integration component. The idea of replay being a result of signal integration in the presence of noise is an interesting one, and the authors present a simple mechanism whereby this can arise. This is a contribution to an important question in neuroscience. The theoretical approach, results, and the numerical experiments are well described and supporte
The paper is potentially confusing to some readers in that the presentation (and the numerical experiments if I have followed) regard training a network with a fixed nonlinearity and trainable weight parameters (i.e. a typical RNN) but the theoretical analysis considers not a fixed nonlinearity, but rather the space of all possible dynamics that could govern the system. I suspect this may misdirect some readers regarding the basic argument (possibly me as well). Training a regular RNN on the
Code & Models
Videos
Taxonomy
TopicsNeural dynamics and brain function · Functional Brain Connectivity Studies · EEG and Brain-Computer Interfaces
