State-Space NTK Collapse Near Bifurcations
James Hazelden, Eric Shea-Brown

TL;DR
This paper develops a local theory of gradient descent near bifurcations in neural networks, showing that bifurcations simplify learning dynamics by dominating the state-space NTK and funneling learning into critical directions.
Contribution
It introduces a procedure to decompose the state-space NTK near bifurcations, revealing a rank-one dominant channel that simplifies the understanding of local learning geometry.
Findings
Near bifurcations, the sNTK reduces to a rank-one operator.
Bifurcations cause a sharp collapse in sNTK effective rank.
Natural gradient methods mitigate learning instability at bifurcations.
Abstract
Rich feature learning in tasks that unfold over time often requires the model to pass through bifurcations, constituting qualitative changes in the underlying model dynamics. We develop a local theory of gradient descent near these transitions through the empirical state-space neural tangent kernel (sNTK). Our central finding is that bifurcations both dominate and simplify learning dynamics: near bifurcations, we can reduce sNTK to a rank-one operator corresponding to learning in a classical normal form system, providing an analytically tractable description of the local learning geometry, even for high-dimensional recurrent systems. Concretely, we give a procedure for decomposing sNTK into bifurcation-relevant and residual channels, showing that near commonly codimension-1 bifurcations the relevant channel is a rank-one operator that is highly amplified. This amplification causes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
