Reactivation: Empirical NTK Dynamics Under Task Shifts
Yuzhi Liu, Zixuan Chen, Zirui Zhang, Yufei Liu, Giulia Lanzillotta

TL;DR
This paper empirically investigates how Neural Tangent Kernel (NTK) dynamics evolve during continual learning with task shifts, revealing limitations of static-kernel assumptions and emphasizing the importance of NTK evolution in understanding neural network adaptation.
Contribution
It provides the first comprehensive empirical analysis of NTK dynamics under task shifts in continual learning, challenging existing static-kernel theoretical models.
Findings
NTK dynamics are significantly affected by task shifts in continual learning.
Static-kernel approximations often fail to capture neural network behavior during task transitions.
Continual learning offers a valuable framework for studying neural training dynamics.
Abstract
The Neural Tangent Kernel (NTK) offers a powerful tool to study the functional dynamics of neural networks. In the so-called lazy, or kernel regime, the NTK remains static during training and the network function is linear in the static neural tangents feature space. The evolution of the NTK during training is necessary for feature learning, a key driver of deep learning success. The study of the NTK dynamics has led to several critical discoveries in recent years, in generalization and scaling behaviours. However, this body of work has been limited to the single task setting, where the data distribution is assumed constant over time. In this work, we present a comprehensive empirical analysis of NTK dynamics in continual learning, where the data distribution shifts over time. Our findings highlight continual learning as a rich and underutilized testbed for probing the dynamics of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques · Neural Networks and Reservoir Computing
