Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks

Daniel Kunin; Giovanni Luca Marchetti; Feng Chen; Dhruva Karkada; James B. Simon; Michael R. DeWeese; Surya Ganguli; Nina Miolane

arXiv:2506.06489·cs.LG·December 25, 2025

Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks

Daniel Kunin, Giovanni Luca Marchetti, Feng Chen, Dhruva Karkada, James B. Simon, Michael R. DeWeese, Surya Ganguli, Nina Miolane

PDF

1 Video

TL;DR

This paper introduces Alternating Gradient Flows (AGF), a new framework that models how two-layer neural networks learn features during training, capturing dynamics like neuron activation and feature acquisition.

Contribution

AGF provides a novel theoretical framework that approximates feature learning dynamics in two-layer networks, extending existing analyses and offering new insights into training behavior.

Findings

01

AGF matches experimental observations of loss plateaus and drops.

02

In linear networks, AGF converges to gradient flow as initialization vanishes.

03

In quadratic networks, AGF reveals Fourier features are learned in decreasing order.

Abstract

What features neural networks learn, and how, remains an open question. In this paper, we introduce Alternating Gradient Flows (AGF), an algorithmic framework that describes the dynamics of feature learning in two-layer networks trained from small initialization. Prior works have shown that gradient flow in this regime exhibits a staircase-like loss curve, alternating between plateaus where neurons slowly align to useful directions and sharp drops where neurons rapidly grow in norm. AGF approximates this behavior as an alternating two-step process: maximizing a utility function over dormant neurons and minimizing a cost function over active ones. AGF begins with all neurons dormant. At each iteration, a dormant neuron activates, triggering the acquisition of a feature and a drop in the loss. AGF quantifies the order, timing, and magnitude of these drops, matching experiments across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks· slideslive