How Graph Neural Networks Learn: Lessons from Training Dynamics
Chenxiao Yang, Qitian Wu, David Wipf, Ruoyu Sun, Junchi Yan

TL;DR
This paper investigates the training dynamics of graph neural networks, revealing how they implicitly leverage graph structure during learning, and proposes a simple, fast algorithm that rivals GNN performance.
Contribution
It introduces the concept of kernel-graph alignment, provides theoretical insights into GNN learning, and proposes a parameter-free, efficient update method.
Findings
Kernel-graph alignment explains GNN learning behavior.
The proposed sparse matrix update matches GNN accuracy.
The method is significantly faster than traditional GNN training.
Abstract
A long-standing goal in deep learning has been to characterize the learning behavior of black-box models in a more interpretable manner. For graph neural networks (GNNs), considerable advances have been made in formalizing what functions they can represent, but whether GNNs will learn desired functions during the optimization process remains less clear. To fill this gap, we study their training dynamics in function space. In particular, we find that the gradient descent optimization of GNNs implicitly leverages the graph structure to update the learned function, as can be quantified by a phenomenon which we call \emph{kernel-graph alignment}. We provide theoretical explanations for the emergence of this phenomenon in the overparameterized regime and empirically validate it on real-world GNNs. This finding offers new interpretable insights into when and why the learned GNN functions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning in Materials Science · Advanced Memory and Neural Computing
