The Graphon Limit Hypothesis: Understanding Neural Network Pruning via Infinite Width Analysis
Hoang Pham, The-Anh Ta, Tom Jacobs, Rebekka Burkholz, Long Tran-Thanh

TL;DR
This paper introduces a novel theoretical framework using graphons to analyze the structure and training dynamics of sparse neural networks in the infinite-width limit, providing insights into why certain pruning methods yield more trainable models.
Contribution
It develops a graphon-based theoretical framework and a Graphon NTK to analyze sparse neural networks, linking connectivity patterns to trainability and convergence behaviors.
Findings
Graphon convergence characterizes pruning-induced connectivity patterns.
Spectral analysis of Graphon NTK correlates with training dynamics.
Framework explains differences in trainability among pruning methods.
Abstract
Sparse neural networks promise efficiency, yet training them effectively remains a fundamental challenge. Despite advances in pruning methods that create sparse architectures, understanding why some sparse structures are better trainable than others with the same level of sparsity remains poorly understood. Aiming to develop a systematic approach to this fundamental problem, we propose a novel theoretical framework based on the theory of graph limits, particularly graphons, that characterizes sparse neural networks in the infinite-width regime. Our key insight is that connectivity patterns of sparse neural networks induced by pruning methods converge to specific graphons as networks' width tends to infinity, which encodes implicit structural biases of different pruning methods. We postulate the Graphon Limit Hypothesis and provide empirical evidence to support it. Leveraging this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Graph Neural Networks · Stochastic Gradient Optimization Techniques · Neural Networks and Applications
