Neural Capacitance: A New Perspective of Neural Network Selection via Edge Dynamics
Chunheng Jiang, Tejaswini Pedapati, Pin-Yu Chen, Yizhou Sun, Jianxi, Gao

TL;DR
This paper introduces a neural capacitance metric derived from edge dynamics during training, enabling efficient early prediction of a neural network's generalization ability for downstream tasks, reducing the need for extensive training.
Contribution
The paper proposes a novel framework that models neural network training as edge dynamics and introduces a neural capacitance metric for early performance prediction.
Findings
Neural capacitance correlates strongly with final model performance.
The method outperforms existing early prediction techniques.
Applicable to various models and datasets with high efficiency.
Abstract
Efficient model selection for identifying a suitable pre-trained neural network to a downstream task is a fundamental yet challenging task in deep learning. Current practice requires expensive computational costs in model training for performance prediction. In this paper, we propose a novel framework for neural network selection by analyzing the governing dynamics over synaptic connections (edges) during training. Our framework is built on the fact that back-propagation during neural network training is equivalent to the dynamical evolution of synaptic connections. Therefore, a converged neural network is associated with an equilibrium state of a networked system composed of those edges. To this end, we construct a network mapping , converting a neural network to a directed line graph that is defined on those edges in . Next, we derive a neural capacitance metric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Memory and Neural Computing · Stochastic Gradient Optimization Techniques
