Neural Capacitance: A New Perspective of Neural Network Selection via   Edge Dynamics

Chunheng Jiang; Tejaswini Pedapati; Pin-Yu Chen; Yizhou Sun; Jianxi; Gao

arXiv:2201.04194·cs.LG·January 19, 2022

Neural Capacitance: A New Perspective of Neural Network Selection via Edge Dynamics

Chunheng Jiang, Tejaswini Pedapati, Pin-Yu Chen, Yizhou Sun, Jianxi, Gao

PDF

Open Access

TL;DR

This paper introduces a neural capacitance metric derived from edge dynamics during training, enabling efficient early prediction of a neural network's generalization ability for downstream tasks, reducing the need for extensive training.

Contribution

The paper proposes a novel framework that models neural network training as edge dynamics and introduces a neural capacitance metric for early performance prediction.

Findings

01

Neural capacitance correlates strongly with final model performance.

02

The method outperforms existing early prediction techniques.

03

Applicable to various models and datasets with high efficiency.

Abstract

Efficient model selection for identifying a suitable pre-trained neural network to a downstream task is a fundamental yet challenging task in deep learning. Current practice requires expensive computational costs in model training for performance prediction. In this paper, we propose a novel framework for neural network selection by analyzing the governing dynamics over synaptic connections (edges) during training. Our framework is built on the fact that back-propagation during neural network training is equivalent to the dynamical evolution of synaptic connections. Therefore, a converged neural network is associated with an equilibrium state of a networked system composed of those edges. To this end, we construct a network mapping $ϕ$ , converting a neural network $G_{A}$ to a directed line graph $G_{B}$ that is defined on those edges in $G_{A}$ . Next, we derive a neural capacitance metric…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Memory and Neural Computing · Stochastic Gradient Optimization Techniques