GradSign: Model Performance Inference with Theoretical Insights
Zhihao Zhang, Zhihao Jia

TL;DR
GradSign introduces a theoretically grounded, simple metric for predicting neural network performance at initialization, improving efficiency and accuracy in neural architecture search across multiple benchmarks.
Contribution
The paper proposes GradSign, a new performance inference metric with theoretical guarantees, and demonstrates its effectiveness in enhancing NAS algorithms.
Findings
GradSign outperforms existing gradient-based methods in MPI accuracy.
Integrating GradSign into NAS algorithms improves discovered network performance.
GradSign generalizes well across diverse datasets and network architectures.
Abstract
A key challenge in neural architecture search (NAS) is quickly inferring the predictive performance of a broad spectrum of networks to discover statistically accurate and computationally efficient ones. We refer to this task as model performance inference (MPI). The current practice for efficient MPI is gradient-based methods that leverage the gradients of a network at initialization to infer its performance. However, existing gradient-based methods rely only on heuristic metrics and lack the necessary theoretical foundations to consolidate their designs. We propose GradSign, an accurate, simple, and flexible metric for model performance inference with theoretical insights. The key idea behind GradSign is a quantity {\Psi} to analyze the optimization landscape of different networks at the granularity of individual training samples. Theoretically, we show that both the network's training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
