GradSign: Model Performance Inference with Theoretical Insights

Zhihao Zhang; Zhihao Jia

arXiv:2110.08616·cs.LG·June 22, 2022·6 cites

GradSign: Model Performance Inference with Theoretical Insights

Zhihao Zhang, Zhihao Jia

PDF

Open Access 1 Repo 1 Video

TL;DR

GradSign introduces a theoretically grounded, simple metric for predicting neural network performance at initialization, improving efficiency and accuracy in neural architecture search across multiple benchmarks.

Contribution

The paper proposes GradSign, a new performance inference metric with theoretical guarantees, and demonstrates its effectiveness in enhancing NAS algorithms.

Findings

01

GradSign outperforms existing gradient-based methods in MPI accuracy.

02

Integrating GradSign into NAS algorithms improves discovered network performance.

03

GradSign generalizes well across diverse datasets and network architectures.

Abstract

A key challenge in neural architecture search (NAS) is quickly inferring the predictive performance of a broad spectrum of networks to discover statistically accurate and computationally efficient ones. We refer to this task as model performance inference (MPI). The current practice for efficient MPI is gradient-based methods that leverage the gradients of a network at initialization to infer its performance. However, existing gradient-based methods rely only on heuristic metrics and lack the necessary theoretical foundations to consolidate their designs. We propose GradSign, an accurate, simple, and flexible metric for model performance inference with theoretical insights. The key idea behind GradSign is a quantity {\Psi} to analyze the optimization landscape of different networks at the granularity of individual training samples. Theoretically, we show that both the network's training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cmu-catalyst/gradsign
pytorchOfficial

Videos

GradSign: Model Performance Inference with Theoretical Insights· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning