Predicting Neural Network Accuracy from Weights

Thomas Unterthiner; Daniel Keysers; Sylvain Gelly; Olivier Bousquet,; Ilya Tolstikhin

arXiv:2002.11448·stat.ML·April 12, 2021·38 cites

Predicting Neural Network Accuracy from Weights

Thomas Unterthiner, Daniel Keysers, Sylvain Gelly, Olivier Bousquet,, Ilya Tolstikhin

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that neural network accuracy can be accurately predicted solely from weights, enabling performance ranking without data evaluation, which advances understanding of network training and generalization.

Contribution

It introduces a formal setting for predicting neural network accuracy from weights and shows high prediction accuracy across different datasets and architectures.

Findings

01

Weight-based predictors achieve R2 > 0.98 in ranking networks

02

Predictors generalize across datasets and architectures

03

A large dataset of 120k trained CNNs is released for further research

Abstract

We show experimentally that the accuracy of a trained neural network can be predicted surprisingly well by looking only at its weights, without evaluating it on input data. We motivate this task and introduce a formal setting for it. Even when using simple statistics of the weights, the predictors are able to rank neural networks by their performance with very high accuracy (R2 score more than 0.98). Furthermore, the predictors are able to rank networks trained on different, unobserved datasets and with different architectures. We release a collection of 120k convolutional neural networks trained on four different datasets to encourage further research in this area, with the goal of understanding network training and performance better.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mostafaelaraby/generalization-gap-features-tensorflow
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Advanced Neural Network Applications