Classifying the classifier: dissecting the weight space of neural   networks

Gabriel Eilertsen; Daniel J\"onsson; Timo Ropinski; Jonas Unger,; Anders Ynnerman

arXiv:2002.05688·cs.CV·February 14, 2020·6 cites

Classifying the classifier: dissecting the weight space of neural networks

Gabriel Eilertsen, Daniel J\"onsson, Timo Ropinski, Jonas Unger,, Anders Ynnerman

PDF

Open Access 1 Repo

TL;DR

This study explores the high-dimensional weight space of neural networks by training meta-classifiers to identify training setup properties from weight patterns, offering new insights into model interpretability and a large dataset for future research.

Contribution

It introduces a novel approach using deep meta-classifiers to analyze neural weight space and provides a large dataset for further exploration.

Findings

01

Meta-classifiers can identify training hyper-parameters from weight patterns.

02

Weight space contains discernible signatures of training variations.

03

The approach enhances explainability of neural network training processes.

Abstract

This paper presents an empirical study on the weights of neural networks, where we interpret each model as a point in a high-dimensional space -- the neural weight space. To explore the complex structure of this space, we sample from a diverse selection of training variations (dataset, optimization procedure, architecture, etc.) of neural network classifiers, and train a large number of models to represent the weight space. Then, we use a machine learning approach for analyzing and extracting information from this space. Most centrally, we train a number of novel deep meta-classifiers with the objective of classifying different properties of the training setup by identifying their footprints in the weight space. Thus, the meta-classifiers probe for patterns induced by hyper-parameters, so that we can quantify how much, where, and when these are encoded through the optimization process.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gabrieleilertsen/nws
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications