MLDS: A Dataset for Weight-Space Analysis of Neural Networks

John Clemens

arXiv:2104.10555·cs.LG·April 22, 2021

MLDS: A Dataset for Weight-Space Analysis of Neural Networks

John Clemens

PDF

TL;DR

This paper introduces MLDS, a dataset of thousands of trained neural networks designed to facilitate weight-space analysis, revealing insights into model relationships and data influence beyond traditional loss metrics.

Contribution

The paper presents MLDS, a large, controlled dataset of neural networks enabling direct weight-space analysis for better evaluation and understanding of neural models.

Findings

01

Models cluster in weight-space when trained on identical data

02

Small data changes cause significant divergence in weight-space

03

Weight-space analysis can complement or surpass loss-based evaluation

Abstract

Neural networks are powerful models that solve a variety of complex real-world problems. However, the stochastic nature of training and large number of parameters in a typical neural model makes them difficult to evaluate via inspection. Research shows this opacity can hide latent undesirable behavior, be it from poorly representative training data or via malicious intent to subvert the behavior of the network, and that this behavior is difficult to detect via traditional indirect evaluation criteria such as loss. Therefore, it is time to explore direct ways to evaluate a trained neural model via its structure and weights. In this paper we present MLDS, a new dataset consisting of thousands of trained neural networks with carefully controlled parameters and generated via a global volunteer-based distributed computing platform. This dataset enables new insights into both model-to-model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.