What's Hidden in a Randomly Weighted Neural Network?

Vivek Ramanujan; Mitchell Wortsman; Aniruddha Kembhavi; Ali Farhadi,; Mohammad Rastegari

arXiv:1911.13299·cs.CV·April 1, 2020

What's Hidden in a Randomly Weighted Neural Network?

Vivek Ramanujan, Mitchell Wortsman, Aniruddha Kembhavi, Ali Farhadi,, Mohammad Rastegari

PDF

4 Repos 1 Video

TL;DR

This paper reveals that randomly weighted neural networks contain untrained subnetworks capable of high performance, and provides an algorithm to find such subnetworks, challenging the traditional view of training as essential for neural network effectiveness.

Contribution

It introduces the concept of untrained subnetworks within randomly weighted neural networks and presents an algorithm to identify them, demonstrating their competitive performance.

Findings

01

Untrained subnetworks can match trained network performance.

02

Larger and deeper random networks contain better untrained subnetworks.

03

Algorithm effectively finds high-performing subnetworks in random networks.

Abstract

Training a neural network is synonymous with learning the values of the weights. By contrast, we demonstrate that randomly weighted neural networks contain subnetworks which achieve impressive performance without ever training the weight values. Hidden in a randomly weighted Wide ResNet-50 we show that there is a subnetwork (with random weights) that is smaller than, but matches the performance of a ResNet-34 trained on ImageNet. Not only do these "untrained subnetworks" exist, but we provide an algorithm to effectively find them. We empirically show that as randomly weighted neural networks with fixed weights grow wider and deeper, an "untrained subnetwork" approaches a network with learned weights in accuracy. Our code and pretrained models are available at https://github.com/allenai/hidden-networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

What’s Hidden in a Randomly Weighted Neural Network?· youtube

Taxonomy

MethodsAverage Pooling · Dropout · Weight Decay · SGD with Momentum · Cosine Annealing · Adam · Wide Residual Block · WideResNet · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia?