Training highly effective connectivities within neural networks with   randomly initialized, fixed weights

Cristian Ivan; Razvan Florian

arXiv:2006.16627·cs.LG·November 18, 2020·1 cites

Training highly effective connectivities within neural networks with randomly initialized, fixed weights

Cristian Ivan, Razvan Florian

PDF

Open Access 2 Repos

TL;DR

This paper introduces simple, hyperparameter-free methods for training neural network connectivities with fixed, randomly initialized weights, achieving high performance with minimal connection modifications and shedding light on network over-parameterization.

Contribution

The authors propose novel methods for training neural connectivities without weight updates, eliminating hyperparameter tuning and reducing the effective size of networks.

Findings

01

Achieve over 90% of standard accuracy by flipping less than 10% of connections.

02

Match or surpass performance of trained weights with fixed, random weights.

03

Effective training of connectivities even with constant or asymmetric weight distributions.

Abstract

We present some novel, straightforward methods for training the connection graph of a randomly initialized neural network without training the weights. These methods do not use hyperparameters defining cutoff thresholds and therefore remove the need for iteratively searching optimal values of such hyperparameters. We can achieve similar or higher performances than in the case of training all weights, with a similar computational cost as for standard training techniques. Besides switching connections on and off, we introduce a novel way of training a network by flipping the signs of the weights. If we try to minimize the number of changed connections, by changing less than 10% of the total it is already possible to reach more than 90% of the accuracy achieved by standard training. We obtain good results even with weights of constant magnitude or even when weights are drawn from highly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning

MethodsAverage Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Residual Connection · Global Average Pooling · Bottleneck Residual Block · Residual Block · Kaiming Initialization · Bitcoin Customer Service Number +1-833-534-1729