Neural Rejuvenation: Improving Deep Network Training by Enhancing   Computational Resource Utilization

Siyuan Qiao; Zhe Lin; Jianming Zhang; Alan Yuille

arXiv:1812.00481·cs.CV·December 4, 2018·1 cites

Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization

Siyuan Qiao, Zhe Lin, Jianming Zhang, Alan Yuille

PDF

Open Access 1 Repo

TL;DR

This paper introduces Neural Rejuvenation, a novel optimization technique that enhances neural network training by reallocating resources to dead neurons, significantly improving performance without increasing resource usage.

Contribution

The paper proposes Neural Rejuvenation, a new method for detecting and revitalizing dead neurons to better utilize computational resources during training.

Findings

01

Significant performance improvements across various neural networks.

02

Effective detection and rejuvenation of dead neurons in real time.

03

Maintains similar resource usage while boosting accuracy.

Abstract

In this paper, we study the problem of improving computational resource utilization of neural networks. Deep neural networks are usually over-parameterized for their tasks in order to achieve good performances, thus are likely to have underutilized computational resources. This observation motivates a lot of research topics, e.g. network pruning, architecture search, etc. As models with higher computational costs (e.g. more parameters or more computations) usually have better performances, we study the problem of improving the resource utilization of neural networks so that their potentials can be further realized. To this end, we propose a novel optimization method named Neural Rejuvenation. As its name suggests, our method detects dead neurons and computes resource utilization in real time, rejuvenates dead neurons by resource reallocation and reinitialization, and trains them with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

joe-siyuan-qiao/NeuralRejuvenation-CVPR19
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning