Minimal Effort Back Propagation for Convolutional Neural Networks

Bingzhen Wei; Xu Sun; Xuancheng Ren; Jingjing Xu

arXiv:1709.05804·cs.LG·September 27, 2017·20 cites

Minimal Effort Back Propagation for Convolutional Neural Networks

Bingzhen Wei, Xu Sun, Xuancheng Ren, Jingjing Xu

PDF

Open Access

TL;DR

This paper extends a gradient-sampling technique to CNNs, showing that passing back only 5% of gradients can achieve comparable or better performance, significantly reducing computational costs.

Contribution

The paper introduces a minimal effort backpropagation method for CNNs, demonstrating effective gradient reduction without sacrificing accuracy.

Findings

01

Passing back 5% of gradients maintains model performance

02

Gradient top-k selection induces sparse calculations

03

Significant computational savings in CNN backpropagation

Abstract

As traditional neural network consumes a significant amount of computing resources during back propagation, \citet{Sun2017mePropSB} propose a simple yet effective technique to alleviate this problem. In this technique, only a small subset of the full gradients are computed to update the model parameters. In this paper we extend this technique into the Convolutional Neural Network(CNN) to reduce calculation in back propagation, and the surprising results verify its validity in CNN: only 5\% of the gradients are passed back but the model still achieves the same effect as the traditional CNN, or even better. We also show that the top- $k$ selection of gradients leads to a sparse calculation in back propagation, which may bring significant computational benefits for high computational complexity of convolution operation in CNN.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition

MethodsConvolution