Low-Rank Approximations for Conditional Feedforward Computation in Deep   Neural Networks

Andrew Davis; Itamar Arel

arXiv:1312.4461·cs.LG·January 30, 2014·ICLR·49 cites

Low-Rank Approximations for Conditional Feedforward Computation in Deep Neural Networks

Andrew Davis, Itamar Arel

PDF

Open Access

TL;DR

This paper proposes a low-rank approximation method for conditional computation in deep neural networks, enabling faster inference by skipping units likely to be inactive, demonstrated on MNIST and SVHN datasets.

Contribution

It introduces a low-rank factorization approach to estimate node activation signs, facilitating efficient conditional computation in deep networks.

Findings

01

Sign estimation reduces unnecessary computations in ReLU networks.

02

The method maintains performance robustness despite approximation errors.

03

Sign estimation leads to significant speed gains in sparse neural networks.

Abstract

Scalability properties of deep neural networks raise key research questions, particularly as the problems considered become larger and more challenging. This paper expands on the idea of conditional computation introduced by Bengio, et. al., where the nodes of a deep network are augmented by a set of gating units that determine when a node should be calculated. By factorizing the weight matrix into a low-rank approximation, an estimation of the sign of the pre-nonlinearity activation can be efficiently obtained. For networks using rectified-linear hidden units, this implies that the computation of a hidden unit with an estimated negative pre-nonlinearity can be ommitted altogether, as its value will become zero when nonlinearity is applied. For sparse neural networks, this can result in considerable speed gains. Experimental results using the MNIST and SVHN data sets with a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Neural Networks and Applications · Advanced Neural Network Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings