Scalable and Sustainable Deep Learning via Randomized Hashing

Ryan Spring; Anshumali Shrivastava

arXiv:1602.08194·stat.ML·December 6, 2016·1 cites

Scalable and Sustainable Deep Learning via Randomized Hashing

Ryan Spring, Anshumali Shrivastava

PDF

Open Access

TL;DR

This paper introduces a hashing-based method to significantly reduce the computational and energy costs of training and testing deep neural networks by selecting only the most active nodes, maintaining high accuracy.

Contribution

The paper presents a novel hashing technique that enables sparse, efficient deep learning with drastically fewer computations, suitable for parallel and energy-efficient environments.

Findings

01

Uses only 5% of multiplications of traditional models

02

Maintains within 1% accuracy of full models

03

Achieves near-linear speedup with more cores

Abstract

Current deep learning architectures are growing larger in order to learn from complex datasets. These architectures require giant matrix multiplication operations to train millions of parameters. Conversely, there is another growing trend to bring deep learning to low-power, embedded devices. The matrix operations, associated with both training and testing of deep networks, are very expensive from a computational and energy standpoint. We present a novel hashing based technique to drastically reduce the amount of computation needed to train and test deep networks. Our approach combines recent ideas from adaptive dropouts and randomized hashing for maximum inner product search to select the nodes with the highest activation efficiently. Our new algorithm for deep learning reduces the overall computational cost of forward and back-propagation by operating on significantly fewer (sparse)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Stochastic Gradient Optimization Techniques