WRPN: Wide Reduced-Precision Networks

Asit Mishra; Eriko Nurvitadhi; Jeffrey J Cook; Debbie Marr

arXiv:1709.01134·cs.CV·September 6, 2017·182 cites

WRPN: Wide Reduced-Precision Networks

Asit Mishra, Eriko Nurvitadhi, Jeffrey J Cook, Debbie Marr

PDF

Open Access

TL;DR

WRPN introduces a method to train neural networks with reduced-precision activations and increased filter maps, achieving comparable or better accuracy than full-precision models while improving computational efficiency.

Contribution

The paper proposes a novel scheme to use reduced-precision activations combined with wider networks, maintaining accuracy and enhancing efficiency in deep neural networks.

Findings

01

WRPN matches or surpasses baseline accuracy on ILSVRC-12.

02

The scheme reduces memory footprint and computational energy.

03

WRPN outperforms previous reduced-precision networks in accuracy and efficiency.

Abstract

For computer vision applications, prior works have shown the efficacy of reducing numeric precision of model parameters (network weights) in deep neural networks. Activation maps, however, occupy a large memory footprint during both the training and inference step when using mini-batches of inputs. One way to reduce this large memory footprint is to reduce the precision of activations. However, past works have shown that reducing the precision of activations hurts model accuracy. We study schemes to train networks from scratch using reduced-precision activations without hurting accuracy. We reduce the precision of activation maps (along with model parameters) and increase the number of filter maps in a layer, and find that this scheme matches or surpasses the accuracy of the baseline full-precision network. As a result, one can significantly improve the execution efficiency (e.g. reduce…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · IoT and Edge/Fog Computing · Anomaly Detection Techniques and Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings