Progressive Stochastic Binarization of Deep Networks

David Hartmann; Michael Wand

arXiv:1904.02205·cs.LG·April 5, 2019

Progressive Stochastic Binarization of Deep Networks

David Hartmann, Michael Wand

PDF

1 Repo

TL;DR

This paper introduces a progressive stochastic binarization method for deep networks that enables efficient, adaptive, and unbiased low-precision inference, achieving accuracy close to full-precision models with reduced computational costs.

Contribution

It presents a novel progressive stochastic binarization scheme allowing adaptive accuracy control and efficient inference, outperforming previous binarization methods in flexibility and cost-effectiveness.

Findings

01

Achieves near-original accuracy on ImageNet with low representational costs.

02

Reduces inference costs by up to 33% through adaptive sampling.

03

Compatible with pretrained networks, including pruned models.

Abstract

A plethora of recent research has focused on improving the memory footprint and inference speed of deep networks by reducing the complexity of (i) numerical representations (for example, by deterministic or stochastic quantization) and (ii) arithmetic operations (for example, by binarization of weights). We propose a stochastic binarization scheme for deep networks that allows for efficient inference on hardware by restricting itself to additions of small integers and fixed shifts. Unlike previous approaches, the underlying randomized approximation is progressive, thus permitting an adaptive control of the accuracy of each operation at run-time. In a low-precision setting, we match the accuracy of previous binarized approaches. Our representation is unbiased - it approaches continuous computation with increasing sample size. In a high-precision regime, the computational costs are…

Figures40

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

JGU-VC/progressive_stochastic_binarization
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings

Full text

Progressive Stochastic Binarization of Deep Networks

David Hartmann

Institute of Computer Science

Johannes Gutenberg-University of Mainz

Staudingerweg 9, 55128 Mainz, Germany

[email protected]

&Michael Wand

Institute of Computer Science

Johannes Gutenberg-University of Mainz

Staudingerweg 9, 55128 Mainz, Germany

[email protected]

\ExecuteMetaData

[sections/structure.tex]

Supplementary Material for:

Progressive Stochastic Binarization of Deep Networks

\ExecuteMetaData

[sections_supplementary/structure.tex]