Embedded Binarized Neural Networks

Bradley McDanel; Surat Teerapittayanon; H.T. Kung

arXiv:1709.02260·cs.CV·September 8, 2017·76 cites

Embedded Binarized Neural Networks

Bradley McDanel, Surat Teerapittayanon, H.T. Kung

PDF

Open Access 2 Repos

TL;DR

This paper introduces embedded Binarized Neural Networks (eBNNs), a memory-efficient approach enabling fast inference on tiny embedded devices by reordering computations and minimizing temporary storage.

Contribution

The paper presents a novel method to reduce memory usage in Binarized Neural Networks for embedded devices by reordering inference and using a single binary temporary, enabling efficient deployment.

Findings

01

eBNN achieves 95% accuracy on MNIST with only 15 KB memory.

02

eBNN reduces temporary storage by 32x compared to traditional BNNs.

03

Inference runtime is under 50 ms on a device with 15 KB memory.

Abstract

We study embedded Binarized Neural Networks (eBNNs) with the aim of allowing current binarized neural networks (BNNs) in the literature to perform feedforward inference efficiently on small embedded devices. We focus on minimizing the required memory footprint, given that these devices often have memory as small as tens of kilobytes (KB). Beyond minimizing the memory required to store weights, as in a BNN, we show that it is essential to minimize the memory used for temporaries which hold intermediate results between layers in feedforward inference. To accomplish this, eBNN reorders the computation of inference while preserving the original BNN structure, and uses just a single floating-point temporary for the entire neural network. All intermediate results from a layer are stored as binary values, as opposed to floating-points used in current BNN implementations, leading to a 32x…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Machine Learning and ELM