Enabling Binary Neural Network Training on the Edge
Erwei Wang, James J. Davis, Daniele Moro, Piotr Zielinski, Jia Jie, Lim, Claudionor Coelho, Satrajit Chatterjee, Peter Y. K. Cheung, George A., Constantinides

TL;DR
This paper introduces a memory-efficient method for training binary neural networks directly on edge devices, enabling on-device learning with minimal accuracy loss and significant memory and energy savings.
Contribution
The authors demonstrate that backward propagation in binary neural networks is robust to quantization and propose a low-cost training strategy that reduces memory usage by 3-5 times without accuracy loss.
Findings
Memory requirement reductions of 3-5× compared to standard methods
Achieved similar accuracy with significantly less memory on small models
Successfully trained binarized ResNet-18 on ImageNet with 3.78× memory reduction
Abstract
The ever-growing computational demands of increasingly complex machine learning models frequently necessitate the use of powerful cloud-based infrastructure for their training. Binary neural networks are known to be promising candidates for on-device inference due to their extreme compute and memory savings over higher-precision alternatives. However, their existing training methods require the concurrent storage of high-precision activations for all layers, generally making learning on memory-constrained devices infeasible. In this article, we demonstrate that the backward propagation operations needed for binary neural network training are strongly robust to quantization, thereby making on-the-edge learning with modern models a practical proposition. We introduce a low-cost binary neural network training strategy exhibiting sizable memory footprint reductions while inducing little to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · COVID-19 diagnosis using AI
