Two-Bit Networks for Deep Learning on Resource-Constrained Embedded   Devices

Wenjia Meng; Zonghua Gu; Ming Zhang; Zhaohui Wu

arXiv:1701.00485·cs.LG·January 5, 2017·29 cites

Two-Bit Networks for Deep Learning on Resource-Constrained Embedded Devices

Wenjia Meng, Zonghua Gu, Ming Zhang, Zhaohui Wu

PDF

Open Access

TL;DR

This paper introduces Two-Bit Networks (TBNs), a model compression technique for CNNs that enables efficient deployment on resource-limited embedded devices by using weights encoded with only two bits.

Contribution

The paper proposes a novel two-bit weight encoding scheme for CNNs, significantly reducing memory and computation requirements while maintaining accuracy.

Findings

01

Reduces memory usage substantially

02

Improves computational efficiency

03

Maintains competitive classification accuracy

Abstract

With the rapid proliferation of Internet of Things and intelligent edge devices, there is an increasing need for implementing machine learning algorithms, including deep learning, on resource-constrained mobile embedded devices with limited memory and computation power. Typical large Convolutional Neural Networks (CNNs) need large amounts of memory and computational power, and cannot be deployed on embedded devices efficiently. We present Two-Bit Networks (TBNs) for model compression of CNNs with edge weights constrained to (-2, -1, 1, 2), which can be encoded with two bits. Our approach can reduce the memory usage and improve computational efficiency significantly while achieving good performance in terms of classification accuracy, thus representing a reasonable tradeoff between model size and performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · Machine Learning and ELM