GXNOR-Net: Training deep neural networks with ternary weights and   activations without full-precision memory under a unified discretization   framework

Lei Deng; Peng Jiao; Jing Pei; Zhenzhi Wu; Guoqi Li

arXiv:1705.09283·cs.LG·May 3, 2018·6 cites

GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework

Lei Deng, Peng Jiao, Jing Pei, Zhenzhi Wu, Guoqi Li

PDF

Open Access 1 Repo

TL;DR

GXNOR-Net introduces a unified framework for training deep neural networks with ternary weights and activations, eliminating the need for full-precision memory and enabling efficient, event-driven hardware implementations.

Contribution

The paper proposes a novel discrete training framework with a multi-step discretization and discrete state transition method, unifying binary and ternary networks without full-precision weights.

Findings

01

Achieves higher performance than state-of-the-art algorithms.

02

Enables flexible hardware adaptation through adjustable sparsity and discrete states.

03

Supports event-driven hardware design for mobile intelligence.

Abstract

There is a pressing need to build an architecture that could subsume these networks under a unified framework that achieves both higher performance and less overhead. To this end, two fundamental issues are yet to be addressed. The first one is how to implement the back propagation when neuronal activations are discrete. The second one is how to remove the full-precision hidden weights in the training phase to break the bottlenecks of memory/computation consumption. To address the first issue, we present a multi-step neuronal activation discretization method and a derivative approximation technique that enable the implementing the back propagation algorithm on discrete DNNs. While for the second issue, we propose a discrete state transition (DST) methodology to constrain the weights in a discrete space without saving the hidden weights. Through this way, we build a unified framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AcrossV/Gated-XNOR
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · EEG and Brain-Computer Interfaces