GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework
Lei Deng, Peng Jiao, Jing Pei, Zhenzhi Wu, Guoqi Li

TL;DR
GXNOR-Net introduces a unified framework for training deep neural networks with ternary weights and activations, eliminating the need for full-precision memory and enabling efficient, event-driven hardware implementations.
Contribution
The paper proposes a novel discrete training framework with a multi-step discretization and discrete state transition method, unifying binary and ternary networks without full-precision weights.
Findings
Achieves higher performance than state-of-the-art algorithms.
Enables flexible hardware adaptation through adjustable sparsity and discrete states.
Supports event-driven hardware design for mobile intelligence.
Abstract
There is a pressing need to build an architecture that could subsume these networks under a unified framework that achieves both higher performance and less overhead. To this end, two fundamental issues are yet to be addressed. The first one is how to implement the back propagation when neuronal activations are discrete. The second one is how to remove the full-precision hidden weights in the training phase to break the bottlenecks of memory/computation consumption. To address the first issue, we present a multi-step neuronal activation discretization method and a derivative approximation technique that enable the implementing the back propagation algorithm on discrete DNNs. While for the second issue, we propose a discrete state transition (DST) methodology to constrain the weights in a discrete space without saving the hidden weights. Through this way, we build a unified framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · EEG and Brain-Computer Interfaces
