Neural Network Training on In-memory-computing Hardware with Radix-4   Gradients

Christopher Grimm; Naveen Verma

arXiv:2203.04821·eess.SY·July 7, 2022

Neural Network Training on In-memory-computing Hardware with Radix-4 Gradients

Christopher Grimm, Naveen Verma

PDF

TL;DR

This paper demonstrates that training deep neural networks on in-memory computing hardware with radix-4 quantization can significantly reduce energy consumption while maintaining training accuracy.

Contribution

It introduces a radix-4 quantization method for training on IMC hardware and shows how to mitigate analog noise effects, enabling efficient deep learning training.

Findings

01

Over 400x energy savings with full quantizer adaptability.

02

3x energy savings with two-level quantizer adaptability.

03

Feasibility of training neural networks on IMC hardware with quantization.

Abstract

Deep learning training involves a large number of operations, which are dominated by high dimensionality Matrix-Vector Multiplies (MVMs). This has motivated hardware accelerators to enhance compute efficiency, but where data movement and accessing are proving to be key bottlenecks. In-Memory Computing (IMC) is an approach with the potential to overcome this, whereby computations are performed in-place within dense 2-D memory. However, IMC fundamentally trades efficiency and throughput gains for dynamic-range limitations, raising distinct challenges for training, where compute precision requirements are seen to be substantially higher than for inferencing. This paper explores training on IMC hardware by leveraging two recent developments: (1) a training algorithm enabling aggressive quantization through a radix-4 number representation; (2) IMC leveraging compute based on precision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.