Neural Network Training on In-memory-computing Hardware with Radix-4 Gradients
Christopher Grimm, Naveen Verma

TL;DR
This paper demonstrates that training deep neural networks on in-memory computing hardware with radix-4 quantization can significantly reduce energy consumption while maintaining training accuracy.
Contribution
It introduces a radix-4 quantization method for training on IMC hardware and shows how to mitigate analog noise effects, enabling efficient deep learning training.
Findings
Over 400x energy savings with full quantizer adaptability.
3x energy savings with two-level quantizer adaptability.
Feasibility of training neural networks on IMC hardware with quantization.
Abstract
Deep learning training involves a large number of operations, which are dominated by high dimensionality Matrix-Vector Multiplies (MVMs). This has motivated hardware accelerators to enhance compute efficiency, but where data movement and accessing are proving to be key bottlenecks. In-Memory Computing (IMC) is an approach with the potential to overcome this, whereby computations are performed in-place within dense 2-D memory. However, IMC fundamentally trades efficiency and throughput gains for dynamic-range limitations, raising distinct challenges for training, where compute precision requirements are seen to be substantially higher than for inferencing. This paper explores training on IMC hardware by leveraging two recent developments: (1) a training algorithm enabling aggressive quantization through a radix-4 number representation; (2) IMC leveraging compute based on precision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
