Hybrid In-memory Computing Architecture for the Training of Deep Neural Networks
Vinay Joshi, Wangxin He, Jae-sun Seo, Bipin Rajendran

TL;DR
This paper introduces a hybrid in-memory computing architecture for deep neural network training that enhances memory efficiency, reduces inference model size, and maintains accuracy over time, using phase-change memory devices and low precision updates.
Contribution
The paper presents a novel hybrid in-memory computing architecture with a new weight representation and low precision update mechanism for efficient DNN training on hardware accelerators.
Findings
HIC outperforms FP32 baseline in accuracy and model size.
HIC reduces inference model size by about 50%.
PCM device drift has negligible impact on long-term inference accuracy.
Abstract
The cost involved in training deep neural networks (DNNs) on von-Neumann architectures has motivated the development of novel solutions for efficient DNN training accelerators. We propose a hybrid in-memory computing (HIC) architecture for the training of DNNs on hardware accelerators that results in memory-efficient inference and outperforms baseline software accuracy in benchmark tasks. We introduce a weight representation technique that exploits both binary and multi-level phase-change memory (PCM) devices, and this leads to a memory-efficient inference accelerator. Unlike previous in-memory computing-based implementations, we use a low precision weight update accumulator that results in more memory savings. We trained the ResNet-32 network to classify CIFAR-10 images using HIC. For a comparable model size, HIC-based training outperforms baseline network, trained in floating-point…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
