Hybrid In-memory Computing Architecture for the Training of Deep Neural   Networks

Vinay Joshi; Wangxin He; Jae-sun Seo; Bipin Rajendran

arXiv:2102.05271·cs.AR·February 11, 2021

Hybrid In-memory Computing Architecture for the Training of Deep Neural Networks

Vinay Joshi, Wangxin He, Jae-sun Seo, Bipin Rajendran

PDF

TL;DR

This paper introduces a hybrid in-memory computing architecture for deep neural network training that enhances memory efficiency, reduces inference model size, and maintains accuracy over time, using phase-change memory devices and low precision updates.

Contribution

The paper presents a novel hybrid in-memory computing architecture with a new weight representation and low precision update mechanism for efficient DNN training on hardware accelerators.

Findings

01

HIC outperforms FP32 baseline in accuracy and model size.

02

HIC reduces inference model size by about 50%.

03

PCM device drift has negligible impact on long-term inference accuracy.

Abstract

The cost involved in training deep neural networks (DNNs) on von-Neumann architectures has motivated the development of novel solutions for efficient DNN training accelerators. We propose a hybrid in-memory computing (HIC) architecture for the training of DNNs on hardware accelerators that results in memory-efficient inference and outperforms baseline software accuracy in benchmark tasks. We introduce a weight representation technique that exploits both binary and multi-level phase-change memory (PCM) devices, and this leads to a memory-efficient inference accelerator. Unlike previous in-memory computing-based implementations, we use a low precision weight update accumulator that results in more memory savings. We trained the ResNet-32 network to classify CIFAR-10 images using HIC. For a comparable model size, HIC-based training outperforms baseline network, trained in floating-point…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.