Accurate deep neural network inference using computational phase-change   memory

Vinay Joshi; Manuel Le Gallo; Simon Haefeli; Irem Boybat; S.R.; Nandakumar; Christophe Piveteau; Martino Dazzi; Bipin Rajendran; Abu; Sebastian; Evangelos Eleftheriou

arXiv:1906.03138·cs.ET·May 19, 2020

Accurate deep neural network inference using computational phase-change memory

Vinay Joshi, Manuel Le Gallo, Simon Haefeli, Irem Boybat, S.R., Nandakumar, Christophe Piveteau, Martino Dazzi, Bipin Rajendran, Abu, Sebastian, Evangelos Eleftheriou

PDF

TL;DR

This paper presents a training methodology for deep neural networks that ensures minimal accuracy loss when deploying on phase-change memory-based in-memory computing hardware, enabling energy-efficient inference.

Contribution

It introduces a novel training approach and compensation technique for PCM-based in-memory computing, achieving high accuracy retention on CIFAR-10 and ImageNet datasets.

Findings

01

Achieved 93.7% accuracy on CIFAR-10 after mapping to PCM

02

Attained 71.6% top-1 accuracy on ImageNet with PCM hardware

03

Maintained over 93.5% accuracy on CIFAR-10 over one day

Abstract

In-memory computing is a promising non-von Neumann approach for making energy-efficient deep learning inference hardware. Crossbar arrays of resistive memory devices can be used to encode the network weights and perform efficient analog matrix-vector multiplications without intermediate movements of data. However, due to device variability and noise, the network needs to be trained in a specific way so that transferring the digitally trained weights to the analog resistive memory devices will not result in significant loss of accuracy. Here, we introduce a methodology to train ResNet-type convolutional neural networks that results in no appreciable accuracy loss when transferring weights to in-memory computing hardware based on phase-change memory (PCM). We also propose a compensation technique that exploits the batch normalization parameters to improve the accuracy retention over time.…

Equations16

G_{ij}^{l} = G_{ij}^{l, +} - G_{ij}^{l, -},

G_{ij}^{l} = G_{ij}^{l, +} - G_{ij}^{l, -},

G_{ij}^{l} = W_{ij}^{l} \times \frac{G _{ma x}}{W _{ma x}^{l}} + δ G_{ij}^{l} = G_{T, ij}^{l} + δ G_{ij}^{l},

G_{ij}^{l} = W_{ij}^{l} \times \frac{G _{ma x}}{W _{ma x}^{l}} + δ G_{ij}^{l} = G_{T, ij}^{l} + δ G_{ij}^{l},

\frac{σ _{δ W_{t r}}^{l}}{W _{ma x}^{l}} \equiv η_{t r} = \frac{σ _{δ G}}{G _{ma x}},

\frac{σ _{δ W_{t r}}^{l}}{W _{ma x}^{l}} \equiv η_{t r} = \frac{σ _{δ G}}{G _{ma x}},

\overset{α}{^} = \frac{\sum _{m = 1}^{L} I _{m}}{V _{c a l} \sum _{n = 1}^{N} \sum _{m = 1}^{L} G _{mn} ( t _{0} )} .

\overset{α}{^} = \frac{\sum _{m = 1}^{L} I _{m}}{V _{c a l} \sum _{n = 1}^{N} \sum _{m = 1}^{L} G _{mn} ( t _{0} )} .

μ_{B}

μ_{B}

σ_{B}^{2}

μ

μ

σ^{2}

p = 0.01 5^{(1/ n)} .

p = 0.01 5^{(1/ n)} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Accurate deep neural network inference using computational phase-change memory

Vinay Joshi