A New MRAM-based Process In-Memory Accelerator for Efficient Neural   Network Training with Floating Point Precision

Hongjie Wang; Yang Zhao; Chaojian Li; Yue Wang; Yingyan Lin

arXiv:2003.01551·cs.DC·May 13, 2020·1 cites

A New MRAM-based Process In-Memory Accelerator for Efficient Neural Network Training with Floating Point Precision

Hongjie Wang, Yang Zhao, Chaojian Li, Yue Wang, Yingyan Lin

PDF

Open Access

TL;DR

This paper introduces a novel SOT-MRAM-based digital process in-memory accelerator that enables efficient neural network training with floating point precision, significantly reducing energy, latency, and area compared to existing solutions.

Contribution

It presents a new SOT-MRAM cell design supporting floating point computation, advancing PIM accelerators for neural network training.

Findings

01

Achieves 3.3× energy efficiency improvement

02

Reduces latency by 1.8×

03

Lowers area by 2.5×

Abstract

The excellent performance of modern deep neural networks (DNNs) comes at an often prohibitive training cost, limiting the rapid development of DNN innovations and raising various environmental concerns. To reduce the dominant data movement cost of training, process in-memory (PIM) has emerged as a promising solution as it alleviates the need to access DNN weights. However, state-of-the-art PIM DNN training accelerators employ either analog/mixed signal computing which has limited precision or digital computing based on a memory technology that supports limited logic functions and thus requires complicated procedure to realize floating point computation. In this paper, we propose a spin orbit torque magnetic random access memory (SOT-MRAM) based digital PIM accelerator that supports floating point precision. Specifically, this new accelerator features an innovative (1) SOT-MRAM cell, (2)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Neural Networks and Reservoir Computing