Evaluation of STT-MRAM as a Scratchpad for Training in ML Accelerators

Sourjya Roy; Cheng Wang; and Anand Raghunathan

arXiv:2308.02024·cs.AR·August 7, 2023·1 cites

Evaluation of STT-MRAM as a Scratchpad for Training in ML Accelerators

Sourjya Roy, Cheng Wang, and Anand Raghunathan

PDF

Open Access

TL;DR

This paper evaluates the potential of STT-MRAM as a high-density, low-leakage scratchpad memory for training deep neural networks in ML accelerators, focusing on energy efficiency and accuracy trade-offs.

Contribution

It presents a comprehensive device-to-system evaluation and proposes optimization strategies for STT-MRAM to improve energy efficiency in ML training accelerators.

Findings

01

STT-MRAM can achieve 15-22x energy savings compared to SRAM.

02

Optimized write operations reduce energy by over 2x with minimal accuracy loss.

03

Heterogeneous memory configurations enable effective training convergence.

Abstract

Progress in artificial intelligence and machine learning over the past decade has been driven by the ability to train larger deep neural networks (DNNs), leading to a compute demand that far exceeds the growth in hardware performance afforded by Moore's law. Training DNNs is an extremely memory-intensive process, requiring not just the model weights but also activations and gradients for an entire minibatch to be stored. The need to provide high-density and low-leakage on-chip memory motivates the exploration of emerging non-volatile memory for training accelerators. Spin-Transfer-Torque MRAM (STT-MRAM) offers several desirable properties for training accelerators, including 3-4x higher density than SRAM, significantly reduced leakage power, high endurance and reasonable access time. On the one hand, MRAM write operations require high write energy and latency due to the need to ensure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMagnetic properties of thin films · Ferroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing