Efficient Deep Learning Using Non-Volatile Memory Technology
Ahmet Inci, Mehmet Meric Isgenc, Diana Marculescu

TL;DR
This paper introduces DeepNVM++, a framework for analyzing NVM-based GPU caches, demonstrating significant energy and area savings over traditional SRAM in deep learning workloads.
Contribution
It presents a comprehensive modeling and analysis framework for NVM caches in GPUs, highlighting their advantages for deep learning applications.
Findings
STT-MRAM and SOT-MRAM reduce energy-delay product by up to 3.8x and 4.7x respectively.
These NVM technologies achieve up to 2.4x area reduction compared to SRAM.
They enable larger cache capacities with significant energy efficiency improvements.
Abstract
Embedded machine learning (ML) systems have now become the dominant platform for deploying ML serving tasks and are projected to become of equal importance for training ML models. With this comes the challenge of overall efficient deployment, in particular low power and high throughput implementations, under stringent memory constraints. In this context, non-volatile memory (NVM) technologies such as STT-MRAM and SOT-MRAM have significant advantages compared to conventional SRAM due to their non-volatility, higher cell density, and scalability features. While prior work has investigated several architectural implications of NVM for generic applications, in this work we present DeepNVM++, a comprehensive framework to characterize, model, and analyze NVM-based caches in GPU architectures for deep learning (DL) applications by combining technology-specific circuit-level models and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFerroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing · Parallel Computing and Optimization Techniques
