Assessing the Use Cases of Persistent Memory in High-Performance Scientific Computing
Yehonatan Fridman, Yaniv Snir, Matan Rusanovsky, Kfir Zvi, Harel, Levin, Danny Hendler, Hagit Attiya, Gal Oren

TL;DR
This paper evaluates Intel's Optane DCPMM as a versatile solution for replacing storage and augmenting memory in HPC, enhancing performance and simplifying data management in scientific computing.
Contribution
It provides a comprehensive performance assessment of DCPMM in HPC, demonstrating its potential to replace storage devices and expand main memory with minimal code modifications.
Findings
DCPMM can replace traditional storage devices like HDDs and SSDs in HPC.
DCPMM enables larger main memory capacity for scientific applications.
Using DCPMM improves data locality and application performance.
Abstract
As the High Performance Computing world moves towards the Exa-Scale era, huge amounts of data should be analyzed, manipulated and stored. In the traditional storage/memory hierarchy, each compute node retains its data objects in its local volatile DRAM. Whenever the DRAM's capacity becomes insufficient for storing this data, the computation should either be distributed between several compute nodes, or some portion of these data objects must be stored in a non-volatile block device such as a hard disk drive or an SSD storage device. Optane DataCenter Persistent Memory Module (DCPMM), a new technology introduced by Intel, provides non-volatile memory that can be plugged into standard memory bus slots and therefore be accessed much faster than standard storage devices. In this work, we present and analyze the results of a comprehensive performance assessment of several ways in which DCPMM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Distributed and Parallel Computing Systems · Cloud Computing and Resource Management
