Harnessing Integrated CPU-GPU System Memory for HPC: a first look into Grace Hopper
Gabin Schieffer, Jacob Wahlgren, Jie Ren, Jennifer Faj, Ivy Peng

TL;DR
This paper explores the first in-depth analysis of the Grace Hopper Superchip's integrated CPU-GPU memory management, demonstrating its potential benefits for unified memory systems in HPC with practical optimization insights.
Contribution
It provides the first detailed study of system memory management on the Grace Hopper Superchip, including performance analysis and optimization strategies for unified memory.
Findings
System memory can benefit most use cases with minimal porting.
The integrated page table impacts GPU application performance.
Practical optimization strategies depend on access patterns.
Abstract
Memory management across discrete CPU and GPU physical memory is traditionally achieved through explicit GPU allocations and data copy or unified virtual memory. The Grace Hopper Superchip, for the first time, supports an integrated CPU-GPU system page table, hardware-level addressing of system allocated memory, and cache-coherent NVLink-C2C interconnect, bringing an alternative solution for enabling a Unified Memory system. In this work, we provide the first in-depth study of the system memory management on the Grace Hopper Superchip, in both in-memory and memory oversubscription scenarios. We provide a suite of six representative applications, including the Qiskit quantum computing simulator, using system memory and managed memory. Using our memory utilization profiler and hardware counters, we quantify and characterize the impact of the integrated CPU-GPU system page table on GPU…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems · Cloud Computing and Resource Management
