On the Applicability of PEBS based Online Memory Access Tracking for Heterogeneous Memory Management at Scale
Aleix Roca Nonell, Balazs Gerofi, Leonardo Bautista-Gomez, Dominique, Martinet, Vicen\c{c} Beltran Querol, Yutaka Ishikawa

TL;DR
This paper investigates the feasibility of using Intel's PEBS feature for online memory access pattern profiling in large-scale heterogeneous memory management, demonstrating low overhead and effective pattern detection.
Contribution
It introduces a lightweight PEBS driver in IHK/McKernel, enabling scalable, low-overhead access pattern profiling for heterogeneous memory management at large scale.
Findings
Access patterns can be captured with up to 10% overhead at 128k cores.
PEBS-based profiling is effective for noise-sensitive HPC applications.
The approach is promising for managing heterogeneous memory systems.
Abstract
Operating systems have historically had to manage only a single type of memory device. The imminent availability of heterogeneous memory devices based on emerging memory technologies confronts the classic single memory model and opens a new spectrum of possibilities for memory management. Transparent data movement between different memory devices based on access patterns of applications is a desired feature to make optimal use of such devices and to hide the complexity of memory management to the end-user. However, capturing memory access patterns of an application at runtime comes at a cost, which is particularly challenging for large scale parallel applications that may be sensitive to system noise. In this work, we focus on the access pattern profiling phase prior to the actual memory relocation. We study the feasibility of using Intel's Processor Event-Based Sampling (PEBS)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
