Optimizing System Memory Bandwidth with Micron CXL Memory Expansion Modules on Intel Xeon 6 Processors
Rohit Sehgal, Vishal Tanna, Vinicius Petrucci, Anil Godbole

TL;DR
This study demonstrates that using Micron's CXL memory expansion modules on Intel Xeon processors significantly increases memory bandwidth and improves performance for HPC and AI workloads, with notable speedups and bandwidth gains.
Contribution
First experimental analysis of CXL memory expansion impact on Intel Xeon 6 processors with real workload performance data.
Findings
CXL modules increase read-only bandwidth by 24%.
CXL modules boost mixed read/write bandwidth by up to 39%.
Performance speedups average 24% across workloads.
Abstract
High-Performance Computing (HPC) and Artificial Intelligence (AI) workloads typically demand substantial memory bandwidth and, to a degree, memory capacity. CXL memory expansion modules, also known as CXL "type-3" devices, enable enhancements in both memory capacity and bandwidth for server systems by utilizing the CXL protocol which runs over the PCIe interfaces of the processor. This paper discusses experimental findings on achieving increased memory bandwidth for HPC and AI workloads using Micron's CXL modules. This is the first study that presents real data experiments utilizing eight CXL E3.S (x8) Micron CZ122 devices on the Intel Xeon 6 processor 6900P (previously codenamed Granite Rapids AP) featuring 128 cores, alongside Micron DDR-5 memory operating at 6400 MT/s on each of the CPU's 12 DRAM channels. The eight CXL memories were set up as a unified NUMA configuration, employing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques
