A Mess of Memory System Benchmarking, Simulation and Application Profiling
Pouya Esmaili-Dokht, Francesco Sgherzi, Valeria Soldera Girelli, Isaac, Boixaderas, Mariana Carmin, Alireza Monemi, Adria Armejach, Estanislao, Mercadal, German Llort, Petar Radojkovic, Miquel Moreto, Judit Gimenez,, Xavier Martorell, Eduard Ayguade, Jesus Labarta

TL;DR
The Mess framework offers comprehensive memory benchmarking, simulation, and profiling across diverse architectures and memory technologies, providing detailed insights into memory behavior and enabling rapid adoption of new memory systems.
Contribution
It introduces a unified framework for memory benchmarking, simulation, and application profiling that covers all major architectures and memory technologies, with open-source tools for broad adoption.
Findings
Provides detailed bandwidth-latency characterization of memory systems
Enables accurate and fast simulation of high-end memory technologies
Positions applications within the memory bandwidth-latency space for better analysis
Abstract
The Memory stress (Mess) framework provides a unified view of the memory system benchmarking, simulation and application profiling. The Mess benchmark provides a holistic and detailed memory system characterization. It is based on hundreds of measurements that are represented as a family of bandwidth--latency curves. The benchmark increases the coverage of all the previous tools and leads to new findings in the behavior of the actual and simulated memory systems. We deploy the Mess benchmark to characterize Intel, AMD, IBM, Fujitsu, Amazon and NVIDIA servers with DDR4, DDR5, HBM2 and HBM2E memory. The Mess memory simulator uses bandwidth--latency concept for the memory performance simulation. We integrate Mess with widely-used CPUs simulators enabling modeling of all high-end memory technologies. The Mess simulator is fast, easy to integrate and it closely matches the actual system…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies
