Exploring Modern GPU Memory System Design Challenges through Accurate Modeling
Mahmoud Khairy, Jain Akshay, Tor Aamodt, Timothy G. Rogers

TL;DR
This paper enhances GPU simulator accuracy to better inform architecture design decisions, demonstrating significant reductions in error and improved correlation with real hardware, emphasizing the need for detailed modeling of modern GPU memory systems.
Contribution
The authors developed an enhanced GPU simulator with detailed memory system modeling, validated it against real hardware, and created a toolset for efficient correlation analysis.
Findings
Enhanced simulator reduces memory error by up to 66X
Execution time error decreases by 2.5X compared to real hardware
Simpler models underestimate the importance of advanced memory designs
Abstract
This paper explores the impact of simulator accuracy on architecture design decisions in the general-purpose graphics processing unit (GPGPU) space. We perform a detailed, quantitative analysis of the most popular publicly available GPU simulator, GPGPU-Sim, against our enhanced version of the simulator, updated to model the memory system of modern GPUs in more detail. Our enhanced GPU model is able to describe the NVIDIA Volta architecture in sufficient detail to reduce error in memory system even counters by as much as 66X. The reduced error in the memory system further reduces execution time error versus real hardware by 2.5X. To demonstrate the accuracy of our enhanced model against a real machine, we perform a counter-by-counter validation against an NVIDIA TITAN V Volta GPU, demonstrating the relative accuracy of the new simulator versus the publicly available model. We go on to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Interconnection Networks and Systems
