Dissecting the NVIDIA Volta GPU Architecture via Microbenchmarking
Zhe Jia, Marco Maggioni, Benjamin Staiger, Daniele P. Scarpazza

TL;DR
This paper provides a detailed microarchitectural analysis of NVIDIA's Volta GPU architecture using microbenchmarking, expanding understanding of recent GPU designs and comparing them with previous generations.
Contribution
It presents new microarchitectural insights into NVIDIA Volta GPUs obtained through microbenchmarking and disassembly, filling a knowledge gap left by limited manufacturer disclosures.
Findings
Detailed microarchitectural features of NVIDIA Volta
Quantitative comparison with Kepler, Maxwell, Pascal
Enhanced understanding of GPU performance characteristics
Abstract
Every year, novel NVIDIA GPU designs are introduced. This rapid architectural and technological progression, coupled with a reluctance by manufacturers to disclose low-level details, makes it difficult for even the most proficient GPU software designers to remain up-to-date with the technological advances at a microarchitectural level. To address this dearth of public, microarchitectural-level information on the novel NVIDIA GPUs, independent researchers have resorted to microbenchmarks-based dissection and discovery. This has led to a prolific line of publications that shed light on instruction encoding, and memory hierarchy's geometry and features at each level. Namely, research that describes the performance and behavior of the Kepler, Maxwell and Pascal architectures. In this technical report, we continue this line of research by presenting the microarchitectural details of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
