How Much Progress Has There Been in NVIDIA Datacenter GPUs?
Emanuele Del Sozzo, Martin Fleming, Kenneth Flamm, and Neil Thompson

TL;DR
This paper analyzes the technical progress of NVIDIA datacenter GPUs from the mid-2000s to present, quantifying improvements in performance, memory, cost, and power, and discusses implications of export controls on future performance gaps.
Contribution
It provides a comprehensive dataset and trend analysis of NVIDIA datacenter GPU features, including performance doubling times and the impact of export regulations.
Findings
FP16 and FP32 operations double approximately every 1.5-1.7 years.
Memory bandwidth and size grow slower, doubling every 3.3-3.5 years.
Export controls could significantly reduce the performance gap in future GPU development.
Abstract
Graphics Processing Units (GPUs) are the state-of-the-art architecture for essential tasks, ranging from rendering 2D/3D graphics to accelerating workloads in supercomputing centers and, of course, Artificial Intelligence (AI). As GPUs continue improving to satisfy ever-increasing performance demands, analyzing past and current progress becomes paramount in determining future constraints on scientific research. This is particularly compelling in the AI domain, where rapid technological advancements and fierce global competition have led the United States to recently implement export control regulations limiting international access to advanced AI chips. For this reason, this paper studies technical progress in NVIDIA datacenter GPUs released from the mid-2000s until today. Specifically, we compile a comprehensive dataset of datacenter NVIDIA GPUs comprising several features, ranging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Cloud Computing and Resource Management · Advanced Data Storage Technologies
