Multi-GPU Quantum Circuit Simulation and the Impact of Network Performance
W. Michael Brown, Anurag Ramesh, Thomas Lubinski, Thien Nguyen, David E. Bernal Neira

TL;DR
This paper evaluates how advancements in interconnect technology and GPU architecture influence the performance of multi-GPU quantum circuit simulations, highlighting the critical role of interconnects.
Contribution
It introduces MPI into quantum simulation benchmarks, reviews interconnect APIs, and benchmarks performance across various interconnects including NVIDIA Grace Blackwell NVL72.
Findings
GPU architecture improvements yield over 4.5X speedups.
Interconnect performance improvements lead to over 16X faster simulations.
Benchmarking on new interconnects demonstrates significant performance gains.
Abstract
As is intrinsic to the fundamental goal of quantum computing, classical simulation of quantum algorithms is notoriously demanding in resource requirements. Nonetheless, simulation is critical to the success of the field and a requirement for algorithm development and validation, as well as hardware design. GPU-acceleration has become standard practice for simulation, and due to the exponential scaling inherent in classical methods, multi-GPU simulation can be required to achieve representative system sizes. In this case, inter-GPU communications can bottleneck performance. In this work, we present the introduction of MPI into the QED-C Application-Oriented Benchmarks to facilitate benchmarking on HPC systems. We review the advances in interconnect technology and the APIs for multi-GPU communication. We benchmark using a variety of interconnect paths, including the recent NVIDIA Grace…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
