Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications
Ryien Hosseini, Filippo Simini, Venkatram Vishwanath

TL;DR
This paper benchmarks low-level GNN operations on NVIDIA A100 GPUs, revealing bottlenecks and inefficiencies, and provides insights to guide future hardware and software optimizations for scalable scientific GNN applications.
Contribution
It systematically profiles GNN operations in PyTorch Geometric, benchmarks them on NVIDIA GPUs, and analyzes bottlenecks to inform future hardware and software improvements.
Findings
Memory inefficiency often dominates runtime costs.
Native PyTorch operations are competitive with Geometric equivalents.
Most GNN operations lack optimization for data sparsity.
Abstract
As Graph Neural Networks (GNNs) increase in popularity for scientific machine learning, their training and inference efficiency is becoming increasingly critical. Additionally, the deep learning field as a whole is trending towards wider and deeper networks, and ever increasing data sizes, to the point where hard hardware bottlenecks are often encountered. Emerging specialty hardware platforms provide an exciting solution to this problem. In this paper, we systematically profile and select low-level operations pertinent to GNNs for scientific computing implemented in the Pytorch Geometric software framework. These are then rigorously benchmarked on NVIDIA A100 GPUs for several various combinations of input values, including tensor sparsity. We then analyze these results for each operation. At a high level, we conclude that on NVIDIA systems: (1) confounding bottlenecks such as memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices · Machine Learning in Materials Science
