Operation-Level Performance Benchmarking of Graph Neural Networks for   Scientific Applications

Ryien Hosseini; Filippo Simini; Venkatram Vishwanath

arXiv:2207.09955·cs.LG·July 21, 2022·1 cites

Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications

Ryien Hosseini, Filippo Simini, Venkatram Vishwanath

PDF

Open Access 1 Repo

TL;DR

This paper benchmarks low-level GNN operations on NVIDIA A100 GPUs, revealing bottlenecks and inefficiencies, and provides insights to guide future hardware and software optimizations for scalable scientific GNN applications.

Contribution

It systematically profiles GNN operations in PyTorch Geometric, benchmarks them on NVIDIA GPUs, and analyzes bottlenecks to inform future hardware and software improvements.

Findings

01

Memory inefficiency often dominates runtime costs.

02

Native PyTorch operations are competitive with Geometric equivalents.

03

Most GNN operations lack optimization for data sparsity.

Abstract

As Graph Neural Networks (GNNs) increase in popularity for scientific machine learning, their training and inference efficiency is becoming increasingly critical. Additionally, the deep learning field as a whole is trending towards wider and deeper networks, and ever increasing data sizes, to the point where hard hardware bottlenecks are often encountered. Emerging specialty hardware platforms provide an exciting solution to this problem. In this paper, we systematically profile and select low-level operations pertinent to GNNs for scientific computing implemented in the Pytorch Geometric software framework. These are then rigorously benchmarked on NVIDIA A100 GPUs for several various combinations of input values, including tensor sparsity. We then analyze these results for each operation. At a high level, we conclude that on NVIDIA systems: (1) confounding bottlenecks such as memory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ryienh/gnn-ops-benchmark
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Ferroelectric and Negative Capacitance Devices · Machine Learning in Materials Science