Accelerating Sampling and Aggregation Operations in GNN Frameworks with GPU Initiated Direct Storage Accesses
Jeongmin Brian Park, Vikram Sharma Mailthody, Zaid Qureshi, Wen-mei Hwu

TL;DR
This paper introduces GIDS, a GPU-initiated direct storage access dataloader that significantly accelerates large-scale GNN training by enabling direct data fetching from storage, reducing bottlenecks and improving GPU utilization.
Contribution
The paper presents GIDS, a novel GPU-oriented dataloader that improves large-scale GNN training efficiency by directly accessing storage and optimizing data movement strategies.
Findings
GIDS accelerates GNN training by up to 392X on terabyte-scale datasets.
It reduces page-fault overhead and improves hardware resource utilization.
Enables efficient training of large graphs exceeding CPU memory capacity.
Abstract
Graph Neural Networks (GNNs) are emerging as a powerful tool for learning from graph-structured data and performing sophisticated inference tasks in various application domains. Although GNNs have been shown to be effective on modest-sized graphs, training them on large-scale graphs remains a significant challenge due to lack of efficient data access and data movement methods. Existing frameworks for training GNNs use CPUs for graph sampling and feature aggregation, while the training and updating of model weights are executed on GPUs. However, our in-depth profiling shows the CPUs cannot achieve the throughput required to saturate GNN model training throughput, causing gross under-utilization of expensive GPU resources. Furthermore, when the graph and its embeddings do not fit in the CPU memory, the overhead introduced by the operating system, say for handling page-faults, comes in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Advanced Neural Network Applications
