GriNNder: Breaking the Memory Capacity Wall in Full-Graph GNN Training with Storage Offloading
Jaeyong Song, Seongyeon Park, Hongsun Jang, Jaewon Jung, Hunseong Lim, Junguk Hong, Jinho Lee

TL;DR
GriNNder enables full-graph GNN training on limited-memory devices by leveraging storage offloading, structured management, and novel strategies, achieving significant speedups and large-scale training feasibility.
Contribution
It introduces a novel storage offloading framework with strategies tailored for full-graph GNN training on limited-memory hardware.
Findings
Achieves up to 9.78x speedup over state-of-the-art baselines.
Enables large-scale full-graph training on a single GPU.
Provides throughput comparable to distributed systems.
Abstract
Full-graph training of graph neural networks (GNNs) is widely used as it enables direct validation of algorithmic improvements by preserving complete neighborhood information. However, it typically requires multiple GPUs or servers, incurring substantial hardware and inter-device communication costs. While existing single-server methods reduce infrastructure requirements, they remain constrained by GPU and host memory capacity as graph sizes increase. To address this limitation, we introduce GriNNder, which is the first work to leverage storage devices to enable full-graph training even with limited memory. Because modern NVMe SSDs offer multi-terabyte capacities and bandwidths exceeding 10 GB/s, they provide an appealing option when memory resources are scarce. Yet, directly applying storage-based methods from other domains fails to address the unique access patterns and data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
