FreshGNN: Reducing Memory Access via Stable Historical Embeddings for Graph Neural Network Training
Kezhao Huang, Haitian Jiang, Minjie Wang, Guangxuan Xiao, David Wipf,, Xiang Song, Quan Gan, Zengfeng Huang, Jidong Zhai, Zheng Zhang

TL;DR
FreshGNN introduces a caching framework for GNN training that reduces memory access and accelerates training on large graphs while maintaining high accuracy by selectively reusing stable node embeddings.
Contribution
It proposes a novel historical cache policy for GNN training that balances re-computation and reuse to improve efficiency and accuracy.
Findings
Achieves up to 20.5x speedup on large datasets
Reduces memory access by 59%
Maintains less than 1% accuracy loss
Abstract
A key performance bottleneck when training graph neural network (GNN) models on large, real-world graphs is loading node features onto a GPU. Due to limited GPU memory, expensive data movement is necessary to facilitate the storage of these features on alternative devices with slower access (e.g. CPU memory). Moreover, the irregularity of graph structures contributes to poor data locality which further exacerbates the problem. Consequently, existing frameworks capable of efficiently training large GNN models usually incur a significant accuracy degradation because of the currently-available shortcuts involved. To address these limitations, we instead propose FreshGNN, a general-purpose GNN mini-batch training framework that leverages a historical cache for storing and reusing GNN node embeddings instead of re-computing them through fetching raw features at every iteration. Critical to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Stochastic Gradient Optimization Techniques · Advanced Neural Network Applications
MethodsGraph Neural Network · Test · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
