FreshGNN: Reducing Memory Access via Stable Historical Embeddings for   Graph Neural Network Training

Kezhao Huang; Haitian Jiang; Minjie Wang; Guangxuan Xiao; David Wipf,; Xiang Song; Quan Gan; Zengfeng Huang; Jidong Zhai; Zheng Zhang

arXiv:2301.07482·cs.LG·March 26, 2024·1 cites

FreshGNN: Reducing Memory Access via Stable Historical Embeddings for Graph Neural Network Training

Kezhao Huang, Haitian Jiang, Minjie Wang, Guangxuan Xiao, David Wipf,, Xiang Song, Quan Gan, Zengfeng Huang, Jidong Zhai, Zheng Zhang

PDF

Open Access

TL;DR

FreshGNN introduces a caching framework for GNN training that reduces memory access and accelerates training on large graphs while maintaining high accuracy by selectively reusing stable node embeddings.

Contribution

It proposes a novel historical cache policy for GNN training that balances re-computation and reuse to improve efficiency and accuracy.

Findings

01

Achieves up to 20.5x speedup on large datasets

02

Reduces memory access by 59%

03

Maintains less than 1% accuracy loss

Abstract

A key performance bottleneck when training graph neural network (GNN) models on large, real-world graphs is loading node features onto a GPU. Due to limited GPU memory, expensive data movement is necessary to facilitate the storage of these features on alternative devices with slower access (e.g. CPU memory). Moreover, the irregularity of graph structures contributes to poor data locality which further exacerbates the problem. Consequently, existing frameworks capable of efficiently training large GNN models usually incur a significant accuracy degradation because of the currently-available shortcuts involved. To address these limitations, we instead propose FreshGNN, a general-purpose GNN mini-batch training framework that leverages a historical cache for storing and reusing GNN node embeddings instead of re-computing them through fetching raw features at every iteration. Critical to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Stochastic Gradient Optimization Techniques · Advanced Neural Network Applications

MethodsGraph Neural Network · Test · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings