AIRES: Accelerating Out-of-Core GCNs via Algorithm-System Co-Design
Shakya Jayakody, Youpeng Zhao, Jun Wang

TL;DR
AIRES is a co-designed algorithm-system approach that accelerates out-of-core GCN computations by optimizing data alignment and memory transfer, significantly reducing latency and improving throughput.
Contribution
AIRES introduces a novel tiling algorithm and a dynamic scheduling strategy that effectively address data alignment and I/O bottlenecks in out-of-core GCNs.
Findings
Achieves up to 1.8x lower latency in benchmarks
Reduces I/O latency and improves throughput
Outperforms state-of-the-art methods significantly
Abstract
Graph convolutional networks (GCNs) are fundamental in various scientific applications, ranging from biomedical protein-protein interactions (PPI) to large-scale recommendation systems. An essential component for modeling graph structures in GCNs is sparse general matrix-matrix multiplication (SpGEMM). As the size of graph data continues to scale up, SpGEMMs are often conducted in an out-of-core fashion due to limited GPU memory space in resource-constrained systems. Albeit recent efforts that aim to alleviate the memory constraints of out-of-core SpGEMM through either GPU feature caching, hybrid CPU-GPU memory layout, or performing the computation in sparse format, current systems suffer from both high I/O latency and GPU under-utilization issues. In this paper, we first identify the problems of existing systems, where sparse format data alignment and memory allocation are the main…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
