AIRES: Accelerating Out-of-Core GCNs via Algorithm-System Co-Design

Shakya Jayakody; Youpeng Zhao; Jun Wang

arXiv:2507.02006·cs.LG·January 7, 2026

AIRES: Accelerating Out-of-Core GCNs via Algorithm-System Co-Design

Shakya Jayakody, Youpeng Zhao, Jun Wang

PDF

TL;DR

AIRES is a co-designed algorithm-system approach that accelerates out-of-core GCN computations by optimizing data alignment and memory transfer, significantly reducing latency and improving throughput.

Contribution

AIRES introduces a novel tiling algorithm and a dynamic scheduling strategy that effectively address data alignment and I/O bottlenecks in out-of-core GCNs.

Findings

01

Achieves up to 1.8x lower latency in benchmarks

02

Reduces I/O latency and improves throughput

03

Outperforms state-of-the-art methods significantly

Abstract

Graph convolutional networks (GCNs) are fundamental in various scientific applications, ranging from biomedical protein-protein interactions (PPI) to large-scale recommendation systems. An essential component for modeling graph structures in GCNs is sparse general matrix-matrix multiplication (SpGEMM). As the size of graph data continues to scale up, SpGEMMs are often conducted in an out-of-core fashion due to limited GPU memory space in resource-constrained systems. Albeit recent efforts that aim to alleviate the memory constraints of out-of-core SpGEMM through either GPU feature caching, hybrid CPU-GPU memory layout, or performing the computation in sparse format, current systems suffer from both high I/O latency and GPU under-utilization issues. In this paper, we first identify the problems of existing systems, where sparse format data alignment and memory allocation are the main…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.