Multi-node Acceleration for Large-scale GCNs
Gongjian Sun, Mingyu Yan, Duo Wang, Han Li, Wenming Li, Xiaochun Ye,, Dongrui Fan, Yuan Xie

TL;DR
This paper introduces MultiGCN, a multi-node acceleration system for large-scale GCNs that reduces communication overhead and improves speed, energy efficiency, and scalability compared to existing solutions.
Contribution
It proposes the first multi-node GCN acceleration system that leverages network latency tolerance to optimize communication and memory access, enabling scalable large-scale GCN processing.
Findings
Achieves 4-12x speedup over baseline MultiAccSys
Reduces network transmissions by 32% and off-chip memory accesses by 73%
Outperforms state-of-the-art multi-GPU solutions by 2.5-8x
Abstract
Limited by the memory capacity and compute power, singe-node graph convolutional neural network (GCN) accelerators cannot complete the execution of GCNs within a reasonable amount of time, due to the explosive size of graphs nowadays. Thus, large-scale GCNs call for a multi-node acceleration system (MultiAccSys) like TPU-Pod for large-scale neural networks. In this work, we aim to scale up single-node GCN accelerators to accelerate GCNs on large-scale graphs. We first identify the communication pattern and challenges of multi-node acceleration for GCNs on large-scale graphs. We observe that (1) coarse-grained communication patterns exist in the execution of GCNs in MultiAccSys, which introduces massive amount of redundant network transmissions and off-chip memory accesses; (2) overall, the acceleration of GCNs in MultiAccSys is bandwidth-bound and latency-tolerant. Guided by these two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Ferroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing
