Multi-node Acceleration for Large-scale GCNs

Gongjian Sun; Mingyu Yan; Duo Wang; Han Li; Wenming Li; Xiaochun Ye,; Dongrui Fan; Yuan Xie

arXiv:2207.07258·cs.AR·September 5, 2022

Multi-node Acceleration for Large-scale GCNs

Gongjian Sun, Mingyu Yan, Duo Wang, Han Li, Wenming Li, Xiaochun Ye,, Dongrui Fan, Yuan Xie

PDF

Open Access

TL;DR

This paper introduces MultiGCN, a multi-node acceleration system for large-scale GCNs that reduces communication overhead and improves speed, energy efficiency, and scalability compared to existing solutions.

Contribution

It proposes the first multi-node GCN acceleration system that leverages network latency tolerance to optimize communication and memory access, enabling scalable large-scale GCN processing.

Findings

01

Achieves 4-12x speedup over baseline MultiAccSys

02

Reduces network transmissions by 32% and off-chip memory accesses by 73%

03

Outperforms state-of-the-art multi-GPU solutions by 2.5-8x

Abstract

Limited by the memory capacity and compute power, singe-node graph convolutional neural network (GCN) accelerators cannot complete the execution of GCNs within a reasonable amount of time, due to the explosive size of graphs nowadays. Thus, large-scale GCNs call for a multi-node acceleration system (MultiAccSys) like TPU-Pod for large-scale neural networks. In this work, we aim to scale up single-node GCN accelerators to accelerate GCNs on large-scale graphs. We first identify the communication pattern and challenges of multi-node acceleration for GCNs on large-scale graphs. We observe that (1) coarse-grained communication patterns exist in the execution of GCNs in MultiAccSys, which introduces massive amount of redundant network transmissions and off-chip memory accesses; (2) overall, the acceleration of GCNs in MultiAccSys is bandwidth-bound and latency-tolerant. Guided by these two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Ferroelectric and Negative Capacitance Devices · Advanced Memory and Neural Computing