Large Graph Convolutional Network Training with GPU-Oriented Data   Communication Architecture

Seung Won Min; Kun Wu; Sitao Huang; Mert Hidayeto\u{g}lu; Jinjun; Xiong; Eiman Ebrahimi; Deming Chen; Wen-mei Hwu

arXiv:2103.03330·cs.LG·August 17, 2021

Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture

Seung Won Min, Kun Wu, Sitao Huang, Mert Hidayeto\u{g}lu, Jinjun, Xiong, Eiman Ebrahimi, Deming Chen, Wen-mei Hwu

PDF

1 Repo

TL;DR

This paper introduces a GPU-oriented data communication architecture for GCN training that reduces host memory bandwidth bottlenecks by enabling direct GPU access to sparse features, significantly improving training speed on large graphs.

Contribution

The paper proposes a novel GPU-centric data access method for GCN training that minimizes CPU involvement and enhances data transfer efficiency, enabling faster training on large-scale graphs.

Findings

01

Achieves 65-92% speedup over traditional methods.

02

Enables training on graphs with up to 111 million nodes.

03

Matches all-in-GPU-memory performance for some datasets.

Abstract

Graph Convolutional Networks (GCNs) are increasingly adopted in large-scale graph-based recommender systems. Training GCN requires the minibatch generator traversing graphs and sampling the sparsely located neighboring nodes to obtain their features. Since real-world graphs often exceed the capacity of GPU memory, current GCN training systems keep the feature table in host memory and rely on the CPU to collect sparse features before sending them to the GPUs. This approach, however, puts tremendous pressure on host memory bandwidth and the CPU. This is because the CPU needs to (1) read sparse features from memory, (2) write features into memory as a dense format, and (3) transfer the features from memory to the GPUs. In this work, we propose a novel GPU-oriented data communication approach for GCN training, where GPU threads directly access sparse features in host memory through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

K-Wu/pytorch-direct_dgl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGraph Convolutional Network