TL;DR
This paper introduces novel parallelization techniques for graph convolutional networks that significantly improve scalability and efficiency on large graphs while maintaining high accuracy.
Contribution
The paper presents new parallel sampling and feature propagation strategies for GCNs, achieving near-linear speedup and reduced communication without sacrificing accuracy.
Findings
Achieves 64x speedup in sampling on 40-core systems
Reduces communication and improves cache utilization in feature propagation
Outperforms state-of-the-art methods in scalability, efficiency, and accuracy
Abstract
The Graph Convolutional Network (GCN) model and its variants are powerful graph embedding tools for facilitating classification and clustering on graphs. However, a major challenge is to reduce the complexity of layered GCNs and make them parallelizable and scalable on very large graphs -- state-of the art techniques are unable to achieve scalability without losing accuracy and efficiency. In this paper, we propose novel parallelization techniques for graph sampling-based GCNs that achieve superior scalable performance on very large graphs without compromising accuracy. Specifically, our GCN guarantees work-efficient training and produces order of magnitude savings in computation and communication. To scale GCN training on tightly-coupled shared memory systems, we develop parallelization strategies for the key steps in training: For the graph sampling step, we exploit parallelism within…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGraph Convolutional Network
