Tango: rethinking quantization for graph neural network training on GPUs

Shiyang Chen; Da Zheng; Caiwen Ding; Chengying Huan; Yuede Ji; Hang; Liu

arXiv:2308.00890·cs.LG·September 4, 2023

Tango: rethinking quantization for graph neural network training on GPUs

Shiyang Chen, Da Zheng, Caiwen Ding, Chengying Huan, Yuede Ji, Hang, Liu

PDF

Open Access

TL;DR

Tango presents a new approach to quantized GNN training on GPUs that maintains accuracy and accelerates training by introducing efficient quantization rules, optimized primitives, and system integration, outperforming existing methods.

Contribution

The paper introduces novel quantization rules, optimized primitives, and system integration techniques to improve accuracy and speed in quantized GNN training on GPUs.

Findings

01

Tango achieves faster training times than state-of-the-art methods.

02

Tango maintains high accuracy in quantized GNN training.

03

System integration with DGL enhances practical applicability.

Abstract

Graph Neural Networks (GNNs) are becoming increasingly popular due to their superior performance in critical graph-related tasks. While quantization is widely used to accelerate GNN computation, quantized training faces unprecedented challenges. Current quantized GNN training systems often have longer training times than their full-precision counterparts for two reasons: (i) addressing the accuracy challenge leads to excessive overhead, and (ii) the optimization potential exposed by quantization is not adequately leveraged. This paper introduces Tango which re-thinks quantization challenges and opportunities for graph neural network training on GPUs with three contributions: Firstly, we introduce efficient rules to maintain accuracy during quantized GNN training. Secondly, we design and implement quantization-aware primitives and inter-primitive optimizations that can speed up GNN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Recommender Systems and Techniques · Advanced Neural Network Applications

MethodsLib · Graph Neural Network · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings