Tango: rethinking quantization for graph neural network training on GPUs
Shiyang Chen, Da Zheng, Caiwen Ding, Chengying Huan, Yuede Ji, Hang, Liu

TL;DR
Tango presents a new approach to quantized GNN training on GPUs that maintains accuracy and accelerates training by introducing efficient quantization rules, optimized primitives, and system integration, outperforming existing methods.
Contribution
The paper introduces novel quantization rules, optimized primitives, and system integration techniques to improve accuracy and speed in quantized GNN training on GPUs.
Findings
Tango achieves faster training times than state-of-the-art methods.
Tango maintains high accuracy in quantized GNN training.
System integration with DGL enhances practical applicability.
Abstract
Graph Neural Networks (GNNs) are becoming increasingly popular due to their superior performance in critical graph-related tasks. While quantization is widely used to accelerate GNN computation, quantized training faces unprecedented challenges. Current quantized GNN training systems often have longer training times than their full-precision counterparts for two reasons: (i) addressing the accuracy challenge leads to excessive overhead, and (ii) the optimization potential exposed by quantization is not adequately leveraged. This paper introduces Tango which re-thinks quantization challenges and opportunities for graph neural network training on GPUs with three contributions: Firstly, we introduce efficient rules to maintain accuracy during quantized GNN training. Secondly, we design and implement quantization-aware primitives and inter-primitive optimizations that can speed up GNN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Recommender Systems and Techniques · Advanced Neural Network Applications
MethodsLib · Graph Neural Network · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
