$\rm A^2Q$: Aggregation-Aware Quantization for Graph Neural Networks
Zeyu Zhu, Fanrong Li, Zitao Mo, Qinghao Hu, Gang Li, Zejian Liu,, Xiaoyao Liang, Jian Cheng

TL;DR
This paper introduces A^2Q, a novel aggregation-aware mixed-precision quantization method for GNNs that automatically assigns bitwidths to nodes, significantly reducing model size and inference latency with minimal accuracy loss.
Contribution
The paper proposes A^2Q, which exploits GNN topology for adaptive quantization, including a local gradient training method and a nearest neighbor strategy for unseen graphs.
Findings
Achieves up to 18.6x compression with negligible accuracy loss.
Outperforms state-of-the-art quantization methods by up to 11.4% in accuracy.
Provides up to 2x speedup on dedicated hardware.
Abstract
As graph data size increases, the vast latency and memory consumption during inference pose a significant challenge to the real-world deployment of Graph Neural Networks (GNNs). While quantization is a powerful approach to reducing GNNs complexity, most previous works on GNNs quantization fail to exploit the unique characteristics of GNNs, suffering from severe accuracy degradation. Through an in-depth analysis of the topology of GNNs, we observe that the topology of the graph leads to significant differences between nodes, and most of the nodes in a graph appear to have a small aggregation value. Motivated by this, in this paper, we propose the Aggregation-Aware mixed-precision Quantization () for GNNs, where an appropriate bitwidth is automatically learned and assigned to each node in the graph. To mitigate the vanishing gradient problem caused by sparse connections between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Graph Neural Networks · Advanced Memory and Neural Computing · Brain Tumor Detection and Classification
Methodsfail
