TL;DR
Bi-GCN introduces a binarized graph neural network that significantly reduces memory usage and accelerates inference while maintaining comparable performance to full-precision models, enabling efficient processing of large attributed graphs.
Contribution
This paper proposes the first binarized GNN, Bi-GCN, which reduces memory and computation costs by binarizing parameters and features, and introduces a new training method for effective learning.
Findings
Memory consumption reduced by ~30x.
Inference speed increased by ~47x.
Performance comparable to full-precision GNNs.
Abstract
Graph Neural Networks (GNNs) have achieved tremendous success in graph representation learning. Unfortunately, current GNNs usually rely on loading the entire attributed graph into network for processing. This implicit assumption may not be satisfied with limited memory resources, especially when the attributed graph is large. In this paper, we pioneer to propose a Binary Graph Convolutional Network (Bi-GCN), which binarizes both the network parameters and input node features. Besides, the original matrix multiplications are revised to binary operations for accelerations. According to the theoretical analysis, our Bi-GCN can reduce the memory consumption by an average of ~30x for both the network parameters and input data, and accelerate the inference speed by an average of ~47x, on the citation networks. Meanwhile, we also design a new gradient approximation based back-propagation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
