Low-latency Mini-batch GNN Inference on CPU-FPGA Heterogeneous Platform

Bingyi Zhang; Hanqing Zeng; Viktor Prasanna

arXiv:2206.08536·cs.DC·January 5, 2023·1 cites

Low-latency Mini-batch GNN Inference on CPU-FPGA Heterogeneous Platform

Bingyi Zhang, Hanqing Zeng, Viktor Prasanna

PDF

Open Access

TL;DR

This paper presents a low-latency mini-batch GNN inference method on CPU-FPGA platforms using a novel adaptive hardware accelerator, achieving significant latency improvements over existing solutions.

Contribution

The paper introduces a flexible FPGA-based GNN accelerator with adaptive kernels and task scheduling, enabling efficient inference for various GNN models on heterogeneous platforms.

Findings

01

Achieves up to 50.8x latency reduction compared to CPU-only implementations.

02

Supports multiple GNN models including GCN, GraphSAGE, and GAT.

03

Demonstrates effective hiding of data communication overhead.

Abstract

Mini-batch inference of Graph Neural Networks (GNNs) is a key problem in many real-world applications. Recently, a GNN design principle of model depth-receptive field decoupling has been proposed to address the well-known issue of neighborhood explosion. Decoupled GNN models achieve higher accuracy than original models and demonstrate excellent scalability for mini-batch inference. We map Decoupled GNNs onto CPU-FPGA heterogeneous platforms to achieve low-latency mini-batch inference. On the FPGA platform, we design a novel GNN hardware accelerator with an adaptive datapath denoted Adaptive Computation Kernel (ACK) that can execute various computation kernels of GNNs with low-latency: (1) for dense computation kernels expressed as matrix multiplication, ACK works as a systolic array with fully localized connections, (2) for sparse computation kernels, ACK follows the scatter-gather…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Machine Learning in Materials Science · Graph Theory and Algorithms