HP-GNN: Generating High Throughput GNN Training Implementation on CPU-FPGA Heterogeneous Platform
Yi-Chien Lin, Bingyi Zhang, Viktor Prasanna

TL;DR
HP-GNN is an automated framework that generates high-throughput GNN training implementations on CPU-FPGA platforms, significantly improving training speed and reducing development effort for researchers and developers.
Contribution
It introduces a novel framework that automates hardware mapping for GNN training on CPU-FPGA platforms, combining data optimization, hardware templates, and design space exploration.
Findings
Achieves up to 55.67x speedup over CPU-only platforms.
Attains up to 4.45x speedup compared to existing GNN implementations.
Demonstrates effectiveness on multiple GNN models and training algorithms.
Abstract
Graph Neural Networks (GNNs) have shown great success in many applications such as recommendation systems, molecular property prediction, traffic prediction, etc. Recently, CPU-FPGA heterogeneous platforms have been used to accelerate many applications by exploiting customizable data path and abundant user-controllable on-chip memory resources of FPGAs. Yet, accelerating and deploying GNN training on such platforms requires not only expertise in hardware design but also substantial development efforts. We propose HP-GNN, a novel framework that generates high throughput GNN training implementations on a given CPU-FPGA platform that can benefit both application developers and machine learning researchers. HP-GNN takes GNN training algorithms, GNN models as the inputs, and automatically performs hardware mapping onto the target CPU-FPGA platform. HP-GNN consists of: (1) data layout and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
