Exploiting On-chip Heterogeneity of Versal Architecture for GNN   Inference Acceleration

Paul Chen; Pavan Manjunath; Sasindu Wijeratne; Bingyi Zhang; Viktor; Prasanna

arXiv:2308.02749·cs.AR·August 8, 2023

Exploiting On-chip Heterogeneity of Versal Architecture for GNN Inference Acceleration

Paul Chen, Pavan Manjunath, Sasindu Wijeratne, Bingyi Zhang, Viktor, Prasanna

PDF

Open Access

TL;DR

This paper presents a novel approach to accelerate GNN inference by leveraging the heterogeneous capabilities of AMD Versal ACAP architecture, combining custom hardware modules and dynamic task mapping to exploit data sparsity.

Contribution

It introduces a runtime kernel mapping strategy and custom hardware modules that utilize the heterogeneous architecture for efficient GNN inference acceleration.

Findings

01

Achieves up to 162.42x speedup over state-of-the-art methods.

02

Demonstrates significant performance improvements on various models and datasets.

03

Provides a flexible approach for dynamic sparsity exploitation in GNN inference.

Abstract

Graph Neural Networks (GNNs) have revolutionized many Machine Learning (ML) applications, such as social network analysis, bioinformatics, etc. GNN inference can be accelerated by exploiting data sparsity in the input graph, vertex features, and intermediate data in GNN computations. For dynamic sparsity exploitation, we leverage the heterogeneous computing capabilities of AMD Versal ACAP architecture to accelerate GNN inference. We develop a custom hardware module that executes the sparse primitives of the computation kernel on the Programmable Logic (PL) and efficiently computes the dense primitives using the AI Engine (AIE). To exploit data sparsity during inference, we devise a runtime kernel mapping strategy that dynamically assigns computation tasks to the PL and AIE based on data sparsity. Our implementation on the VCK5000 ACAP platform leads to superior performance compared with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Advanced Graph Neural Networks · Advanced Memory and Neural Computing