A Data-Driven Approach to Dataflow-Aware Online Scheduling for Graph Neural Network Inference
Pol Puigdemont, Enrico Russo, Axel Wassington, Abhijit Das, Sergi, Abadal, Maurizio Palesi

TL;DR
This paper presents a data-driven framework that predicts optimal dataflows for GNN inference, enabling an online scheduler to significantly improve performance across diverse graph types.
Contribution
It introduces a novel latency prediction model for GNN dataflows and an online scheduling algorithm that adapts to graph diversity, enhancing accelerator efficiency.
Findings
Predicts dataflow latency with up to 91.28% accuracy
Achieves up to 3.17x speedup in mean completion time
Reduces execution time variability across graph types
Abstract
Graph Neural Networks (GNNs) have shown significant promise in various domains, such as recommendation systems, bioinformatics, and network analysis. However, the irregularity of graph data poses unique challenges for efficient computation, leading to the development of specialized GNN accelerator architectures that surpass traditional CPU and GPU performance. Despite this, the structural diversity of input graphs results in varying performance across different GNN accelerators, depending on their dataflows. This variability in performance due to differing dataflows and graph properties remains largely unexplored, limiting the adaptability of GNN accelerators. To address this, we propose a data-driven framework for dataflow-aware latency prediction in GNN inference. Our approach involves training regressors to predict the latency of executing specific graphs on particular dataflows,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
