PerfSAGE: Generalized Inference Performance Predictor for Arbitrary Deep Learning Models on Edge Devices
Yuji Chai, Devashree Tripathy, Chuteng Zhou, Dibakar Gope, Igor, Fedorov, Ramon Matas, David Brooks, Gu-Yeon Wei, Paul Whatmough

TL;DR
PerfSAGE is a novel graph neural network model that accurately predicts inference latency, energy, and memory footprint for any deep neural network on various edge hardware platforms, facilitating efficient DNN deployment.
Contribution
This work introduces PerfSAGE, a GNN-based predictor capable of generalizing to arbitrary DNN graphs and outperforming prior models in accuracy and versatility.
Findings
Achieves <5% MAPE across multiple hardware targets
Outperforms previous GNN-based predictors in accuracy
Predicts performance on arbitrary DNN graphs without modifications
Abstract
The ability to accurately predict deep neural network (DNN) inference performance metrics, such as latency, power, and memory footprint, for an arbitrary DNN on a target hardware platform is essential to the design of DNN based models. This ability is critical for the (manual or automatic) design, optimization, and deployment of practical DNNs for a specific hardware deployment platform. Unfortunately, these metrics are slow to evaluate using simulators (where available) and typically require measurement on the target hardware. This work describes PerfSAGE, a novel graph neural network (GNN) that predicts inference latency, energy, and memory footprint on an arbitrary DNN TFlite graph (TFL, 2017). In contrast, previously published performance predictors can only predict latency and are restricted to pre-defined construction rules or search spaces. This paper also describes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Neural Network Applications · Ferroelectric and Negative Capacitance Devices
MethodsGraph Neural Network
