GCoDE: Efficient Device-Edge Co-Inference for GNNs via Architecture-Mapping Co-Search
Ao Zhou, Jianlei Yang, Tong Qiao, Yingjie Qi, Zhi Yang, Weisheng Zhao, Chunming Hu

TL;DR
GCoDE is an automatic framework that optimizes GNN architecture and deployment for device-edge co-inference, significantly improving speed and energy efficiency on edge devices.
Contribution
It introduces the first unified architecture-mapping co-design framework for GNNs on edge devices, incorporating energy prediction and system-aware optimization.
Findings
Achieves up to 44.9x speedup over existing methods.
Reduces energy consumption by up to 98.2%.
Effectively balances accuracy and efficiency in GNN deployment.
Abstract
Graph Neural Networks (GNNs) have emerged as the state-of-the-art graph learning method. However, achieving efficient GNN inference on edge devices poses significant challenges, limiting their application in real-world edge scenarios. This is due to the high computational cost of GNNs and limited hardware resources on edge devices, which prevent GNN inference from meeting real-time and energy requirements. As an emerging paradigm, device-edge co-inference shows potential for improving inference efficiency and reducing energy consumption on edge devices. Despite its potential, research on GNN device-edge co-inference remains scarce, and our findings show that traditional model partitioning methods are ineffective for GNNs. To address this, we propose GCoDE, the first automatic framework for GNN architecture-mapping Co-design and deployment on Device-Edge hierarchies. By abstracting the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBig Data and Digital Economy · Advanced Graph Neural Networks · Advanced Neural Network Applications
