Hybrid Orchestration of Edge AI and Microservices via Graph-based Self-Imitation Learning
Chen Yang, Jin Zheng, Yang Zhuolin, Lai Pan, Zhang Xiao, Hu Menglan, Yin Haiyan

TL;DR
This paper introduces SIL-GPO, a graph-based self-imitation reinforcement learning framework that optimizes the complex orchestration of AI and microservices at the edge, significantly reducing latency and improving resource use.
Contribution
It presents a novel RL approach combining graph attention networks and self-imitation learning to effectively handle the joint deployment and routing decisions in edge microservice systems.
Findings
SIL-GPO outperforms existing heuristics and deep RL methods in reducing latency.
The framework improves resource utilization in edge AI workloads.
Experimental results validate the scalability and effectiveness of SIL-GPO.
Abstract
Modern edge AI applications increasingly rely on microservice architectures that integrate both AI services and conventional microservices into complex request chains with stringent latency requirements. Effectively orchestrating these heterogeneous services is crucial for ensuring low-latency performance, yet remains challenging due to their diverse resource demands and strong operational interdependencies under resource-constrained edge environments. In particular, frequent interactions between services tightly couple deployment and routing decisions, yet existing approaches optimize them in isolation, leading to fundamentally inadequate system performance.In this paper, we propose SIL-GPO, a reinforcement learning framework that optimizes hybrid orchestration for edge AI microservice systems. SIL-GPO formulates the orchestration problem as a sequential decision-making task and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Software-Defined Networks and 5G · IoT and Edge/Fog Computing
