Dora: QoE-Aware Hybrid Parallelism for Distributed Edge AI
Jianli Jin, Ziyang Lin, Qianli Dong, Yi Chen, Jayanth Srinivasa, Myungjin Lee, Zhaowei Tan, Fan Lai

TL;DR
Dora is a framework that optimizes distributed edge AI for better user experience by balancing model performance, network contention, and energy efficiency through adaptive, QoE-aware parallelism strategies.
Contribution
Dora introduces a novel QoE-aware hybrid parallelism framework that jointly optimizes model partitioning, network scheduling, and runtime adaptation for edge AI.
Findings
Achieves 1.1-6.3x faster execution in edge deployments.
Reduces energy consumption by 21-82%.
Maintains QoE under runtime dynamics.
Abstract
With the proliferation of edge AI applications, satisfying user quality of experience (QoE) requirements, such as model inference latency, has become a first class objective, as these models operate in resource constrained settings and directly interact with users. Yet, modern AI models routinely exceed the resource capacity of individual devices, necessitating distributed execution across heterogeneous devices over variable and contention prone networks. Existing planners for hybrid (e.g., data and pipeline) parallelism largely optimize for throughput or device utilization, overlooking QoE, leading to severe resource inefficiency (e.g., unnecessary energy drain) or QoE violations under runtime dynamics. We present Dora, a framework for QoE aware hybrid parallelism in distributed edge AI training and inference. Dora jointly optimizes heterogeneous computation, contention prone…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Age of Information Optimization · Privacy-Preserving Technologies in Data
