A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence
Xin Yuan, Ning Li, Quan Chen, Wenchao Xu, Zhaoxin Zhang, Song Guo

TL;DR
This paper proposes a QoE-aware resource allocation algorithm for split inference in edge intelligence, balancing delay, QoE, and resource use to enhance performance over existing methods.
Contribution
It introduces the ERA algorithm that optimizes model split and resource allocation considering QoE, delay, and resource consumption, with a novel loop iteration gradient descent approach.
Findings
ERA outperforms previous methods in experimental evaluations.
The proposed algorithm effectively balances QoE, delay, and resource consumption.
Convergence and complexity properties of ERA are thoroughly analyzed.
Abstract
Even the AI has been widely used and significantly changed our life, deploying the large AI models on resource limited edge devices directly is not appropriate. Thus, the model split inference is proposed to improve the performance of edge intelligence, in which the AI model is divided into different sub models and the resource-intensive sub model is offloaded to edge server wirelessly for reducing resource requirements and inference latency. However, the previous works mainly concentrate on improving and optimizing the system QoS, ignore the effect of QoE which is another critical item for the users except for QoS. Even the QoE has been widely learned in EC, considering the differences between task offloading in EC and split inference in EI, and the specific issues in QoE which are still not addressed in EC and EI, these algorithms cannot work effectively in edge split inference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Brain Tumor Detection and Classification · Blind Source Separation Techniques
