CDIO: Cross-Domain Inference Optimization with Resource Preference Prediction for Edge-Cloud Collaboration
Zheming Yang, Wen Ji, Qi Guo, Dieli Hu, Chang Zhao, Xiaowei Li,, Xuanlei Zhao, Yi Zhao, Chaoyu Gong, and Yang You

TL;DR
CDIO is a framework that optimizes resource allocation for edge-cloud video inference tasks by predicting resource preferences, significantly reducing resource consumption and energy use while maintaining accuracy.
Contribution
This paper introduces a novel cross-domain inference optimization framework that predicts resource preferences and guides resource allocation for improved efficiency in edge-cloud systems.
Findings
Achieves 20%-40% reduction in computing and bandwidth consumption.
Reduces energy consumption by over 40%.
Effectively meets accuracy and delay requirements.
Abstract
Currently, massive video tasks are processed by edge-cloud collaboration. However, the diversity of task requirements and the dynamics of resources pose great challenges to efficient inference, resulting in many wasted resources. In this paper, we present CDIO, a cross-domain inference optimization framework designed for edge-cloud collaboration. For diverse input tasks, CDIO can predict resource preference types by analyzing spatial complexity and processing requirements of the task. Subsequently, a cross-domain collaborative optimization algorithm is employed to guide resource allocation in the edge-cloud system. By ensuring that each task is matched with the ideal servers, the edge-cloud system can achieve higher efficiency inference. The evaluation results on public datasets demonstrate that CDIO can effectively meet the accuracy and delay requirements for task processing. Compared…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCloud Computing and Resource Management · IoT and Edge/Fog Computing · Big Data and Business Intelligence
