Hierarchical Online-Scheduling for Energy-Efficient Split Inference with Progressive Transmission
Zengzipeng Tang, Yuxuan Sun, Wei Chen, Jianwen Ding, Bo Ai, Yulin Shao

TL;DR
This paper introduces ENACHI, a hierarchical online scheduling framework for split DNN inference that optimizes accuracy, energy, and latency by jointly managing task-level and packet-level decisions with adaptive transmission.
Contribution
ENACHI is a novel hierarchical optimization framework that jointly considers task and packet-level scheduling for energy-efficient split inference with adaptive transmission techniques.
Findings
Achieves 43.12% higher accuracy compared to benchmarks.
Reduces energy consumption by 62.13% under strict deadlines.
Maintains stable energy use in multi-user scenarios.
Abstract
Device-edge collaborative inference with Deep Neural Networks (DNNs) faces fundamental trade-offs among accuracy, latency and energy consumption. Current scheduling exhibits two drawbacks: a granularity mismatch between coarse, task-level decisions and fine-grained, packet-level channel dynamics, and insufficient awareness of per-task complexity. Consequently, scheduling solely at the task level leads to inefficient resource utilization. This paper proposes a novel ENergy-ACcuracy Hierarchical optimization framework for split Inference, named ENACHI, that jointly optimizes task- and packet-level scheduling to maximize accuracy under energy and delay constraints. A two-tier Lyapunov-based framework is developed for ENACHI, with a progressive transmission technique further integrated to enhance adaptivity. At the task level, an outer drift-plus-penalty loop makes online decisions for DNN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Age of Information Optimization · IoT and Edge/Fog Computing
