Hierarchical Online-Scheduling for Energy-Efficient Split Inference with Progressive Transmission

Zengzipeng Tang; Yuxuan Sun; Wei Chen; Jianwen Ding; Bo Ai; Yulin Shao

arXiv:2601.08135·cs.NI·January 14, 2026

Hierarchical Online-Scheduling for Energy-Efficient Split Inference with Progressive Transmission

Zengzipeng Tang, Yuxuan Sun, Wei Chen, Jianwen Ding, Bo Ai, Yulin Shao

PDF

Open Access

TL;DR

This paper introduces ENACHI, a hierarchical online scheduling framework for split DNN inference that optimizes accuracy, energy, and latency by jointly managing task-level and packet-level decisions with adaptive transmission.

Contribution

ENACHI is a novel hierarchical optimization framework that jointly considers task and packet-level scheduling for energy-efficient split inference with adaptive transmission techniques.

Findings

01

Achieves 43.12% higher accuracy compared to benchmarks.

02

Reduces energy consumption by 62.13% under strict deadlines.

03

Maintains stable energy use in multi-user scenarios.

Abstract

Device-edge collaborative inference with Deep Neural Networks (DNNs) faces fundamental trade-offs among accuracy, latency and energy consumption. Current scheduling exhibits two drawbacks: a granularity mismatch between coarse, task-level decisions and fine-grained, packet-level channel dynamics, and insufficient awareness of per-task complexity. Consequently, scheduling solely at the task level leads to inefficient resource utilization. This paper proposes a novel ENergy-ACcuracy Hierarchical optimization framework for split Inference, named ENACHI, that jointly optimizes task- and packet-level scheduling to maximize accuracy under energy and delay constraints. A two-tier Lyapunov-based framework is developed for ENACHI, with a progressive transmission technique further integrated to enhance adaptivity. At the task level, an outer drift-plus-penalty loop makes online decisions for DNN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Age of Information Optimization · IoT and Edge/Fog Computing