Learning the Optimal Path and DNN Partition for Collaborative Edge Inference
Yin Huang, Letian Zhang, Jie Xu

TL;DR
This paper develops a novel learning framework for optimizing DNN partitioning and path selection in collaborative edge inference, addressing unknown network parameters and security concerns, with a new bandit algorithm demonstrating superior performance.
Contribution
It introduces a new adversarial bandit formulation with switching costs for learning optimal DNN partitioning and path selection under incomplete network information.
Findings
The structural insights reduce the decision space for DNN layer assignment.
The B-EXPUCB algorithm achieves sublinear regret in the learning task.
Simulations show B-EXPUCB outperforms existing algorithms in accuracy and efficiency.
Abstract
Recent advancements in Deep Neural Networks (DNNs) have catalyzed the development of numerous intelligent mobile applications and services. However, they also introduce significant computational challenges for resource-constrained mobile devices. To address this, collaborative edge inference has been proposed. This method involves partitioning a DNN inference task into several subtasks and distributing these across multiple network nodes. Despite its potential, most current approaches presume known network parameters -- like node processing speeds and link transmission rates -- or rely on a fixed sequence of nodes for processing the DNN subtasks. In this paper, we tackle a more complex scenario where network parameters are unknown and must be learned, and multiple network paths are available for distributing inference tasks. Specifically, we explore the learning problem of selecting the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Advanced Clustering Algorithms Research
