Where to Split? A Pareto-Front Analysis of DNN Partitioning for Edge Inference
Adiba Masud, Nicholas Foley, Pragathi Durga Rajarajan, and Palden Lama

TL;DR
This paper introduces ParetoPipe, a framework for multi-objective DNN partitioning that balances latency and throughput on edge devices, supported by benchmarking and Pareto front analysis under network variability.
Contribution
It presents a multi-objective optimization approach for DNN partitioning, along with an open-source framework and benchmarking on heterogeneous edge hardware.
Findings
Identified Pareto-optimal partitioning points for latency and throughput trade-offs.
Benchmarking results on Raspberry Pis and GPU edge server demonstrate the framework's effectiveness.
Network variability significantly impacts optimal partitioning strategies.
Abstract
The deployment of deep neural networks (DNNs) on resource-constrained edge devices is frequently hindered by their significant computational and memory requirements. While partitioning and distributing a DNN across multiple devices is a well-established strategy to mitigate this challenge, prior research has largely focused on single-objective optimization, such as minimizing latency or maximizing throughput. This paper challenges that view by reframing DNN partitioning as a multi-objective optimization problem. We argue that in real-world scenarios, a complex trade-off between latency and throughput exists, which is further complicated by network variability. To address this, we introduce ParetoPipe, an open-source framework that leverages Pareto front analysis to systematically identify optimal partitioning strategies that balance these competing objectives. Our contributions are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · IoT and Edge/Fog Computing · Big Data and Digital Economy
