Hybrid-Parallel: Achieving High Performance and Energy Efficient Distributed Inference on Robots
Zekai Sun, Xiuxian Guan, Junming Wang, Haoze Song, Yuhao Qing,, Tianxiang Shen, Dong Huang, Fangming Liu, Heming Cui

TL;DR
Hybrid-Parallel is a novel distributed inference system designed for robots that significantly reduces inference time and energy consumption by fine-grained operator parallelism and overlapping computation with transmission.
Contribution
It introduces a fine-grained operator-level parallelism approach for distributed DNN inference on robotic IoT, improving latency and energy efficiency.
Findings
Reduces inference time by up to 41.1%
Decreases energy consumption per inference by up to 35.3%
Outperforms state-of-the-art baselines in robotic inference tasks
Abstract
The rapid advancements in machine learning techniques have led to significant achievements in various real-world robotic tasks. These tasks heavily rely on fast and energy-efficient inference of deep neural network (DNN) models when deployed on robots. To enhance inference performance, distributed inference has emerged as a promising approach, parallelizing inference across multiple powerful GPU devices in modern data centers using techniques such as data parallelism, tensor parallelism, and pipeline parallelism. However, when deployed on real-world robots, existing parallel methods fail to provide low inference latency and meet the energy requirements due to the limited bandwidth of robotic IoT. We present Hybrid-Parallel, a high-performance distributed inference system optimized for robotic IoT. Hybrid-Parallel employs a fine-grained approach to parallelize inference at the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems · Advanced Neural Network Applications · Modular Robots and Swarm Intelligence
