JointDNN: An Efficient Training and Inference Engine for Intelligent   Mobile Cloud Computing Services

Amir Erfan Eshratifar; Mohammad Saeed Abrishami; Massoud Pedram

arXiv:1801.08618·cs.DC·February 6, 2020

JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services

Amir Erfan Eshratifar, Mohammad Saeed Abrishami, Massoud Pedram

PDF

TL;DR

JointDNN is a novel adaptive engine that efficiently partitions DNN computations between mobile devices and the cloud, significantly reducing latency and energy consumption for mobile AI applications.

Contribution

It introduces an optimization framework for collaborative DNN processing that adapts to device and cloud constraints, improving efficiency over existing methods.

Findings

01

Up to 18x reduction in latency.

02

Up to 32x reduction in mobile energy consumption.

03

Effective partitioning of DNN layers between mobile and cloud.

Abstract

Deep learning models are being deployed in many mobile intelligent applications. End-side services, such as intelligent personal assistants, autonomous cars, and smart home services often employ either simple local models on the mobile or complex remote models on the cloud. However, recent studies have shown that partitioning the DNN computations between the mobile and cloud can increase the latency and energy efficiencies. In this paper, we propose an efficient, adaptive, and practical engine, JointDNN, for collaborative computation between a mobile device and cloud for DNNs in both inference and training phase. JointDNN not only provides an energy and performance efficient method of querying DNNs for the mobile side but also benefits the cloud server by reducing the amount of its workload and communications compared to the cloud-only approach. Given the DNN architecture, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.