Enabling Deep Learning on Edge Devices

Zhongnan Qu

arXiv:2210.03204·cs.LG·October 10, 2022

Enabling Deep Learning on Edge Devices

Zhongnan Qu

PDF

TL;DR

This paper explores methods to adapt and optimize deep neural networks for deployment on resource-constrained edge devices, addressing challenges in model size, resource consumption, and performance across various edge intelligence scenarios.

Contribution

It introduces tailored methodologies for reducing DNN redundancy and enabling deep learning in four distinct edge scenarios, enhancing deployment feasibility.

Findings

01

Reduced DNN model sizes suitable for edge devices

02

Improved resource efficiency without significant accuracy loss

03

Frameworks for inference, adaptation, learning, and edge-server systems

Abstract

Deep neural networks (DNNs) have succeeded in many different perception tasks, e.g., computer vision, natural language processing, reinforcement learning, etc. The high-performed DNNs heavily rely on intensive resource consumption. For example, training a DNN requires high dynamic memory, a large-scale dataset, and a large number of computations (a long training time); even inference with a DNN also demands a large amount of static storage, computations (a long inference time), and energy. Therefore, state-of-the-art DNNs are often deployed on a cloud server with a large number of super-computers, a high-bandwidth communication bus, a shared storage infrastructure, and a high power supplement. Recently, some new emerging intelligent applications, e.g., AR/VR, mobile assistants, Internet of Things, require us to deploy DNNs on resource-constrained edge devices. Compare to a cloud…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.