Enabling All In-Edge Deep Learning: A Literature Review
Praveen Joshi, Mohammed Hasanuzzaman, Chandra Thapa, Haithem Afli, and, Ted Scully

TL;DR
This literature review explores the all in-edge deep learning paradigm, focusing on architectures, enabling technologies, model adaptation, performance metrics, and future research challenges for deploying DL solely on edge servers.
Contribution
It provides a comprehensive overview of all in-edge deep learning, including architectures, enabling techniques, adaptation methods, and evaluation metrics, highlighting open challenges in the field.
Findings
All in-edge architectures include centralized, decentralized, and distributed models.
Enabling technologies like model parallelism and split learning facilitate edge DL deployment.
Key performance metrics for all in-edge DL include latency, communication cost, and resource utilization.
Abstract
In recent years, deep learning (DL) models have demonstrated remarkable achievements on non-trivial tasks such as speech recognition and natural language understanding. One of the significant contributors to its success is the proliferation of end devices that acted as a catalyst to provide data for data-hungry DL models. However, computing DL training and inference is the main challenge. Usually, central cloud servers are used for the computation, but it opens up other significant challenges, such as high latency, increased communication costs, and privacy concerns. To mitigate these drawbacks, considerable efforts have been made to push the processing of DL models to edge servers. Moreover, the confluence point of DL and edge has given rise to edge intelligence (EI). This survey paper focuses primarily on the fifth level of EI, called all in-edge level, where DL training and inference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Age of Information Optimization · Advanced Neural Network Applications
