Collaborative Inference for AI-Empowered IoT Devices
Nir Shlezinger, Ivan V. Bajic

TL;DR
This paper reviews strategies for enabling AI inference on IoT edge devices through collaboration, balancing accuracy, latency, privacy, and connectivity, and discusses future research directions.
Contribution
It systematically compares existing collaborative inference methods for IoT devices, highlighting their characteristics and identifying future research challenges.
Findings
Multiple inference schemes are suitable for different mobility and connectivity scenarios.
Collaborative inference can reduce latency and privacy concerns compared to cloud-only methods.
Trade-offs exist between inference accuracy, communication cost, and privacy in different strategies.
Abstract
Artificial intelligence (AI) technologies, and particularly deep learning systems, are traditionally the domain of large-scale cloud servers, which have access to high computational and energy resources. Nonetheless, in Internet-of-Things (IoT) networks, the interface with the real-world is carried out using edge devices that are limited in hardware and can communicate. The conventional approach to provide AI processing to data collected by edge devices involves sending samples to the cloud, at the cost of latency, communication, connectivity, and privacy concerns. Consequently, recent years have witnessed a growing interest in enabling AI-aided inference on edge devices by leveraging their communication capabilities to establish collaborative inference. This article reviews candidate strategies for facilitating the transition of AI to IoT devices via collaboration. We identify the need…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Privacy-Preserving Technologies in Data · Age of Information Optimization
