Channel-Adaptive Edge AI: Maximizing Inference Throughput by Adapting Computational Complexity to Channel States
Jierui Zhang, Jianhao Huang, Kaibin Huang

TL;DR
This paper develops a theoretical framework for integrated communication and computation in edge AI, proposing a channel-adaptive algorithm that optimizes inference throughput by adjusting model complexity and feature compression based on channel states.
Contribution
It introduces a tractable accuracy model for end-to-end inference and designs a channel-adaptive AI algorithm that jointly adapts feature compression and model complexity to maximize throughput.
Findings
The accuracy model effectively captures the impact of channel distortion and model complexity.
The proposed algorithm outperforms fixed-complexity methods in inference throughput.
Experimental results validate the efficiency of the adaptive approach.
Abstract
\emph{Integrated communication and computation} (IC) has emerged as a new paradigm for enabling efficient edge inference in sixth-generation (6G) networks. However, the design of IC technologies is hindered by the lack of a tractable theoretical framework for characterizing \emph{end-to-end} (E2E) inference performance. The metric is highly complicated as it needs to account for both channel distortion and artificial intelligence (AI) model architecture and computational complexity. In this work, we address this challenge by developing a tractable analytical model for E2E inference accuracy and leveraging it to design a \emph{channel-adaptive AI} algorithm that maximizes inference throughput, referred to as the edge processing rate (EPR), under latency and accuracy constraints. Specifically, we consider an edge inference system in which a server deploys a backbone model with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Wireless Communication Technologies · Advanced MIMO Systems Optimization · IoT Networks and Protocols
