SemanticNN: Compressive and Error-Resilient Semantic Offloading for Extremely Weak Devices
Jiaming Huang, Yi Gao, Fuchang Pan, Renjie Li, and Wei Dong

TL;DR
SemanticNN is a novel semantic codec designed for weak IoT devices that enables error-resilient, efficient offloading of AI inference tasks by tolerating bit errors and adapting to dynamic network conditions, significantly reducing data transmission.
Contribution
It introduces a semantic-level error-resilient codec with adaptive decoding, a new training strategy, and asymmetry compensation to improve offloading efficiency on resource-constrained devices.
Findings
Reduces feature transmission volume by up to 344.83x.
Maintains high inference accuracy under varying error rates.
Demonstrates effectiveness on STM32 with multiple models and datasets.
Abstract
With the rapid growth of the Internet of Things (IoT), integrating artificial intelligence (AI) on extremely weak embedded devices has garnered significant attention, enabling improved real-time performance and enhanced data privacy. However, the resource limitations of such devices and unreliable network conditions necessitate error-resilient device-edge collaboration systems. Traditional approaches focus on bit-level transmission correctness, which can be inefficient under dynamic channel conditions. In contrast, we propose SemanticNN, a semantic codec that tolerates bit-level errors in pursuit of semantic-level correctness, enabling compressive and resilient collaborative inference offloading under strict computational and communication constraints. It incorporates a Bit Error Rate (BER)-aware decoder that adapts to dynamic channel conditions and a Soft Quantization (SQ)-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIoT and Edge/Fog Computing · Advanced Neural Network Applications · Age of Information Optimization
