Importance-Aware Image Segmentation-based Semantic Communication for Autonomous Driving
Jie Lv, Haonan Tong, Qiang Pan, Zhilong Zhang, Xinxin He, Tao Luo,, Changchuan Yin

TL;DR
This paper introduces VIS-SemCom, a semantic communication system for autonomous driving that emphasizes important objects in image segmentation to improve safety and efficiency, utilizing a Swin Transformer-based codec and importance-aware strategies.
Contribution
The paper presents a novel importance-aware semantic communication system with a Swin Transformer-based codec and multi-scale semantic extraction tailored for autonomous driving.
Findings
Achieves nearly 6 dB coding gain with 60% mIoU
Reduces data transmission by up to 70% with 60% mIoU
Improves important object segmentation IoU by 4%
Abstract
This article studies the problem of image segmentation-based semantic communication in autonomous driving. In real traffic scenes, detecting the key objects (e.g., vehicles, pedestrians and obstacles) is more crucial than that of other objects to guarantee driving safety. Therefore, we propose a vehicular image segmentation-oriented semantic communication system, termed VIS-SemCom, where image segmentation features of important objects are transmitted to reduce transmission redundancy. First, to accurately extract image semantics, we develop a semantic codec based on Swin Transformer architecture, which expands the perceptual field thus improving the segmentation accuracy. Next, we propose a multi-scale semantic extraction scheme via assigning the number of Swin Transformer blocks for diverse resolution features, thus highlighting the important objects' accuracy. Furthermore, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Brain Tumor Detection and Classification
MethodsMulti-Head Attention · Attention Is All You Need · Label Smoothing · Absolute Position Encodings · Dropout · Linear Layer · Byte Pair Encoding · Softmax · Adam · Residual Connection
