Importance-Aware Image Segmentation-based Semantic Communication for   Autonomous Driving

Jie Lv; Haonan Tong; Qiang Pan; Zhilong Zhang; Xinxin He; Tao Luo,; Changchuan Yin

arXiv:2401.10153·cs.NI·January 19, 2024·2 cites

Importance-Aware Image Segmentation-based Semantic Communication for Autonomous Driving

Jie Lv, Haonan Tong, Qiang Pan, Zhilong Zhang, Xinxin He, Tao Luo,, Changchuan Yin

PDF

Open Access

TL;DR

This paper introduces VIS-SemCom, a semantic communication system for autonomous driving that emphasizes important objects in image segmentation to improve safety and efficiency, utilizing a Swin Transformer-based codec and importance-aware strategies.

Contribution

The paper presents a novel importance-aware semantic communication system with a Swin Transformer-based codec and multi-scale semantic extraction tailored for autonomous driving.

Findings

01

Achieves nearly 6 dB coding gain with 60% mIoU

02

Reduces data transmission by up to 70% with 60% mIoU

03

Improves important object segmentation IoU by 4%

Abstract

This article studies the problem of image segmentation-based semantic communication in autonomous driving. In real traffic scenes, detecting the key objects (e.g., vehicles, pedestrians and obstacles) is more crucial than that of other objects to guarantee driving safety. Therefore, we propose a vehicular image segmentation-oriented semantic communication system, termed VIS-SemCom, where image segmentation features of important objects are transmitted to reduce transmission redundancy. First, to accurately extract image semantics, we develop a semantic codec based on Swin Transformer architecture, which expands the perceptual field thus improving the segmentation accuracy. Next, we propose a multi-scale semantic extraction scheme via assigning the number of Swin Transformer blocks for diverse resolution features, thus highlighting the important objects' accuracy. Furthermore, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Brain Tumor Detection and Classification

MethodsMulti-Head Attention · Attention Is All You Need · Label Smoothing · Absolute Position Encodings · Dropout · Linear Layer · Byte Pair Encoding · Softmax · Adam · Residual Connection