Task-Oriented Multi-User Semantic Communications
Huiqiang Xie, Zhijin Qin, Xiaoming Tao, Khaled B. Letaief

TL;DR
This paper develops Transformer-based multi-user semantic communication systems for single-modal and multimodal data, demonstrating improved robustness, efficiency, and task performance over traditional methods.
Contribution
It introduces a unified Transformer framework for multi-user semantic communications across different tasks and modalities, including novel models for image retrieval, translation, and VQA.
Findings
Models outperform traditional methods in robustness and efficiency.
DeepSC-IR and DeepSC-MT effectively optimize semantic embedding and meaning recovery.
DeepSC-VQA successfully fuses multimodal data for visual question answering.
Abstract
While semantic communications have shown the potential in the case of single-modal single-users, its applications to the multi-user scenario remain limited. In this paper, we investigate deep learning (DL) based multi-user semantic communication systems for transmitting single-modal data and multimodal data, respectively. We will adopt three intelligent tasks, including, image retrieval, machine translation, and visual question answering (VQA) as the transmission goal of semantic communication systems. We will then propose a Transformer based unique framework to unify the structure of transmitters for different tasks. For the single-modal multi-user system, we will propose two Transformer based models, named, DeepSC-IR and DeepSC-MT, to perform image retrieval and machine translation, respectively. In this case, DeepSC-IR is trained to optimize the distance in embedding space between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWireless Signal Modulation Classification · Advanced Image and Video Retrieval Techniques · Advanced Data and IoT Technologies
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Position-Wise Feed-Forward Layer · Adam · Residual Connection · Layer Normalization · Absolute Position Encodings · Dropout
