Synchronous Multi-modal Semantic Communication System with Packet-level   Coding

Yun Tian; Jingkai Ying; Zhijin Qin; Ye Jin; Xiaoming Tao

arXiv:2408.04535·eess.IV·August 13, 2024

Synchronous Multi-modal Semantic Communication System with Packet-level Coding

Yun Tian, Jingkai Ying, Zhijin Qin, Ye Jin, Xiaoming Tao

PDF

Open Access

TL;DR

This paper introduces a synchronous multimodal semantic communication system with packet-level coding, improving synchronization, reducing overhead, and maintaining high quality in transmitting video and speech over lossy networks.

Contribution

It proposes a novel SyncSC system with packet-level FEC and a BERT-based text packet loss concealment method, enhancing multimodal synchronization and robustness.

Findings

01

Achieves high-quality synchronous transmission over lossy channels.

02

Reduces transmission overhead compared to traditional methods.

03

Maintains semantic and temporal synchronization effectively.

Abstract

Although the semantic communication with joint semantic-channel coding design has shown promising performance in transmitting data of different modalities over physical layer channels, the synchronization and packet-level forward error correction of multimodal semantics have not been well studied. Due to the independent design of semantic encoders, synchronizing multimodal features in both the semantic and time domains is a challenging problem. In this paper, we take the facial video and speech transmission as an example and propose a Synchronous Multimodal Semantic Communication System (SyncSC) with Packet-Level Coding. To achieve semantic and time synchronization, 3D Morphable Mode (3DMM) coefficients and text are transmitted as semantics, and we propose a semantic codec that achieves similar quality of reconstruction and synchronization with lower bandwidth, compared to traditional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCognitive Computing and Networks