Editable-DeepSC: Reliable Cross-Modal Semantic Communications for Facial Editing

Bin Chen; Wenbo Yu; Qinshan Zhang; Tianqu Zhuang; Hao Wu; Yong Jiang; Shu-Tao Xia

arXiv:2411.15702·cs.IT·March 30, 2026

Editable-DeepSC: Reliable Cross-Modal Semantic Communications for Facial Editing

Bin Chen, Wenbo Yu, Qinshan Zhang, Tianqu Zhuang, Hao Wu, Yong Jiang, Shu-Tao Xia

PDF

TL;DR

Editable-DeepSC introduces a cross-modal semantic communication framework for facial editing that preserves semantic information, improves editing quality, and reduces bandwidth under noisy conditions.

Contribution

It proposes a novel joint editing-channel coding scheme with inversion-based semantic coding and SNR-aware channel adaptation for facial editing tasks.

Findings

01

Achieves superior editing quality compared to traditional methods.

02

Reduces transmission bandwidth significantly, even at high resolutions.

03

Maintains performance under high noise and out-of-distribution scenarios.

Abstract

Interactive computer vision (CV) plays a crucial role in various real-world applications, whose performance is highly dependent on communication networks. Nonetheless, the data-oriented characteristics of conventional communications often do not align with the special needs of interactive CV tasks. To alleviate this issue, the recently emerged semantic communications only transmit task-related semantic information and exhibit a promising landscape to address this problem. However, the communication challenges associated with Semantic Facial Editing, one of the most important interactive CV applications on social media, still remain largely unexplored. In this paper, we fill this gap by proposing Editable-DeepSC, a novel cross-modal semantic communication approach for facial editing. Firstly, we theoretically discuss different transmission schemes that separately handle communications…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.