Editable-DeepSC: Reliable Cross-Modal Semantic Communications for Facial Editing
Bin Chen, Wenbo Yu, Qinshan Zhang, Tianqu Zhuang, Hao Wu, Yong Jiang, Shu-Tao Xia

TL;DR
Editable-DeepSC introduces a cross-modal semantic communication framework for facial editing that preserves semantic information, improves editing quality, and reduces bandwidth under noisy conditions.
Contribution
It proposes a novel joint editing-channel coding scheme with inversion-based semantic coding and SNR-aware channel adaptation for facial editing tasks.
Findings
Achieves superior editing quality compared to traditional methods.
Reduces transmission bandwidth significantly, even at high resolutions.
Maintains performance under high noise and out-of-distribution scenarios.
Abstract
Interactive computer vision (CV) plays a crucial role in various real-world applications, whose performance is highly dependent on communication networks. Nonetheless, the data-oriented characteristics of conventional communications often do not align with the special needs of interactive CV tasks. To alleviate this issue, the recently emerged semantic communications only transmit task-related semantic information and exhibit a promising landscape to address this problem. However, the communication challenges associated with Semantic Facial Editing, one of the most important interactive CV applications on social media, still remain largely unexplored. In this paper, we fill this gap by proposing Editable-DeepSC, a novel cross-modal semantic communication approach for facial editing. Firstly, we theoretically discuss different transmission schemes that separately handle communications…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
