Semantic Communications for Digital Signals via Carrier Images
Zhigang Yan, Dong Li

TL;DR
This paper introduces a novel semantic communication method that encodes digital signals into images using mask locations, leveraging pre-trained Masked Autoencoders to efficiently transmit digital data alongside images with minimal overhead.
Contribution
It proposes a new SemCom framework that encodes digital signals as masked regions on images and utilizes MAE for joint image and digital signal transmission, reducing communication costs.
Findings
Maintains reliable digital signal transmission at high mask ratios.
Reduces transmission overhead by encoding mask tokens as sparse matrices.
Uses pre-trained MAE for efficient encoding and decoding.
Abstract
Most of current semantic communication (SemCom) frameworks focus on the image transmission, which, however, do not address the problem on how to deliver digital signals without any semantic features. This paper proposes a novel SemCom approach to transmit digital signals by using the image as the carrier signal. Specifically, the proposed approach encodes the digital signal as a binary stream and maps it to mask locations on an image. This allows binary data to be visually represented, enabling the use of existing model, pre-trained Masked Autoencoders (MAE), which are optimized for masked image reconstruction, as the SemCom encoder and decoder. Since MAE can both process and recover masked images, this approach allows for the joint transmission of digital signals and images without incurring significant communication overheads. In addition, considering the mask tokens transmission…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems · Cognitive Computing and Networks · Neural Networks and Applications
MethodsMasked autoencoder · Focus
