Reinforcement Learning-powered Semantic Communication via Semantic Similarity
Kun Lu, Rongpeng Li, Xianfu Chen, Zhifeng Zhao, Honggang Zhang

TL;DR
This paper proposes SemanticRL, a reinforcement learning-based semantic communication system that preserves meaning rather than bit accuracy, addressing semantic gaps with a self-critic RL approach for stable, efficient learning and improved semantic noise handling.
Contribution
It introduces a novel RL-based semantic communication framework that learns from semantic similarity, overcoming non-differentiability and channel noise challenges, with demonstrated effectiveness on real datasets.
Findings
Outperforms traditional methods in semantic meaning preservation
Handles semantic noise more effectively than existing approaches
Shows strong generalization in real-life and image transmission scenarios
Abstract
We introduce a new semantic communication mechanism - SemanticRL, whose key idea is to preserve the semantic information instead of strictly securing the bit-level precision. Unlike previous methods that mainly concentrate on the network or structure design, we revisit the learning process and point out the semantic blindness of commonly used objective functions. To address this semantic gap, we introduce a schematic shift that learns from semantic similarity, instead of relying on conventional paired bit-level supervisions like cross entropy and bit error rate. However, developing such a semantic communication system is indeed a nontrivial task considering the non-differentiability of most semantic metrics as well as the instability from noisy channels. To further resolve these issues, we put forward a self-critic reinforcement learning (RL) solution which allows an efficient and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWireless Signal Modulation Classification · Neural Networks and Reservoir Computing · Adversarial Robustness in Machine Learning
