TL;DR
This paper proposes a deep learning-based joint source-channel coding method for natural language, improving semantic preservation and reducing word error rates over traditional separate coding approaches in finite-length scenarios.
Contribution
It introduces a novel deep learning framework that embeds sentences into a semantic space and performs joint coding, outperforming traditional separate source and channel coding methods.
Findings
Lower word error rates achieved with deep learning encoder-decoder
Semantic preservation of sentences demonstrated
Outperforms traditional separate coding in finite-length scenarios
Abstract
We consider the problem of joint source and channel coding of structured data such as natural language over a noisy channel. The typical approach to this problem in both theory and practice involves performing source coding to first compress the text and then channel coding to add robustness for the transmission across the channel. This approach is optimal in terms of minimizing end-to-end distortion with arbitrarily large block lengths of both the source and channel codes when transmission is over discrete memoryless channels. However, the optimality of this approach is no longer ensured for documents of finite length and limitations on the length of the encoding. We will show in this scenario that we can achieve lower word error rates by developing a deep learning based encoder and decoder. While the approach of separate source and channel coding would minimize bit error rates, our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
