DDPG-E2E: A Novel Policy Gradient Approach for End-to-End Communication   Systems

Bolun Zhang; Nguyen Van Huynh; Dinh Thai Hoang; Diep N. Nguyen; and; Quoc-Viet Pham

arXiv:2404.06257·cs.NI·October 29, 2024·1 cites

DDPG-E2E: A Novel Policy Gradient Approach for End-to-End Communication Systems

Bolun Zhang, Nguyen Van Huynh, Dinh Thai Hoang, Diep N. Nguyen, and, Quoc-Viet Pham

PDF

Open Access

TL;DR

This paper introduces DDPG-E2E, a reinforcement learning-based end-to-end communication system that trains transceivers without prior channel knowledge, improving performance for large block lengths through a reward mechanism and CNN architecture.

Contribution

The paper proposes a novel DDPG-based E2E learning approach that eliminates the need for prior channel information and enhances performance with large block lengths.

Findings

01

Significant reduction in block error rate compared to existing methods

02

Effective training without prior channel knowledge

03

Improved performance with large block lengths

Abstract

The End-to-end (E2E) learning-based approach has great potential to reshape the existing communication systems by replacing the transceivers with deep neural networks. To this end, the E2E learning approach needs to assume the availability of prior channel information to mathematically formulate a differentiable channel layer for the backpropagation (BP) of the error gradients, thereby jointly optimizing the transmitter and the receiver. However, accurate and instantaneous channel state information is hardly obtained in practical wireless communication scenarios. Moreover, the existing E2E learning-based solutions exhibit limited performance in data transmissions with large block lengths. In this article, these practical issues are addressed by our proposed deep deterministic policy gradient-based E2E communication system. In particular, the proposed solution utilizes a reward feedback…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Grid Security and Resilience