Avoiding normalization uncertainties in deep learning architectures for   end-to-end communication

Simon Bos; Evgenii Vinogradov; Sofie Pollin

arXiv:2107.07868·eess.SP·July 19, 2021·MeditCom

Avoiding normalization uncertainties in deep learning architectures for end-to-end communication

Simon Bos, Evgenii Vinogradov, Sofie Pollin

PDF

Open Access

TL;DR

This paper identifies normalization errors in deep learning-based communication systems and proposes a modified architecture that improves performance by better satisfying power normalization constraints, especially with small batch sizes.

Contribution

The paper introduces a simple architectural modification that reduces normalization errors in end-to-end deep learning communication models, enhancing their performance.

Findings

01

Modified architecture reduces normalization errors.

02

Improved performance with small batch sizes.

03

Enhanced adherence to power constraints.

Abstract

Recently, deep learning is considered to optimize the end-to-end performance of digital communication systems. The promise of learning a digital communication scheme from data is attractive, since this makes the scheme adaptable and precisely tunable to many scenarios and channel models. In this paper, we analyse a widely used neural network architecture and show that the training of the end-to-end architecture suffers from normalization errors introduced by an average power constraint. To solve this issue, we propose a modified architecture: shifting the batch slicing after the normalization layer. This approach meets the normalization constraints better, especially in the case of small batch sizes. Finally, we experimentally demonstrate that our modified architecture leads to significantly improved performance of trained models, even for large batch sizes where normalization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Wireless Signal Modulation Classification · Blind Source Separation Techniques