Avoiding normalization uncertainties in deep learning architectures for end-to-end communication
Simon Bos, Evgenii Vinogradov, Sofie Pollin

TL;DR
This paper identifies normalization errors in deep learning-based communication systems and proposes a modified architecture that improves performance by better satisfying power normalization constraints, especially with small batch sizes.
Contribution
The paper introduces a simple architectural modification that reduces normalization errors in end-to-end deep learning communication models, enhancing their performance.
Findings
Modified architecture reduces normalization errors.
Improved performance with small batch sizes.
Enhanced adherence to power constraints.
Abstract
Recently, deep learning is considered to optimize the end-to-end performance of digital communication systems. The promise of learning a digital communication scheme from data is attractive, since this makes the scheme adaptable and precisely tunable to many scenarios and channel models. In this paper, we analyse a widely used neural network architecture and show that the training of the end-to-end architecture suffers from normalization errors introduced by an average power constraint. To solve this issue, we propose a modified architecture: shifting the batch slicing after the normalization layer. This approach meets the normalization constraints better, especially in the case of small batch sizes. Finally, we experimentally demonstrate that our modified architecture leads to significantly improved performance of trained models, even for large batch sizes where normalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Wireless Signal Modulation Classification · Blind Source Separation Techniques
