Deep Neural Network-aided Soft-Demapping in Optical Coherent Systems: Regression versus Classification
Pedro J. Freire, Jaroslaw E. Prilepsky, Yevhenii Osadchuk, Sergei K., Turitsyn, Vahid Aref

TL;DR
This paper compares neural network-based classification and regression approaches for soft-demapping in nonlinear optical communication channels, revealing regression often outperforms classification in practical scenarios due to training challenges.
Contribution
It provides the first detailed analysis of the drawbacks of classification versus regression in neural network models for optical channel equalization and soft-demapping.
Findings
Regression-based models achieve higher Q-factors.
Cross-entropy loss can cause training difficulties.
Regression methods are more robust in practical nonlinear scenarios.
Abstract
We examine here what type of predictive modelling, classification, or regression, using neural networks (NN), fits better the task of soft-demapping based post-processing in coherent optical communications, where the transmission channel is nonlinear and dispersive. For the first time, we present possible drawbacks in using each type of predictive task in a machine learning context, considering the nonlinear coherent optical channel equalization/soft-demapping problem. We study two types of equalizers based on the feed-forward and recurrent NNs, for several transmission scenarios, in linear and nonlinear regimes of the optical channel. We point out that even though from the information theory perspective the cross-entropy loss (classification) is the most suitable option for our problem, the NN models based on the cross-entropy loss function can severely suffer from learning problems.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptical Network Technologies · Neural Networks and Reservoir Computing · Spectroscopy Techniques in Biomedical and Chemical Research
