Performance versus Complexity Study of Neural Network Equalizers in Coherent Optical Systems
Pedro J. Freire, Yevhenii Osadchuk, Bernhard Spinnler, Antonio Napoli,, Wolfgang Schairer, Nelson Costa, Jaroslaw E. Prilepsky, Sergei K. Turitsyn

TL;DR
This study compares the performance and computational complexity of various neural network architectures for nonlinear channel equalization in coherent optical systems, highlighting the CNN+biLSTM as the most effective in high-complexity scenarios.
Contribution
It introduces a comprehensive complexity-performance comparison of neural network equalizers, including novel combinations like CNN+biLSTM, with a detailed complexity derivation applicable to diverse systems.
Findings
CNN+biLSTM achieves the highest Q-factor improvement of 2.9 dB.
Three-layer perceptron performs best under low complexity constraints.
Complexity analysis is generalizable to various physical and engineering systems.
Abstract
We present the results of the comparative analysis of the performance versus complexity for several types of artificial neural networks (NNs) used for nonlinear channel equalization in coherent optical communication systems. The comparison has been carried out using an experimental set-up with transmission dominated by the Kerr nonlinearity and component imperfections. For the first time, we investigate the application to the channel equalization of the convolution layer (CNN) in combination with a bidirectional long short-term memory (biLSTM) layer and the design combining CNN with a multi-layer perceptron. Their performance is compared with the one delivered by the previously proposed NN equalizer models: one biLSTM layer, three-dense-layer perceptron, and the echo state network. Importantly, all architectures have been initially optimized by a Bayesian optimizer. We present the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Bidirectional LSTM · Convolution
