Reinforcement-Learning based routing for packet-optical networks with hybrid telemetry
A. L. Garc\'ia Navarro, Nataliia Koneva, Alfonso S\'anchez-Maci\'an,, Jos\'e Alberto Hern\'andez, \'Oscar Gonz\'alez de Dios, J. M. Rivas-Moscoso

TL;DR
This paper introduces a reinforcement learning approach for dynamic routing in packet-optical networks, utilizing physical and link layer measurements to adapt to changing network conditions and optimize performance.
Contribution
It presents a novel RL-based routing algorithm that dynamically adapts to network changes using real-time telemetry data and provides an open-source implementation.
Findings
The RL algorithm effectively adapts to network condition changes.
It improves routing decisions based on real-time measurements.
The approach is validated through simulations or experiments.
Abstract
This article provides a methodology and open-source implementation of Reinforcement Learning algorithms for finding optimal routes in a packet-optical network scenario. The algorithm uses measurements provided by the physical layer (pre-FEC bit error rate and propagation delay) and the link layer (link load) to configure a set of latency-based rewards and penalties based on such measurements. Then, the algorithm executes Q-learning based on this set of rewards for finding the optimal routing strategies. It is further shown that the algorithm dynamically adapts to changing network conditions by re-calculating optimal policies upon either link load changes or link degradation as measured by pre-FEC BER.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optical Network Technologies · Optical Network Technologies · Advanced Photonic Communication Systems
MethodsSparse Evolutionary Training · Q-Learning
