Graph Transformers and Stabilized Reinforcement Learning for Large-Scale Dynamic Routing Modulation and Spectrum Allocation in Elastic Optical Networks

Michael Doherty; Alejandra Beghelli; Laura Toni

arXiv:2605.02075·cs.NI·May 19, 2026

Graph Transformers and Stabilized Reinforcement Learning for Large-Scale Dynamic Routing Modulation and Spectrum Allocation in Elastic Optical Networks

Michael Doherty, Alejandra Beghelli, Laura Toni

PDF

1 Repo

TL;DR

This paper introduces a novel stable reinforcement learning approach using graph transformers for large-scale dynamic routing and spectrum allocation in elastic optical networks, outperforming existing methods.

Contribution

It is the first to successfully train a transformer model with reinforcement learning for this task, demonstrating scalability and improved performance on real network topologies.

Findings

01

Supports up to 13% more traffic load than benchmarks.

02

Able to handle networks with up to 143 nodes and 362 links.

03

Achieves stable RL training with novel techniques and GPU acceleration.

Abstract

Reinforcement learning (RL) has been widely applied to dynamic routing, modulation and spectrum assignment (RMSA) in optical networks, yet no prior work has trained a transformer model for this task. We attribute this to the high data and compute requirements of transformers and potential training instabilities with RL. We address this gap by combining recent advances from the machine learning literature (rotary positional encodings for graph-structured data, off-policy invalid action masking, and valid mass regularization) with GPU-accelerated simulation to achieve, for the first time, stable RL training of a transformer for dynamic RMSA. We demonstrate, through systematic benchmarking against previous RL methods and heuristic algorithms, that ours is the first RL method to exceed all benchmarks, increasing the supportable traffic load by up to 13%. To demonstrate the scalability of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

micdoh/XLRON
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.