TL;DR
This paper introduces a novel stable reinforcement learning approach using graph transformers for large-scale dynamic routing and spectrum allocation in elastic optical networks, outperforming existing methods.
Contribution
It is the first to successfully train a transformer model with reinforcement learning for this task, demonstrating scalability and improved performance on real network topologies.
Findings
Supports up to 13% more traffic load than benchmarks.
Able to handle networks with up to 143 nodes and 362 links.
Achieves stable RL training with novel techniques and GPU acceleration.
Abstract
Reinforcement learning (RL) has been widely applied to dynamic routing, modulation and spectrum assignment (RMSA) in optical networks, yet no prior work has trained a transformer model for this task. We attribute this to the high data and compute requirements of transformers and potential training instabilities with RL. We address this gap by combining recent advances from the machine learning literature (rotary positional encodings for graph-structured data, off-policy invalid action masking, and valid mass regularization) with GPU-accelerated simulation to achieve, for the first time, stable RL training of a transformer for dynamic RMSA. We demonstrate, through systematic benchmarking against previous RL methods and heuristic algorithms, that ours is the first RL method to exceed all benchmarks, increasing the supportable traffic load by up to 13%. To demonstrate the scalability of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
