Generalization in Reinforcement Learning for Radio Access Networks
Burak Demirel, Yu Wang, Cristian Tatino, Pablo Soldati

TL;DR
This paper introduces a generalization-focused reinforcement learning framework for Radio Access Networks that enhances performance across diverse and dynamic 5G scenarios by robust state reconstruction, domain randomization, and distributed training.
Contribution
It presents a novel RL approach that improves generalization in RAN control through graph-based state encoding, domain randomization, and scalable distributed training architecture.
Findings
Achieves ~10% throughput improvement over baseline in 5G benchmarks.
Attains >20% spectral efficiency gains under high mobility conditions.
Models outperform MLP baselines with 30% higher throughput in multi-cell deployments.
Abstract
Modern RAN operate in highly dynamic and heterogeneous environments, where hand-tuned, rule-based RRM algorithms often underperform. While RL can surpass such heuristics in constrained settings, the diversity of deployments and unpredictable radio conditions introduce major generalization challenges. Data-driven policies frequently overfit to training conditions, degrading performance in unseen scenarios. To address this, we propose a generalization-centered RL framework for RAN control that: (i) robustly reconstructs dynamically varying states from partial and noisy observations, while encoding static and semi-static information, such as radio nodes, cell attributes, and their topology, through graph representations; (ii) applies domain randomization to broaden the training distribution; and (iii) distributes data generation across multiple actors while centralizing training in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Advanced MIMO Systems Optimization · Wireless Networks and Protocols
