Generalization in Reinforcement Learning for Radio Access Networks

Burak Demirel; Yu Wang; Cristian Tatino; Pablo Soldati

arXiv:2507.06602·cs.LG·January 29, 2026

Generalization in Reinforcement Learning for Radio Access Networks

Burak Demirel, Yu Wang, Cristian Tatino, Pablo Soldati

PDF

Open Access

TL;DR

This paper introduces a generalization-focused reinforcement learning framework for Radio Access Networks that enhances performance across diverse and dynamic 5G scenarios by robust state reconstruction, domain randomization, and distributed training.

Contribution

It presents a novel RL approach that improves generalization in RAN control through graph-based state encoding, domain randomization, and scalable distributed training architecture.

Findings

01

Achieves ~10% throughput improvement over baseline in 5G benchmarks.

02

Attains >20% spectral efficiency gains under high mobility conditions.

03

Models outperform MLP baselines with 30% higher throughput in multi-cell deployments.

Abstract

Modern RAN operate in highly dynamic and heterogeneous environments, where hand-tuned, rule-based RRM algorithms often underperform. While RL can surpass such heuristics in constrained settings, the diversity of deployments and unpredictable radio conditions introduce major generalization challenges. Data-driven policies frequently overfit to training conditions, degrading performance in unseen scenarios. To address this, we propose a generalization-centered RL framework for RAN control that: (i) robustly reconstructs dynamically varying states from partial and noisy observations, while encoding static and semi-static information, such as radio nodes, cell attributes, and their topology, through graph representations; (ii) applies domain randomization to broaden the training distribution; and (iii) distributes data generation across multiple actors while centralizing training in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware-Defined Networks and 5G · Advanced MIMO Systems Optimization · Wireless Networks and Protocols