Asynchronous MultiAgent Reinforcement Learning for 5G Routing under Side Constraints

Sebastian Racedo; Brigitte Jaumard; Oscar Delgado; Meysam Masoudi

arXiv:2602.00035·cs.NI·February 3, 2026

Asynchronous MultiAgent Reinforcement Learning for 5G Routing under Side Constraints

Sebastian Racedo, Brigitte Jaumard, Oscar Delgado, Meysam Masoudi

PDF

Open Access

TL;DR

This paper introduces an asynchronous multi-agent reinforcement learning framework for 5G routing that improves scalability and robustness by enabling independent, service-specific agents to coordinate in real-time network environments.

Contribution

The paper proposes a novel AMARL framework with independent PPO agents for each service, enhancing scalability and feasibility in 5G routing under side constraints.

Findings

01

Achieves similar Grade of Service and latency as baseline

02

Reduces training time significantly

03

Improves robustness to demand shifts

Abstract

Networks in the current 5G and beyond systems increasingly carry heterogeneous traffic with diverse quality-of-service constraints, making real-time routing decisions both complex and time-critical. A common approach, such as a heuristic with human intervention or training a single centralized RL policy or synchronizing updates across multiple learners, struggles with scalability and straggler effects. We address this by proposing an asynchronous multi-agent reinforcement learning (AMARL) framework in which independent PPO agents, one per service, plan routes in parallel and commit resource deltas to a shared global resource environment. This coordination by state preserves feasibility across services and enables specialization for service-specific objectives. We evaluate the method on an O-RAN like network simulation using nearly real-time traffic data from the city of Montreal. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware-Defined Networks and 5G · Advanced MIMO Systems Optimization · Vehicular Ad Hoc Networks (VANETs)