Asynchronous MultiAgent Reinforcement Learning for 5G Routing under Side Constraints
Sebastian Racedo, Brigitte Jaumard, Oscar Delgado, Meysam Masoudi

TL;DR
This paper introduces an asynchronous multi-agent reinforcement learning framework for 5G routing that improves scalability and robustness by enabling independent, service-specific agents to coordinate in real-time network environments.
Contribution
The paper proposes a novel AMARL framework with independent PPO agents for each service, enhancing scalability and feasibility in 5G routing under side constraints.
Findings
Achieves similar Grade of Service and latency as baseline
Reduces training time significantly
Improves robustness to demand shifts
Abstract
Networks in the current 5G and beyond systems increasingly carry heterogeneous traffic with diverse quality-of-service constraints, making real-time routing decisions both complex and time-critical. A common approach, such as a heuristic with human intervention or training a single centralized RL policy or synchronizing updates across multiple learners, struggles with scalability and straggler effects. We address this by proposing an asynchronous multi-agent reinforcement learning (AMARL) framework in which independent PPO agents, one per service, plan routes in parallel and commit resource deltas to a shared global resource environment. This coordination by state preserves feasibility across services and enables specialization for service-specific objectives. We evaluate the method on an O-RAN like network simulation using nearly real-time traffic data from the city of Montreal. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Advanced MIMO Systems Optimization · Vehicular Ad Hoc Networks (VANETs)
