Interpretable Attention-Based Multi-Agent PPO for Latency Spike Resolution in 6G RAN Slicing
Kavan Fatehi, Mostafa Rahmani Ghourtani, Amir Sonee, Poonam Yadav, Alessandra M Russo, Hamed Ahmadi, Radu Calinescu

TL;DR
This paper introduces AE-MAPPO, an interpretable multi-agent reinforcement learning framework with attention mechanisms that effectively resolves latency spikes in 6G RAN slicing, ensuring SLA compliance and real-time decision-making.
Contribution
The paper presents AE-MAPPO, a novel attention-enhanced multi-agent RL approach that provides faithful explanations and improves latency spike resolution in 6G RAN slicing.
Findings
Resolves latency spikes in 18ms
Restores latency to 0.98ms with 99.9999% reliability
Reduces troubleshooting time by 93%
Abstract
Sixth-generation (6G) radio access networks (RANs) must enforce strict service-level agreements (SLAs) for heterogeneous slices, yet sudden latency spikes remain difficult to diagnose and resolve with conventional deep reinforcement learning (DRL) or explainable RL (XRL). We propose \emph{Attention-Enhanced Multi-Agent Proximal Policy Optimization (AE-MAPPO)}, which integrates six specialized attention mechanisms into multi-agent slice control and surfaces them as zero-cost, faithful explanations. The framework operates across O-RAN timescales with a three-phase strategy: predictive, reactive, and inter-slice optimization. A URLLC case study shows AE-MAPPO resolves a latency spike in ms, restores latency to ms with reliability, and reduces troubleshooting time by while maintaining eMBB and mMTC continuity. These results confirm AE-MAPPO's ability to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware-Defined Networks and 5G · Advanced MIMO Systems Optimization · Cognitive Radio Networks and Spectrum Sensing
