Hierarchical Reinforcement Learning for Next Generation of Multi-AP Coordinated Spatial Reuse

Ziru Chen; Salvatore Talarico; Qing Xia; Xihan Peng; Xing Hao; Lin X. Cai

arXiv:2603.20647·cs.NI·March 24, 2026·GLOBECOM

Hierarchical Reinforcement Learning for Next Generation of Multi-AP Coordinated Spatial Reuse

Ziru Chen, Salvatore Talarico, Qing Xia, Xihan Peng, Xing Hao, Lin X. Cai

PDF

Open Access

TL;DR

This paper introduces a hierarchical reinforcement learning framework using a two-layer Multi-Armed Bandit algorithm to optimize coordinated spatial reuse in multi-AP Wi-Fi networks, improving throughput and fairness.

Contribution

It proposes a novel two-layer MAB-based approach for multi-AP coordination, reducing overhead and complexity compared to classical methods.

Findings

01

Enhanced sum-throughput in simulations.

02

Improved fairness among network nodes.

03

Robust performance across various network scenarios.

Abstract

In next generation of Wi-Fi networks Multiple Access Point Coordination (MAPC) is poised to significantly enhance the network performance by enabling a set of Access Points (APs) to coordinate with each other through advanced coordinating schemes so that to reduce inter-AP contention and congestion. This paper focuses on defining a framework to facilitate the coordination across multi-APs when these employ Coordinated Spatial Reuse (C-SR). In this case, the coordinating APs may need to reciprocally adjust their scheduling strategy, power control and link adaptation to meet specific Quality of Service (QoS) requirements, which by using classical approaches leads to high overhead due to negotiations needed across APs, and requires complex solutions in order to properly optimize the network across all the parameters in play. In this matter, a two layer Multi-Armed Bandit (MAB) algorithm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWireless Networks and Protocols · Advanced MIMO Systems Optimization · Cognitive Radio Networks and Spectrum Sensing