Hierarchical Reinforcement Learning for Next Generation of Multi-AP Coordinated Spatial Reuse
Ziru Chen, Salvatore Talarico, Qing Xia, Xihan Peng, Xing Hao, Lin X. Cai

TL;DR
This paper introduces a hierarchical reinforcement learning framework using a two-layer Multi-Armed Bandit algorithm to optimize coordinated spatial reuse in multi-AP Wi-Fi networks, improving throughput and fairness.
Contribution
It proposes a novel two-layer MAB-based approach for multi-AP coordination, reducing overhead and complexity compared to classical methods.
Findings
Enhanced sum-throughput in simulations.
Improved fairness among network nodes.
Robust performance across various network scenarios.
Abstract
In next generation of Wi-Fi networks Multiple Access Point Coordination (MAPC) is poised to significantly enhance the network performance by enabling a set of Access Points (APs) to coordinate with each other through advanced coordinating schemes so that to reduce inter-AP contention and congestion. This paper focuses on defining a framework to facilitate the coordination across multi-APs when these employ Coordinated Spatial Reuse (C-SR). In this case, the coordinating APs may need to reciprocally adjust their scheduling strategy, power control and link adaptation to meet specific Quality of Service (QoS) requirements, which by using classical approaches leads to high overhead due to negotiations needed across APs, and requires complex solutions in order to properly optimize the network across all the parameters in play. In this matter, a two layer Multi-Armed Bandit (MAB) algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWireless Networks and Protocols · Advanced MIMO Systems Optimization · Cognitive Radio Networks and Spectrum Sensing
