Multi-Armed Bandits for Decentralized AP selection in Enterprise WLANs

Marc Carrascosa; Boris Bellalta

arXiv:2001.00392·cs.NI·May 29, 2020

Multi-Armed Bandits for Decentralized AP selection in Enterprise WLANs

Marc Carrascosa, Boris Bellalta

PDF

TL;DR

This paper introduces a decentralized reinforcement learning method using Multi-Armed Bandits for AP selection in dense WiFi networks, improving load balancing and resource utilization.

Contribution

It proposes a novel Opportunistic epsilon-greedy with Stickiness approach for decentralized AP selection, enhancing convergence speed and network efficiency.

Findings

01

Faster convergence to optimal APs with the proposed method.

02

Improved network resource utilization and load balancing.

03

Effective in non-stationary environments with station arrivals.

Abstract

WiFi densification leads to the existence of multiple overlapping coverage areas, which allows user stations (STAs) to choose between different Access Points (APs). The standard WiFi association method makes the STAs select the AP with the strongest signal, which in many cases leads to underutilization of some APs while overcrowding others. To mitigate this situation, Reinforcement Learning techniques such as Multi-Armed Bandits can be used to dynamically learn the optimal mapping between APs and STAs, and so redistribute the STAs among the available APs accordingly. This is an especially challenging problem since the network response observed by a given STA depends on the behavior of the others, and so it is very difficult to predict without a global view of the network. In this paper, we focus on solving this problem in a decentralized way, where STAs independently explore the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.