# Optimizing Throughput Performance in Distributed MIMO Wi-Fi Networks   using Deep Reinforcement Learning

**Authors:** Neelakantan Nurani Krishnan, Eric Torkildson, Narayan Mandayam, and Dipankar Raychaudhuri, Enrico-Henrik Rantala, Klaus Doppler

arXiv: 1812.06885 · 2024-10-30

## TL;DR

This paper demonstrates that deep reinforcement learning can effectively optimize resource management in distributed MIMO Wi-Fi networks, significantly improving user throughput and fairness under dynamic conditions.

## Contribution

It introduces a novel DRL framework for joint channel assignment and clustering in D-MIMO Wi-Fi networks, addressing NP-hard problems with learned policies.

## Key findings

- Achieves 20% throughput improvement over heuristics
- Effective in dynamic network environments
- Balances throughput and fairness objectives

## Abstract

This paper explores the feasibility of leveraging concepts from deep reinforcement learning (DRL) to enable dynamic resource management in Wi-Fi networks implementing distributed multi-user MIMO (D-MIMO). D-MIMO is a technique by which a set of wireless access points are synchronized and grouped together to jointly serve multiple users simultaneously. This paper addresses two dynamic resource management problems pertaining to D-MIMO Wi-Fi networks: (i) channel assignment of D-MIMO groups, and (ii) deciding how to cluster access points to form D-MIMO groups, in order to maximize user throughput performance. These problems are known to be NP-Hard and only heuristic solutions exist in literature. We construct a DRL framework through which a learning agent interacts with a D-MIMO Wi-Fi network, learns about the network environment, and is successful in converging to policies which address the aforementioned problems. Through extensive simulations and on-line training based on D-MIMO Wi-Fi networks, this paper demonstrates the efficacy of DRL in achieving an improvement of 20% in user throughput performance compared to heuristic solutions, particularly when network conditions are dynamic. This work also showcases the effectiveness of DRL in meeting multiple network objectives simultaneously, for instance, maximizing throughput of users as well as fairness of throughput among them.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1812.06885/full.md

## Figures

31 figures with captions in the complete paper: https://tomesphere.com/paper/1812.06885/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/1812.06885/full.md

---
Source: https://tomesphere.com/paper/1812.06885