Handoff Design in User-Centric Cell-Free Massive MIMO Networks Using DRL
Hussein A. Ammar, Raviraj Adve, Shahram Shahbazpanahi, Gary Boudreau, and Israfil Bahceci

TL;DR
This paper introduces a DRL-based handoff management system for user-centric cell-free massive MIMO networks, optimizing connection updates for mobile users to reduce overhead and improve efficiency.
Contribution
It proposes a novel DRL approach with continuous actions and a reward function balancing rate and overhead, enhancing handoff decision-making in UC-mMIMO networks.
Findings
DRL approach outperforms discrete methods in scalability.
Handoff policy minimizes unnecessary handovers by learning optimal timing.
Real-time operation with response time under 0.4 ms.
Abstract
In the user-centric cell-free massive MIMO (UC-mMIMO) network scheme, user mobility necessitates updating the set of serving access points to maintain the user-centric clustering. Such updates are typically performed through handoff (HO) operations; however, frequent HOs lead to overheads associated with the allocation and release of resources. This paper presents a deep reinforcement learning (DRL)-based solution to predict and manage these connections for mobile users. Our solution employs the Soft Actor-Critic algorithm, with continuous action space representation, to train a deep neural network to serve as the HO policy. We present a novel proposition for a reward function that integrates a HO penalty in order to balance the attainable rate and the associated overhead related to HOs. We develop two variants of our system; the first one uses mobility direction-assisted (DA)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
