Queue-Aware Dynamic Clustering and Power Allocation for Network MIMO Systems via Distributive Stochastic Learning
Ying Cui, Qingqing Huang, Vincent K.N.Lau

TL;DR
This paper introduces a two-timescale, distributed learning approach for dynamic clustering and power allocation in network MIMO systems, optimizing delay performance by leveraging queue and channel state information.
Contribution
It develops a novel distributive online learning algorithm for CPOMDP-based control, with convergence guarantees, and formulates power allocation as a QSI-aware interference game.
Findings
The proposed algorithm converges almost surely.
The power allocation scheme achieves Nash Equilibrium.
Delay performance is optimized through the two-timescale control.
Abstract
In this paper, we propose a two-timescale delay-optimal dynamic clustering and power allocation design for downlink network MIMO systems. The dynamic clustering control is adaptive to the global queue state information (GQSI) only and computed at the base station controller (BSC) over a longer time scale. On the other hand, the power allocations of all the BSs in one cluster are adaptive to both intra-cluster channel state information (CCSI) and intra-cluster queue state information (CQSI), and computed at the cluster manager (CM) over a shorter time scale. We show that the two-timescale delay-optimal control can be formulated as an infinite-horizon average cost Constrained Partially Observed Markov Decision Process (CPOMDP). By exploiting the special problem structure, we shall derive an equivalent Bellman equation in terms of Pattern Selection Q-factor to solve the CPOMDP. To address…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
