Deep Learning Power Allocation in Massive MIMO

Luca Sanguinetti; Alessio Zappone; Merouane Debbah

arXiv:1812.03640·eess.SP·June 4, 2019

Deep Learning Power Allocation in Massive MIMO

Luca Sanguinetti, Alessio Zappone, Merouane Debbah

PDF

TL;DR

This paper proposes a deep learning approach to efficiently perform power allocation in Massive MIMO networks, achieving near-optimal performance with reduced computational complexity by mapping user positions to power policies.

Contribution

It introduces a neural network model that predicts power allocation in Massive MIMO, eliminating the need for statistical averaging and traditional optimization methods.

Findings

01

Achieves near-optimal power allocation performance.

02

Reduces computational complexity compared to traditional methods.

03

Does not require statistical averaging in the optimization process.

Abstract

This work advocates the use of deep learning to perform max-min and max-prod power allocation in the downlink of Massive MIMO networks. More precisely, a deep neural network is trained to learn the map between the positions of user equipments (UEs) and the optimal power allocation policies, and then used to predict the power allocation profiles for a new set of UEs' positions. The use of deep learning significantly improves the complexity-performance trade-off of power allocation, compared to traditional optimization-oriented methods. Particularly, the proposed approach does not require the computation of any statistical average, which would be instead necessary by using standard methods, and is able to guarantee near-optimal performance.

Tables4

Table 1. TABLE I: Massive MIMO network.

Cell area (with wrap-around)	$1$ km $\times 1$ km
Bandwidth	$20$ MHz
Number of cells	$L = 4$
Number of UEs per cell	$K = 5$
UL noise power	$- 94$ dBm
UL transmit power	$20$ dBm
Samples per coherence block	$τ_{c} = 200$
Pilot reuse factor	$1$

Table 2. TABLE II: Layout of the neural network. The trainable parameters are 6 , 373 6 373 6,373 .

	Size	Parameters	Activation function
Input	40	–	–
Layer 1 (Dense)	64	2624	elu
Layer 2 (Dense)	32	2080	elu
Layer 3 (Dense)	32	1056	elu
Layer 4 (Dense)	32	528	elu
Layer 5 (Dense)	5	85	elu
Layer 6 (Dense)	6	36	linear

Table 3. TABLE III: Layout for a given cell with L = 4 𝐿 4 L=4 and K = 5 𝐾 5 K=5 . Trainable params: 202,373

	Size	Parameters	Activation function
Input	40	–	–
Layer 1 (Dense)	512	20992	elu
Layer 2 (Dense)	256	131328	elu
Layer 3 (Dense)	128	32896	elu
Layer 4 (Dense)	128	16512	elu
Layer 5 (Dense)	5	645	elu
Layer 6 (Dense)	6	36	linear

Table 4. TABLE IV: Layout for a given cell with L = 4 𝐿 4 L=4 and K = 5 𝐾 5 K=5 . Trainable params: 509,829

	Size	Parameters	Activation function
Input	40	–	–
Layer 1 (LSTM)	256	204128	tanh
Layer 2 (LSTM)	128	197120	tanh
Layer 3 (Dense)	64	8256	relu
Layer 4 (Dense)	5	325	relu

Equations30

h_{l i}^{j} \sim N_{C} (0_{M}, R_{l i}^{j})

h_{l i}^{j} \sim N_{C} (0_{M}, R_{l i}^{j})

β_{l i}^{j} = Υ - 10 α lo g_{10} (\frac{d _{l i}^{j}}{1 km}) dB

β_{l i}^{j} = Υ - 10 α lo g_{10} (\frac{d _{l i}^{j}}{1 km}) dB

\displaystyle\hat{\mathbf{h}}_{li}^{j}=\mathbf{R}_{li}^{j}\mathbf{Q}_{li}^{-1}\bigg{(}\sum_{l^{\prime}=1}^{L}\mathbf{h}_{l^{\prime}i}^{j}+\frac{1}{\tau_{p}}\frac{\sigma^{2}}{\rho}\mathbf{n}_{li}\bigg{)}\!\sim\!\mathcal{N}_{\mathbb{C}}\left(\mathbf{0},\mathbf{\Phi}_{li}^{j}\right)

\displaystyle\hat{\mathbf{h}}_{li}^{j}=\mathbf{R}_{li}^{j}\mathbf{Q}_{li}^{-1}\bigg{(}\sum_{l^{\prime}=1}^{L}\mathbf{h}_{l^{\prime}i}^{j}+\frac{1}{\tau_{p}}\frac{\sigma^{2}}{\rho}\mathbf{n}_{li}\bigg{)}\!\sim\!\mathcal{N}_{\mathbb{C}}\left(\mathbf{0},\mathbf{\Phi}_{li}^{j}\right)

SE_{j k}^{dl} = \frac{τ _{d}}{τ _{c}} lo g_{2} (1 + γ_{j k}^{dl}) [bit/s/Hz]

SE_{j k}^{dl} = \frac{τ _{d}}{τ _{c}} lo g_{2} (1 + γ_{j k}^{dl}) [bit/s/Hz]

\displaystyle\!\!\!\gamma^{\mathrm{dl}}_{jk}=\frac{\rho_{jk}|\mathbb{E}\{\mathbf{w}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{j}\}|^{2}}{\sum\limits_{l=1}^{L}\sum\limits_{i=1}^{K}\rho_{li}\mathbb{E}\{|\mathbf{w}_{li}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{l}|^{2}\}-\rho_{jk}|\mathbb{E}\{\mathbf{w}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{j}\}|^{2}+{\sigma^{2}}}

\displaystyle\!\!\!\gamma^{\mathrm{dl}}_{jk}=\frac{\rho_{jk}|\mathbb{E}\{\mathbf{w}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{j}\}|^{2}}{\sum\limits_{l=1}^{L}\sum\limits_{i=1}^{K}\rho_{li}\mathbb{E}\{|\mathbf{w}_{li}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{l}|^{2}\}-\rho_{jk}|\mathbb{E}\{\mathbf{w}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{j}\}|^{2}+{\sigma^{2}}}

w_{j k} = \frac{v _{j k}}{∥ v _{j k} ∥}

w_{j k} = \frac{v _{j k}}{∥ v _{j k} ∥}

v_{j k}^{MR} = \hat{h}_{j k}^{j}

v_{j k}^{MR} = \hat{h}_{j k}^{j}

\mathbf{v}_{jk}^{\rm M-MMSE}=\Bigg{(}\sum\limits_{l=1}^{L}\sum\limits_{i=1}^{K}\hat{\mathbf{h}}_{li}^{j}{(\hat{\mathbf{h}}_{li}^{j})}^{\mbox{\tiny$\mathrm{H}$}}+\mathbf{Z}_{j}\Bigg{)}^{\!-1}\!\!\!\hat{\mathbf{h}}_{jk}^{j}

\mathbf{v}_{jk}^{\rm M-MMSE}=\Bigg{(}\sum\limits_{l=1}^{L}\sum\limits_{i=1}^{K}\hat{\mathbf{h}}_{li}^{j}{(\hat{\mathbf{h}}_{li}^{j})}^{\mbox{\tiny$\mathrm{H}$}}+\mathbf{Z}_{j}\Bigg{)}^{\!-1}\!\!\!\hat{\mathbf{h}}_{jk}^{j}

{\mathsf{SE}}^{\mathrm{dl}}_{jk}=\frac{\tau_{d}}{\tau_{c}}\log_{2}\Bigg{(}1+\frac{\rho_{jk}a_{jk}}{\sum\limits_{l=1}^{L}\sum\limits_{i=1}^{K}\rho_{li}b_{lijk}+\sigma^{2}}\Bigg{)}\quad\forall j,k

{\mathsf{SE}}^{\mathrm{dl}}_{jk}=\frac{\tau_{d}}{\tau_{c}}\log_{2}\Bigg{(}1+\frac{\rho_{jk}a_{jk}}{\sum\limits_{l=1}^{L}\sum\limits_{i=1}^{K}\rho_{li}b_{lijk}+\sigma^{2}}\Bigg{)}\quad\forall j,k

\displaystyle a_{jk}=|\mathbb{E}\{\mathbf{w}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{j}\}|^{2}

\displaystyle a_{jk}=|\mathbb{E}\{\mathbf{w}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{j}\}|^{2}

\displaystyle\!\!b_{lijk}=\begin{cases}\mathbb{E}\{|\mathbf{w}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{l}|^{2}\}&\!\!\!(l,i)\neq(j,k)\\ \mathbb{E}\{|\mathbf{w}_{li}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{l}|^{2}\}-|\mathbb{E}\{\mathbf{w}_{li}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{j}\}|^{2}&\!\!\!(l,i)=(j,k)\end{cases}

\displaystyle\!\!b_{lijk}=\begin{cases}\mathbb{E}\{|\mathbf{w}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{l}|^{2}\}&\!\!\!(l,i)\neq(j,k)\\ \mathbb{E}\{|\mathbf{w}_{li}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{l}|^{2}\}-|\mathbb{E}\{\mathbf{w}_{li}^{\mbox{\tiny$\mathrm{H}$}}\mathbf{h}_{jk}^{j}\}|^{2}&\!\!\!(l,i)=(j,k)\end{cases}

{ρ_{j k} : \forall j, k} max

{ρ_{j k} : \forall j, k} max

k = 1 \sum K ρ_{j k} \leq P_{m a x}^{dl}, j = 1, \dots, L

{ρ_{j k} : \forall j, k} max

{ρ_{j k} : \forall j, k} max

k = 1 \sum K ρ_{j k} \leq P_{m a x}^{dl}, j = 1, \dots, L

W, b min \frac{1}{N _{T}} n = 1 \sum N_{T} ℓ (\hat{ρ}_{j} (n), ρ_{j}^{⋆} (n))

W, b min \frac{1}{N _{T}} n = 1 \sum N_{T} ℓ (\hat{ρ}_{j} (n), ρ_{j}^{⋆} (n))

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Deep Learning Power Allocation in Massive MIMO

Luca Sanguinetti23, Alessio Zappone3, Merouane Debbah34

The work of L. Sanguinetti work was supported by the University of Pisa under the PRA 2018-2019 Research Project CONCEPT and by the H2020-ERC PoC-CacheMire project (grant 727682). The research of A. Zappone was supported by the H2020 MSCA IF BESMART, grant 749336. The work of M. Debbah was partly supported by the H2020 MSCA IF BESMART, grant 749336, and by the H2020-ERC PoC-CacheMire project, grant 727682.

2Dipartimento di Ingegneria dell’Informazione, University of Pisa, Pisa, Italy

3Large Networks and System Group (LANEAS), CentraleSupélec, Université Paris-Saclay, Gif-sur-Yvette, France

4Mathematical and Algorithmic Sciences Lab, Huawei Technologies, France.

Abstract

This work advocates the use of deep learning to perform max-min and max-prod power allocation in the downlink of Massive MIMO networks. More precisely, a deep neural network is trained to learn the map between the positions of user equipments (UEs) and the optimal power allocation policies, and then used to predict the power allocation profiles for a new set of UEs’ positions. The use of deep learning significantly improves the complexity-performance trade-off of power allocation, compared to traditional optimization-oriented methods. Particularly, the proposed approach does not require the computation of any statistical average, which would be instead necessary by using standard methods, and is able to guarantee near-optimal performance.

I Introduction

Massive MIMO refers to a wireless network technology where the base stations (BSs) are equipped with a very large number $M$ of antennas to serve a multitude of user equipments (UEs) by spatial multiplexing [1, 2]. Exciting developments have occurred in the recent year. In industry, the technology has been integrated into the 5G New Radio standard [3]. In academia, the long-standing pilot contamination issue, which was believed to impose fundamental limitations [1], has finally been resolved [4]. More precisely, [4] showed that with optimal minimum mean squared error (MMSE) combining/precoding and a tiny amount of spatial channel correlation, the capacity increases without bound in uplink (UL) and downlink (DL) as the number of antennas increases.

In this work, we propose to use deep learning for solving the max-min and max-prod power allocation problems in the DL of Massive MIMO networks. We are inspired by the recent explosion of successful applications of machine learning techniques [5] that demonstrate the ability of deep neural networks to learn rich patterns and to approximate arbitrary function mappings [5, 6]. Particularly, we aim to demonstrate that the positions of the UEs (which can be easily obtained via global positioning system) can be effectively used by a neural network to obtain near-optimum performance. This allows to reduce substantially the complexity of power allocation (since simple matrix-vector operations are required) and thus makes it possible to perform power allocation in real-time, i.e. following the variations of UEs’ positions. In addition to this, training such a neural network is fairly convenient since training samples are easily obtainable by running off-the-shelf optimization algorithms.

Deep learning for radio resource allocation in wireless networks has been also considered in [7], where the WMMSE algorithm for sum-rate maximization has been emulated by a fully-connected feedforward neural network, and in [8], where a convolutional neural network is used for user-cell association.

II Massive MIMO network

We consider the DL of a Massive MIMO network with $L$ cells, each comprising a BS with $M$ antennas and $K$ UEs [9]. We denote by $\mathbf{h}_{li}^{j}\in\mathbb{C}^{M}$ the channel between UE $i$ in cell $l$ and BS $j$ and assume that

[TABLE]

where $\mathbf{R}_{li}^{j}\in\mathbb{C}^{M\times M}$ is the spatial correlation matrix, known at the BS. The normalized trace $\beta_{li}^{j}={1}/{M}\mathrm{tr}(\mathbf{R}_{li}^{j})$ accounts for the average channel gain from an antenna at BS $j$ to UE $i$ in cell $l$ and is modelled as (in dB)

[TABLE]

where $\Upsilon=-148$ dB determines the median channel gain at a reference distance of 1 km, and $\alpha=3.76$ is the pathloss coefficient. Also, $d_{li}^{j}$ is the distance of UE $i$ in cell $l$ from BS $j$ , given by $d_{li}^{j}=\|{\bf{x}}_{li}^{j}\|$ with ${\bf{x}}_{li}^{j}\in\mathbb{R}^{2}$ being the UE location in the Euclidean space. Note that shadowing should also be considered in (2). However, this is usually modeled by a log-normal distribution, resulting into a channel model that is not spatially consistent. In other words, two UEs at almost the same location would not experience the same channel. To overcome this issue, one should resort to channel models based on ray tracing or recorded measurements.

II-A Channel Estimation

Pilot-based channel training is utilized to estimate the channel vectors at BS $j$ . We assume that the BS and UEs are perfectly synchronized and operate according to a time-division duplex (TDD) protocol wherein the DL data transmission phase is preceded in the UL by a training phase for channel estimation. There are $\tau_{p}=K$ pilots (i.e., pilot reuse factor of $1$ ) and UE $i$ in each cell uses the same pilot. Using a total UL pilot power of $\rho^{\rm{tr}}$ per UE and standard MMSE estimation techniques [9], BS $j$ obtains the estimate of $\mathbf{h}_{li}^{j}$ as

[TABLE]

where $\mathbf{n}_{li}\sim\mathcal{N}_{\mathbb{C}}(\mathbf{0},\mathbf{I}_{M})$ is noise, $\mathbf{Q}_{li}=\sum_{l^{\prime}=1}^{L}\mathbf{R}_{l^{\prime}i}^{j}+\frac{1}{\rho^{\rm{tr}}}\mathbf{I}_{M}$ , and $\mathbf{\Phi}_{jli}=\mathbf{R}_{li}^{j}\mathbf{Q}_{li}^{-1}\mathbf{R}_{li}^{j}$ . The estimation error $\tilde{\mathbf{h}}_{li}^{j}=\mathbf{h}_{li}^{j}-\hat{\mathbf{h}}_{li}^{j}\sim\mathcal{N}_{\mathbb{C}}(\mathbf{0},\mathbf{R}_{li}^{j}-\mathbf{\Phi}_{li}^{j})$ is independent of $\hat{\mathbf{h}}_{li}^{j}$ .

II-B Downlink Spectral Efficiency

The BS in cell $l$ transmits the DL signal $\mathbf{x}_{l}=\sum_{i=1}^{K}\mathbf{w}_{li}\varsigma_{li}$ where $\varsigma_{li}\sim\mathcal{N}_{\mathbb{C}}(0,\rho_{li})$ is the DL data signal intended for UE $i$ in cell $l$ , assigned to a precoding vector $\mathbf{w}_{li}\in\mathbb{C}^{M}$ that determines the spatial directivity of the transmission and satisfies $\|\mathbf{w}_{li}\|^{2}=1$ so that $\rho_{li}$ represents the transmit power.

An achievable DL SE can be computed in Massive MIMO by using the following hardening bound [10].

Theorem 1.

The DL ergodic channel capacity of UE $k$ in cell $j$ is lower bounded by

[TABLE]

with

[TABLE]

where the expectations are computed with respect to the channel realizations. The pre-log factor $\frac{\tau_{d}}{\tau_{c}}$ accounts for the fraction of samples per coherence block used for DL data.

Notice that the above lower bound is achieved when the UE treats the mean of its precoded channel as the true one. This is a reasonable assumption for channels that exhibit channel hardening [9, Sec. 2.5], but a certain loss occurs for channels with little or no hardening. An alternative approach (not considered in this work) consists in estimating the precoded channels either explicitly as in [11] or implicitly as in [12].

II-C Precoder Design

Unlike in the UL [9, Sec. 4.1], finding the optimal precoders is a challenging task since the DL SE in (5) depends on the precoding vectors $\{\mathbf{w}_{li}\}$ of all UEs in the entire network. Motivated by the UL-DL duality [9, Sec. 4.3], a common heuristic approach is to select $\mathbf{w}_{jk}$ as

[TABLE]

where $\mathbf{v}_{jk}$ denotes the combining vector used to detect the UL signal transmitted by UE $k$ in cell $j$ . In this work, we assume that $\mathbf{v}_{jk}$ is designed according to MR combining [10]

[TABLE]

and M-MMSE combining [4, 13]

[TABLE]

where $\mathbf{Z}_{j}=\sum\nolimits_{l=1}^{L}\sum\nolimits_{i=1}^{K}(\mathbf{R}_{li}^{j}-\mathbf{\Phi}_{li}^{j})+\frac{\sigma_{\rm{ul}}^{2}}{\rho_{\rm{ul}}}\mathbf{I}_{M}$ . This choice is motivated by the fact that M-MMSE is optimal but has high computational complexity. On the other hand, MR is suboptimal (not only for finite values of $M$ but also as $M\to\infty$ [4]) but has the lowest complexity among the receive combining schemes.

III Power Allocation

The DL SE of UE $k$ in cell $j$ can be rewritten as

[TABLE]

where

[TABLE]

and

[TABLE]

are the average channel gains and average interference gains, respectively. The average is computed with respect to the small-scale fading realizations so that the DL SE is only a function of the large scale fading statistics and the choice of precoding. This is a unique feature of Massive MIMO that largely simplifies the power allocation problem compared to single-antenna systems [9, Sec. 7.1].

Among the different power allocation policies, two prominent examples are the max-min fairness and max product SINR strategies, which can be mathematically formalized as follows:

[TABLE]

and the max product SINR, given by

[TABLE]

where $P_{\max}^{\mathrm{dl}}$ denotes the maximum DL transmit power. Irrespective of the strategy, the following Monte Carlo methodology is needed to compute the optimal powers [9].

Macroscopic propagation effects

(a)

Randomly drop UEs in positions ${\bf{x}}_{li}^{j}$ 2. (b)

Compute large-scale fading coefficients $\beta_{lk}^{j}$ 3. (c)

Compute channel correlation matrices $\mathbf{R}_{lk}^{j}$ 2. 2.

Microscopic propagation effects

(a)

Generate random estimated channel vectors $\hat{\mathbf{h}}_{lk}^{j}$ by using MMSE estimator 3. 3.

SE computation

(a)

Compute precoding vectors $\mathbf{w}_{jk}$ with MR or M-MMSE precoding 2. (b)

Average over estimated channels to obtain $\{a_{jk}\}$ and $\{b_{lijk}\}$ . 4. 4.

Allocate the power by solving (12) or (13).

The solution to (12) can be obtained through a bisection approach in which a sequence of convex problems is solved, while (13) can be solved by geometric programming. Thus, both (12) and (13) require a polynomial or quasi-polynomial complexity to be solved. However, even a polynomial complexity can be too much when the solution must be obtained in real-time; that is, fast enough to be deployed in the system before the UEs’ positions change and the power allocation problem needs to be solved again.

IV Deep Learning based Power Allocation

A central goal of this work is to demonstrate that geographical location information of UEs is already sufficient as a proxy for computing the optimal powers at any given cell. This is in contrast to the traditional optimization approaches for solving (12) and (13) that require knowledge of $\{a_{jk}\}$ and $\{b_{lijk}\}$ in (10) and (11). We advocate using UEs’ positions because they already capture the main feature of propagation channels and interference in the network. Therefore, for any given cell $j$ the problem is to learn the unknown map between the solution ${{\boldsymbol{\rho}}}_{j}^{\star}=[\rho_{j1}^{\star},\ldots,\rho_{jK}^{\star}]\in\mathbb{R}^{K}$ to (12) or (13) and the $2KL$ geographical UE positions ${\bf x}=\{{\bf{x}}_{li}^{j};\forall j,l,i\}\in\mathbb{R}^{2KL}$ . This is achieved by leveraging the known property of NNs that are universal function approximators [6, 5]. Particularly, we employ a feedforward neural network with fully-connected layers, and consisting of a $2KL$ -dimensional input layer, $N$ hidden layers, and a $K+1$ -dimensional output layer yielding an estimate ${\hat{\boldsymbol{\rho}}}_{j}=[\hat{\rho}_{j1},\ldots,\hat{\rho}_{jK}]$ of the optimal power allocation vector ${{\boldsymbol{\rho}}}_{j}^{\star}$ . Observe that the output layer has size $K+1$ instead of $K$ , since we also make the NN learn $\sum_{k=1}^{K}\rho_{jk}^{\star}$ so as to satisfy the power constraint and increase the estimation accuracy.

The problem reduces to train the weights $\bf W$ and bias terms $\bf b$ of the NN so that the input-output map of the NN emulates the map of traditional approaches. This requires a training set containing $N_{T}$ multiple samples $\{{{\boldsymbol{\rho}}}_{j}^{\star}(n),{\bf x}(n);n=1,\ldots,N_{T}\}$ , where ${{\boldsymbol{\rho}}}_{j}^{\star}(n)$ corresponds to the optimal power allocation for the training input ${\bf x}(n)$ . Denoting by ${\hat{\boldsymbol{\rho}}}_{j}(n)$ the corresponding output of the NN, the learning process consists of minimizing the following loss:

[TABLE]

with $\ell(\cdot,\cdot)$ any suitable distance measure. Once the parameters ${\bf W}$ and ${\bf b}$ are configured, the NN can estimate the optimal power allocation policy also for input vectors that are not part of the training set. Therefore, every time the UEs’ change their positions in the network, the power allocation can be updated by simply feeding the new positions to the NN, without having to actually solve (12) or (13). The complexity reduction granted by this approach is analyzed in more detail in the next section.

IV-A Online implementation and complexity

The complexity of the proposed approach mainly lies in the generation of the training set. Assume that each layer is composed of $N_{i}$ neurons. Computing the output of the NN requires only $\sum_{i=1}^{N+1}N_{i-1}N_{i}$ real multiplications111The complexity related to additions is neglected as it is much smaller than that required for multiplications. and the evaluation of $\sum_{i=1}^{N+1}N_{i}$ activation functions. Also, the training algorithm is conveniently performed by standard (stochastic) gradient descent algorithms coupled with the back-propagation algorithm [5, Ch. 6.5]. Instead, generating the training set requires to actually solve (12) or (13) for many different realizations of ${\bf x}$ , by means of traditional optimization theory methods. However, this is not an issue for at least two reasons:

•

The training set can be generated off-line. Thus, a much higher complexity can be afforded and real-time constraints do not apply.

•

The training set can be updated at a much longer time-scale than the rate at which the UEs’ positions in the network vary. Thus, the training set can be updated at a much longer time-scale than that at which the power control problem should be solved if traditional resource allocation approaches were used.

From the above considerations, it follows that the proposed approach grants a huge complexity reduction, which allows one to update the power allocation based on the UEs’ positions in real time.

V Performance evaluation

We consider the Massive MIMO network reported in Table 1 with $L=4$ cells, with each cell covering a square area of $250\times 250$ m. A wrap around topology is used. We assume that $K=5$ UEs are randomly and uniformly distributed in each cell, at distances larger than 35 m from the BS. Results are averaged over 100 UE distributions. We consider communication over a 20 MHz bandwidth with a total receiver noise power $\sigma^{2}$ of $-94$ dBm. We assume that $\tau_{p}=K$ (i.e., pilot reuse factor of $1$ ) and that the UL transmit power $\rho$ per UE is $20$ dBm.

The NNs were trained based on a dataset of $N_{T}=340000$ samples of independent realizations of the UEs’ positions $\{{\bf x}(n);n=1,\ldots,N_{T}\}$ , and optimal power allocations $\{{{\boldsymbol{\rho}}}_{j}^{\star}(n);n=1,\ldots,N_{T}\}$ for $j=1\ldots,L$ , obtained by solving (12) and (13) with traditional optimization approaches. Particularly, 90% percent of the samples was used for training and 10% for validation. Other $10000$ samples formed the test dataset, which is independent from the training dataset. The training set is available online at https://data.ieeemlc.org while the Matlab code available at https://github.com/lucasanguinetti/ allows to generate further samples. We used the Adam optimizer [14], and chose the relative MSE as loss function $\ell(\cdot,\cdot)$ since numerical results showed that it performed better than the MSE for the problem at hand. The learning rate, batch size, and epochs were adjusted with a trial and error approach. We used the open source python library Keras. The code is available online at https://github.com/lucasanguinetti/.

V-A Max-prod

To evaluate the performance of the NN-based power allocation, we illustrate the cumulative distribution function (CDF) of the DL SE per UE, where the randomness is due to the UE locations and shadow fading realizations. We consider MR, and M-MMSE. The NN used with both precoding schemes is reported in Table II, whose trainable parameters are $6,373$ . The results of Fig. 2(a) show that the NN matches very well the optimal solution with M-MMSE. The average MSE is $0.007$ . With MR precoding, a small mismatch between the two curves is observed. Indeed, the average MSE increases to $0.051$ . Fig. 2(b) illustrates the CDF of the MSE of the SEs. As expected, the CDF curve with M-MMSE is to the left of the MR curve. This basically means that the NN achieves, statistically speaking, better performance with M-MMSE than with MR. This result might seem counterintuitive, since the M-MMSE is algorithmically and computationally more complex than MR and thus its optimal power allocation should in principle be more difficult to learn. A possible explanation for this is that with MR precoding the power is allocated only on the basis of the desired signal gain. On the other hand, with M-MMSE this is accomplished by also taking into account the power of interfering signals. Since the NN receives as input the positions of all UEs in the network, it is able to make the most of this information only when M-MMSE is employed.

To improve the learning capabilities with MR, we also considered the more complex NN reported in Table III. Numerical results show that the average MSE of SEs reduces to $0.003$ and $0.015$ with M-MMSE and MR precoding, respectively. This is achieved at the price of an computational complexity and training time since the number of trainable parameters is $202,373$ , instead of $6,373$ .

To conclude, with the max-prod strategy the proposed deep learning based power allocation has significant computational complexity advantage compared to traditional approaches, while maintaining near-optimal performance with both MR and M-MMSE precoding.

V-B Max-min

The NNs used for the max-prod strategy, revealed to be inadequate with the max-min approach. This is probably due to the fact that the power distribution changes considerably between the two strategies. To overcome this issue, we used a different NN, which consists of two recurrent Long Short-Term Memory (LSTM)1 layers and two dense layers. The NN parameters together with the activation functions are summarized in Table IV. The results of Fig. 2 show that the NN matches almost exactly the theoretical curves with both MR and M-MMSE. Despite providing satisfactory results in terms of accuracy, the NN in Table IV counts a total number of $509,829$ trainable parameters. This is a relatively high number for a Massive MIMO network with $L=4$ and $K=5$ . It lacks scalability when the network size increases.

VI Conclusions

In this work, we proposed a deep learning framework to allocate the power in the DL of a Massive MIMO network with MR and M-MMSE precoding. Two power allocation strategies were considered, namely, max-min and max-prod. We showed that with both strategies a properly trained feed-forward NN is able to learn how to allocate powers to the UEs in each cell. This is achieved by using only the knowledge of the positions of UEs in the network, thereby substantially reducing the complexity and processing time of the optimization process. Numerical results showed that the deep learning framework performs better with M-MMSE rather than with MR. This is likely due to the fact the M-MMSE allows the NN to exploit the most its available information. Moreover, the max-min policy revealed to be harder to learn. In fact, we needed to resort to recurrent neural networks with a relatively high number of trainable parameters.

The analysis was conducted for a relatively small Massive MIMO network with $L=4$ cells and $K=5$ UEs per cell. Further investigations are needed to understand how the developed framework performs as the size of the network increases. Moreover, in practice the number of UEs per cell varies constantly. A simple way to handle this would be to have multiple NNs per BS for all possible configurations of UEs. However, such a solution is not scalable. Besides these and many other open issues, the integration of deep learning tools for real-time power allocation in Massive MIMO seems quite promising.

Bibliography14

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] T. L. Marzetta, “Noncooperative cellular wireless with unlimited numbers of base station antennas,” vol. 9, no. 11, pp. 3590–3600, 2010.
2[2] E. G. Larsson, F. Tufvesson, O. Edfors, and T. L. Marzetta, “Massive MIMO for next generation wireless systems,” IEEE Commun. Magazine , vol. 52, no. 2, pp. 186–195, Feb. 2014.
3[3] S. Parkvall, E. Dahlman, A. Furuskär, and M. Frenne, “NR: The new 5G radio access technology,” IEEE Communications Standards Magazine , vol. 1, no. 4, pp. 24–30, Dec 2017.
4[4] E. Björnson, J. Hoydis, and L. Sanguinetti, “Massive MIMO has unlimited capacity,” IEEE Transactions on Wireless Communications , vol. 17, no. 1, pp. 574–590, 2018.
5[5] I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning , MIT Press, 2016.
6[6] K. Hornik, M. Stinchcombe, and H. White, “Multilayer feedforward networks are universal approximators,” Neural Networks , vol. 2, no. 5, pp. 359–366, 1989.
7[7] H. Sun, X. Chen, Q. Shi, M. Hong, X. Fu, and N. D. Sidiropoulos, “Learning to optimize: Training deep neural networks for interference management,” IEEE Transactions on Signal Processing , vol. 66, no. 20, pp. 5438–5453, 2018.
8[8] W. Cui, K. Shen, and W. Yu, “Spatial deep learning for wireless scheduling,” https://arxiv.org/pdf/1808.01486.pdf , 2018.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Deep Learning Power Allocation in Massive MIMO

Abstract

I Introduction

II Massive MIMO network

II-A Channel Estimation

II-B Downlink Spectral Efficiency

Theorem 1**.**

II-C Precoder Design

III Power Allocation

IV Deep Learning based Power Allocation

IV-A Online implementation and complexity

V Performance evaluation

V-A Max-prod

V-B Max-min

VI Conclusions

Theorem 1.