Deep Learning Power Allocation in Massive MIMO
Luca Sanguinetti, Alessio Zappone, Merouane Debbah

TL;DR
This paper proposes a deep learning approach to efficiently perform power allocation in Massive MIMO networks, achieving near-optimal performance with reduced computational complexity by mapping user positions to power policies.
Contribution
It introduces a neural network model that predicts power allocation in Massive MIMO, eliminating the need for statistical averaging and traditional optimization methods.
Findings
Achieves near-optimal power allocation performance.
Reduces computational complexity compared to traditional methods.
Does not require statistical averaging in the optimization process.
Abstract
This work advocates the use of deep learning to perform max-min and max-prod power allocation in the downlink of Massive MIMO networks. More precisely, a deep neural network is trained to learn the map between the positions of user equipments (UEs) and the optimal power allocation policies, and then used to predict the power allocation profiles for a new set of UEs' positions. The use of deep learning significantly improves the complexity-performance trade-off of power allocation, compared to traditional optimization-oriented methods. Particularly, the proposed approach does not require the computation of any statistical average, which would be instead necessary by using standard methods, and is able to guarantee near-optimal performance.
| Cell area (with wrap-around) | km km |
|---|---|
| Bandwidth | MHz |
| Number of cells | |
| Number of UEs per cell | |
| UL noise power | dBm |
| UL transmit power | dBm |
| Samples per coherence block | |
| Pilot reuse factor |
| Size | Parameters | Activation function | |
|---|---|---|---|
| Input | 40 | – | – |
| Layer 1 (Dense) | 64 | 2624 | elu |
| Layer 2 (Dense) | 32 | 2080 | elu |
| Layer 3 (Dense) | 32 | 1056 | elu |
| Layer 4 (Dense) | 32 | 528 | elu |
| Layer 5 (Dense) | 5 | 85 | elu |
| Layer 6 (Dense) | 6 | 36 | linear |
| Size | Parameters | Activation function | |
|---|---|---|---|
| Input | 40 | – | – |
| Layer 1 (Dense) | 512 | 20992 | elu |
| Layer 2 (Dense) | 256 | 131328 | elu |
| Layer 3 (Dense) | 128 | 32896 | elu |
| Layer 4 (Dense) | 128 | 16512 | elu |
| Layer 5 (Dense) | 5 | 645 | elu |
| Layer 6 (Dense) | 6 | 36 | linear |
| Size | Parameters | Activation function | |
|---|---|---|---|
| Input | 40 | – | – |
| Layer 1 (LSTM) | 256 | 204128 | tanh |
| Layer 2 (LSTM) | 128 | 197120 | tanh |
| Layer 3 (Dense) | 64 | 8256 | relu |
| Layer 4 (Dense) | 5 | 325 | relu |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Deep Learning Power Allocation in Massive MIMO
Luca Sanguinetti23, Alessio Zappone3, Merouane Debbah34
The work of L. Sanguinetti work was supported by the University of Pisa under the PRA 2018-2019 Research Project CONCEPT and by the H2020-ERC PoC-CacheMire project (grant 727682). The research of A. Zappone was supported by the H2020 MSCA IF BESMART, grant 749336. The work of M. Debbah was partly supported by the H2020 MSCA IF BESMART, grant 749336, and by the H2020-ERC PoC-CacheMire project, grant 727682.
2Dipartimento di Ingegneria dell’Informazione, University of Pisa, Pisa, Italy
3Large Networks and System Group (LANEAS), CentraleSupélec, Université Paris-Saclay, Gif-sur-Yvette, France
4Mathematical and Algorithmic Sciences Lab, Huawei Technologies, France.
Abstract
This work advocates the use of deep learning to perform max-min and max-prod power allocation in the downlink of Massive MIMO networks. More precisely, a deep neural network is trained to learn the map between the positions of user equipments (UEs) and the optimal power allocation policies, and then used to predict the power allocation profiles for a new set of UEs’ positions. The use of deep learning significantly improves the complexity-performance trade-off of power allocation, compared to traditional optimization-oriented methods. Particularly, the proposed approach does not require the computation of any statistical average, which would be instead necessary by using standard methods, and is able to guarantee near-optimal performance.
I Introduction
Massive MIMO refers to a wireless network technology where the base stations (BSs) are equipped with a very large number of antennas to serve a multitude of user equipments (UEs) by spatial multiplexing [1, 2]. Exciting developments have occurred in the recent year. In industry, the technology has been integrated into the 5G New Radio standard [3]. In academia, the long-standing pilot contamination issue, which was believed to impose fundamental limitations [1], has finally been resolved [4]. More precisely, [4] showed that with optimal minimum mean squared error (MMSE) combining/precoding and a tiny amount of spatial channel correlation, the capacity increases without bound in uplink (UL) and downlink (DL) as the number of antennas increases.
In this work, we propose to use deep learning for solving the max-min and max-prod power allocation problems in the DL of Massive MIMO networks. We are inspired by the recent explosion of successful applications of machine learning techniques [5] that demonstrate the ability of deep neural networks to learn rich patterns and to approximate arbitrary function mappings [5, 6]. Particularly, we aim to demonstrate that the positions of the UEs (which can be easily obtained via global positioning system) can be effectively used by a neural network to obtain near-optimum performance. This allows to reduce substantially the complexity of power allocation (since simple matrix-vector operations are required) and thus makes it possible to perform power allocation in real-time, i.e. following the variations of UEs’ positions. In addition to this, training such a neural network is fairly convenient since training samples are easily obtainable by running off-the-shelf optimization algorithms.
Deep learning for radio resource allocation in wireless networks has been also considered in [7], where the WMMSE algorithm for sum-rate maximization has been emulated by a fully-connected feedforward neural network, and in [8], where a convolutional neural network is used for user-cell association.
II Massive MIMO network
We consider the DL of a Massive MIMO network with cells, each comprising a BS with antennas and UEs [9]. We denote by the channel between UE in cell and BS and assume that
[TABLE]
where is the spatial correlation matrix, known at the BS. The normalized trace accounts for the average channel gain from an antenna at BS to UE in cell and is modelled as (in dB)
[TABLE]
where dB determines the median channel gain at a reference distance of 1 km, and is the pathloss coefficient. Also, is the distance of UE in cell from BS , given by with being the UE location in the Euclidean space. Note that shadowing should also be considered in (2). However, this is usually modeled by a log-normal distribution, resulting into a channel model that is not spatially consistent. In other words, two UEs at almost the same location would not experience the same channel. To overcome this issue, one should resort to channel models based on ray tracing or recorded measurements.
II-A Channel Estimation
Pilot-based channel training is utilized to estimate the channel vectors at BS . We assume that the BS and UEs are perfectly synchronized and operate according to a time-division duplex (TDD) protocol wherein the DL data transmission phase is preceded in the UL by a training phase for channel estimation. There are pilots (i.e., pilot reuse factor of ) and UE in each cell uses the same pilot. Using a total UL pilot power of per UE and standard MMSE estimation techniques [9], BS obtains the estimate of as
[TABLE]
where is noise, , and . The estimation error is independent of .
II-B Downlink Spectral Efficiency
The BS in cell transmits the DL signal where is the DL data signal intended for UE in cell , assigned to a precoding vector that determines the spatial directivity of the transmission and satisfies so that represents the transmit power.
An achievable DL SE can be computed in Massive MIMO by using the following hardening bound [10].
Theorem 1**.**
The DL ergodic channel capacity of UE in cell is lower bounded by
[TABLE]
with
[TABLE]
where the expectations are computed with respect to the channel realizations. The pre-log factor accounts for the fraction of samples per coherence block used for DL data.
Notice that the above lower bound is achieved when the UE treats the mean of its precoded channel as the true one. This is a reasonable assumption for channels that exhibit channel hardening [9, Sec. 2.5], but a certain loss occurs for channels with little or no hardening. An alternative approach (not considered in this work) consists in estimating the precoded channels either explicitly as in [11] or implicitly as in [12].
II-C Precoder Design
Unlike in the UL [9, Sec. 4.1], finding the optimal precoders is a challenging task since the DL SE in (5) depends on the precoding vectors of all UEs in the entire network. Motivated by the UL-DL duality [9, Sec. 4.3], a common heuristic approach is to select as
[TABLE]
where denotes the combining vector used to detect the UL signal transmitted by UE in cell . In this work, we assume that is designed according to MR combining [10]
[TABLE]
[TABLE]
where . This choice is motivated by the fact that M-MMSE is optimal but has high computational complexity. On the other hand, MR is suboptimal (not only for finite values of but also as [4]) but has the lowest complexity among the receive combining schemes.
III Power Allocation
The DL SE of UE in cell can be rewritten as
[TABLE]
where
[TABLE]
and
[TABLE]
are the average channel gains and average interference gains, respectively. The average is computed with respect to the small-scale fading realizations so that the DL SE is only a function of the large scale fading statistics and the choice of precoding. This is a unique feature of Massive MIMO that largely simplifies the power allocation problem compared to single-antenna systems [9, Sec. 7.1].
Among the different power allocation policies, two prominent examples are the max-min fairness and max product SINR strategies, which can be mathematically formalized as follows:
[TABLE]
and the max product SINR, given by
[TABLE]
where denotes the maximum DL transmit power. Irrespective of the strategy, the following Monte Carlo methodology is needed to compute the optimal powers [9].
Macroscopic propagation effects
- (a)
Randomly drop UEs in positions 2. (b)
Compute large-scale fading coefficients 3. (c)
Compute channel correlation matrices 2. 2.
Microscopic propagation effects
- (a)
Generate random estimated channel vectors by using MMSE estimator 3. 3.
SE computation
- (a)
Compute precoding vectors with MR or M-MMSE precoding 2. (b)
Average over estimated channels to obtain and . 4. 4.
Allocate the power by solving (12) or (13).
The solution to (12) can be obtained through a bisection approach in which a sequence of convex problems is solved, while (13) can be solved by geometric programming. Thus, both (12) and (13) require a polynomial or quasi-polynomial complexity to be solved. However, even a polynomial complexity can be too much when the solution must be obtained in real-time; that is, fast enough to be deployed in the system before the UEs’ positions change and the power allocation problem needs to be solved again.
IV Deep Learning based Power Allocation
A central goal of this work is to demonstrate that geographical location information of UEs is already sufficient as a proxy for computing the optimal powers at any given cell. This is in contrast to the traditional optimization approaches for solving (12) and (13) that require knowledge of and in (10) and (11). We advocate using UEs’ positions because they already capture the main feature of propagation channels and interference in the network. Therefore, for any given cell the problem is to learn the unknown map between the solution to (12) or (13) and the geographical UE positions . This is achieved by leveraging the known property of NNs that are universal function approximators [6, 5]. Particularly, we employ a feedforward neural network with fully-connected layers, and consisting of a -dimensional input layer, hidden layers, and a -dimensional output layer yielding an estimate of the optimal power allocation vector . Observe that the output layer has size instead of , since we also make the NN learn so as to satisfy the power constraint and increase the estimation accuracy.
The problem reduces to train the weights and bias terms of the NN so that the input-output map of the NN emulates the map of traditional approaches. This requires a training set containing multiple samples , where corresponds to the optimal power allocation for the training input . Denoting by the corresponding output of the NN, the learning process consists of minimizing the following loss:
[TABLE]
with any suitable distance measure. Once the parameters and are configured, the NN can estimate the optimal power allocation policy also for input vectors that are not part of the training set. Therefore, every time the UEs’ change their positions in the network, the power allocation can be updated by simply feeding the new positions to the NN, without having to actually solve (12) or (13). The complexity reduction granted by this approach is analyzed in more detail in the next section.
IV-A Online implementation and complexity
The complexity of the proposed approach mainly lies in the generation of the training set. Assume that each layer is composed of neurons. Computing the output of the NN requires only real multiplications111The complexity related to additions is neglected as it is much smaller than that required for multiplications. and the evaluation of activation functions. Also, the training algorithm is conveniently performed by standard (stochastic) gradient descent algorithms coupled with the back-propagation algorithm [5, Ch. 6.5]. Instead, generating the training set requires to actually solve (12) or (13) for many different realizations of , by means of traditional optimization theory methods. However, this is not an issue for at least two reasons:
- •
The training set can be generated off-line. Thus, a much higher complexity can be afforded and real-time constraints do not apply.
- •
The training set can be updated at a much longer time-scale than the rate at which the UEs’ positions in the network vary. Thus, the training set can be updated at a much longer time-scale than that at which the power control problem should be solved if traditional resource allocation approaches were used.
From the above considerations, it follows that the proposed approach grants a huge complexity reduction, which allows one to update the power allocation based on the UEs’ positions in real time.
V Performance evaluation
We consider the Massive MIMO network reported in Table 1 with cells, with each cell covering a square area of m. A wrap around topology is used. We assume that UEs are randomly and uniformly distributed in each cell, at distances larger than 35 m from the BS. Results are averaged over 100 UE distributions. We consider communication over a 20 MHz bandwidth with a total receiver noise power of dBm. We assume that (i.e., pilot reuse factor of ) and that the UL transmit power per UE is dBm.
The NNs were trained based on a dataset of samples of independent realizations of the UEs’ positions , and optimal power allocations for , obtained by solving (12) and (13) with traditional optimization approaches. Particularly, 90% percent of the samples was used for training and 10% for validation. Other samples formed the test dataset, which is independent from the training dataset. The training set is available online at https://data.ieeemlc.org while the Matlab code available at https://github.com/lucasanguinetti/ allows to generate further samples. We used the Adam optimizer [14], and chose the relative MSE as loss function since numerical results showed that it performed better than the MSE for the problem at hand. The learning rate, batch size, and epochs were adjusted with a trial and error approach. We used the open source python library Keras. The code is available online at https://github.com/lucasanguinetti/.
V-A Max-prod
To evaluate the performance of the NN-based power allocation, we illustrate the cumulative distribution function (CDF) of the DL SE per UE, where the randomness is due to the UE locations and shadow fading realizations. We consider MR, and M-MMSE. The NN used with both precoding schemes is reported in Table II, whose trainable parameters are . The results of Fig. 2(a) show that the NN matches very well the optimal solution with M-MMSE. The average MSE is . With MR precoding, a small mismatch between the two curves is observed. Indeed, the average MSE increases to . Fig. 2(b) illustrates the CDF of the MSE of the SEs. As expected, the CDF curve with M-MMSE is to the left of the MR curve. This basically means that the NN achieves, statistically speaking, better performance with M-MMSE than with MR. This result might seem counterintuitive, since the M-MMSE is algorithmically and computationally more complex than MR and thus its optimal power allocation should in principle be more difficult to learn. A possible explanation for this is that with MR precoding the power is allocated only on the basis of the desired signal gain. On the other hand, with M-MMSE this is accomplished by also taking into account the power of interfering signals. Since the NN receives as input the positions of all UEs in the network, it is able to make the most of this information only when M-MMSE is employed.
To improve the learning capabilities with MR, we also considered the more complex NN reported in Table III. Numerical results show that the average MSE of SEs reduces to and with M-MMSE and MR precoding, respectively. This is achieved at the price of an computational complexity and training time since the number of trainable parameters is , instead of .
To conclude, with the max-prod strategy the proposed deep learning based power allocation has significant computational complexity advantage compared to traditional approaches, while maintaining near-optimal performance with both MR and M-MMSE precoding.
V-B Max-min
The NNs used for the max-prod strategy, revealed to be inadequate with the max-min approach. This is probably due to the fact that the power distribution changes considerably between the two strategies. To overcome this issue, we used a different NN, which consists of two recurrent Long Short-Term Memory (LSTM)1 layers and two dense layers. The NN parameters together with the activation functions are summarized in Table IV. The results of Fig. 2 show that the NN matches almost exactly the theoretical curves with both MR and M-MMSE. Despite providing satisfactory results in terms of accuracy, the NN in Table IV counts a total number of trainable parameters. This is a relatively high number for a Massive MIMO network with and . It lacks scalability when the network size increases.
VI Conclusions
In this work, we proposed a deep learning framework to allocate the power in the DL of a Massive MIMO network with MR and M-MMSE precoding. Two power allocation strategies were considered, namely, max-min and max-prod. We showed that with both strategies a properly trained feed-forward NN is able to learn how to allocate powers to the UEs in each cell. This is achieved by using only the knowledge of the positions of UEs in the network, thereby substantially reducing the complexity and processing time of the optimization process. Numerical results showed that the deep learning framework performs better with M-MMSE rather than with MR. This is likely due to the fact the M-MMSE allows the NN to exploit the most its available information. Moreover, the max-min policy revealed to be harder to learn. In fact, we needed to resort to recurrent neural networks with a relatively high number of trainable parameters.
The analysis was conducted for a relatively small Massive MIMO network with cells and UEs per cell. Further investigations are needed to understand how the developed framework performs as the size of the network increases. Moreover, in practice the number of UEs per cell varies constantly. A simple way to handle this would be to have multiple NNs per BS for all possible configurations of UEs. However, such a solution is not scalable. Besides these and many other open issues, the integration of deep learning tools for real-time power allocation in Massive MIMO seems quite promising.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] T. L. Marzetta, “Noncooperative cellular wireless with unlimited numbers of base station antennas,” vol. 9, no. 11, pp. 3590–3600, 2010.
- 2[2] E. G. Larsson, F. Tufvesson, O. Edfors, and T. L. Marzetta, “Massive MIMO for next generation wireless systems,” IEEE Commun. Magazine , vol. 52, no. 2, pp. 186–195, Feb. 2014.
- 3[3] S. Parkvall, E. Dahlman, A. Furuskär, and M. Frenne, “NR: The new 5G radio access technology,” IEEE Communications Standards Magazine , vol. 1, no. 4, pp. 24–30, Dec 2017.
- 4[4] E. Björnson, J. Hoydis, and L. Sanguinetti, “Massive MIMO has unlimited capacity,” IEEE Transactions on Wireless Communications , vol. 17, no. 1, pp. 574–590, 2018.
- 5[5] I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning , MIT Press, 2016.
- 6[6] K. Hornik, M. Stinchcombe, and H. White, “Multilayer feedforward networks are universal approximators,” Neural Networks , vol. 2, no. 5, pp. 359–366, 1989.
- 7[7] H. Sun, X. Chen, Q. Shi, M. Hong, X. Fu, and N. D. Sidiropoulos, “Learning to optimize: Training deep neural networks for interference management,” IEEE Transactions on Signal Processing , vol. 66, no. 20, pp. 5438–5453, 2018.
- 8[8] W. Cui, K. Shen, and W. Yu, “Spatial deep learning for wireless scheduling,” https://arxiv.org/pdf/1808.01486.pdf , 2018.
