Large-System Analysis of Massive MIMO with Optimal M-MMSE Processing

Luca Sanguinetti; Emil Bj\"ornson; Abla Kammoun

arXiv:1903.09783·cs.IT·June 26, 2019

Large-System Analysis of Massive MIMO with Optimal M-MMSE Processing

Luca Sanguinetti, Emil Bj\"ornson, Abla Kammoun

PDF

TL;DR

This paper analyzes the spectral efficiency of Massive MIMO uplink networks with optimal M-MMSE processing when both the number of antennas and users grow large, providing accurate approximations validated by simulations.

Contribution

It extends previous asymptotic analysis to the regime where both antennas and users grow large with a fixed ratio, using random matrix theory for practical system insights.

Findings

01

Spectral efficiency grows unboundedly with large M and K.

02

Derived low-complexity, accurate approximations for system performance.

03

Validated approximations through simulations for realistic system sizes.

Abstract

We consider the uplink of a Massive MIMO network with $L$ cells, each comprising a BS with $M$ antennas and $K$ single-antenna user equipments. Recently, [1] studied the asymptotic spectral efficiency of such networks with optimal multicell minimum mean-squared error (M-MMSE) processing when $M \to \infty$ and $K$ is kept fixed. Remarkably, [1] proved that, for practical channels with spatial correlation, the spectral efficiency grows unboundedly, even with pilot contamination. In this paper, we extend the analysis from [1] to the alternative regime in which $M, K \to \infty$ with a given ratio. Tools from random matrix theory are used to compute low-complexity approximations which are proved to be asymptotically tight, but accurate for realistic system dimensions, as shown by simulations.

Tables2

Table 1. TABLE I: Number of complex multiplications per coherence block to compute ( 9 ) and ( 11 ).

	Channel estimation	Computation of $γ_{j k}$
(9)	$M τ_{p} + L M^{2}$	$\frac{M^{2} + M}{2} (L K + 1) + \frac{M^{3} - M}{3}$
(11)	$M τ_{p} + L M^{2}$	$\frac{M^{2} + M}{2} (L^{2} (K + 2) + L) + \frac{M^{3} - M}{3} + \frac{L^{3} - L}{3}$

Table 2. TABLE II: Network parameters

Cell area (with wrap around)

0.4

km

\times 0.4

km

Number of cells

L = 4

Samples per coherence block

τ_{c} = 200

Distance between UE

k

in cell

l

and BS

j

d_{l ​ k}^{j}

Large-scale fading coefficient for

the channel between UE

k

in cell

l

and BS

j

β_{l ​ k}^{j} = - 148.1 - 37.6 ​ \log_{10} (\frac{d_{l ​ k}^{j}}{1 ​ km}) + F_{l ​ k}^{j}

dB

Shadow fading between UE

k

in cell

l

and BS

j

F_{l ​ k}^{j} \sim 𝒩 ​ (0, 10)

Equations57

γ_{j k}

γ_{j k}

y_{j} = l = 1 \sum L i = 1 \sum K ρ h_{j l i} x_{l i} + n_{j}

y_{j} = l = 1 \sum L i = 1 \sum K ρ h_{j l i} x_{l i} + n_{j}

\displaystyle\!\!\!\hat{\mathbf{h}}_{jli}=\mathbf{R}_{jli}\mathbf{Q}_{ji}^{-1}\bigg{(}\sum_{l^{\prime}=1}^{L}\mathbf{h}_{jl^{\prime}i}+\frac{1}{\sqrt{\rho^{\rm{tr}}}}\mathbf{n}_{ji}\bigg{)}\!\sim\!\mathcal{N}_{\mathbb{C}}\left(\mathbf{0},\mathbf{\Phi}_{jlli}\right)

\displaystyle\!\!\!\hat{\mathbf{h}}_{jli}=\mathbf{R}_{jli}\mathbf{Q}_{ji}^{-1}\bigg{(}\sum_{l^{\prime}=1}^{L}\mathbf{h}_{jl^{\prime}i}+\frac{1}{\sqrt{\rho^{\rm{tr}}}}\mathbf{n}_{ji}\bigg{)}\!\sim\!\mathcal{N}_{\mathbb{C}}\left(\mathbf{0},\mathbf{\Phi}_{jlli}\right)

\displaystyle\mathbb{E}\{\hat{\mathbf{h}}_{jl^{\prime}i}\hat{\mathbf{h}}_{jli}^{\mbox{\tiny$\mathrm{H}$}}\}=\mathbf{\Phi}_{jl^{\prime}li}=\mathbf{R}_{jl^{\prime}i}\mathbf{Q}_{ji}^{-1}\mathbf{R}_{jli}.

\displaystyle\mathbb{E}\{\hat{\mathbf{h}}_{jl^{\prime}i}\hat{\mathbf{h}}_{jli}^{\mbox{\tiny$\mathrm{H}$}}\}=\mathbf{\Phi}_{jl^{\prime}li}=\mathbf{R}_{jl^{\prime}i}\mathbf{Q}_{ji}^{-1}\mathbf{R}_{jli}.

SE_{j k}^{ul} = (1 - \frac{K}{τ _{c}}) E {lo g_{2} (1 + γ_{j k})} [bit/s/Hz]

SE_{j k}^{ul} = (1 - \frac{K}{τ _{c}}) E {lo g_{2} (1 + γ_{j k})} [bit/s/Hz]

γ_{j k}

γ_{j k}

\displaystyle=\frac{|\mathbf{v}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\hat{\mathbf{h}}_{jjk}|^{2}}{\mathbf{v}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\left(\sum\limits_{(l,i)\neq(j,k)}\hat{\mathbf{h}}_{jli}\hat{\mathbf{h}}_{jli}^{\mbox{\tiny$\mathrm{H}$}}+\mathbf{Z}_{j}+\frac{1}{\rho^{\rm{ul}}}\mathbf{I}_{M}\right)\mathbf{v}_{jk}}

Z_{j} = l = 1 \sum L i = 1 \sum K (R_{j l i} - Φ_{j l l i}) .

Z_{j} = l = 1 \sum L i = 1 \sum K (R_{j l i} - Φ_{j l l i}) .

\hat{H}_{j k} = [\hat{h}_{j 1 k}, \hat{h}_{j 2 k}, \dots, \hat{h}_{j L k}]

\hat{H}_{j k} = [\hat{h}_{j 1 k}, \hat{h}_{j 2 k}, \dots, \hat{h}_{j L k}]

\mathbf{v}_{jk}=\Bigg{(}\sum\limits_{l=1}^{L}\sum\limits_{i=1}^{K}\hat{\mathbf{h}}_{jli}\hat{\mathbf{h}}_{jli}^{\mbox{\tiny$\mathrm{H}$}}+\mathbf{Z}_{j}+\frac{1}{\rho^{\rm{ul}}}\mathbf{I}_{M}\Bigg{)}^{\!-1}\!\!\hat{\mathbf{h}}_{jjk}.

\mathbf{v}_{jk}=\Bigg{(}\sum\limits_{l=1}^{L}\sum\limits_{i=1}^{K}\hat{\mathbf{h}}_{jli}\hat{\mathbf{h}}_{jli}^{\mbox{\tiny$\mathrm{H}$}}+\mathbf{Z}_{j}+\frac{1}{\rho^{\rm{ul}}}\mathbf{I}_{M}\Bigg{)}^{\!-1}\!\!\hat{\mathbf{h}}_{jjk}.

γ_{j k}

γ_{j k}

\displaystyle{\mathbf{U}}_{jk}=\hat{\mathbf{H}}_{jk}^{[j]}{(\hat{\mathbf{H}}_{jk}^{[j]})}^{\mbox{\tiny$\mathrm{H}$}}+\overbrace{\sum_{l}\sum_{i\neq k}\hat{\mathbf{h}}_{jli}\hat{\mathbf{h}}_{jli}^{\mbox{\tiny$\mathrm{H}$}}\!+\mathbf{Z}_{j}+\frac{1}{\rho^{\rm{ul}}}\mathbf{I}_{M}}^{\triangleq{\bf A}_{jk}}.

\displaystyle{\mathbf{U}}_{jk}=\hat{\mathbf{H}}_{jk}^{[j]}{(\hat{\mathbf{H}}_{jk}^{[j]})}^{\mbox{\tiny$\mathrm{H}$}}+\overbrace{\sum_{l}\sum_{i\neq k}\hat{\mathbf{h}}_{jli}\hat{\mathbf{h}}_{jli}^{\mbox{\tiny$\mathrm{H}$}}\!+\mathbf{Z}_{j}+\frac{1}{\rho^{\rm{ul}}}\mathbf{I}_{M}}^{\triangleq{\bf A}_{jk}}.

γ_{j k} = \frac{1}{MSE _{j k}^{ul}} - 1

γ_{j k} = \frac{1}{MSE _{j k}^{ul}} - 1

\displaystyle{\rm{MSE}}_{jk}^{\rm{ul}}=\left[\left({\bf I}_{L}+\hat{\mathbf{H}}_{jk}^{\mbox{\tiny$\mathrm{H}$}}{\bf A}_{jk}^{-1}\hat{\mathbf{H}}_{jk}\right)^{-1}\right]_{j,j}.

\displaystyle{\rm{MSE}}_{jk}^{\rm{ul}}=\left[\left({\bf I}_{L}+\hat{\mathbf{H}}_{jk}^{\mbox{\tiny$\mathrm{H}$}}{\bf A}_{jk}^{-1}\hat{\mathbf{H}}_{jk}\right)^{-1}\right]_{j,j}.

T_{j}^{⋆} = (\frac{1}{M} l = 1 \sum L i = 1 \sum K \frac{Φ _{j l l i}}{1 + μ _{j l i}^{⋆}} + \frac{1}{M} Z_{j} + \frac{1}{ρ} I_{M})^{- 1}

T_{j}^{⋆} = (\frac{1}{M} l = 1 \sum L i = 1 \sum K \frac{Φ _{j l l i}}{1 + μ _{j l i}^{⋆}} + \frac{1}{M} Z_{j} + \frac{1}{ρ} I_{M})^{- 1}

μ_{j l k} = \frac{1}{M} tr Φ_{j l l k} (\frac{1}{M} l = 1 \sum L i = 1 \sum K \frac{Φ _{j l l i}}{1 + μ _{j l i}} + \frac{1}{M} Z_{j} + \frac{1}{ρ} I_{M})^{- 1} .

μ_{j l k} = \frac{1}{M} tr Φ_{j l l k} (\frac{1}{M} l = 1 \sum L i = 1 \sum K \frac{Φ _{j l l i}}{1 + μ _{j l i}} + \frac{1}{M} Z_{j} + \frac{1}{ρ} I_{M})^{- 1} .

\displaystyle\big{[}\mathbf{B}_{jk}\big{]}_{l,l^{\prime}}=\frac{1}{M}\mathrm{tr}\left(\mathbf{\Phi}_{jl^{\prime}lk}{\bf T}_{j}^{\star}\right)

\displaystyle\big{[}\mathbf{B}_{jk}\big{]}_{l,l^{\prime}}=\frac{1}{M}\mathrm{tr}\left(\mathbf{\Phi}_{jl^{\prime}lk}{\bf T}_{j}^{\star}\right)

γ_{j k} ≍ \overline{γ}_{j k}

γ_{j k} ≍ \overline{γ}_{j k}

\displaystyle=[\mathbf{B}_{jk}]_{j,j}-\underbrace{{\big{(}\mathbf{b}_{jk}^{[j]}\big{)}}^{\mbox{\tiny$\mathrm{H}$}}\left({\bf I}_{L-1}+\mathbf{B}_{jk}^{[jj]}\right)^{-1}\!\!\!\mathbf{b}_{jk}^{[j]}}_{\triangleq\overline{\zeta}_{jk}}

\displaystyle\!\!\!\!\!\!\frac{M}{KL}\frac{1}{\varsigma}\leq\frac{\big{[}\mathbf{B}_{jk}\big{]}_{j,j}}{\frac{1}{M}\mathrm{tr}\left(\mathbf{\Phi}_{jjjk}\right)}\leq\frac{M}{KL}\frac{1}{\eta}

\displaystyle\!\!\!\!\!\!\frac{M}{KL}\frac{1}{\varsigma}\leq\frac{\big{[}\mathbf{B}_{jk}\big{]}_{j,j}}{\frac{1}{M}\mathrm{tr}\left(\mathbf{\Phi}_{jjjk}\right)}\leq\frac{M}{KL}\frac{1}{\eta}

\frac{( \frac{M}{K L} ) ^{2} ς ^{'}}{1 + \frac{M}{K L} η ^{'}} \leq \overline{ζ}_{j k} \leq \frac{( \frac{M}{K L} ) ^{2} ς ^{'}}{1 + \frac{M}{K L} \frac{1}{L - 1} η ^{'}}

\frac{( \frac{M}{K L} ) ^{2} ς ^{'}}{1 + \frac{M}{K L} η ^{'}} \leq \overline{ζ}_{j k} \leq \frac{( \frac{M}{K L} ) ^{2} ς ^{'}}{1 + \frac{M}{K L} \frac{1}{L - 1} η ^{'}}

γ_{j k} ≍ \frac{1}{M} tr (Φ_{j j j k} T_{j}^{⋆}) = μ_{j j k}^{⋆}

γ_{j k} ≍ \frac{1}{M} tr (Φ_{j j j k} T_{j}^{⋆}) = μ_{j j k}^{⋆}

\frac{M}{K L} \frac{1}{≜ ς j l i max ∣∣ Φ _{j l l i} ∣ ∣ _{2} + j l i max ∣∣ R _{j l i} - Φ _{j l l i} ∣ ∣ _{2} + \frac{1}{K L} \frac{1}{ρ}} I_{M} ⪯ T_{j}^{⋆} ⪯ \frac{M}{K L} \frac{1}{≜ η j l i min λ _{m i n} ( R _{j l i} - Φ _{j l l i} ) + \frac{1}{K L} \frac{1}{ρ}} I_{M}

\frac{M}{K L} \frac{1}{≜ ς j l i max ∣∣ Φ _{j l l i} ∣ ∣ _{2} + j l i max ∣∣ R _{j l i} - Φ _{j l l i} ∣ ∣ _{2} + \frac{1}{K L} \frac{1}{ρ}} I_{M} ⪯ T_{j}^{⋆} ⪯ \frac{M}{K L} \frac{1}{≜ η j l i min λ _{m i n} ( R _{j l i} - Φ _{j l l i} ) + \frac{1}{K L} \frac{1}{ρ}} I_{M}

\displaystyle\left[\frac{1}{M}\hat{\mathbf{H}}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\tilde{\bf A}_{jk}^{-1}\hat{\mathbf{H}}_{jk}\right]_{l,l^{\prime}}

\displaystyle\left[\frac{1}{M}\hat{\mathbf{H}}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\tilde{\bf A}_{jk}^{-1}\hat{\mathbf{H}}_{jk}\right]_{l,l^{\prime}}

\displaystyle\asymp\frac{1}{M}\mathrm{tr}\bigg{(}\mathbf{\Phi}_{jl^{\prime}lk}\tilde{\bf A}_{jk}^{-1}\bigg{)}

\displaystyle\frac{1}{M}\mathrm{tr}\left(\mathbf{\Phi}_{jl^{\prime}lk}\tilde{\bf A}_{jk}^{-1}\right)\asymp\big{[}\mathbf{B}_{jk}\big{]}_{l,l^{\prime}}

\displaystyle\frac{1}{M}\mathrm{tr}\left(\mathbf{\Phi}_{jl^{\prime}lk}\tilde{\bf A}_{jk}^{-1}\right)\asymp\big{[}\mathbf{B}_{jk}\big{]}_{l,l^{\prime}}

\displaystyle\left\|\frac{1}{M}\hat{\mathbf{H}}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\tilde{\bf A}_{jk}^{-1}\hat{\mathbf{H}}_{jk}-\mathbf{B}_{jk}\right\|_{2}\asymp 0

\displaystyle\left\|\frac{1}{M}\hat{\mathbf{H}}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\tilde{\bf A}_{jk}^{-1}\hat{\mathbf{H}}_{jk}-\mathbf{B}_{jk}\right\|_{2}\asymp 0

\displaystyle\left\|\Big{(}{\bf{I}}_{L}+\frac{1}{M}\hat{\mathbf{H}}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\tilde{\bf A}_{jk}^{-1}\hat{\mathbf{H}}_{jk}\Big{)}^{-1}-\Big{(}{\bf{I}}_{L}+\mathbf{B}_{jk}\Big{)}^{-1}\right\|_{2}\asymp 0.

\displaystyle\left\|\Big{(}{\bf{I}}_{L}+\frac{1}{M}\hat{\mathbf{H}}_{jk}^{\mbox{\tiny$\mathrm{H}$}}\tilde{\bf A}_{jk}^{-1}\hat{\mathbf{H}}_{jk}\Big{)}^{-1}-\Big{(}{\bf{I}}_{L}+\mathbf{B}_{jk}\Big{)}^{-1}\right\|_{2}\asymp 0.

\displaystyle\!\!\!\!\!\!\frac{M}{KL}\frac{1}{\varsigma}\frac{1}{M}\mathrm{tr}\left(\mathbf{\Phi}_{jllk}\right)\leq\big{[}\mathbf{B}_{jk}\big{]}_{l,l}\leq\frac{M}{KL}\frac{1}{\eta}\frac{1}{M}\mathrm{tr}\left(\mathbf{\Phi}_{jllk}\right).

\displaystyle\!\!\!\!\!\!\frac{M}{KL}\frac{1}{\varsigma}\frac{1}{M}\mathrm{tr}\left(\mathbf{\Phi}_{jllk}\right)\leq\big{[}\mathbf{B}_{jk}\big{]}_{l,l}\leq\frac{M}{KL}\frac{1}{\eta}\frac{1}{M}\mathrm{tr}\left(\mathbf{\Phi}_{jllk}\right).

\frac{1}{L - 1} tr (B_{j k}^{[j j]}) I_{L - 1} ⪯ B_{j k}^{[j j]} ⪯ tr (B_{j k}^{[j j]}) I_{L - 1} .

\frac{1}{L - 1} tr (B_{j k}^{[j j]}) I_{L - 1} ⪯ B_{j k}^{[j j]} ⪯ tr (B_{j k}^{[j j]}) I_{L - 1} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Large-System Analysis of Massive MIMO with Optimal M-MMSE Processing

Luca Sanguinetti1, Emil Björnson2, Abla Kammoun3

1Dipartimento di Ingegneria dell’Informazione, University of Pisa, Pisa, Italy

2Department of Electrical Engineering (ISY), Linköping University, Linköping, Sweden

3Electrical Engineering Department, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia

Abstract

We consider the uplink of a Massive MIMO network with $L$ cells, each comprising a BS with $M$ antennas and $K$ single-antenna user equipments. Recently, [1] studied the asymptotic spectral efficiency of such networks with optimal multicell minimum mean-squared error (M-MMSE) processing when $M\to\infty$ and $K$ is kept fixed. Remarkably, [1] proved that, for practical channels with spatial correlation, the spectral efficiency grows unboundedly, even with pilot contamination. In this paper, we extend the analysis from [1] to the alternative regime in which $M,K\to\infty$ with a given ratio. Tools from random matrix theory are used to compute low-complexity approximations which are proved to be asymptotically tight, but accurate for realistic system dimensions, as shown by simulations.

I Introduction

Massive MIMO is a wireless network technology where the base stations (BSs) are equipped with a very large number $M$ of low-power, fully digitally controlled, and physically small antennas to serve a multitude of user equipments (UEs) by spatial multiplexing [2]. A rigorous and mature theory for Massive MIMO has been developed in recent years, as underlined by the recent textbooks [3] and [4].

In industry, exciting developments occurred in 2018. The technology has been integrated into the 5G New Radio standard [5], and the first 64-antenna Massive MIMO BSs have been added to the Ericsson AIR, Huawei AAU, and Nokia AirScale product lines and commercially deployed [6]. This manifests that Massive MIMO is no longer a promising concept but a reality for cellular networks (below 6 GHz).

In academia, Massive MIMO was originally characterized by the “Marzetta limit” where $M\to\infty$ while the number $K$ of UEs is fixed [2]. This limit is different from the traditional “large-system limit” where $M,K\to\infty$ with a fixed ratio. The Marzetta limit has the practical benefit that the $K$ pilot resources required for channel estimation remain finite even in the asymptotic limit. The Massive MIMO capacity was first believed to be upper limited by the coherent interference created by pilot contamination (i.e., reuse of pilots across cells). However, this issue was recently resolved in [1, 7, 8]. More precisely, [1] proved that, with optimal multicell minimum mean-squared error (M-MMSE) processing, the capacity grows unboundedly as $M\to\infty$ . The only requirement is that the channel correlation matrices of the contaminating users are asymptotically linearly independent. This was not the case in Marzetta’s original paper [2], but channel measurements show that it is likely the case in practice [9]. Similar results can be obtained by using a generalized matched filter [7, 8].

Any practical system will operate with a finite $M$ and $K$ . Therefore, the purpose of asymptotic analysis is not the limit itself but to understand the capacity scaling behavior and obtain tight low-complexity performance approximations. To this end, we should choose between the Marzetta limit and traditional large-system limit depending on whether $M/K$ will be nearly infinite or small in practice. Since the sum capacity is often maximized when $M/K$ is fairly small [4, 10], the traditional large-system limit is still of interest.

In this paper, we extend the asymptotic analysis from [1, 7], which considers the Marzetta limit, to the traditional regime in which $M,K\to\infty$ with $\liminf M/K>0$ . To the best of our knowledge, only suboptimal schemes such as maximum ratio, zero-forcing, and single-cell MMSE processing are considered in prior work; see e.g., [11, 12]. M-MMSE is investigated in [13] but only for uncorrelated Rayleigh fading channels. This paper fills the gap by providing an analytical framework that allows evaluating the performance of a Massive MIMO network with M-MMSE for practically large numbers of $M$ and $K$ , without the need of carrying out computationally demanding Monte Carlo simulations. Moreover, it provides novel insights into the achievable performance when using M-MMSE processing.

II Massive MIMO System Model

We consider a Massive MIMO network with $L$ cells, each comprising a BS with $M$ antennas and $K$ single-antenna UEs. We consider a block-fading system model where each channel takes one realization in a coherence block of $\tau_{c}$ channel uses and independent realizations across blocks. There are $K$ mutually orthogonal pilots and the $k$ th UE in each cell uses the same pilot. Following the notation from [11], the received signal ${\bf y}_{j}\in\mathbb{C}^{M}$ at BS $j$ is

[TABLE]

where $\rho$ is the normalized transmit power, $x_{li}$ is the signal from UE $i$ in cell $l$ , $\mathbf{n}_{j}\sim\mathcal{N}_{\mathbb{C}}(\mathbf{0},\mathbf{I}_{M})$ is the normalized independent receiver noise at BS $j$ , and $\mathbf{h}_{jli}\sim\mathcal{N}_{\mathbb{C}}(\mathbf{0},\mathbf{R}_{jli})$ is the block-fading channel from this UE to BS $j$ . The covariance/correlation matrix $\mathbf{R}_{jli}\in\mathbb{C}^{M\times M}$ accounts for the large-scale fading, including pathloss and spatial correlation [4]. These matrices are assumed to be known, but practical estimation methods are found in [14, 15, 16].

II-A Channel Estimation and Spectral Efficiency

Using a total uplink pilot power of $\rho^{\rm{tr}}$ per UE and standard MMSE estimation techniques [11], BS $j$ obtains the estimate of $\mathbf{h}_{jli}$ as

[TABLE]

where $\mathbf{n}_{ji}\sim\mathcal{N}_{\mathbb{C}}(\mathbf{0},\mathbf{I}_{M})$ , $\mathbf{Q}_{ji}=\sum_{l^{\prime}=1}^{L}\mathbf{R}_{jl^{\prime}i}+\frac{1}{\rho^{\rm{tr}}}\mathbf{I}_{M}$ , and $\mathbf{\Phi}_{jlli}=\mathbf{R}_{jli}\mathbf{Q}_{ji}^{-1}\mathbf{R}_{jli}$ . The estimation error $\tilde{\mathbf{h}}_{jli}=\mathbf{h}_{jli}-\hat{\mathbf{h}}_{jli}\sim\mathcal{N}_{\mathbb{C}}\left(\mathbf{0},\mathbf{R}_{jli}-\mathbf{\Phi}_{jlli}\right)$ is independent of $\hat{\mathbf{h}}_{jli}$ . The mutual interference generated by the pilot-sharing UEs is known as pilot contamination and has two main consequences in the channel estimation process [4, Sec. 3.3.2]. The first is the reduced estimation quality, whereas the second is that the estimates $\hat{\mathbf{h}}_{j1i},\ldots,\hat{\mathbf{h}}_{jLi}$ become correlated:

[TABLE]

Both have an impact on the UEs’ performance but it is only the second one that is responsible of the so-called coherent interference [4, Sec. 4.2], which might increase linearly with $M$ , just as the signal term. This is investigated later in detail.

We call ${\bf v}_{jk}\in\mathbb{C}^{M}$ the receive combining vector associated with UE $k$ in cell $j$ . The uplink ergodic capacity can be lower bounded by the achievable spectral efficiency (SE)[3, 4]

[TABLE]

with the instantaneous effective SINR

[TABLE]

where ${\mathbb{E}}\{\cdot|\{\hat{\mathbf{h}}_{jli}\}\}$ denotes the conditional expectation given the MMSE estimates $\{\hat{\mathbf{h}}_{jli}:\forall l,i\}$ available at BS $j$ and

[TABLE]

II-B Optimal Receive Combining: M-MMSE

For notational convenience, we define $\hat{\mathbf{H}}_{jk}\in\mathbb{C}^{M\times L}$ as

[TABLE]

the matrix collecting channel estimates of pilot sharing UEs and call $\hat{\mathbf{H}}_{jk}^{[j]}\in\mathbb{C}^{M\times(L-1)}$ the matrix obtained from $\hat{\mathbf{H}}_{jk}$ after removing the vector $\hat{\mathbf{h}}_{jjk}$ .

As shown in [1, 13], the instantaneous effective SINR in (5) is a generalized Rayleigh quotient with respect to $\mathbf{v}_{jk}$ and thus is maximized by the M-MMSE combining vector:

[TABLE]

Plugging (8) into (5) yields

[TABLE]

where

[TABLE]

It can be shown that (8) also minimizes ${\rm{MSE}}_{jk}^{\rm{ul}}=\mathbb{E}\{|s_{jk}-\mathbf{v}_{jk}^{\mbox{\tiny$ \mathrm{H} $}}\mathbf{y}_{j}|^{2}\,|\,\{\hat{\mathbf{h}}_{jli}\}\}$ which represents the conditional MSE between the data signal $s_{jk}$ and the received signal $\mathbf{v}_{jk}^{\mbox{\tiny$ \mathrm{H} $}}\mathbf{y}_{j}$ after receive combining. By using standard calculus, (9) can be equivalently expressed as

[TABLE]

where ${\rm{MSE}}_{jk}^{\rm{ul}}$ (as obtained after plugging (8) into its definition) reads

[TABLE]

Notice that the right-hand-side of (12) can be rewritten in many equivalent forms by collecting the channel estimate vectors in (10) in different matrices. The reason that we consider the form in (12) is that ${\bf A}_{jk}$ is independent of $\hat{\mathbf{H}}_{jk}$ . This not only makes the asymptotic analysis of (11) rather simple (as shown later) but also allows to gain the following interesting insights. By using the same steps as in [17, Eq. (8)], (11) can be equivalently rewritten as in (13) at the top of the page, which is obtained as the difference between two terms. The first depends on the inverse of the matrix ${\bf A}_{jk}$ defined in (10), which is obtained from all the UE channels that do not cause pilot contamination to UE $k$ in cell $j$ . The second term in (13) depends not only on ${\bf A}_{jk}$ but also on the channel estimates of all the pilot-sharing UEs, which enters into $\hat{\mathbf{H}}_{jk}^{[j]}$ . Therefore, it can be seen as the loss induced in the effective instantaneous SINR by the correlation among pilot contaminating channels. Notice that, although independent from $\hat{\mathbf{H}}_{jk}^{[j]}$ , the first term is also affected by pilot contamination due to the reduced channel estimation quality. As shown later by simulations, both terms grow with $M/K$ when $M,K\to\infty$ .

Table I summarizes the total complexity for evaluating (9) and (11) (in terms of number of complex multiplications) for each coherence block, under the assumption that the statistical matrices $\{{\bf Z}_{j}\}$ and $\{\mathbf{R}_{jli},{\bf{Q}}_{ji}^{-1}\}$ are precomputed and stored at the BSs. Clearly, the computation of the effective SINR is very involved in all cases. In particular, the complexity scales as $M^{3}$ and $M^{2}K$ , which are basically the same when $M$ and $K$ grow with a fixed ratio. Notice also that these operations must be performed over hundreds of coherence blocks to obtain a good estimate of the SE as given by (4). This makes it hard to evaluate the SE when $M$ and $K$ grow large, as envisioned in future Massive MIMO networks. Nevertheless, the evaluation of the effective SINR can be crucial for both physical layer (link-level) and network layer (system-level) simulations and optimization. While the former aims at investigating issues such as adaptive modulation and coding, feedback, channel encoding and decoding, the latter focuses on network-related issues such as scheduling and mobility management [18].

III Asymptotic Analysis

As mentioned in the introduction, we want to analyze $\gamma_{jk}$ in the regime where $M,K\to\infty$ with $\liminf M/K>0$ , which might provide better approximations of practical setups where both $M$ and $K$ are large. To this end, we assume that $\rho^{\rm{ul}}=\rho/M$ with $\rho$ being fixed and make the following assumptions.

Assumption 1.

As $M\to\infty$ $\forall j,l,i$ , $\liminf_{M}\;\frac{1}{{M}}\mathrm{tr}(\mathbf{R}_{jli})>0$ and $\limsup_{M}\;\|\mathbf{R}_{jli}\|_{2}<\infty$ .

These conditions are widely used for the asymptotic analysis [11, 4] of Massive MIMO. The first implies that the array gathers more energy as $M$ increases, whereas the second implies that the energy is spread over many spatial dimensions.

For convenience, we define

[TABLE]

where the coefficients $\{\mu_{jli}^{\star}:\forall l,i\}$ are solutions of the following system of equations:

[TABLE]

Moreover, we define $\mathbf{B}_{jk}\in\mathbb{C}^{L\times L}$ with entries

[TABLE]

where $\mathbf{\Phi}_{jl^{\prime}lk}$ is given by (3), and denote by $\mathbf{B}_{jk}^{[jj]}\in\mathbb{C}^{(L-1)\times(L-1)}$ the matrix obtained from $\mathbf{B}_{jk}$ after removing the $j$ th column and $j$ th row. Also, $\mathbf{b}_{jk}^{[j]}\in\mathbb{C}^{L-1}$ is obtained from the $j$ th column of $\mathbf{B}_{jk}$ after removing $[\mathbf{B}_{jk}]_{j,j}$ .

Theorem 1.

If Assumptions 1 holds and M-MMSE combining is used with $\rho^{\rm{ul}}=\rho/M$ , then

[TABLE]

when $M,K\to\infty$ with $\liminf M/K>0$ .

Proof:

The proof of (17) is given in the appendix by applying tools from random matrix theory to (11). Simple arguments (e.g., [17]) can be used to obtain (18) from (17), which can be seen as an asymptotic approximation of (13). Interestingly, the asymptotic analysis is much simpler than that for S-MMSE [11], where similar tools can be used. This is because with S-MMSE, $\gamma_{jk}$ does not reduce to the quadratic form in (9) (from which (11) follows) as with M-MMSE, and thus an asymptotic approximation can only be obtained by deriving asymptotic expressions for each single term in (5). This latter approach was also taken in [13], even though M-MMSE was considered (but for uncorrelated channels). ∎

Theorem 1 provides asymptotic approximations of $\gamma_{jk}$ that are deterministic and thus can be inserted into (4) to directly obtain approximations of the SE, without the need to evaluate the expectation. The computation requires first to obtain the coefficients $\{\mu_{jli}^{\star}:\forall j\}$ by solving $L$ sets of $KL$ fixed-point equations. In [19], it is proved that $\{\mu_{jli}^{\star}:\forall j\}$ can be efficiently obtained by an iterative algorithm, which needs only a few iterations to converge. We notice that $\{\mu_{jli}^{\star}\}$ only depend on the channel statistics and, therefore, can be precomputed and only updated when the channel statistics change substantially (e.g., due to UE mobility or new scheduling decisions).

Once $\{\mu_{jli}^{\star}\}$ are computed, we need roughly $\frac{4M^{3}-M}{3}KL^{2}$ complex multiplications to compute (17), which is not too different from the complexity of computing (9) and (11) (see Table I). The key difference is that the latter ones need to be computed for every channel realization (or at least very many realizations to approximate the expectation in (4) by Monte Carlo simulations). Hence, the asymptotic approximation $\overline{\gamma}_{jk}$ will substantially reduce the computational burden. Moreover, the numerical results in Section IV prove that it is both asymptotically tight and accurate for systems with finite dimensions.

In the appendix, it is shown that the two terms in (18) can be bounded as follows:

[TABLE]

and

[TABLE]

where $\eta,\eta^{\prime},\varsigma$ and $\varsigma^{\prime}$ are defined in the appendix. As seen, both terms are bounded below and above by $M/K$ (up to constant factors), as validated later by numerical results.

Remark 1 (Orthogonal correlation matrices).

It is known that the SE increases when the interfering UEs’ have different spatial correlation properties [4]. This is confirmed by the expression in (17). In the extreme case of $\mathbf{R}_{jl^{\prime}k}\mathbf{R}_{jlk}={\bf 0}_{M}$ $\forall l^{\prime}\neq l$ , we have that $\mathbf{B}_{jk}$ becomes diagonal and thus

[TABLE]

where $\mu_{jjk}^{\star}$ is obtained from (15) after replacing $\mathbf{\Phi}_{jlli}$ with $\mathbf{\Phi}_{jlli}=\mathbf{R}_{jli}\big{(}\mathbf{R}_{jli}+\frac{1}{\rho^{\rm{tr}}}\mathbf{I}_{M}\big{)}^{-1}\mathbf{R}_{jli}$ [4, Lemma B.6], which does not depend on the pilot-sharing UEs. A similar result holds if $\{\mathbf{R}_{jl^{\prime}k}:\forall l^{\prime}\neq l\}$ are asymptotically spatially orthogonal $\frac{1}{M}\mathrm{tr}\big{(}\mathbf{R}_{jl^{\prime}k}\mathbf{R}_{jlk}\big{)}\asymp 0$ . This implies that the loss due to correlation among pilot contaminating channels in (13) can be avoided if their correlation matrices are (asymptotically) spatially orthogonal. However, this condition only appears in special cases [4] and thus the SINR will always be affected by pilot contamination in practice.

IV Numerical results

The asymptotic analysis is now validated by using the network setup in Table II. Each BS is equipped with a uniform linear array with half-wavelength antenna spacing. The correlation matrices $\{\mathbf{R}_{li}^{j}\}$ are generated by using the exponential correlation model with correlation factor $r=0.5$ between adjacent antennas. The large-scale fading coefficient $\beta_{li}^{j}$ is reported in Table II. The normalized transmit power is $\rho=114$ dBm, while $\rho^{\rm{tr}}=\rho K$ .

Fig. 2 plots the average sum SE per cell as a function of $K$ when $M$ is increased proportionally to $K$ with $M/K=2,4$ . The curve ‘Sim’ refers to the SE obtained with M-MMSE by Monte Carlo simulations, while ’Approx’ is computed by means of the asymptotic approximation provided in Theorem 1. As seen, the SE obtained with the asymptotic approximation perfectly matches the Monte Carlo simulations in all investigated cases. While the SINR (not shown for space limitations) grows linearly with $K$ in both cases, the SE starts decreasing because of the pilot overhead, which enters in (4) through the pre-log factor. To quantify the impact of the SINR loss caused by the pilot-contaminating UEs, we also report the SE as obtained with (13) after neglecting the second term. Only a negligible difference is observed. This means that the correlation among channel estimates of pilot-sharing UEs has a very minor impact on SE.

To validate the scaling behaviour of the two terms in (13) and quantify their relative importance, Fig. 2 plots their average values in dB for an arbitrary UE in the cell. The results obtained with the asymptotic approximations in (18) perfectly match the Monte Carlo simulations. Moreover, both maintain constant as $K$ grows but increases with $M/K$ . The first term is roughly $40-50$ dB higher than the second one for both antenna-UE ratios. Although the situation is different if a specific UE in the cell is considered, the loss caused by the correlation among pilot contaminating channels is always several dBs lower. This implies that it has a minor impact compared to intra- and inter-cell interference.

V Conclusions

We analyzed Massive MIMO in the traditional large-system limit where the number of antennas and UEs are growing with a fixed ratio, which is different from the “Marzetta limit” where only the number of antennas grows. We provided an asymptotically tight low-complexity approximation of the uplink SINR in Massive MIMO networks with the optimal M-MMSE combiner and arbitrary correlated Rayleigh fading channels. Numerical results were used to validate the high accuracy of this approximation for realistic system dimensions. When applied to practical networks, such a result can be used to evaluate the SE of network and/or the effective SINR without to carry out extensive Monte Carlo simulations. In particular, expressions like this are valuable for resource allocation and optimization, as exemplified in [13].

Appendix

Since ${\bf A}_{jk}$ is independent of $\hat{\mathbf{H}}_{jk}$ and $\hat{\mathbf{h}}_{jlk}\sim\mathcal{N}_{\mathbb{C}}\left(\mathbf{0},\mathbf{\Phi}_{jllk}\right)$ , under Assumption 1 from the trace lemma [19] it follows that111Note that it can be shown that the matrices $\mathbf{\Phi}_{jllk}$ have uniformly bounded spectral norm due to Assumption 1.

[TABLE]

with $\tilde{\bf A}_{jk}=\frac{1}{M}{\bf A}_{jk}$ and $\mathbf{\Phi}_{jl^{\prime}lk}$ given by (3). By using [11, Th. 1] under Assumption 1, we obtain

[TABLE]

where the entries of $\mathbf{B}_{jk}$ are defined in (16). Since each of the entries of $\frac{1}{M}\hat{\mathbf{H}}_{jk}^{\mbox{\tiny$ \mathrm{H} $}}\tilde{\bf A}_{jk}^{-1}\hat{\mathbf{H}}_{jk}$ converges, we have that

[TABLE]

from which it follows that

[TABLE]

Plugging this result into (11) we obtain (17) from the continuous mapping theorem.

Under Assumption 1, the matrix ${\bf T}_{j}^{\star}$ can be bounded as in (25) at top of the page. Hence, from (16) we have that

[TABLE]

For the second term in (18), we notice that

[TABLE]

By using (26) and (27) with ${\bf x}^{\mbox{\tiny$ \mathrm{H} $}}\mathbf{A}{\bf x}^{\mbox{\tiny$ \mathrm{H} $}}\leq{\bf x}^{\mbox{\tiny$ \mathrm{H} $}}\mathbf{C}{\bf x}^{\mbox{\tiny$ \mathrm{H} $}}$ if $\mathbf{C}-\mathbf{A}\succeq\mathbf{0}$ , we thus obtain (20) with $\eta^{\prime}=\frac{1}{\eta}\sum\nolimits_{l=1,l\neq j}^{L}\frac{1}{M}\mathrm{tr}\left(\mathbf{\Phi}_{jllk}\right)$ and $\varsigma^{\prime}=\frac{1}{\varsigma^{2}}\sum\nolimits_{l=1,l\neq j}^{L}\big{(}\frac{1}{M}\mathrm{tr}(\mathbf{\Phi}_{jllk})\big{)}^{2}.$

Bibliography19

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] E. Björnson, J. Hoydis, and L. Sanguinetti, “Massive MIMO has unlimited capacity,” IEEE Trans. Wireless Commun. , vol. 17, no. 1, pp. 574–590, 2018.
2[2] T. L. Marzetta, “Noncooperative cellular wireless with unlimited numbers of base station antennas,” IEEE Trans. Wireless Commun. , vol. 9, no. 11, pp. 3590–3600, Nov. 2010.
3[3] T. L. Marzetta, E. G. Larsson, H. Yang, and H. Q. Ngo, Fundamentals of Massive MIMO . Cambridge University Press, 2016.
4[4] E. Björnson, J. Hoydis, and L. Sanguinetti, “Massive MIMO networks: Spectral, energy, and hardware efficiency,” Foundations and Trends® in Signal Processing , vol. 11, no. 3-4, pp. 154–655, 2017. [Online]. Available: http://dx.doi.org/10.1561/2000000093
5[5] S. Parkvall, E. Dahlman, A. Furuskär, and M. Frenne, “NR: The new 5G radio access technology,” IEEE Commun. Std. Mag. , vol. 1, no. 4, pp. 24–30, Dec 2017.
6[6] “Sprint unveils six 5G-ready cities; significant milestone toward launching first 5G mobile network in the U.S.” https://newsroom.sprint.com/sprint-unveils-5g-ready-massive-mimo-markets.htm .
7[7] D. Neumann, T. Wiese, M. Joham, and W. Utschick, “A bilinear equalizer for massive MIMO systems,” IEEE Trans. Signal Process. , vol. 66, no. 14, pp. 3740–3751, July 2018.
8[8] L. Sanguinetti, E. Björnson, and J. Hoydis, “Fundamental asymptotic behavior of (two-user) distributed massive MIMO,” in IEEE Global Communications Conference , Dec. 2018.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Large-System Analysis of Massive MIMO with Optimal M-MMSE Processing

Abstract

I Introduction

II Massive MIMO System Model

II-A Channel Estimation and Spectral Efficiency

II-B Optimal Receive Combining: M-MMSE

III Asymptotic Analysis

Assumption 1**.**

Theorem 1**.**

Proof:

Remark 1** (Orthogonal correlation matrices).**

IV Numerical results

V Conclusions

Appendix

Assumption 1.

Theorem 1.

Remark 1 (Orthogonal correlation matrices).