Asymptotic Analysis of RZF over Double Scattering Channels with MMSE Estimation
Qurrat-Ul-Ain Nadeem, Abla Kammoun, Merouane Debbah, Mohamed-Slim, Alouini

TL;DR
This paper provides an asymptotic analysis of RZF precoding in multi-user MISO systems with double scattering channels, deriving deterministic approximations for SINR and ergodic rate, and revealing the maximum number of supported users.
Contribution
It introduces the first MMSE channel estimate for double scattering models and derives tight deterministic approximations for system performance metrics in large-scale regimes.
Findings
Maximum supported users equals the number of scatterers.
Asymptotic approximations closely match simulations for moderate sizes.
Supports large-scale MIMO gains with double scattering channels.
Abstract
This paper studies the ergodic rate performance of regularized zero-forcing (RZF) precoding in the downlink of a multi-user multiple-input single-output (MISO) system, where the channel between the base station (BS) and each user is modeled by the double scattering model. This non-Gaussian channel model is a function of both the antenna correlation and the structure of scattering in the propagation environment. This paper makes the preliminary contribution of deriving the minimum-mean-square-error (MMSE) channel estimate for this model. Then under the assumption that the users are divided into groups of common correlation matrices, this paper derives deterministic approximations of the signal-to-interference-plus-noise ratio (SINR) and the ergodic rate, which are almost surely tight in the limit that the number of BS antennas, the number of users and the number of scatterers in each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Asymptotic Analysis of RZF over Double Scattering Channels with MMSE Estimation
Qurrat-Ul-Ain Nadeem, Abla Kammoun, Mérouane Debbah, and Mohamed-Slim Alouini Q.-U.-A. Nadeem, A. Kammoun and M.-S. Alouini are with the Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, Building 1, Level 3, King Abdullah University of Science and Technology (KAUST), Thuwal, Makkah Province, Saudi Arabia 23955-6900 (e-mail: {qurratulain.nadeem,abla.kammoun,slim.alouini}@kaust.edu.sa)M. Debbah is with Supélec, Gif-sur-Yvette, France and Mathematical and Algorithmic Sciences Lab, Huawei France R&D, Paris, France (e-mail: [email protected], [email protected]).A part of this paper is submitted to IEEE Global Communications Conference (GLOBECOM), Abu Dhabi, UAE, Dec. 2018.
Abstract
This paper studies the ergodic rate performance of regularized zero-forcing (RZF) precoding in the downlink of a multi-user multiple-input single-output (MISO) system, where the channel between the base station (BS) and each user is modeled by the double scattering model. This non-Gaussian channel model is a function of both the antenna correlation and the structure of scattering in the propagation environment. This paper makes the preliminary contribution of deriving the minimum-mean-square-error (MMSE) channel estimate for this model. Then under the assumption that the users are divided into groups of common correlation matrices, this paper derives deterministic approximations of the signal-to-interference-plus-noise ratio (SINR) and the ergodic rate, which are almost surely tight in the limit that the number of BS antennas, the number of users and the number of scatterers in each group grow infinitely large. The derived results are expressed in a closed-form for the special case of multi-keyhole channels. Simulation results confirm the close match provided by the asymptotic analysis for moderate system dimensions. We show that the maximum number of users that can be supported simultaneously, while realizing large-scale MIMO gains, is equal to the number of scatterers.
I Introduction
Large-scale multiple-input multiple-output (MIMO) is widely considered as a promising technology for next generation wireless communication systems [1, 2, 3, 4, 5, 6, 7, 8, 9]. However, most works on this subject share the underlying assumption of rich scattering conditions and, thus, work with full rank Rayleigh or Rician fading channel matrices. Although the use of full rank channel matrices facilitates the derivation of closed-form capacity bounds and approximations, these models do not capture the characteristics of realistic propagation environments, where the presence of spatial correlation and poor scattering conditions significantly affects the system performance [10, 11, 12].
Many works have already considered correlated Rayleigh fading channel models in their analysis of large-scale and Massive MIMO systems [2, 7, 3, 4, 5, 8]. The authors in [2] show that in the limit of an infinite number of antennas, the effects of noise, channel estimation errors, and interference vanish, while pilot contamination remains the only performance limitation. However, very recently the authors in [3] show in the same limit that the capacity increases without bound even under pilot contamination, provided that the spatial correlation matrices of the pilot contaminating users are asymptotically linearly independent and multi-cell minimum-mean-square-error (MMSE) precoding/combining techniques are employed.
Despite the important role played by spatial correlation in determining large-scale MIMO performance, low rank channels have been observed in MIMO systems that have low antenna correlation at both ends of the transmission link [13]. In fact, Gesbert et al. show in [10] that MIMO capacity is governed by both the spatial correlation at the communication ends and the structure of scattering in the propagation environment. Motivated by this, the authors devise a “double scattering channel model”, which utilizes the geometry of the propagation environment to model spatial correlation, rank deficiency and limited scattering. Unlike the commonly utilized MIMO channel models, the double scattering model is non-Gaussian, rendering the theoretical analysis of techniques designed using this model extremely hard.
I-A Related Literature
The main literature related to the double scattering channel model is scarce and is mainly represented by [14, 15, 16, 17, 18, 19, 13, 20, 21]. The authors in [14] study its diversity order and show that a MIMO system with transmit (Tx) and receive (Rx) antennas and scatterers achieves a diversity of order . The authors in [16] analyze the ergodic MIMO capacity taking into account the presence of double scattering and keyhole effects and show that the use of multiple antennas in keyhole channels only offers diversity gains, but no spatial multiplexing gains. Closed-form upper-bounds on the sum-capacity are obtained in [19] for the MIMO multiple access channel with double scattering fading.
A few papers have analyzed the double scattering model in large-scale MIMO settings [13, 21, 22]. The authors in [13] study this model without Tx and Rx correlation using tools from free probability theory and derive implicit expressions for the asymptotic mutual information (MI) and signal-to-interference-plus-noise ratio (SINR) of the MMSE detector. The authors in [21] study a MIMO multiple access system with double-scattering channels and use random matrix theory (RMT) tools to derive an almost surely tight deterministic approximation of the MI. However, none of these papers consider practical large-scale multi-user MIMO systems with linear signal processing schemes and channel estimation. The authors are only aware of [23], that deals with these aspects using numerical examples instead of theoretical analysis.
I-B Main Contributions
The focus of this work is on the downlink (DL) of a single-cell multi-user multiple-input single-output (MISO) system. We consider double scattering fading between the BS and the users in its most general form. The users are further divided into groups, such that the users in the same group are characterized by common correlation matrices. This assumption is needed to provide some level of analytical tractability for an, otherwise, extremely difficult to study non-Gaussian channel model. We stress here that having variation across the correlation matrices is important in realizing the fundamental performance of large-scale MIMO systems as shown in [3, 4, 9]. However, these works base their analysis on the correlated Rayleigh channel model, which encourages us to study the benefits of having a large number of antennas under the more realistic double scattering model, where we focus on studying the effects of the scattering conditions (instead of the structure of the correlation matrices) on the system performance.
We consider a realistic large-scale MIMO system where the BS obtains channel state information (CSI) from uplink pilot transmissions and applies MMSE estimation technique. Under the assumption that the number of BS antennas, scatterers and users grow large, we derive asymptotically tight deterministic approximations of the SINR and the ergodic rate with regularized zero-forcing (RZF) precoding, which are accurate for moderate system dimensions as well, as shown through simulations. The deterministic approximations are expressed in a closed-form for the multi-keyhole channel and some important insights into the impact of different system parameters on the sum rate performance are drawn. We show that the number of users scheduled simultaneously should be less than the number of scatterers to see ‘large-scale MIMO gains’.
I-C Outline and Notation
The rest of the paper is organized as follows. Section II presents the transmission model and introduces the double scattering channel model along with its MMSE estimate. In Section III, the asymptotically tight deterministic equivalents of the SINR and user rates with RZF precoding are derived. Simulation results are provided in Section IV and Section V concludes the paper.
The following notation is used. Boldface lower-case and upper-case characters denote vectors and matrices respectively. The operator denotes the trace of a matrix X. The spectral norm of a matrix X is denoted by and the diagonal matrix of entries is denoted by . A random vector is complex Gaussian distributed with mean m and covariance matrix . The notation denotes almost sure convergence.
II System Model
Consider a single-cell multi-user MISO system where a BS equipped with antennas serves single-antenna users, who are divided into groups of , , users such that the users in the same group experience similar propagation conditions. In this section, we outline the transmission model, and introduce the double scattering channel along with its MMSE estimate.
II-A Transmission Model
The received complex baseband signal at user in group is given as,
[TABLE]
where is the channel vector from the BS to user in group , is the Tx signal vector and is the receiver noise. The Tx signal vector x is given as,
[TABLE]
where , and are the precoding vector, signal power and data symbol for user in group respectively. The precoding vectors satisfy,
[TABLE]
where is the average total Tx power, and is the precoding matrix, where .
II-B Double Scattering Channel Model
A main contribution of this paper is to apply the double scattering channel model proposed in [10] to a multi-user MISO system. This non-Gaussian model has rank that is determined by both the spatial correlation between the antennas at the BS and the structure of scattering in the propagation environment. The double scattering channel vector is given as [10],
[TABLE]
where is the number of scatterers at the Tx and the Rx sides in group , is the correlation matrix between the BS antennas and the Tx scatterers, is the correlation matrix between the Tx and Rx scatterers, is a standard complex Gaussian matrix that describes the small-scale fading between the BS and the scattering cluster at the Tx side and describes the small-scale fading between the user in group and the scattering cluster at the Rx side. Note that we can assume to be diagonal without any loss of generality for the statistics of the received signal.
Schematic of the double scattering channel is shown in Fig. 1, where and represent the angular spread of the radiated signal from the BS array and the Tx scatterers respectively in group , and and represent the mean angle of departure (AoD) of the radiated signal from the BS array and the Tx scatterers respectively in group , where .
II-C Channel Estimation
During a dedicated uplink training phase, the users transmit mutually orthogonal pilot sequences that allow the BS to compute the MMSE estimates of the channel vectors . After correlating the received training signal with the pilot sequence of user in group , the BS estimates the channel vector based on the received observation, , given as,
[TABLE]
where and is the effective training SNR, assumed to be given here.
*Lemma 1: * The MMSE estimate of the channel in (4) is given as,
[TABLE]
where and .
The proof is postponed to Appendix A.
We stress that Lemma 1 has been derived for a non-Gaussian channel. Under the orthogonality property of the MMSE estimate, , where is the uncorrelated channel estimation error. Since our focus is on a single-cell system so the derived channel estimate is not corrupted by pilot contamination from adjacent cells. Therefore the considered system is referred to as a large-scale MIMO system instead of a Massive MIMO system.
II-D Achievable Rates
Since the BS does not have CSI, so it utilizes the MMSE estimate in (6) to implement digital precoding. The analysis presented in this paper is, therefore, a function of , which determines the error that is incurred in channel estimation. However, to compute the SINR in the downlink, the user needs to have CSI too. A well-known feature of large-scale MIMO systems is channel hardening, which means that the effective useful channel of a user converges to its average value when grow large. Hence, it is sufficient for each user to have only the statistical CSI and the resulting performance loss vanishes in the large system limit. Using this idea, an ergodic achievable user rate can be computed using a technique from [24], widely applied to large-scale MIMO systems in [1, 25, 2, 8]. The main idea is to decompose in (1) as and assume that the average effective channel is perfectly known at the corresponding user. By treating the inter-user interference and channel uncertainty as worst-case Gaussian noise, the user in group can achieve the ergodic rate,
[TABLE]
without knowing the instantaneous values of . The parameter can be interpreted as the effective average SINR of user in group and is defined as,
[TABLE]
The ergodic achievable sum rate is given as,
[TABLE]
This paper considers RZF precoding, which is a state-of-the-art heuristic precoding scheme with a simple closed-form expression given as [2, 7],
[TABLE]
where , where , is the regularization parameter and ensures that the power constraint in (3) is satisfied as,
[TABLE]
where , where . The SINR in (8) is now defined as,
[TABLE]
where , and .
III Main Results
The performance of the users are characterized by their ergodic rates, , which depend on their SINRs. These ergodic rates, with RZF precoding, are difficult to compute for finite system dimensions. However, they tend to deterministic quantities in the large () regime, as shown in [2, 8, 7]. These quantities depend only on the statistics of the channels and are referred to as deterministic equivalents. These deterministic equivalents are almost surely (a.s.) tight in the asymptotic limit. By a.s. tight in the asymptotic limit we mean that as the system dimensions grow to infinity, the approximations yielded by these deterministic equivalents tend to the actual values with probability one. Deterministic equivalents were first proposed by Hachem et al. in [26], who showed their ability to capture important system performance indicators.
In the sequel, the asymptotic limit, denoted as , represents the following assumption.
A-1. For all , , , and tend to infinity such that,
[TABLE]
Also, in the sequel, the deterministic equivalent of a sequence of RVs is represented by the deterministic sequence , which approximates such that,
[TABLE]
The objective of this section is to derive the deterministic equivalent of the SINR with RZF precoding, such that,
[TABLE]
Note that is a function of both and . From now on, these random vectors and matrices must be understood as sequences of vectors and matrices of growing dimensions. For the sake of simplicity, their dependence on , and is not explicitly shown.
We stress again that the result in (14) must be understood in the way that, for each given set of system parameters , , and , we provide approximations of the SINR and the user rates that become increasingly tight as , and grow large. We will show later by simulations that these approximations are very accurate for moderate system dimensions - an observation made in [2, 7, 3, 8, 26] as well. The asymptotic analysis requires the following two assumptions.
A-2. For all , and .
A-3. For all , and .
We first present the Fubini theorem, which is a key mathematical idea behind our derivations.
III-A Fubini Theorem
Most works derive deterministic equivalents for random matrix models created from sums of independent random matrices. In many practical cases, like for the double scattering model, it is necessary to consider products of random matrices. To handle such models, we rely on the concept of iterative deterministic equivalents which utilizes the Fubini theorem [27, 21].
A consequence of the Fubini theorem is that if we have a function of two independent random sequences of matrices, and , then we can condition on one sequence, let’s say , and find the deterministic equivalent for . If it can be proved that this deterministic equivalent holds true for every generated by a space , then it is also valid for the random sequence [27]. However is still random due to dependence on , so we need to get a deterministic equivalent for it.
This is the main mathematical idea behind our derivation of the deterministic equivalent of which is a function of the random matrices and the random vectors . First, we interpret the double-scattering channel model in (4) as,
[TABLE]
where , and we assume to be deterministic. Under this setting, the estimate of the double scattering model in (6) can be interpreted as,
[TABLE]
where and is the covariance matrix of the channel estimate given as,
[TABLE]
We obtain the deterministic equivalent of in terms of certain fixed point equations that depend on the deterministic matrices ’s using RMT results from [28]. We then extend the analysis by allowing ’s to be random based on the Fubini theorem. In this second step, we derive the deterministic equivalents of the fixed point equations under the actual random ’s.
To summarize, the “randomness" related to is first removed and then introduced later after we have the deterministic equivalent of the SINR as a function of the random vector and the deterministic matrix . From this construction, the resulting deterministic approximation is referred to as an iterative deterministic equivalent.
III-B New and Useful Results
The two theorems in this section represent the major contributions of this work as they are required to cope with the non-Gaussian channel model in (4) and its estimate in (6) and form the mathematical basis of the subsequent large system analysis of RZF precoding. They provide deterministic equivalents of normalized traces and quadratic forms involving single and double occurrences of the resolvent matrix , which is defined as .
The resolvent is of constant use in this paper as it arises from the expression of in the RZF precoder in (10) and appears in all the terms of the SINR in (12). The trace of the resolvent is characterized by the quantities , in the asymptotic limit. The deterministic approximation of the SINR explicitly depends on these quantities, which is why they are introduced first in the following theorem.
Theorem 1: Consider the resolvent matrix, , where the columns of are distributed as (6). Then the following system of implicit equations in and ,
[TABLE]
has a unique solution satisfying , for all and , where is an arbitrary matrix with a uniformly bounded spectral norm, and . Let U be any deterministic matrix with a uniformly bounded spectral norm. Under assumptions A-1, A-2, A-3 and for , we have
[TABLE]
Proof: The proof of Theorem 1 is postponed to Appendix B.
Next we present the deterministic equivalents of and , where , where is defined in Section II-D. These quantities arise in the expression of and appear repeatedly in our analysis.
Lemma 2: Under the setting of assumptions A-1, A-2 and A-3 and for ,
[TABLE]
where is obtained as the unique solution of (18) and .
Proof: The proof of Lemma 2 is provided in Appendix C.
Theorem 1 and Lemma 2 approximate quantities with one occurrence of the resolvent matrix. However, with RZF, random terms involving two resolvents arise in the expression of the SINR, a case which is out of the scope of Theorem 1. We therefore develop the result for this case.
Theorem 2: Define , where =, where is given by (17). Under the setting of assumptions A-1, A-2 and A-3 and for ,
[TABLE]
where,
[TABLE]
[TABLE]
where for arbitrary matrices and L having uniformly bounded spectral norm,
[TABLE]
and , which can be expressed as a system of linear equations as follows,
[TABLE]
for .
Proof: The proof of Theorem 2 is provided in Appendix D.
III-C Deterministic Approximation of the SINR
Based on the results in Theorem 1, Lemma 2 and Theorem 2, the deterministic equivalent of the SINR in (12) can be derived and is presented in the next theorem.
Theorem 3: Under the setting of assumptions A-1, A-2 and A-3 and for , the downlink SINR of user in group defined in (12) converges almost surely as,
[TABLE]
where,
[TABLE]
where,
[TABLE]
Proof: The proof of Theorem 3 is given in Appendix F.
Corollary 1: Assume that A-1, A-2 and A-3 hold true and . Then the individual downlink rates of users converge as,
[TABLE]
where,
[TABLE]
where is given by (33).
Proof: The proof follows from the application of the continuous mapping theorem [29] to the logarithm function and the almost sure convergence of in (32).
An approximation of the average system sum rate can be obtained by replacing in (9) with its asymptotic approximation as,
[TABLE]
These asymptotic expressions will be shown to provide good approximations even for moderate system sizes by the means of simulations in Section IV. This means they can be used for evaluating the performance of practical systems without relying on time-consuming Monte-Carlo simulations. These expressions are also useful in performing different optimization tasks, for example, determining the optimal and power allocation as shown in [7].
The deterministic equivalents depend only on the ‘slowly varying’ covariance matrices and , instead of the instantaneous channels that vary very fast. These correlation matrices, although huge in size in the large regime, can be computed at the BS using knowledge of only the large-scale channel statistics. In particular, each correlation matrix, can be computed using estimates of the angular spreads, , and the mean AoD, . These parameters are illustrated in Fig. 1. The spreads can be locally estimated while depends on the line of sight (LoS) angles of the users within a group, that have to be fed to the BS. However, the estimation of these large-scale parameters must be performed only once per coherence period (rather than at the same pace as the small-scale fading) and the number of required estimated parameters depends on and (much less than and in practical systems). Discussion on the estimation of large-scale parameters can be found in [30, 31], where the authors reason that the estimation of channel covariance matrices imposes a low CSI feedback overhead.
We would also point out here that the considered RZF precoding, known for its high performance, also imposes a high computational complexity in large-scale MIMO systems due to the matrix inversion operation. It is therefore interesting to study the double scattering model with other precoding schemes that reduce this prohibitively high computational complexity. One way is to employ truncated polynomial expansion (TPE) precoding proposed in [8], which replaces the matrix inversion by a TPE. The TPE precoding can perform close to RZF, with a reduced computational burden. Theorem 1 and Theorem 2 developed in this work will play a direct role in the asymptotic analysis of the SINR of the double scattering model with TPE.
Despite being useful, the expressions in this work are quite involved and do not provide direct insights into the interplay between different system parameters. We therefore focus on Rayleigh product channel and obtain some simplifications under different operating conditions.
III-D Rayleigh Product Channel
A special case of the double-scattering channel is the Rayleigh product channel which does not exhibit any form of correlation [18]. For this model, popularly known as the multi-keyhole channel, we find that Theorem 3 can be given in a closed-form as shown in the next corollary.
Corollary 2: For , let , , and assume and . Then in Theorem 3 can be given in a closed-form as,
[TABLE]
and is given as the unique root to,
[TABLE]
Sketch of Proof: One can show by straightforward but tedious calculations that the fundamental equations in (18) can be reduced to a single polynomial equation (III-D) in , for a single group and and . Simplifying (120) for the setting of Corollary 2, writing it as a polynomial in and applying implicit function theorem with respect to yields the expression of . After some tedious calculations, we show that Theorem 2 can be given in a closed form and the SINR expressions in Theorem 3 can be greatly simplified.
Corollary 3: Under the setting of Corollary 2, let . Then defined in Theorem 3 approaches the limit which is given by,
[TABLE]
where , , and .
Note that as , h behaves as a Rayleigh fading channel, whose SINR is given as (45). Setting, , we obtain the following corollary for the perfect CSI case.
Corollary 4: Under the setting of Corollary 3, let . Then is given by,
[TABLE]
where .
This result has also been obtained in Corollary 2 of [7], where the authors derive the deterministic equivalent of the SINR under RZF, with the channels modeled as correlated Rayleigh.
Corollary 5: Under the setting of Corollary 2, let and with . Then defined in Theorem 3 approaches the limit which is given by,
[TABLE]
Corollary 6: If in the setting of Corollary 5, then and as a consequence, the interference term and the power normalization term in Theorem 3.
Remark 1: Corollary 6 implies for with , grows unboundedly.
Two very important observations are in order now. First, the performance of a large-scale MIMO system is limited by the number of scatterers in the propagation environment. Recent works that show the spectral efficiency of large-scale MIMO systems to be unlimited base their analysis on correlated Rayleigh channels and neglect the impact of limited scattering, that may exist in practice. Even though some of these works realize that poor scattering will deteriorate the performance but they do not account for it in the theoretical analysis. This is the first paper that derives a bound on the asymptotic ergodic rate that can be achieved under limited scattering.
The second observation is that the maximum number of users that can be served simultaneously while achieving large-scale MIMO gains is less than or equal to the number of scatterers. In fact, we find through Corollary 6 that if , grows unboundedly with . This ‘large-scale MIMO gain’ is in accordance with the results in [2] and [1], which when simplified for a single cell case, reveal that for , . The inherent assumption in these works that rely on the correlated Rayleigh channel is that there is enough scattering to serve all the users. However, as exceeds , we start to see a significant performance loss. For example, with uniform power allocation, serving users would result in the SINR of each user approaching as , according to Corollary 5. If we serve users, the SINR will approach , resulting in a loss. Thus, the value of is what determines the maximum number of users that can be served while realizing large-scale MIMO gains.
IV Simulation Results
Under the double scattering model, the correlation matrices and are given as and , where is defined as [21],
[TABLE]
All parameters have already been defined in Fig. 1. The parameter values are set as , , , =, =, = and =. Also, and for all . We assume an equal number of users in each group, i.e. , with uniform power allocation, . Fig. 3 compares the downlink system sum rate obtained using 2000 Monte-Carlo realizations of the SINR in (12) to the asymptotic approximation provided in (39), where is given by (33). It can be seen that the asymptotic result yields a very good approximation for moderate system dimensions.
To explain the decrease in sum-rate at high values of for , note that the goal of the regularization parameter in the RZF precoder in (10) is to improve the conditioning of the random matrix and hence it should depend on the channel parameters, including the training SNR and the downlink SNR . Finding the optimal that maximizes is outside the scope of this work. Now for the chosen (optimal under perfect CSI), the sum rate is decreasing at high SNR values for . This is because does not account for and thus the matrix in the RZF precoder becomes ill-conditioned as the quality of the estimate deteriorates and decreases (due to increase in ). This results in a loss in the performance. This effect has also been observed in ([7], Fig. 2). In fact, the authors there obtain the optimal as a solution of a fixed point equation that depends on and show that for , the sum rate continues to increase with for any quality of channel estimate.
Furthermore, note that the mismatch between the theoretical and the Monte-Carlo results starts to increase for high SNR values due to the slower convergence of to its deterministic approximation as well documented in RMT literature [7, 32]. A better approximation at high SNR can be seen in systems with higher values of and .
Next we plot in Fig. 3 the average achievable rate for , , , and users having mean AoDs generated between to and spreads between and . We employ the fixed quantization method from [31] to group the users, based on the criteria of the minimum chordal distance between the users’ correlation eigenspaces and the group subspaces. The Monte-Carlo and deterministic average rates when common correlation matrices are assumed are shown to be close. In fact, the average (and sum) rate now continues to increase for because for , is well-conditioned. The Monte-Carlo average rate under the actual different correlation matrices of the users is plotted in black to highlight the importance of having variation in correlation matrices. However, for us to analyze this difficult non-Gaussian model, it was necessary to assume common correlation matrices within groups. The performance loss due to this assumption can be reduced by increasing the number of groups.
Fig. 5 studies the effect of the number of scatterers on the system sum rate for a single group with multi-keyhole channel, i.e. , , , with , , and . The downlink sum rate in (39) plotted using the closed-form expression of in Corollary 2 is close to the Monte-Carlo result even for a very low number of scatterers. The spatial multiplexing gains are seen to increase linearly with . However, for , the gains start to decrease since the degrees of freedom are limited by the number of antennas at the BS. The limiting sum rate as is also plotted using the SINR in (45). As the number of scatterers increases, the performance approaches to that of a Rayleigh fading channel.
Finally, we study the performance of the Rayleigh product channel in Fig. 5 as the number of BS antennas increases for a single group. It can be seen that the performance for the case when , saturates to a limiting sum rate, given by substituting (47) in (39), which is also plotted in black on the figure. This result confirms that it is useless to deploy more antennas when the number of scatterers in the environment is limited. The other curves highlight that when , we see the ‘large-scale MIMO effect’ and the performance continues to grow with because the channel has enough degrees of freedom to support all the users.
V Conclusion
In this paper, we studied a large-scale multi-user MISO system with double-scattering channels that are more realistic than the commonly used Gaussian channels. We first derived the MMSE estimate for this channel. Then under the assumption of per-group channel correlation matrices, we derived the deterministic approximations of the SINR and ergodic rates with RZF precoding, that are tight in the large system limit. Simulation results showed a close match between the asymptotic and the Monte-Carlo simulated sum rate and provided insights into the performance of multi-keyhole channels. We showed that large-scale MIMO gains can only be realized when the number of scheduled users is less than the number of scatterers in the environment.
Appendix A Proof of Lemma 1
The MMSE estimate of is computed as where is obtained as the solution of resulting in,
[TABLE]
Using the expression of in (5) and the independence of and we get,
[TABLE]
Conditioning the expectation on first, we obtain,
[TABLE]
Therefore , where .
Appendix B Proof of Theorem 1
As a starting point we assume to be deterministic such that . Under this setting, we use the expression of given by (16) in the resolvent matrix, , to have,
[TABLE]
where the entries of and , where is given by (17). Note that is a double scattering model with deterministic s. Under this setting, the deterministic equivalent of is obtained using Corollary 1 from [28] as,
[TABLE]
where are given as a unique solution to the following set of implicit equations,
[TABLE]
[TABLE]
Now for the actual double-scattering channel model, s are random and modeled as
. Using the Fubini Theorem [27] we can extend the result in (55) to random s by finding deterministic equivalents, denoted as and , for and respectively.
To do this, we first define quantities and for , as and , where,
[TABLE]
It can be shown following the techniques used in Appendix E of [27] that for all ,
[TABLE]
Now using the expression of from (17) in the expression of in (57) we have,
[TABLE]
[TABLE]
where . First we treat using (61) and matrix inversion lemma (Lemma 1 in [2]) to remove the dependence of on the vector as,
[TABLE]
Then from trace lemma and rank-one perturbation lemma (Lemmas 3 and 5 in [33]) we have,
[TABLE]
where .
Notice that is still a function of which is random. To remove the dependence of on , we need the deterministic equivalent of . Using the expression of in (17) and Corollary 1 from [28], we have the following convergence,
[TABLE]
where is any deterministic matrix that satisfies and,
[TABLE]
such that . Substituting in , we have,
[TABLE]
Consequently we get the following convergence for and ,
[TABLE]
Plugging and into the expression of in (63) yields,
[TABLE]
where . Consider the deterministic counterpart of as defined in (18) and define , , and . It can be shown using the techniques from Appendix E of [27] that,
[TABLE]
for sufficiently large and . This result can be extended to all by Vitali convergence theorem [34].
Now combining (55) with the result in (70) for will yield,
[TABLE]
where is given by (19). This completes the proof of Theorem 1. The uniqueness of the solution can be proved by showing that the G-variate function in (18) is a standard interference function as done for Theorem 2 in [35].
Appendix C Proof of Lemma 2
C-A * Deterministic equivalent of :*
It has been shown in ([27], Appendix E) that in (15) satisfies almost surely. We can therefore assume to be deterministic as a starting point and use trace lemma and rank-one lemma (Lemma 3 and Lemma 5 from [33]) to have,
[TABLE]
where is given by (17). Utilizing the result in (55) for yields . The result can be extended to random s using Fubini Theorem and the deterministic equivalent of is obtained as in Appendix B.
C-B * Deterministic equivalent of :*
Utilizing the expressions of the double scattering channel in (15) and its MMSE estimate in (6) and the independence of and , we have,
[TABLE]
Next exploit trace lemma and rank-1 lemma (Lemmas 3 and 5 from [33]) to get,
[TABLE]
where is given by (54) under the assumption of deterministic and , where . Utilizing the result in (55) for , we have,
[TABLE]
where are given as a unique solution to the set of implicit equations in (56), (57), Denote in (82) as and let ’s be random now. A similar development as done for the term in Appendix B will yield,
[TABLE]
Using the convergence in (70) and defined in (72), we obtain,
[TABLE]
Replace and in (84) by their deterministic approximations and respectively from Appendix B. This yields the deterministic equivalent of denoted as .
Appendix D Proof of Theorem 2
We are interested in finding the deterministic equivalent of:
[TABLE]
First we need to control the variance of and prove that converges to zero. This can be done using standard tools from RMT (see [26]) and will imply that,
[TABLE]
where . This allows us to focus directly on .
Using the expression of in (17) we have,
[TABLE]
where,
[TABLE]
and and denote their respective expectations. We therefore start the proof by deriving the deterministic equivalents of and , where A and B are deterministic matrices of uniformly bounded spectral norm. Plugging these results in (88) will complete the proof of Theorem 2.
D-A * Deterministic equivalent of :*
The aim of this section is to derive a deterministic equivalent for the random quantity . We can again control the variance of and show that it goes to zero implying that,
[TABLE]
where . Using the result from last section in (55) that , where and the resolvent identity given as , we decompose as,
[TABLE]
We start by applying matrix inversion lemma (Lemma 1 in [2]) on and re-writing the expression to get,
[TABLE]
Assuming to be deterministic and using the fact that almost surely, we can use the expression of in (16) and show with the help of trace lemma (Lemma 3 from [33]) that the first term on the right side of the above equation is negligible. Now using the rank-one perturbation lemma (Lemma 5 from [33]) and Lemma 2 on the second term yields,
[TABLE]
Using trace lemma (Lemma 3 from [33]) with the expression of in (16) we have,
[TABLE]
Now using the rank-one lemma (Lemma 5 from [33]) and Corollary 1 from [28] we have,
[TABLE]
Note that the analysis in the last two steps can be extended to random s using Fubini Theorem and we need the deterministic equivalent of under the actual random s. Using the expression of from (17) results in
[TABLE]
We analyze the two terms separately. First we use matrix inversion lemma (Lemma 1 from [2]) on to remove the dependence of T on resulting in
[TABLE]
where \tilde{\textbf{T}}_{i}=\Big{(}\sum_{l=1}^{G}\frac{K_{l}}{K}d_{l}^{2}\bar{m}_{l}\textbf{R}_{BS_{l}}\textbf{Q}_{l}\left(\textbf{Z}_{l}\textbf{Z}_{l}^{H}+\frac{1}{\rho_{tr}}\textbf{I}_{N}\right)\textbf{Q}^{H}_{l}\textbf{R}^{H}_{BS_{l}}-\frac{K_{i}}{K}d_{i}^{2}\bar{m}_{i}\textbf{R}_{BS_{i}}\textbf{Q}_{i}\textbf{z}_{i,j}\textbf{z}_{i,j}^{H}\textbf{Q}^{H}_{i}\textbf{R}^{H}_{BS_{i}}\\ +\alpha\textbf{I}_{N}\Big{)}^{-1}. Next we use trace lemma and rank-one lemma (Lemmas 3 and 5 from [33]) as,
[TABLE]
Using matrix inversion lemma to remove the dependence of T on and using from Appendix B, where , yields,
[TABLE]
Finally we use trace lemma and rank-one perturbation lemma from [33] to get,
[TABLE]
Similar steps would yield the following deterministic equivalent for ,
[TABLE]
where .
In order to complete the calculation of the deterministic equivalent of , we need the deterministic equivalent of which is stated in the following Lemma.
Lemma 3: Define where and L, are deterministic matrices with uniformly bounded spectral norm. Then under the setting of assumptions A-1, A-2 and A-3 and for ,
[TABLE]
where and , which can be expressed as a system of linear equations as follows:
[TABLE]
for . The proof of Lemma 3 can be found in Appendix E.
Using Lemma 3, we have the deterministic equivalents of and (denoted as and ) in Theorem 2) and hence . This yields the expression of as,
[TABLE]
We now study . Using matrix inversion lemma and common inverses of resolvents we have,
[TABLE]
We sequentially deal with terms and . Using Lemma 2 and some manipulation we have,
[TABLE]
Assuming to be deterministic, we can use Lemmas 6, 3 and 5 from [33] and Corollary 1 from [28] sequentially to obtain,
[TABLE]
Extending the analysis to random based on the Fubini Theorem and using Lemma 2 we have,
[TABLE]
The term is compensated by . To see this, observe that the first order of the term does not change if we substitute by and by P and then apply trace lemma, rank-one perturbation lemma (Lemmas 3 and 5 from [33]) and Fubini theorem.
Finally, we deal with . Using Lemma 2 and some manipulation we have,
[TABLE]
Analogously to before, can be simplified as,
[TABLE]
[TABLE]
where is defined in (88). Combining (103), (108) and (109), we obtain,
[TABLE]
D-B * Deterministic equivalents of and :*
Using similar argument for as done for in (91), we have , where .
Repeating the same steps as done for , we obtain,
[TABLE]
where , where has been defined in Lemma 3.
Plugging (110) and (111) into (88) yields the deterministic equivalent of as,
[TABLE]
Now can be expressed as a system of linear equations as,
[TABLE]
Also, plugging (112) into (110) and (111) completes the deterministic equivalents of and respectively. These terms will later be needed in the proof of Theorem 3.
Appendix E Proof of Lemma 3
To derive the deterministic equivalent of , where and L, are deterministic matrices, we use the results from Appendix B in (70) and (73) along with to have , where and , where is given by (19). To this end note that,
[TABLE]
The deterministic equivalent of is obtained using Corollary 1 from [28] as,
[TABLE]
where are defined as the unique solution to,
[TABLE]
such that . Substituting in , we have,
[TABLE]
Note that reduces to for .
The expression of can be obtained by using (116) and (E) to get resulting in,
[TABLE]
where .
Note that depends on the values of . The latter can be expressed as a system of linear equations by solving (121) for . This system is represented by (100).
Appendix F Proof of Theorem 3
The deterministic equivalents of the energy term , the term of the power normalization, the interference term and the variance term
are worked out separately to yield the deterministic equivalent of the SINR.
F-A * Deterministic equivalent of :*
Note that can be written as , where . In order to remove the dependency of on , we use common inverses of resolvents lemma (Lemma 2 from [33]) to get,
[TABLE]
The deterministic equivalents of both and have been derived in Lemma 2 as and respectively. Note that Lemma 2 not only implies almost sure convergence but also convergence in mean. Therefore, by the dominated convergence theorem and the continuous mapping theorem, we have
[TABLE]
F-B *Deterministic equivalent of : *
[TABLE]
where was derived in Appendix D and is given by (111).
F-C Deterministic equivalent of :
Denote as and use Lemma 2 from [33] to decompose it as,
[TABLE]
Let us begin by treating . Using trace lemma and results from Appendix D we have,
[TABLE]
Therefore,
[TABLE]
where was derived in the last section and is given by (110).
Next note that,
[TABLE]
Using the definition of and from (15) and (6) respectively, we have
[TABLE]
Using the trace lemma and the independence of and we have,
[TABLE]
Note that . Therefore,
[TABLE]
where was derived in the last section and is given by (110). Combining this with Lemma 2 yields the deterministic equivalent of as,
[TABLE]
Similar analysis yields the deterministic equivalent of as,
[TABLE]
Next note that,
[TABLE]
where . Using Fubini theorem and trace lemma we have . Notice that , where is defined in (85). This yields the deterministic equivalent of as,
[TABLE]
Combining (128), (134), (135) and (137) yields the deterministic equivalent of . The lemmas utilized here not only imply almost sure convergence but also convergence in mean. Therefore,
[TABLE]
F-D Deterministic Equivalent of
Define the quantities and . By matrix inversion lemma we have . Thus . We have already shown that . Since and are bounded, so by dominated convergence theorem we have . Therefore .
Combining the results of the four subsections completes the proof of Theorem 3.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] T. L. Marzetta, “Noncooperative cellular wireless with unlimited numbers of base station antennas,” IEEE Transactions on Wireless Communications , vol. 9, no. 11, pp. 3590–3600, November 2010.
- 2[2] J. Hoydis, S. T. Brink, and M. Debbah, “Massive MIMO in the UL/DL of cellular networks: How many antennas do we need?” IEEE Journal on Selected Areas in Communications , vol. 31, no. 2, pp. 160–171, Feb. 2013.
- 3[3] E. Björnson, J. Hoydis, and L. Sanguinetti, “Massive MIMO has unlimited capacity,” IEEE Transactions on Wireless Communications , vol. 17, no. 1, pp. 574–590, 2018.
- 4[4] H. Yin, D. Gesbert, M. Filippou, and Y. Liu, “A coordinated approach to channel estimation in large-scale multiple-antenna systems,” IEEE Journal on Selected Areas in Communications , vol. 31, no. 2, pp. 264–273, 2013.
- 5[5] H. Huh, G. Caire, H. C. Papadopoulos, and S. A. Ramprashad, “Achieving "massive MIMO" spectral efficiency with a not-so-large number of antennas,” IEEE Transactions on Wireless Communications , vol. 11, no. 9, pp. 3226–3239, 2012.
- 6[6] T. L. Marzetta, E. G. Larsson, H. Yang, and H. Q. Ngo, Fundamentals of Massive MIMO . Cambridge, UK: Cambridge University Press, 2016.
- 7[7] S. Wagner, R. Couillet, M. Debbah, and D. T. M. Slock, “Large system analysis of linear precoding in correlated MISO broadcast channels under limited feedback,” IEEE Transactions on Information Theory , vol. 58, no. 7, pp. 4509–4537, 2012.
- 8[8] A. Kammoun, A. Müller, E. Björnson, and M. Debbah, “Linear precoding based on polynomial expansion: Large-scale multi-cell MIMO systems,” IEEE Journal of Selected Topics in Signal Processing , vol. 8, no. 5, pp. 861–875, Oct. 2014.
