A New Class of Nonlinear Precoders for Hardware Efficient Massive MIMO Systems
Mohammad A. Sedaghat, Ali Bereyhi, Ralf R. M\"uller

TL;DR
This paper introduces a new class of nonlinear precoders for massive MIMO systems, analyzing their performance using advanced statistical mechanics methods and demonstrating their effectiveness in reducing peak power without significant loss.
Contribution
It develops a comprehensive analysis of nonlinear LSE precoders with various constraints using the replica method, including novel insights into peak power reduction and performance under different assumptions.
Findings
LSE precoders can reduce peak to average power ratio to 3dB.
RS assumption fails for PSK, 1-RSB provides more accurate predictions.
Nonlinear precoders improve hardware efficiency in massive MIMO.
Abstract
A general class of nonlinear Least Square Error (LSE) precoders in multi-user multiple-input multiple-output systems is analyzed using the replica method from statistical mechanics. A single cell downlink channel with transmit antennas at the base station and single-antenna users is considered. The data symbols are assumed to be iid Gaussian and the precoded symbols on each transmit antenna are restricted to be chosen from a predefined set . The set encloses several well-known constraints in wireless communications including signals with peak power, constant envelope signals and finite constellations such as Phase Shift Keying (PSK). We determine the asymptotic distortion of the LSE precoder under both the Replica Symmetry (RS) and the one step Replica Symmetry Breaking (1-RSB) assumptions. For the case of peak power constraint on each transmit antenna,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A New Class of Nonlinear Precoders for Hardware Efficient Massive MIMO Systems
Mohammad A. Sedaghat, Ali Bereyhi, Ralf R. Müller
Emails: {mohammad.sedaghat, ali.bereyhi, ralf.r.mueller}@fau.de
∗ Friedrich-Alexander Universität Erlangen-Nürnberg, Erlangen, Germany
Abstract
A general class of nonlinear Least Square Error (LSE) precoders in multi-user multiple-input multiple-output systems is analyzed using the replica method from statistical mechanics. A single cell downlink channel with transmit antennas at the base station and single-antenna users is considered. The data symbols are assumed to be iid Gaussian and the precoded symbols on each transmit antenna are restricted to be chosen from a predefined set . The set encloses several well-known constraints in wireless communications including signals with peak power, constant envelope signals and finite constellations such as Phase Shift Keying (PSK). We determine the asymptotic distortion of the LSE precoder under both the Replica Symmetry (RS) and the one step Replica Symmetry Breaking (1-RSB) assumptions. For the case of peak power constraint on each transmit antenna, our analyses under the RS assumption show that the LSE precoder can reduce the peak to average power ratio to 3dB without any significant performance loss. For PSK constellations, as grows, the RS assumption fails to predict the performance accurately and therefore, investigations under the 1-RSB assumption are further considered. The results show that the 1-RSB assumption is more accurate.
I Introduction
Massive Multiple-Input Multiple-Output (MIMO) is among the main technologies for the next generation of wireless networks [1, 2]. In massive MIMO systems, multi-antenna base stations utilize precoding techniques to focus the power to desired users. This shifts a large portion of the system’s overall processing to base stations which in contrast to power-limited users, do not face a computing power constraint.
So far, several precoding schemes have been proposed including linear and nonlinear schemes. Linear schemes mainly consist of Match Filtering (MF), Zero Forcing (ZF) and Regularized Zero Forcing (RZF), where in practice each of them could be preferred regarding the desired tradeoff between the complexity and performance [3]. As examples of nonlinear schemes, one names Tomlinson-Harashima [4] and vector precoding [5]. Regarding data support, precoding schemes can be designed for finite or infinite input alphabets. In finite alphabet cases, the users data symbols are chosen from a finite and countable set, e.g., Phase Shift Keying (PSK) constellation [6].
In this paper, we consider a nonlinear Least Square Error (LSE) precoder in which the signal on each transmit antenna is restricted to be chosen from a predefined set. The main motivation for investigating the LSE precoder is to model a large variety of signal constraints at massive MIMO base stations which allow us to use more efficient hardware. As an example, the LSE precoder can be designed to fulfill an instantaneous peak power constraint on each transmit antenna which avoids clipping at power amplifiers. Furthermore, the LSE precoder are able to fix the envelope of transmitted signals to increase the power efficiency of power amplifiers which was initially investigated in [7]. The LSE precoder also enable us to have finite alphabet signals on antennas111Note that the constraints considered in this paper are on the signals on the antennas and this is different than the case of having constraint on the input data signal of users. which are required in the recently introduced Load Modulated Single-RF (LMSRF) MIMO transmitters [8]. In LMSRF, the signal on each antenna is taken from a discrete constellation due to using limited number of switches .
Considering a large number of transmit antennas, we study the LSE precoder in the asymptotic regime. Our performance analyses are based on the replica method introduced in the context of statistical mechanics. The LSE precoder is analyzed in frequency-flat fading channels and it is shown that the same performance is achieved in frequency-selective fading channels utilizing Orthogonal Frequency Division Multiplexing (OFDM). Both the Replica Symmetry (RS) and one-step Replica Symmetry Breaking (1-RSB) assumptions are applied under some known signal constraints such as signals with peak and average power constraints, constant envelope signals and PSK signals. It is shown that in the case of peak power constraint, the RS prediction is consistent with the numerical results, although in some cases it might not be exact. However, the RS assumption does not give an accurate solution for Binary Shift Keying (BPSK) and Quadrature Phase Shift Keying (QPSK) signals on the transmit antennas and 1-RSB improves the prediction in these cases.
Notation: We use bold lowercase letters for vectors and bold uppercase letters for matrices. Conjugate transpose of a matrix is denoted by , the transpose itself is shown by . Moreover, the complex set is shown by . F_{{\mathchoice{\mbox{\boldmath\displaystyle b}}{\mbox{\boldmath\textstyle b}}{\mbox{\boldmath\scriptstyle b}}{\mbox{\boldmath\scriptscriptstyle b}}}}(\cdot) denotes the cumulative distribution function (cdf) of and the Kronecker product is shown by . The real and imaginary parts are denoted by and , respectively, and \mathop{\mbox{\mathsf{E}}} represents the mathematical expectation. Furthermore, we define and {\rm Vec}({\mathchoice{\mbox{\boldmath\displaystyle A}}{\mbox{\boldmath\textstyle A}}{\mbox{\boldmath\scriptstyle A}}{\mbox{\boldmath\scriptscriptstyle A}}}) to be the vector obtained by stacking the columns of .
II Problem formulation
Let’s consider the general problem of designing a precoder for a massive MIMO system with transmit antennas and single-antenna users. The channel is assumed to be a frequency-flat fading channel. The generalization to the case of frequency-selective fading channels and OFDM signal is presented in Appendix A where we show that the same result holds for the case of frequency-selective channels. The inter-cell interference is neglected and it is assumed that the channel matrix is perfectly known at the base station. Let {\mathchoice{\mbox{\boldmath\displaystyle u}}{\mbox{\boldmath\textstyle u}}{\mbox{\boldmath\scriptstyle u}}{\mbox{\boldmath\scriptscriptstyle u}}}\in\mathbbmss{C}^{K} and {\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}\in\mathbbmss{C}^{K\times N} be the data vector of the users and the channel matrix, respectively, and {\mathchoice{\mbox{\boldmath\displaystyle v}}{\mbox{\boldmath\textstyle v}}{\mbox{\boldmath\scriptstyle v}}{\mbox{\boldmath\scriptscriptstyle v}}}\in\mathbbmss{X}^{N} denotes the precoded vector signal where is a predefined set. The received vector at the user terminals reads
[TABLE]
where {\mathchoice{\mbox{\boldmath\displaystyle y}}{\mbox{\boldmath\textstyle y}}{\mbox{\boldmath\scriptstyle y}}{\mbox{\boldmath\scriptscriptstyle y}}}=[y_{1},\cdots,y_{K}]^{\rm T} with being the received signal at the th user terminal, and being the zero mean additive white Gaussian noise vector whose elements have the variance of . We define the non-linear LSE precoder with the following rule
[TABLE]
where is a positive constant and is a tuning parameter (Lagrange multiplier) controlling the total transmit power. In case of , our nonlinear LSE precoding scheme reduces to the linear scheme
[TABLE]
which is known as the regularized zero-forcing precoder [9]. The precoding procedure, however, does not take a simple form for a general . As an example, consider the LMSRF MIMO transmitter which chooses a finite constellation with respect to the number of discrete load modulators’ states [10, 8]. Another example is the case of constant envelope precoding on each antenna [7] where
[TABLE]
Here, the Peak to Average Power Ratio (PAPR) is small, around or decibels depending the pulse shaping filter. In these cases, the classical tools fail to analyze the optimization problem in (2). We therefore invoke the replica method developed in statistical mechanics to determine the large system performance of the precoder by calculating the asymptotic distortion defined as
[TABLE]
when the inverse load factor defined as , is kept fixed. Our analysis determines the asymptotic distortion defined in (5) without finding the explicit solution of the optimization problem (2). This allows us to have an estimate of the best performance in order to have a reference performance measure for comparing the practical algorithms.
Throughout the analysis, we set the data symbols of the users to be independent and identically distributed (iid) Gaussian, i.e., {\mathchoice{\mbox{\boldmath\displaystyle u}}{\mbox{\boldmath\textstyle u}}{\mbox{\boldmath\scriptstyle u}}{\mbox{\boldmath\scriptscriptstyle u}}}\sim\mathcal{CN}({\mathchoice{\mbox{\boldmath\displaystyle 0}}{\mbox{\boldmath\textstyle 0}}{\mbox{\boldmath\scriptstyle 0}}{\mbox{\boldmath\scriptscriptstyle 0}}},\sigma_{u}^{2}{\mathchoice{\mbox{\boldmath\displaystyle I}}{\mbox{\boldmath\textstyle I}}{\mbox{\boldmath\scriptstyle I}}{\mbox{\boldmath\scriptscriptstyle I}}}).
The asymptotic distortion measure can be used to derive a lower bound for the ergodic achievable rate of the users in the downlink channel as follows. Let be the ergodic achievable rate of the th user. A lower bound for the ergodic rate is obtained when we impose the worst case scenario for the interference at each user terminal which implies the Gaussian distributed interference at each user terminal. Note that this is true only in the case of Gaussian distributed input signals. Then, we obtain the following bound on the average ergodic rate of the users
[TABLE]
where I_{i}({\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}) is the interference power at the th user terminal. Using Jensen’s inequality and the fact that the function is convex, we obtain
[TABLE]
It is easy to show that \frac{1}{K}\sum\limits_{i=1}^{K}\mathop{\mbox{\mathsf{E}}}\limits_{{\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}}I_{i}({\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}})=\mathsf{D} for , and therefore,
[TABLE]
In the case that users have symmetry, e.g., when the users are uniformly distributed in an area, it is easy to show that the ergodic rate of each user is also larger than .
In this paper we use the replica method to analyze the LSE precoding in a very general case. The replica method also known as the “replica trick” is a non-rigorous method developed for asymptotic analysis in statistical mechanics. The method has been rigorously justified in some few cases, e.g., for the system whose matrix has a semicircular eigenvalue distribution. Furthermore, it has been shown that the replica method gives valid predictions for several known problems, and thus it is widely employed for large system analysis of communication systems [11, 12, 13].
III Large System Analysis
Define {\mathchoice{\mbox{\boldmath\displaystyle R}}{\mbox{\boldmath\textstyle R}}{\mbox{\boldmath\scriptstyle R}}{\mbox{\boldmath\scriptscriptstyle R}}}\stackrel{{\scriptstyle\triangle}}{{=}}{\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}^{\dagger}{\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}. We start our analysis by determining
[TABLE]
which reads
[TABLE]
with the function \mathsf{g}({\mathchoice{\mbox{\boldmath\displaystyle x}}{\mbox{\boldmath\textstyle x}}{\mbox{\boldmath\scriptstyle x}}{\mbox{\boldmath\scriptscriptstyle x}}}) being defined as
[TABLE]
Using Varadhan’s theorem, one can write
[TABLE]
From (10) and (12), the evaluation of needs a logarithmic expectation to be determined which is not a trivial task. Thus, we use
[TABLE]
for some positive random variable . Consequently, we have
[TABLE]
where denotes the corresponding term in the first line. Here, the replica method suggests us to consider the replica continuity assumption and do the further analysis as follows: First, determine for an integer , and then, assume that the analytic continuation of onto the real line gives the same result. For details about the validity of this assumption, the reader is referred to [14]. Thus, we obtain
[TABLE]
where \{{\mathchoice{\mbox{\boldmath\displaystyle x}}{\mbox{\boldmath\textstyle x}}{\mbox{\boldmath\scriptstyle x}}{\mbox{\boldmath\scriptscriptstyle x}}}_{a}\} denotes the replicas \{{\mathchoice{\mbox{\boldmath\displaystyle x}}{\mbox{\boldmath\textstyle x}}{\mbox{\boldmath\scriptstyle x}}{\mbox{\boldmath\scriptscriptstyle x}}}_{1},\ldots,{\mathchoice{\mbox{\boldmath\displaystyle x}}{\mbox{\boldmath\textstyle x}}{\mbox{\boldmath\scriptstyle x}}{\mbox{\boldmath\scriptscriptstyle x}}}_{n}\}\in\mathbbmss{X}^{N}\times\ldots\times\mathbbmss{X}^{N}. Using the independency of and , the expectations over and separate, and thus, the summation on the right hand side of (15) reduces to
[TABLE]
By defining the matrix as
[TABLE]
with being an matrix with entries
[TABLE]
is found as
[TABLE]
Suppose that the empirical distribution of the eigenvalues of converges to a deterministic distribution, and denote the corresponding cdf with F_{{\mathchoice{\mbox{\boldmath\displaystyle R}}{\mbox{\boldmath\textstyle R}}{\mbox{\boldmath\scriptstyle R}}{\mbox{\boldmath\scriptscriptstyle R}}}}(\lambda). The Stieltjes transform of the distribution F_{{\mathchoice{\mbox{\boldmath\displaystyle R}}{\mbox{\boldmath\textstyle R}}{\mbox{\boldmath\scriptstyle R}}{\mbox{\boldmath\scriptscriptstyle R}}}}(\lambda) is defined as {\mathrm{G}}_{{\mathchoice{\mbox{\boldmath\displaystyle R}}{\mbox{\boldmath\textstyle R}}{\mbox{\boldmath\scriptstyle R}}{\mbox{\boldmath\scriptscriptstyle R}}}}(s)=\mathop{\mbox{\mathsf{E}}}(\lambda-s)^{-1}. The corresponding -transform is then defined as
[TABLE]
where \mathrm{G}_{{\mathchoice{\mbox{\boldmath\displaystyle R}}{\mbox{\boldmath\textstyle R}}{\mbox{\boldmath\scriptstyle R}}{\mbox{\boldmath\scriptscriptstyle R}}}}^{-1}(w) denotes the inverse with respect to composition. Noting that the expectation in (19) is a spherical integral, we use the results from [15] which state
[TABLE]
as with being the eigenvalues of . The matrix has only nonzero eigenvalues which are equal to the nonzero eigenvalues of
[TABLE]
Consequently, denoting the eigenvalues of by ,
[TABLE]
where tends to zero when . In order to find , we need to sum over the -dimensional space. We determine the summation by taking the same approach as in [12]. We split the space into the subshells
[TABLE]
where is the th entry of the matrix
[TABLE]
Substituting in (19), reduces to
[TABLE]
where the function \mathcal{G}({\mathchoice{\mbox{\boldmath\displaystyle Q}}{\mbox{\boldmath\textstyle Q}}{\mbox{\boldmath\scriptstyle Q}}{\mbox{\boldmath\scriptscriptstyle Q}}}) is defined as
[TABLE]
and {\rm e}^{N\mathcal{I}({\mathchoice{\mbox{\boldmath\displaystyle Q}}{\mbox{\boldmath\textstyle Q}}{\mbox{\boldmath\scriptstyle Q}}{\mbox{\boldmath\scriptscriptstyle Q}}})} is the Jacobian of the integral; moreover, we define
[TABLE]
In order to calculate the Jacobian term in (26), we firstly write
[TABLE]
Then, we introduce a new matrix \tilde{{\mathchoice{\mbox{\boldmath\displaystyle Q}}{\mbox{\boldmath\textstyle Q}}{\mbox{\boldmath\scriptstyle Q}}{\mbox{\boldmath\scriptscriptstyle Q}}}} in the complex frequency domain. Following the lines in [12, eq.(52)-(58)] and defining for some , we obtain
[TABLE]
where the function \mathcal{M}(\tilde{{\mathchoice{\mbox{\boldmath\displaystyle Q}}{\mbox{\boldmath\textstyle Q}}{\mbox{\boldmath\scriptstyle Q}}{\mbox{\boldmath\scriptscriptstyle Q}}}}) is defined as
[TABLE]
In the large system limit, the integration in (26) is dominated by the integrand at the saddle point. In order to calculate the saddle point of the integrand, one needs to impose a structure on the matrices and \tilde{{\mathchoice{\mbox{\boldmath\displaystyle Q}}{\mbox{\boldmath\textstyle Q}}{\mbox{\boldmath\scriptstyle Q}}{\mbox{\boldmath\scriptscriptstyle Q}}}}. The most primary structure is imposed by the RS assumption. In the RS assumption, it is postulated that the saddle point matrices which dominate the integration in (26) are invariant to permutation of the replica indices. Therefore, following [12], under the RS assumption we set and for all and and for all where are some positive finite constants. Substituting in (26), can be analytically calculated, and consequently, is determined accordingly. It is then straightforward to evaluate the asymptotic distortion explicitly. The final expression for the asymptotic distortion under the RS assumption is stated in Proposition 1.
Proposition 1
Under the RS assumption, the asymptotic distortion converges to
[TABLE]
as and the inverse load factor is kept fixed. and are solutions to the following two coupled equations
[TABLE]
where
[TABLE]
Although the replica symmetry assumption has given the exact solution for some problems [16, 17], there are several examples in which this assumption fails to give valid prediction of the solution [13]. For these cases in order to have a more precise prediction, one needs to employ the -step RSB assumption which imposes a more generalized structure on the matrices and \tilde{{\mathchoice{\mbox{\boldmath\displaystyle Q}}{\mbox{\boldmath\textstyle Q}}{\mbox{\boldmath\scriptstyle Q}}{\mbox{\boldmath\scriptscriptstyle Q}}}}. Here, we consider the 1-RSB assumption which postulates [13]
[TABLE]
where are non-negative real numbers, {\mathchoice{\mbox{\boldmath\displaystyle 1}}{\mbox{\boldmath\textstyle 1}}{\mbox{\boldmath\scriptstyle 1}}{\mbox{\boldmath\scriptscriptstyle 1}}}_{n} is an matrix with all elements equal to and {\mathchoice{\mbox{\boldmath\displaystyle I}}{\mbox{\boldmath\textstyle I}}{\mbox{\boldmath\scriptstyle I}}{\mbox{\boldmath\scriptscriptstyle I}}}_{n} is the identity matrix. With same steps as for Proposition 1, the asymptotic distortion is determined under the 1-RSB assumption as in the following proposition.
Proposition 2
Under the 1-RSB assumption, the asymptotic distortion converges to
[TABLE]
as and the inverse load factor is kept fixed. The set of scalars is calculated through the coupled equations
[TABLE]
and
[TABLE]
where ,
[TABLE]
and the function being defined as
[TABLE]
Moreover, the parameters are determined as
[TABLE]
Note that letting reduces the 1-RSB solution to RS. This means that one of the 1-RSB solutions is always the RS solution. In fact, the coupled equations in both the RS and 1-RSB cases may have multiple solutions in which one of them is the valid saddle point of (26). In this case, the saddle point is the solution which minimizes a function corresponding to the system, known as the free energy. It can be shown that the free energy of the nonlinear LSE precoder is . This means that the saddle point is the solution which maximizes . The explicit expression for can be found in terms of the replica parameters and is skipped here due to lack of space.
IV Numerical results
In this section, we numerically investigate several examples of the nonlinear LSE precoder. Throughout our investigations, we consider and a channel matrix whose entries are iid with variance . Note that Propositions 1 and 2 are given for a general channel matrix and are not restricted to the iid case. In Appendix A, we show that the results can be generalized to the case of frequency-selective channels and OFDM signal. For the sake of simplicity, we assume that all the users have the same path loss and consider the extension of the analysis for more general setups in the extended version of the paper. For an iid matrix, we have [12]
[TABLE]
In the following, we consider two examples of massive MIMO systems with input constraint; namely transmitters with peak power constraint on each antenna and systems with PSK constellation on the transmit antennas. We determine the performance of the nonlinear LSE precoder using our replica results.
IV-A Per-antenna peak and total average power constraints
Consider a setup with constraints on the instantaneous power on each transmit antenna and total average power. The average power is set by choosing properly. In this case, reads
[TABLE]
where is the instantaneous peak power. We invoke the RS solution for investigation. Using Proposition 1, it can be shown that the parameter in the RS solution represents the average power per antenna. Moreover, the RS coupled equations become
[TABLE]
for being defined as
[TABLE]
and denoted by
[TABLE]
Furthermore, the asymptotic distortion is determined as
[TABLE]
Fig. 1 represents the asymptotic distortion versus the inverse load factor for a fixed total average power , different PAPR defined as and . To validate the results by the replica method, we have also plotted some simulation results obtained by CVX, a package for specifying and solving convex programs [18], for . It is observed that the analytical results are consistent with the simulation results although RS may not be exact in some cases. Furthermore, for PAPRs equal to or more than , the asymptotic distortion is sufficiently close to the case without peak power constraint.
In order to describe the variation of the required average power for a fixed asymptotic distortion with respect to the number of transmit antennas, we consider the case with unit per-antenna peak power constraint and plot the average power per-antenna for given asymptotic distortions. The parameter is set to . The results are shown in Fig. 2. It is observed that the per-antenna average power decays by increasing . By numerical curve fitting, it can be observed that the per-antenna average power converges to for some constants and as grows unbounded. For bounded , however, . For massive MIMO systems, i.e., , with average power constraint, it has been shown that when the base station has perfect channel state information, signal to interference plus noise ratio can be improved by a factor of , asymptotically [3] which agrees the result given here for the peak power constraint.
Next, the lower bound for the ergodic rate per user is investigated. The noise variance and the average transmitted power are set to 1. An important parameter in this case is . Increasing outperforms the received SNR but at the same time it also affects the power of interference. The numerical results show that there is an optimum value for for every and the average transmitted power . In Fig. 3, the lower bound for the ergodic rate per user is plotted versus the inverse load factor for different peak to average power constraints when the rate is optimized over . Note that although the replica method also predicts the results for , but the valid system assumption here is since the number of base station antennas should be larger than the number of users. The rate for different PAPRs are quit close. At around , for the case of constant envelope signal we need about more antennas to obtain the same performance as in the case of no peak power constraint. Further simulations for the case of constant envelope signal, which are not presented here due to space limitation, show that for , about more transmit power is required to get the same performance compared to the case of no peak power constraint.
IV-B M-PSK signals on antennas
Let’s consider the case of which the signal on each transmit antenna is selected from -PSK constellation. In this case, we have
[TABLE]
The constant envelope constraint is obtained easily by letting . Under the RS assumption, the unit per-antenna average power results in and the parameter reads
[TABLE]
Then, the RS prediction for the asymptotic distortion is
[TABLE]
Fig. 4 shows the asymptotic distortion for BPSK and QPSK constellations. For the sake of comparison, a lower bound for the asymptotic distortion is also plotted222Due to lack of space, derivation of the lower bound is omitted here. . For a BPSK constellation, the simulation results using an integer programming algorithm is also plotted considering .
For the BPSK case, it is observed that the RS prediction starts to deviate from the simulation results as increases. The RS prediction even violates the lower bound for . This observation clarifies the failure of the RS assumption in this case. To have a better approximation of the exact solution, we have also plotted the 1-RSB prediction in Fig. 4. The 1-RSB prediction meets RS for small and deviates as grows. However, the 1-RSB prediction also fails to approximate the simulation results and violates the lower bound for large . This observation brings this conjecture into mind that the precise approximation of the asymptotic distortion is given by the infinite number of replica breaking steps. Similar results are observed for the QPSK constellation.
Fig. 5 illustrates the RS prediction for PSK constellations. The case of constant envelope signal is also plotted by letting . For sake of comparison, we have shown the result for the unit peak power constraint as well. For , the results for -PSK and the constant envelope constellations are sufficiently close. Furthermore, the constant envelope constellation and the unit peak power constraint give almost the same asymptotic distortion under the RS assumption.
V Conclusions
The asymptotic performance of the nonlinear LSE precoder was analyzed using the replica method. Under the RS assumption, the asymptotic distortion of the precoder takes a simple form. Based on the investigations, the RS assumption seems to give a valid approximation of the exact solution in the case of a peak power constraint on each antenna and constant envelope signals. For the case with peak power constraint, the numerical results show that the transmit signals with PAPR of about perform sufficiently close to the case without peak power constraint. This plays a very important role in practice where low PAPR signals at the transmitter enable us to employ highly efficient nonlinear power amplifiers.
The RS prediction for an -PSK constellation, however, violates the theoretically rigorous bounds as the inverse load factor , defined as the number of transmit antennas to the number of users, increases. This implies that further exploration based on RSB assumptions is necessary for accurately approximating the performance. We considered the 1-RSB assumption and observed that the solution predicts the simulation results for a larger interval of . The results show that for the first time even 1-RSB can be unreliable in wireless communications and more RSB steps need to be considered.
Appendix A Genealization to frequency-selective fading channels
Let be the number of subcarriers and also assume that the channel is frequency-flat at each frequency sub-band. Furthermore, let {\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}_{j} be the channel matrix at th sub-band. The data input vector at the th subcarrier is {\mathchoice{\mbox{\boldmath\displaystyle u}}{\mbox{\boldmath\textstyle u}}{\mbox{\boldmath\scriptstyle u}}{\mbox{\boldmath\scriptscriptstyle u}}}_{j}. The LSE precoder in this case determines vectors {\mathchoice{\mbox{\boldmath\displaystyle v}}{\mbox{\boldmath\textstyle v}}{\mbox{\boldmath\scriptstyle v}}{\mbox{\boldmath\scriptscriptstyle v}}}_{1},\cdots,{\mathchoice{\mbox{\boldmath\displaystyle v}}{\mbox{\boldmath\textstyle v}}{\mbox{\boldmath\scriptstyle v}}{\mbox{\boldmath\scriptscriptstyle v}}}_{L} to be given to the Inverse Fast Fourier Transform (IFFT) blocks as inputs. Let be the IFFT matrix and {\mathchoice{\mbox{\boldmath\displaystyle v}}{\mbox{\boldmath\textstyle v}}{\mbox{\boldmath\scriptstyle v}}{\mbox{\boldmath\scriptscriptstyle v}}}_{\rm t}\stackrel{{\scriptstyle\triangle}}{{=}}{\rm Vec}\left([{\mathchoice{\mbox{\boldmath\displaystyle v}}{\mbox{\boldmath\textstyle v}}{\mbox{\boldmath\scriptstyle v}}{\mbox{\boldmath\scriptscriptstyle v}}}_{1},\ldots,{\mathchoice{\mbox{\boldmath\displaystyle v}}{\mbox{\boldmath\textstyle v}}{\mbox{\boldmath\scriptstyle v}}{\mbox{\boldmath\scriptscriptstyle v}}}_{L}]^{\rm T}\right). The LSE precoder rule is
[TABLE]
where {\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}_{\rm t} is a matrix whose th part of its columns is the th column of the {\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}_{k} and the remained entries are zero, {\mathchoice{\mbox{\boldmath\displaystyle u}}{\mbox{\boldmath\textstyle u}}{\mbox{\boldmath\scriptstyle u}}{\mbox{\boldmath\scriptscriptstyle u}}}_{\rm t}\stackrel{{\scriptstyle\triangle}}{{=}}[{\mathchoice{\mbox{\boldmath\displaystyle u}}{\mbox{\boldmath\textstyle u}}{\mbox{\boldmath\scriptstyle u}}{\mbox{\boldmath\scriptscriptstyle u}}}_{1}^{\rm T},\ldots,{\mathchoice{\mbox{\boldmath\displaystyle u}}{\mbox{\boldmath\textstyle u}}{\mbox{\boldmath\scriptstyle u}}{\mbox{\boldmath\scriptscriptstyle u}}}_{L}^{\rm T}]^{\rm T} and {\mathchoice{\mbox{\boldmath\displaystyle W}}{\mbox{\boldmath\textstyle W}}{\mbox{\boldmath\scriptstyle W}}{\mbox{\boldmath\scriptscriptstyle W}}}_{\rm t} is an block-diagonal matrix whose diagonal blocks are equal to .
One can reformulate (53) as
[TABLE]
using the fact that {\mathchoice{\mbox{\boldmath\displaystyle W}}{\mbox{\boldmath\textstyle W}}{\mbox{\boldmath\scriptstyle W}}{\mbox{\boldmath\scriptscriptstyle W}}}_{\rm t}{\mathchoice{\mbox{\boldmath\displaystyle W}}{\mbox{\boldmath\textstyle W}}{\mbox{\boldmath\scriptstyle W}}{\mbox{\boldmath\scriptscriptstyle W}}}_{\rm t}^{\dagger}={\mathchoice{\mbox{\boldmath\displaystyle I}}{\mbox{\boldmath\textstyle I}}{\mbox{\boldmath\scriptstyle I}}{\mbox{\boldmath\scriptscriptstyle I}}}. One can consider an equivalent frequency-flat fading channel with the channel matrix equal to {\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}_{\rm t}{\mathchoice{\mbox{\boldmath\displaystyle W}}{\mbox{\boldmath\textstyle W}}{\mbox{\boldmath\scriptstyle W}}{\mbox{\boldmath\scriptscriptstyle W}}}_{\rm t}^{\dagger}. For the case that {\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}_{i}s are iid Gaussian matrices, Fig. 6 compares the empirical cumulative distribution of the eigenvalues of {\mathchoice{\mbox{\boldmath\displaystyle R}}{\mbox{\boldmath\textstyle R}}{\mbox{\boldmath\scriptstyle R}}{\mbox{\boldmath\scriptscriptstyle R}}}_{\rm t}={\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}_{\rm t}^{\dagger}{\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}_{\rm t} and {\mathchoice{\mbox{\boldmath\displaystyle R}}{\mbox{\boldmath\textstyle R}}{\mbox{\boldmath\scriptstyle R}}{\mbox{\boldmath\scriptscriptstyle R}}}_{j}={\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}_{j}^{\dagger}{\mathchoice{\mbox{\boldmath\displaystyle H}}{\mbox{\boldmath\textstyle H}}{\mbox{\boldmath\scriptstyle H}}{\mbox{\boldmath\scriptscriptstyle H}}}_{j} numerically for and . It is observed that the both cases have the same distribution. Since the result derived by the replica method depends only on the eigenvalue distribution of {\mathchoice{\mbox{\boldmath\displaystyle R}}{\mbox{\boldmath\textstyle R}}{\mbox{\boldmath\scriptstyle R}}{\mbox{\boldmath\scriptscriptstyle R}}}_{\rm t}, this proves that the LSE precoder in this case has the same performance as in the case of frequency-flat fading channel.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. G. Andrews, S. Buzzi, W. Choi, S. Hanly, A. Lozano, A. C. Soong, and J. C. Zhang, “What will 5G be?” IEEE Journal on Selected Areas in Communications , vol. 32, no. 6, 2014.
- 2[2] T. L. Marzetta, “Noncooperative cellular wireless with unlimited numbers of base station antennas,” Wireless Communications, IEEE Transactions on , vol. 9, no. 11, pp. 3590–3600, 2010.
- 3[3] F. Rusek, D. Persson, B. K. Lau, E. G. Larsson, T. L. Marzetta, O. Edfors, and F. Tufvesson, “Scaling up mimo: Opportunities and challenges with very large arrays,” Signal Processing Magazine, IEEE , vol. 30, no. 1, pp. 40–60, 2013.
- 4[4] R. F. Fischer, C. Windpassinger, A. Lampe, and J. B. Huber, “Space-time transmission using tomlinson-harashima precoding,” ITG FACHBERICHT , pp. 139–148, 2002.
- 5[5] B. M. Hochwald, C. Peel, and A. Swindlehurst, “A vector-perturbation technique for near-capacity multiantenna multiuser communication-Part II: Perturbation,” IEEE Transactions on Communications , vol. 53, no. 3, pp. 537–544, Mar. 2005.
- 6[6] W. Zeng, C. Xiao, M. Wang, and J. Lu, “Linear precoding for finite-alphabet inputs over MIMO fading channels with statistical CSI,” Signal Processing, IEEE Transactions on , vol. 60, no. 6, pp. 3134–3148, 2012.
- 7[7] S. K. Mohammed and E. G. Larsson, “Per-antenna constant envelope precoding for large multi-user MIMO systems,” Communications, IEEE Transactions on , vol. 61, no. 3, pp. 1059–1071, 2013.
- 8[8] M. A. Sedaghat, V. I. Barousis, C. Papadias et al. , “Load modulated arrays: a low-complexity antenna,” IEEE Communications Magazine , vol. 54, no. 3, pp. 46–52, 2016.
