RLS-Based Detection for Massive Spatial Modulation MIMO
Ali Bereyhi, Saba Asaad, Bernhard G\"ade, Ralf R. M\"uller

TL;DR
This paper analyzes the asymptotic performance of RLS-based detection algorithms in massive spatial modulation MIMO systems, providing theoretical insights and validation for various channel matrix types.
Contribution
It derives the asymptotic mean squared error and error rate for RLS detection in SM, extending analysis to non-i.i.d. and non-Gaussian channel matrices.
Findings
Performance characterization holds for non-Gaussian matrices.
Analytical results match numerical simulations.
Supports the validity of the box-LASSO approach.
Abstract
Most detection algorithms in spatial modulation (SM) are formulated as linear regression via the regularized least-squares (RLS) method. In this method, the transmit signal is estimated by minimizing the residual sum of squares penalized with some regularization. This paper studies the asymptotic performance of a generic RLS-based detection algorithm employed for recovery of SM signals. We derive analytically the asymptotic average mean squared error and the error rate for the class of bi-unitarily invariant channel matrices. The analytic results are employed to study the performance of SM detection via the box-LASSO. The analysis demonstrates that the performance characterization for i.i.d. Gaussian channel matrices is valid for matrices with non-Gaussian entries, as well. This justifies the partially approved conjecture given in [1]. The derivations further extend the former studies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
MIMO multiple-input multiple-output CSI channel state information AWGN additive white Gaussian noise i.i.d. independent and identically distributed UT user terminal BS base station SM spatial modulation MA-SM multiple-active spatial modulation GLSE generalized least squared error RLS regularized least-squares r.h.s. right hand side l.h.s. left hand side w.r.t. with respect to RS replica symmetry RSB replica symmetry breaking ML maximum-likelihood RZF regularized zero forcing PSK phase shift keying SNR signal-to-noise ratio RF radio frequency TDD time division duplexing MF match filtering GAMP generalized approximate message passing MAP maximum-a-posterior MSE mean squared error MMSE minimum MSE SSK space shift keying RSS residual sum of squares LASSO least absolute shrinkage and selection operator
RLS-Based Detection for Massive Spatial Modulation MIMO
Ali Bereyhi, Saba Asaad, Bernhard Gäde, Ralf R. Müller
Institute for Digital Communications, Friedrich-Alexander Universität Erlangen-Nürnberg
{ali.bereyhi, saba.asaad, bernhard.gaede, ralf.r.mueller}@fau.de
This work has been accepted for presentation in the IEEE International Symposium on Information Theory (ISIT) 2019 in Paris, France. The link to the final version in the Proceedings of ISIT will be available later.
Abstract
Most detection algorithms in spatial modulation (SM) are formulated as linear regression via the regularized least-squares (RLS) method. In this method, the transmit signal is estimated by minimizing the residual sum of squares penalized with some regularization. This paper studies the asymptotic performance of a generic RLS-based detection algorithm employed for recovery of SM signals. We derive analytically the asymptotic average mean squared error and the error rate for the class of bi-unitarily invariant channel matrices.
The analytic results are employed to study the performance of SM detection via the box-LASSO. The analysis demonstrates that the performance characterization for i.i.d. Gaussian channel matrices is valid for matrices with non-Gaussian entries, as well. This justifies the partially approved conjecture given in [1]. The derivations further extend the former studies to scenarios with non-i.i.d. channel matrices. Numerical investigations validate the analysis, even for practical system dimensions.
I Introduction
Recovery algorithms based on the regularized least-squares (RLS) regression method are doubtless among the most studied schemes for ill-posed signal recovery in linear models. Examples of such algorithms are the least absolute shrinkage and selection operator (LASSO) [2] and Tikhonov regularization method [3] which are used in various applications, e.g. sparse recovery [4]. The main task in these applications is to recover a signal from an underdetermined set of linear and possibly noisy observations . The RLS-based algorithms solve this problem by minimizing the residual sum of squares (RSS) penalized via a regularization term, i.e. for some penalty and the known projection .
This study investigates the characteristics of a generic class of RLS-based algorithms by considering their applications to the spatial modulation (SM) technique recently proposed for multiple-input multiple-output (MIMO) transmission with restricted hardware complexity [5, 6]. For scenarios in which the number of active transmit antennas is smaller than the total number of available antenna elements, SM utilizes the sparsity of the transmit signal to convey information via both the non-zero symbols and the support of the transmit signal. To this end, the information bits are divided into two subsets. One subset is used to select a unique subset of transmit antennas which are set active during transmission, and the other subset is transmitted over these active antennas via a conventional modulation technique, e.g. phase shift keying (PSK). At the receive side, the receiver needs to jointly recover the index of the active antennas, and the transmitted symbols.
Following analogy between the detection task in SM and compressive sensing [4, 7], several detection schemes were proposed in the literature based on sparse recovery techniques, e.g. [8, 9, 1]. These schemes often lie in the class of RLS-based recovery algorithms; hence, their performance is characterized by studying these recovery algorithms.
Contributions and Related Work
Recent developments in multi-antenna technologies suggest the utilization of massive MIMO settings in the next generation of mobile networks [10]. In this respect, we characterize the RLS-based recovery algorithms in the large-system limit. The analysis differs from earlier studies on mathematically similar models, e.g. [11], in the fact that the SM* signals are not independent and identically distributed (i.i.d.)* in general. We address this via a conditional form of the asymmetric decoupling property derived for RLS recovery in [12]. Closed-form expressions for the average mean squared error (MSE) and error rate in a massive multiuser multiple-active spatial modulation (MA-SM) MIMO system are derived.
Our derivations extend available results in the literature in various respects. We demonstrate this by studying the example of box-LASSO recovery. For this example, our results extend the earlier derivations for i.i.d. Gaussian channel matrices in [1] to the class of bi-unitarily invariant random matrices. Our investigations further show that the asymptotic distortion is of the same form for all i.i.d. channel matrices. This justifies the universality conjecture which was partially approved in [1].
Notations
We represent scalars, vectors and matrices with non-bold, bold lower case and bold upper case letters, respectively. denotes the identity matrix. and are the transposed and transposed conjugate of . The real axes and its non-negative subset are denoted by and , respectively. The complex plane is shown by . We use and to show the - and -norm, respectively. denotes the “-norm” of defined as the number of non-zero entries. For random variable , the probability distribution is shown by . is the expectation operator. We use the shortened notation to represent .
II Problem Formulation
Consider uplink transmission in a Gaussian multiple access MIMO channel with transmitters and a single receiver. Each transmitter is equipped with antennas and radio frequency (RF) chains. Hence, in each channel use, only transmit antennas are active at transmit terminals. We denote the fraction of active antennas by .
At the receive side, an antenna array of size is employed. Hence, the receive vector reads
[TABLE]
Here, represents the vector of transmit signals, i.e.
[TABLE]
where is the signal transmitted by terminal . As a result, is of size . denotes the channel matrix. It is assumed that the channel is estimated prior to data transmission, and hence is known at the receiver.
represents noise and reads .
II-A Channel Model
The channel is assumed to experience frequency-flat fading with slow time variations. We consider a generic fading model in which is a bi-unitarily invariant random matrix. This means that for any pair of unitary matrices and , which are independent of , the joint distribution of the entries of is identical to that of [13].
This ensemble comprises a variety of fading models including the well-known i.i.d. Rayleigh fading model.
II-B Spatial Modulation
Let denote a sequence of information bits at terminal . Without loss of generality, let be an i.i.d. binary sequence with uniform distribution. Assume that the active antennas at each terminal transmit symbols from alphabet which contains symbols. is then constructed as follows:
Assigning modulation indices: Consider all possible subsets of transmit antennas selected out of available antennas at terminal . The transmitter selects distinct subsets randomly and uniformly, where
[TABLE]
To each of these subsets a modulation index is assigned. 2. 2.
Index modulation: For given sequence , the transmitter chooses index , such that the first bits of be the binary representation of . We denote the subset of antenna indices corresponding to with . 3. 3.
Modulating multiple streams: Terminal considers independent blocks of , each of length , and maps them into the symbols for , using a standard modulation scheme, e.g. PSK. These symbols are then transmitted on the antennas corresponding to .
Considering the above SM scheme, the -th transmit signal entry of terminal , i.e. for , reads , where is the indicator function, returning one, if and zero, if . Hence, is an -sparse vector, i.e. only entries are non-zero. As a result, is -sparse, where .
We define the activity factor as . This factor describes the total fraction of active transmit antennas in the network. Noting that , we have .
II-C RLS-Based Detection Algorithms
For data recovery, the receiver requires to detect both the support of and the transmitted symbols from . To this end, an RLS-based algorithm is employed. This algorithm determines a soft estimation of , for given , as
[TABLE]
where is a regularization function, and is a subset of including the alphabet , i.e. . The notation is further defined for brevity.
Given , the detected vector is then given by mapping the soft estimation into a vector in . This means where is a decisioning function.
Special Cases
The RLS-based recovery schemes recover various MA-SM detection algorithms. As an example, let and with denoting the prior distribution of . Moreover, set . The algorithm in this case reduces to the optimal Bayesian detection scheme, i.e. * maximum-a-posterior (MAP)*.
Noting that dimensions are rather large in massive MIMO systems, the non-convex choices of and result in computationally intractable detection algorithms. As a result, convex forms are considered in practice. A well-known example is the so-called box-LASSO. In box-LASSO, is set to a convex subset of the complex plane which contains the symbols in . is further chosen proportional to the -norm following the regularization approach proposed by Tibshirani in [2]. In this case, the soft estimation comprises symbols from . is hence set to be an entry-wise thresholding operator which maps each estimated entry into either zero or a transmit symbol in .
II-D Performance Metrics
Using the MA-SM scheme, each terminal transmits information bits per channel use. Comparing to conventional modulation schemes, the spectral efficiency in this case is increased by bits per channel use. This is acquired at the expense of diversity. In fact, the index modulation in MA-SM performs as a random selection algorithm which reduces the diversity gain, compared to other selection techniques.
In order to characterize the performance of MA-SM transmission, a distortion metric is further required. The common metric is the average error rate defined as the probability of bit flip averaged over the transmit block size, i.e. .
Definition 1** (Average error rate):**
Let denote the vector of detected signals. The average error rate is defined as
[TABLE]
For RLS-based detectors, the distortion can also be defined with respect to the soft estimation. In this respect, we further consider the average MSE as another distortion metric.
Definition 2** (Average MSE):**
Consider the soft estimation . The average MSE is defined as
[TABLE]
II-E The Large-System Limit
We aim to determine the asymptotic limit of the distortion metrics. In this respect, for bounded and fixed and , we consider a sequence of settings with receive antennas and transmit terminals. We assume that is a deterministic sequence of , such that
[TABLE]
is bounded. As a result, the sequence of channel loads, defined as , converges to , as . The activity factor of this sequence of settings is constant in and reads for all choices of .
For a given setting with receive antennas, we define to be the empirical cumulative distribution of the eigenvalues of . It is assumed that the sequence of has a deterministic limit , when grows large. The Stieltjes transform of is given by
[TABLE]
for some in the upper half complex plane. The -transform is further defined as for some , such that . Here, denotes the inverse with respect to composition.
III Main Results
The analytic derivations in this section follows the results given in [12] via the replica method. The consistency of the results relies on the validity of some conjectures, such as replica continuity and replica symmetry. Although these conjectures lack mathematical proofs, several studies have confirmed the consistency for some particular examples, e.g. [14].
We state the main results using the decoupled setting. This is a tunable scalar setting whose distortion metrics are analytically calculated for all tuning parameters. It is shown that for a specific tuple of the tuning parameters, the distortion metrics of this setting give the large-system limits of the average error rate and MSE.
Definition 3** (Decoupled setting):**
For given and , define
[TABLE]
Let , where is uniformly distributed on , and is a Bernoulli random variable for which . Define the decoupled output as
[TABLE]
with . Then, the decoupled soft estimation is
[TABLE]
The decoupled detected symbol is moreover defined in terms of as . For this setting, the error probability reads and the MSE is given by .
Unlike and , the derivation of and deals with a scalar optimization problem which is tractable for any and . Proposition 1 indicates that for specific choices of and , the average error rate and the average MSE are given by the error probability and the MSE of the decoupled setting, respectively. The values of and , for which this equivalency happens, is given in the proposition via a system of fixed-point equations.
Proposition 1**:**
Consider the sequence of settings illustrated in Section II-E. As grows large, we have
[TABLE]
and
[TABLE]
where and are solutions to the fixed-point equations
[TABLE]
Sketch of the Proof.
The proof follows the asymmetric form of the decoupling property derived for RLS recovery in [12]. To start with the proof, let be the vector of modulation indices for a setting with transmit dimension . For a given distortion function , define the conditional average distortion as
[TABLE]
For a given , let contain the indices of the non-zero entries in . It is then straightforward to conclude that conditioned to , the entries for are independent and uniformly distributed on . As a result, the conditional distribution reads
[TABLE]
This distribution follows the asymmetric signal model that considered in [12] with two blocks of i.i.d. sequences.
By standard derivations, it is concluded from the asymptotic decoupling property in [12] that for and , converges to and , respectively, as tends to . Here, denotes the conditional distribution of the decoupled soft estimation defined in (10), for and , given that the decoupled input is set to . is moreover distributed uniformly over . Substituting into (14), after some lines of calculations, we have
[TABLE]
Noting that the limit in (16) is constant in , we finally have
[TABLE]
where .
By setting the distortion function to and , the asymptotic average MSE and error rate are derived from (16), respectively. More details are given in the extended version of the paper. ∎
IV A Classic Application of the Results
The asymptotic characterization of the RLS-based detection schemes enables us investigating MA-SM in various aspects. In this section, we discuss a particular application; namely, we study a box-LASSO detector. This special form was investigated earlier in [1], where its asymptotic performance was characterized analytically for i.i.d. Gaussian channel matrices. The analysis in this section hence extends the derivations in [1] in several aspects, e.g. channel model.
Applications of the results are not restricted to this particular example. For instance, the asymptotic characterizations can be utilized to derive theoretical bounds on the performance of MA-SM transmission. Due to page limitation, we skip further applications and present them later in an extended version.
IV-A Box-LASSO Detection for SSK Transmission
We consider the following special case of the setting:
for some power . This equivalently means . This is an extreme case of SM, known as space shift keying (SSK), in which the information symbols are completely conveyed via index modulation. 2. 2.
The RLS-based detector has the following specifications:
- •
The regularization function is set to for some regularization parameter which is tunable.
- •
for some and .
- •
The decisioning function is for some threshold .
This setting describes box-LASSO detection for SSK signaling. The detector, in this case, relaxes the optimal Bayesian detector by approximating the exponent of the transmit signal’s prior distribution with the -norm and convexifying the set with the interval .
The asymptotic performance of this particular setting was investigated in [1] for i.i.d. Gaussian channel matrices. Based on simulations and universality results111See [15] for some results on universality., the authors conjec-tured that the analysis is further valid for non-Gaussian i.i.d. channel matrices. This conjecture was partially approved in [1] using the Lindeberg principle.
Invoking our asymptotic derivations, the conjecture in [1] is straightforwardly justified. In fact, from Proposition 1, it is observed that the channel matrix plays rule in the asymptotic characterization via the limiting distribution . From random matrix theory, we know that this distribution is the same for all i.i.d. channel matrices and follows the Marcenko-Pastur law [13]. It is hence concluded that the asymptotic characterizations of the performance for i.i.d. Gaussian matrices extends to i.i.d. matrices with other entry distributions, as well.
Remark 1**:**
The asymptotic results in this paper are given for bi-unitarily invariant random matrices. Hence, the analysis not only justifies the conjecture in [1], but also extends the results beyond the i.i.d. matrices. For non-i.i.d. matrices, it is obvious from Proposition 1, that the derivations in [1] are not valid anymore. The performance in such cases is straightforwardly characterized via Proposition 1.
IV-B Decoupled Setting of the Box-LASSO Detector
For the box-LASSO detector, the decoupled input reads with being a Bernoulli random variable described in Definition 3. The decoupled soft estimation is further given by
[TABLE]
where and .
The asymptotic values for the MSE and error rate are derived by substituting (18) into the fixed point equations, given in Proposition 1, and calculating and .
Noting that is a zero-mean unit-variance complex Gaussian random variable, the expectations on the right hand side of (13a) and (13b) are analytically derived for a given pair of and as sums of Gaussian integrals. Hence, they are straightforwardly calculated and replaced into the fixed-point equations. The resulting equations are then solved numerically.222An alternative approach is to iteratively find the stability point of the corresponding replica simulator; see [11] for detailed discussions.
IV-C Numerical Results
The analytic derivations are validated via numerical investigations considering the example of box-LASSO detection. To this end, we consider a scenario with transmit terminals, each equipped with antennas and a single RF chain, i.e. . Consequently, . Using the single RF-chain, each terminal transmits three information bits in each channel use by SSK signaling. We also set .
At the receiver, an antenna array of size is employed. Thus, , and the channel load is . For signal recovery, a box-LASSO detector is used in which and . The threshold in the decisioning function is set to .
The analytical results are given for an i.i.d. channel matrix whose entries are zero-mean with variance .333The entries can have an arbitrary distribution. In this case, we have [13]. The simulations are given for Rayleigh fading model in which the channel entries are Gaussian. We set the noise variance with respect to the signal-to-noise ratio (SNR) which is defined as .
Fig. 1 and 2 depict the average MSE and the error rate, achieved by the box-LASSO detector, against regularization parameter , when dB. For sake of comparison, the plots for standard LASSO are further given. By standard LASSO, we mean the extreme case of box-LASSO detection in which . The solid lines in these figures indicate the analytic results given by Proposition 1. The squares are given by numerical simulations with realizations. As the figures depict the simulated points closely track the large-system characterization. This observation validates the consistency of the analytic results in practical dimensions.
As Fig. 1 and 2 demonstrate, for a given SNR, the performance is optimized at some . This value is analytically found via Proposition 1, as Fig. 3 illustrates. In this figure, is plotted against the SNR. At each SNR, is found such that the asymptotic MSE, derived via Proposition 1, is minimized. The regularization parameter in the box-LASSO detector is then set to and the achieved MSE is plotted against the SNR. As the figure depicts, numerical simulations are tightly consistent with the analytical derivations. This implies that the analytic derivations provide an easy and fast tool for efficient tuning of RLS-based detectors, even in practical dimensions.
V Conclusion
The average MSE and error rate were analytically derived for massive SM MIMO settings when a generic RLS-based algorithm is employed for detection. The analysis was given for the large class of bi-unitarily invariant random matrices which includes various fading models. The asymptotic results were employed to study box-LASSO detectors for SSK signaling. This particular application of the results extended the analysis in [1] to a larger set of channel matrices. Numerical simulations showed close consistency with the analytic results, even in practical dimensions. Other applications of the results are skipped due to the page limitation and will be presented in an extended version which is currently under preparation.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] I. B. Atitallah, C. Thrampoulidis, A. Kammoun, T. Y. Al-Naffouri, M.-S. Alouini, and B. Hassibi, “The box-lasso with application to GSSK modulation in massive MIMO systems,” IEEE ISIT, 2017.
- 2[2] R. Tibshirani, “Regression shrinkage and selection via the lasso,” J. of the Royal Stat. Soc. Series B (Methodological) , pp. 267–288, 1996.
- 3[3] A. Tikhonov and V. Y. Arsenin, Methods for solving ill-posed problems, John Wiley and Sons, Inc, 1977.
- 4[4] E. J. Candès, J. Romberg, and T. Tao, “Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information,” IEEE Trans. on Inf. Th. , vol. 52, no. 2, pp. 489–509, 2006.
- 5[5] J. Jeganathan, A. Ghrayeb, L. Szczecinski, and A. Ceron, “Space shift keying modulation for MIMO channels,” IEEE Trans. on Wir. Comm. , vol. 8, no. 7, pp. 3692–3703, 2009.
- 6[6] A. Younis, N. Serafimovski, R. Mesleh, and H. Haas, “Generalised spatial modulation,” in 44 th superscript 44 th 44^{\rm th} IEEE Asilomar Conference, 2010.
- 7[7] D. L. Donoho, “Compressed sensing,” IEEE Trans. on Inf. Th. , vol. 52, no. 4, pp. 1289–1306, 2006.
- 8[8] C.-M. Yu, S.-H. Hsieh, H.-W. Liang, C.-S. Lu, W.-H. Chung, S.-Y. Kuo, and S.-C. Pei, “Compressed sensing detector design for space shift keying in MIMO systems,” IEEE Comm. Letters , vol. 16, no. 10, pp. 1556–1559, 2012.
