Spatial Channel Covariance Estimation for Hybrid Architectures Based on Tensor Decompositions
Sungwoo Park, Anum Ali, Nuria Gonz\'alez-Prelcic, Robert W., Heath Jr

TL;DR
This paper introduces a tensor decomposition-based method for estimating spatial channel covariance in hybrid architectures, improving accuracy especially at low SNR levels by leveraging low-rank tensor representations.
Contribution
It proposes a novel tensor decomposition approach for covariance estimation in hybrid MIMO systems, outperforming existing compressive sensing and angle-of-arrival methods.
Findings
Achieves higher estimation accuracy than prior methods.
Performs better at low SNR conditions.
Provides theoretical bounds on estimation accuracy.
Abstract
Spatial channel covariance information can replace full instantaneous channel state information for the analog precoder design in hybrid analog/digital architectures. Obtaining spatial channel covariance estimation, however, is challenging in the hybrid structure due to the use of fewer radio frequency (RF) chains than the number of antennas. In this paper, we propose a spatial channel covariance estimation method based on higher-order tensor decomposition for spatially sparse time-varying frequency-selective channels. The proposed method leverages the fact that the channel can be represented as a low-rank higher-order tensor. We also derive the Cram\'er-Rao lower bound on the estimation accuracy of the proposed method. Numerical results and theoretical analysis show that the proposed tensor-based approach achieves higher estimation accuracy in comparison with prior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Spatial Channel Covariance Estimation
for Hybrid Architectures Based on Tensor Decompositions
Sungwoo Park, Anum Ali, Nuria González-Prelcic, and Robert W. Heath Jr S. Park, A. Ali, N. González-Prelcic, and R. W. Heath Jr. are with the Wireless Networking and Communication Group (WNCG), Department of Electrical and Computer Engineering, The University of Texas at Austin, TX, 78701 USA. (e-mail: {swpark96,anumali,ngprelcic,rheath}@utexas.edu). This work is supported in part by the National Science Foundation under Grant No. 1514275, and by a gift from Huawei Technologies.
Abstract
Spatial channel covariance information can replace full instantaneous channel state information for the analog precoder design in hybrid analog/digital architectures. Obtaining spatial channel covariance estimation, however, is challenging in the hybrid structure due to the use of fewer radio frequency (RF) chains than the number of antennas. In this paper, we propose a spatial channel covariance estimation method based on higher-order tensor decomposition for spatially sparse time-varying frequency-selective channels. The proposed method leverages the fact that the channel can be represented as a low-rank higher-order tensor. We also derive the Cramér-Rao lower bound on the estimation accuracy of the proposed method. Numerical results and theoretical analysis show that the proposed tensor-based approach achieves higher estimation accuracy in comparison with prior compressive-sensing-based approaches or conventional angle-of-arrival estimation approaches. Simulation results reveal that the proposed approach becomes more beneficial at low signal-to-noise (SNR) region.
I Introduction
Hybrid analog/digital precoding uses a smaller number of RF chains to reduce the number of power-consuming devices like analog-to-digital converters (ADCs) or digital-to-analog converters (DACs). Consequently, the hybrid approach can reduce power consumption and implementation complexity in millimeter wave multiple-input-multiple-output (MIMO) systems [1, 2, 3, 4] and massive MIMO systems [5, 6, 7]. The rate loss incurred by the hybrid architecture is insignificant for spatially sparse channels such as in millimeter wave systems or in suburban/rural areas in sub-6 GHz systems [1, 2, 3, 4, 7].
A main challenge in the hybrid architecture is to configure the analog and digital precoding stages. Many previous methods accomplish this task based on full CSI [1, 4, 8]. These approaches require frequent estimation of the channel, obtained for example via sparse recovery techniques. An alternative is to use only long-term statistical knowledge such as that contained in spatial channel covariance matrices, for the analog precoder design [9, 10, 7, 11]. Once the analog precoder is determined based on the spatial channel covariance, the digital precoder, of a much smaller dimension is designed by using instantaneous full CSI of the low-dimensional effective channel, i.e., the propagation channel combined with the analog precoder. While the accurate estimation of the full CSI is difficult for time-varying frequency-selective channels, the long-term statistical CSI can be efficiently estimated. It was shown in [9, 10, 7, 11] that the hybrid precoding methods based on spatial channel covariance achieve spectral efficiency close to that of the hybrid precoding obtained from full CSI when the channels are spatially sparse.
Although the use of the spatial channel covariance matrix helps the hybrid precoding design to be simpler and more practical, the hybrid architecture makes it difficult to estimate the covariance matrix. Since there is no digital access to the outputs of every antenna, and only the signals combined in an analog way are available at baseband, it is difficult to estimate the spatial covariance of the high dimension channel. Different approaches have been suggested to solve the spatial channel covariance estimation problem under such an environment. In [6], a least-squares-based covariance estimation method was proposed by using time-varying analog beamforming matrices. Since the method does not exploit the sparse channel property, it is not an efficient method for sparse channels, which are of our interest in this paper. The sparse ruler array in [12, 13, 14] and the coprime sensor array [15] can omit measurements on some antenna elements by leveraging the fact that correlations between antenna elements are wide-sense stationary in spatial domain. Although these so-called compressive covariance sensing (CCS) methods can reduce the number of RF chains, the methods have a limitation on the configuration of the number of RF chains and antennas.
The CCS methods were initially developed by using only a subset of antennas. It is, however, possible to extend the work to general hybrid architecture where the analog part is composed of phase shifters and thus is represented as a dense matrix [14]. In this dense sensing matrix case, the CCS methods become closely related to typical compressive sensing (CS) techniques. For example, in [11], the spatial channel estimation method was developed by adopting the time-varying analog combiners used in [6], which are dense matrices. Instead of the least-squares method, one of the well-known conventional vector-type CS techniques, orthogonal matching pursuit (OMP), was adopted to exploit the sparse channel property. It is worthwhile to note that conventional vector-type CS techniques were typically developed for channel estimation [16, 17, 18] but can be extended to spatial channel covariance estimation as well. The vector-type CS techniques for covariance estimation, however, are outperformed by matrix-type CS techniques developed for so-called multiple measurement vector (MMV) problems [19, 20, 21] that enable the joint spare recovery. An advanced spatial channel covariance estimation method based on the MMV approach was proposed in [22] by applying time-varying sensing matrices and exploiting the Hermitian property of the covariance matrix. The performance of the CS-based methods, however, is acceptable only in the moderate or high SNR region.
Besides the CS-based work, conventional AoA estimation methods such as the multiple signal classification (MUSIC) [23] and the estimation of signal parameters via rotational invariance technique (ESPRIT) [24] can also be applied to the spatial channel covariance estimation problem via some modification. In the conventional AoA estimation methods for fully-digital architectures, the spatial channel covariance is directly calculated from the received signal vectors, and the AoA is estimated from the obtained covariance matrix. A covariance estimation based on AoA estimation for the hybrid architecture requires the estimation process in the opposite direction. First, the covariance matrix of the low-dimensional baseband received signal is calculated. Second, the AoAs are estimated from the covariance of the baseband received signal vectors. Finally, the covariance of the actual channel is reconstructed from the estimated AoAs. This basic approach has been adopted for different scenarios with some variations [25, 26]. This approach, however, has a weak point: the estimation accuracy rapidly decreases as the number of channel paths increases toward the number of RF chains. In addition, the methods based on this approach do not work properly when the number of channel paths exceeds that of RF chains.
In this paper, we propose a spatial channel covariance estimation method for the hybrid architecture over uplink time-varying frequency-selective channels. We consider a time division duplex (TDD) system where the estimated covariance over uplink channels can be used for the downlink precoding design at a base station (BS). We represent the channel and the received baseband signal as higher-order tensors. Considering spatially sparse channels, we use the fact that these higher-order tensors have a low tensor rank and their canonical polyadic decomposition (CPD) forms are unique up to a common permutation and scaling of columns under some mild conditions [27, 28, 29]. We also analyze the theoretical performance by adopting the performance metric in [22, 30] that is associated with the dominant eigenvalues and their eigenspaces of the spatial channel covariance matrix. We will call this performance metric the relative precoding efficiency (RPE) in this paper. After showing that the RPE is closely related to the mean squared error (MSE) of the AoA estimation, we derive Cramér-Rao lower bound (CRLB) for the AoA estimation and its associated bound for the performance metric. Using numerical results, we first show that the performance of the proposed method approaches the performance bound as SNR increases. We also show that the lower bound of the tensor-based method is lower than that of the MUSIC-based method, which provides insight into the benefits of using tensor-based methods. Simulations show that the proposed tensor-based method outperforms CS-based methods as well as MUSIC-based methods, and its gain becomes more significant in the low SNR regime.
The rest of the paper is organized as follows. Section II briefly introduces the basics of high-order tensor algebra. Section III provides a system and channel model by using tensor representations. Section IV describes the proposed spatial channel covariance method. The performance metric is analyzed in Section V, and the CRLB is derived in Section VI. In Section VII, the proposed tensor-based work is compared with prior work based on CS or MUSIC. Simulation results are presented in Section VIII, and conclusions are drawn in Section IX. This paper is the journal version of [31] with theoretical analysis added.
Notation: We use the following notation throughout this paper: is a scalar, is a vector, is a matrix, and is a tensor. , , , and are transpose, conjugate, conjugate transpose, and Moore-Penrose pseudoinverse. and are the -th row and the -th column of the matrix . , , and denote the Kronecker product, the Hadamard product, and the column-wise Khatri-Rao product. denotes the outer product, which is also known as the tensor product. and denote the real part and the imaginary part of . is a column vector whose elements are composed of the diagonal elements of .
II Preliminaries: overview of tensor algebra and canonical polyadic decomposition
In this section, we review the basics of tensor algebra that will be used in this paper. Readers who are interested in more details about tensors can refer to [27, 28, 29] and the references therein. A tensor denotes a multi-dimensional (a.k.a. multi-way or multi-mode) array. The order of a tensor is defined as the number of dimensions of the tensor. A vector and a matrix are special cases of a tensor, i.e., a vector is a tensor of order one, and a matrix is a tensor of order two.
Given an -th order tensor , let its -th element be denoted by . The mode- fibers of are defined as vector-valued sub-tensors obtained by fixing all but one index associated with mode-, i.e., . The number of mode- fibers in is .
The mode- matricization (a.k.a. unfolding) is a process that transforms a tensor into a matrix whose columns are composed of mode- fibers of the tensor. The mode- unfolding matrix of is denoted by . The tensor maps to such that .
The mode- product of a tensor and a matrix is denoted by . Let . Then, the elements of the tensor are given by f. The mode- product representation can also be expressed by using the mode- matricization as
[TABLE]
Given a tensor and matrices for , their full multilinear product is defined as
[TABLE]
For a special case where and is a diagonal tensor that has zero off-diagonal elements and unit diagonal elements, there exists a simplified notation of the full multilinear product as
[TABLE]
The norm of a tensor is defined as
[TABLE]
which is analogous to the Frobenius norm in the matrix case.
Let and . Then, the outer product (a.k.a. tensor product) of and is denoted by . Let . Then, the elements of the tensor are given by .
A tensor is called a rank-one tensor if it can be written as the outer product of vectors as
[TABLE]
where .
The canonical polyadic decomposition (CPD), which is also known as CANDECOMP/PARAFAC decomposition, factorizes a tensor into a sum of component rank-one tensors. The CPD of has a form
[TABLE]
where for . The minimum possible value of the number of rank-one tensors that constitute , which is in (6), is called the rank of .
III Channel model and system model
Consider a TDD system where a base station with antennas and RF chains communicates with a mobile station that has a single antenna.
III-A Channel model
We consider a spatially sparse channel that has paths between the BS and mobile user. Let and denote the path delay and AoA of the path. Let denote the short-term fading complex path gain of the path at frame . Let denote the low pass filter including pulse shaping and analog filters. We assume a uniform linear array (ULA) with antenna element spacing and signal wavelength . It is possible to extend the proposed method to a uniform planar array (UPA). The array response vector associated with the AoA is expressed as
[TABLE]
Let and be the sampling duration and the cyclic prefix length. We assume that the large-scale fading parameters, ’s and ’s, are constant during the estimation process. By using the delay- channel model [32, 33, 34], the uplink channel at frame can be represented as
[TABLE]
By letting , the channel frequency response vector can be expressed as
[TABLE]
at frame and subcarrier .
III-B System model
Let be an uplink training symbol at frame and subcarrier with , and be a circularly symmetric Gaussian noise. The received signal vector at each frame and subcarrier can be represented as
[TABLE]
Let and be an analog combining matrix and a digital baseband processing matrix. Similar to a sensing matrix in prior CS-based channel estimation work [35, 36], we assume that the elements of have random phases with a unit amplitude. Let denote the hybrid combining matrix as . After combining with the hybrid combiner and multiplying by , the baseband received signal vector becomes
[TABLE]
where denotes the signal part of , and denotes the noise part. From the viewpoint of the estimator at baseband, the effective noise becomes colored for an arbitrary hybrid combiner . It is possible to whiten the effective noise by using the baseband combiner . With this choice of , the hybrid combiner satisfies for any . We will use this unitary hybrid combiner throughout this paper.
III-C Tensor representation of channels and received signals
In this subsection, we show that the time-varying frequency-selective channel in (9) can be represented as a low-rank third-order tensor. Let , , and for . With , , and , let us define , , and as , , and .
The channel frequency response vectors for and in (9) can be regarded as mode-1 fibers in a third-order tensor as shown in Fig. 1. In CPD form, the rank- third-order tensor is
[TABLE]
In this CPD form in (12), the channel tensor is factorized into the three matrices, , and , which are called factor matrices. The mode-1 factor matrix is associated with antennas in the space domain, the mode-2 factor matrix with subcarriers in the frequency domain, and the mode-3 factor matrix with frames in the time domain.
From the channel frequency response tensor model, the received signal at baseband, in (11) can also be represented as a third-order tensor as
[TABLE]
where is the noise tensor whose mode-1 fibers are IID Gaussian vectors with . The signal tensor is given by
[TABLE]
The tensor representation in (14) can also be expressed by using a mode-1 matricization as where is the mode-1 matricization of shown in Fig. 1. Let , which can be regarded as the effective array response matrices from the viewpoint of the baseband estimator. The CPD of is given by
[TABLE]
Note that both the time-varying frequency-selective channel and its associated baseband received signal part are third-order tensors with rank .
IV Spatial channel covariance estimation based on tensor decomposition
The spatial channel covariance matrix can be estimated from the sample covariance of mode-1 fibers in . Since we assume that the elements in have zero mean, the sample covariance of the mode-1 fibers becomes
[TABLE]
The goal of the spatial channel covariance estimation for the hybrid architecture is to calculate of from the baseband received signal , which has smaller dimensions than . In this section, we propose an estimation method that has three steps. In the first step, the measurement tensor is decomposed into three factor matrices: , , and in a CPD form. Note that the three factor matrices obtained in the first step are strongly related to the actual factor matrices , , and , but are not identical. The relationship between these matrices is further explained in Section IV-A. In the second step, the estimate of the actual factor matrix of the channel tensor is obtained from , which is denoted by . The spatial channel covariance matrix is calculated from , , and in the last step. Each step is explained in detail in the following subsections.
IV-A First step: factorization of in a CPD form
If the factor matrices , , and are given, we can exactly calculate the signal part of the measurement tensor . The reverse process does not hold in general; the perfect reconstruction of the original , , and from any are not guaranteed. There is, however, a special case where the factor matrices can be reconstructed from . If the tensor rank of a higher-order (more than second-order) is low, its CPD is unique under some mild constraints [28]. The uniqueness of the CPD means that there exists only one possible combination of rank-one tensors that sum to the given tensor subject to two types of indeterminacy: scaling and permutation. The scaling indeterminacy means that the columns in each factor matrix can be scaled arbitrarily, i.e., the CPD form in (15) can be rewritten as
[TABLE]
as long as for . The CPD form in (17) can be expressed in a multilinear product format as
[TABLE]
where , , and are any diagonal matrices satisfying . The permutation indeterminacy means that the column vectors in each factor matrix can be reordered with a permutation matrix that is common all the modes, i.e., the CPD form in (15) can also be represented as
[TABLE]
for any permutation matrix . From (18) and (19), the general form of the CPD of becomes
[TABLE]
If we set the indeterminacy issue aside, we can exactly reconstruct its factor matrices by leveraging the uniqueness of the CPD. In this subsection, we focus on how to find a CPD solution. We will discuss how to deal with the indeterminacy in the following subsections.
Given a received signal tensor , the problem of finding its CPD form is formulated as
[TABLE]
There are many known algorithms to solve (21) for CPD. One algorithm to compute the CPD problem in (21) is the alternating least squares (ALS) [27]. By rewriting the objective function in (21) in matrix form as
[TABLE]
the ALS algorithm first finds the mode-1 factor matrix assuming that the mode-2 and mode-3 factor matrices, and are fixed. This subproblem is formulated as
[TABLE]
The solution to (23) can be found by using the least squares algorithm as
[TABLE]
Similar to (23), the mode-2 factor matrix and the mode-3 factor matrix can be calculated by fixing other factor matrices except for its own factor matrix. The ALS algorithm iterates the three steps until the objective function converges. The convergence is guaranteed although the converged solution may not be a global optimum.
The solution after convergence provides an estimate of the CPD form of as
[TABLE]
Note that is an estimate of the actual in (20). Due to the scaling and permutation indeterminacy, the estimated factor matrices , , and obtained in the first step are related to the actual factor matrices , , and as
[TABLE]
where , , and are complex-valued diagonal matrices that satisfy , and , , and denote the estimation errors caused by CPD.
IV-B Second step: estimation of of the channel tensor
The goal of the second step is to estimate from that is obtained in the first step as described in Section IV-A. Let . We will show in Section IV-C that we do not need to obtain explicitly to estimate the spatial channel covariance. Let denote the -th column in the estimate of . The problem of finding that minimizes the angle between and is represented as
[TABLE]
and its solution can be found by one-dimensional search methods with respect to , which is a continuous variable. Since we assume ULA, the solution can be obtained more efficiently by using a polynomial equation similar to the Root-MUSIC algorithm [37]. By letting , the array response vector can be denoted by . Then, the optimization problem in (27) is rewritten as
[TABLE]
Let . The numerator in (28) is represented as a polynomial with respect to and becomes zero in the noiseless case, i.e.,
[TABLE]
Note that if is a root of (29), then is also its root, and there are roots within the unit circle. Let denote the roots normalized by their amplitudes. Then, the solution to (28) can be obtained by searching over that has discrete elements, i.e., . After obtaining , the diagonal elements in can be estimated as
[TABLE]
Let and denote the estimate of and . Then, and can be represented as and . Note that is the estimate of , in which the permutation matrix is not known.
IV-C Third step: estimation of the spatial channel covariance matrix from , , and
While the estimate of is given by with only permutation indeterminacy at the second step in Section IV-B, the estimated factor matrices and at the first step in Section IV-A still contain both scaling and permutation indeterminacy. Consequently, it is impossible to simply replace and by and in (16) without considering , , and .
Let , and denote the estimate of the actual , and without any indeterminacy, which are defined as , and . Note that for any permutation matrix . Then, the estimate of the sample spatial channel covariance matrix in (16) can be calculated from , , and as
[TABLE]
where comes from the fact that , and is the estimate of .
V Relative precoding efficiency for spatial channel covariance estimation
The MSE or normalized MSE (NMSE) is typically used as a performance metric for channel estimation methods. Other metrics, though, are more relevant for spatial channel covariance estimation. This is because the dominant eigenvalues and their eigenspaces are more useful for hybrid precoding rather than each element in the covariance matrix. In this regard, we adopt the performance metric used in [30, 22], which we call relative precoding efficiency (RPE). Let and be the spatial channel covariance and its estimate. Let and denote the matrices composed of the dominant eigenvectors of and . The RPE is defined as
[TABLE]
This metric lies between zero and one, i.e., , and higher indicates more accurate estimation. The RPE in (32) is closely related to the relative spectral efficiency of the hybrid beamforming based on compared to that of the hybrid beamforming based on . Consider a hybrid beamforming system where the analog part is composed of or with RF chains as in [9, 10]. For analytical tractability, we ignore the fact that phase shifter are typically used for the analog part. At low SNR region, the achievable rate ratio approximates to
[TABLE]
where comes from the fact that for , and comes from and . This result shows that the metric allows us to anticipate how much relative loss will be caused by the estimation error in terms of achievable rate at low SNR region.
The RPE defined in (32) can be analyzed approximately in large antenna array regimes. Let be a matrix composed of array response vectors and be the covariance of channel path gains. The spatial channel covariance matrix is represented as
[TABLE]
As becomes large, , i.e., becomes semi-unitary asymptotically [38]. If we assume that are IID complex random variables with zero mean and variance for analytical tractability, then (34) can be regarded as the approximate eigenvalue decomposition of , i.e., . Let be the AoA estimation error and be the estimated AoA for the -th path. Similar to , we assume that approximates to as increases. Then, the RPE becomes
[TABLE]
where we assume that is small and for .
Let be defined as
[TABLE]
which approximates to for small . In the ULA case, in (35) is given by
[TABLE]
where comes from the second-order approximation of Maclaurin series for small . Consequently, approximately becomes
[TABLE]
where the CRLB of the estimation will be derived in the following section.
VI Cramér-Rao Lower Bound for the AoA estimation
In this section, we derive the CRLB of the MSE of AoAs. The basic tool to derive CRLB is based on the method in [39] and [40]. In [39], all the factor matrices are non-structure matrices, i.e., a factor matrix is determined by complex-valued variables and has no specific structure. In [40], all the factor matrices have a special structure and are determined by only real-valued variables. In this paper, we derive CRLB in the case where the tensor has the combination of two structured factor matrices, and , and one unstructured factor matrix . We also simplify the complicated CRLB expression to a more compact form.
Focusing on the fact that the channel tensor is determined by AoAs, path delays, and time-varying channel path gains, we define three parameter vectors as , , and . Let denote a column vector that includes all the parameters such that . Note that is a complex vector while and are real vectors.
Since the analog combining matrix combined with the baseband post-processing matrix is unitary, the elements in the noise tensor become IID circularly symmetric Gaussian with . Consequently, the log-likelihood function of is given by
[TABLE]
Then, the CRLB with respect to the parameter set can be obtained as
[TABLE]
where is the complex Fisher information matrix (FIM) defined as
[TABLE]
The FIM in (41) is divided into submatrices as
[TABLE]
Each submatrix in (42) is calculated in the following subsections.
VI-A Calculation of
The partial derivative of with respect to is given by
[TABLE]
Let be defined as
[TABLE]
and let be the mode-1 unfolding matrix of .
By using (43) and (44), the partial derivative of with respect to can be represented as
[TABLE]
Let and . Then, can be represented as
[TABLE]
by using the fact that when and have the same number of columns.
Since we consider IID zero mean circularly symmetric complex Gaussian noise, the covariance matrix and the pseudo-covariance matrix of in (46) become
[TABLE]
and
[TABLE]
From (47) and (48), the covariance matrix of the complex vector becomes
[TABLE]
and the pseudo-covariance matrix becomes . Consequently, the submatrix becomes
[TABLE]
VI-B Calculation of
Let be a vector whose -th element is defined as
[TABLE]
where is the first derivative of . Then, the partial derivative of with respect to can be obtained as
[TABLE]
where . Let be the mode-2 unfolding matrix of , and . Let denote
[TABLE]
Similarly to Section VI-A, the submatrix can be calculated as
[TABLE]
VI-C Calculation of , , and
From the fact that , the submatrix becomes
[TABLE]
Since , the submatrix is given by
[TABLE]
The submatrix can be obtained from such that .
VI-D Calculation of
The submatrix is given by
[TABLE]
where and .
To calculate , let us first start with calculating the cross-covariance matrix of and , which are associated with the mode-1 and mode-2 unfolding matrix of . Let be the -th unit coordinate vector and . Using the fact that is expressed in different ways as
[TABLE]
the cross-covariance matrix can be represented as
[TABLE]
Consequently, is a matrix that has only nonzero elements whose amplitudes are equal to . From (59), the cross-covariance matrix of and can be expressed as
[TABLE]
Since , the pseudo-cross-covariance matrix of and becomes
[TABLE]
From (60) and (61), the submatrix in (57) can be rewritten as
[TABLE]
VI-E Calculation of and
The submatrix is expressed as
[TABLE]
where and
[TABLE]
By using (64), we can further simplify in (63) as
[TABLE]
The submatrix is expressed as .
VI-F Calculation of and
Similar to Section VI-E, we can obtain and as
[TABLE]
and .
VI-G CRLB for the estimation
The results of the preceding subsections are summarized as
[TABLE]
Letting and
[TABLE]
the CRLB for the estimation can be expressed in a compact form as
[TABLE]
by using the Schur complement and the matrix inversion lemma.
VII Comparison with CS-based or MUSIC-based methods
In this section, we explain two other approaches that estimate spatial channel covariance or subspace for comparison:
- CS-based methods and 2) MUSIC-based methods.
VII-A Prior work based on CS
The channel frequency response vector in (9) can be represented by using a matrix form as
[TABLE]
where and . Let be a dictionary matrix whose columns are composed of the array response vectors associated with a predefined set of AoAs. In the CS framework, the channel model in (70) is rewritten as
[TABLE]
where and are sparse column vectors with nonzero elements of and in the space domain. The positions of the nonzero elements indicate AoAs, and thus and share the same support for all and . To exploit the joint sparsity of and , we can use CS techniques known as multiple measurement vector (MMV) problems instead of conventional single measurement vector (SMV) problems [19, 20, 21]. While simultaneous orthogonal matching pursuit (SOMP) is known as an adequate algorithm for the general MMV problems, a more advanced CS algorithm was proposed for the spatial channel covariance estimation problem in [22]. We will compare the algorithm in [22] to our tensor-based method.
VII-B Prior work based on MUSIC
In conventional fully-digital architectures, the goal of the MUSIC algorithm is to estimate AoAs from the spatial channel covariance matrix. In other words, the covariance must be known prior to applying the MUSIC algorithms. Note that the spatial channel covariance can be estimated from the covariance of the received signal vectors in fully-digital architectures. Although the spatial channel covariance estimation is not straightforward in the hybrid architectures, the MUSIC algorithm can be applied to the subspace estimation problem for hybrid architectures. It is worthwhile to note that only the subspace can be estimated and the covariance cannot be estimated by using the MUSIC-based approach. Since the subspace is enough for the hybrid precoder design in some cases as in SU-MIMO systems, we will compare our proposed work with the MUSIC-based method in terms of subspace estimation. The overall process for the MUSIC-based method is composed of three steps. First, the sample covariance of the baseband received signal vectors in (11) is estimated for all and as
[TABLE]
where . Let the SVD of be
[TABLE]
where is the signal subspace and is the subspace orthogonal to the signal subspace. Let . Since the RF chains can be regarded as the effective antennas from the viewpoint of the estimator at baseband, the vector can be considered as the effective array response vector with a reduced size. The second step is to find the highest peaks of the function of defined as
[TABLE]
The final step is to reconstruct the subspace of the channel by using the s for that are obtained from the subspace of the baseband received signals. The subspace of the channel is given by the subspace of .
The CRLB for s in the MUSIC-based method case is given by [41]
[TABLE]
where is defined in (44) and is defined as . The bound of the RPE in the MUSIC-based method case can also be obtained from (38) as in the tensor-based method case.
VIII Simulation results
In this section, we numerically evaluate the CRLB analysis in Section VI. We also present simulation results to demonstrate the performance of the proposed spatial channel covariance estimation algorithms based on CPD of higher-order tensors.
VIII-A Analytical results on CRLB
In Fig. 2, we show the MSE of the estimation of . We also compare the MSE results to the CRLB derived in Section VI for , , , , and . We assume that the path gains ’s are generated from and . Since CRLB depends on the deterministic value of AoA and path delays, we set the values as in degrees and for the purpose of reproduction. We can see that the proposed method achieves the MSE that is close to its theoretical lower bound at moderate and high SNR region.
Fig. 2(a) compares the proposed tensor-based method with the MUSIC-based method in terms of the MSE(). In addition to numerical results, the analytical results indicate the superiority of the tensor-based method over the MUSIC-based method. The metric is plotted in Fig. 2(b) with the lower bound of its approximation derived in Section V. As shown in Section V, the RPE is closely related to MSE() and its CRLB.
Fig. 3 shows the relationship between the CRLB of MSE() and other system design parameters such as , , and as well as SNR. Unlike the relationship between CRLB() and SNR as shown in Fig. 3(d), which is a linear relationship in dB, other parameters impact less on the CRLB as the parameter values increase. As shown in Fig. 3(c), the CRLB of the proposed method approaches that of the MUSIC as increases.
VIII-B *Performance evaluation of the spatial channel covariance estimation methods *
In this subsection, we evaluate the performance of the spatial channel covariance estimation in terms of the RPE for and . Unlike Section VIII-A, the AoA s are uniformly distributed in , and the normalized delay s are uniformly distributed in , where the cyclic prefix length is set to . The path gains ’s are IID complex Gaussian random variables as and .
Fig. 4 compares the proposed method with the two other methods explained in Section VII in terms of the RPE when . The two figures, Fig. 4(a), and Fig. 4(b), show the comparison at different SNR values: 0 dB and -10 dB. In each figure, we compare the methods for different values: 2, 3, and 4. It is worthwhile to note that the MUSIC-based method does not work properly if because in (74) must have at least one column. Even when , the figure shows that the performance degradation of the MUSIC-based method is more severe than other two methods as approaches . In addition, as SNR becomes low, the RPE of the MUSIC-based method rapidly decreases compared to that of the tensor-based method. The CS-based method outperforms the MUSIC-based method when relative to is large, but the method shows poor performance at very low SNR. The proposed method based on the higher-tensor CPD shows the best performance in most cases, in particular when is not so small and SNR is low. Fig. 5 shows the results when , and the results have the same trend as in Fig. 4 where .
Fig. 6 shows the dependency of the RPE on the number of channel paths for each method. Fig. 6(a) reveals that the performance of the MUSIC-based method rapidly decreases after becomes larger than six. In Fig. 6(b), the number of frames is fixed at 20, and the RPEs are compared for different SNR values. For SNR 0 dB and 10 dB, the CS-based method has as a high RPE as the tensor-based method, but its performance is significantly degraded as SNR goes to -10 dB. For any values, we can observe that the proposed tensor-based method has a reasonably high RPE.
Until now, we assumed that the channel has only channel paths as in (8). To evaluate performance for more realistic channels, we consider a clustered channel model [34] that has multiple clusters with multiple subrays as
[TABLE]
The channel has clusters whose AoAs are uniformly distributed in , and each cluster has subrays whose AoA offsets are Laplacian distributed with angular spread . All subrays within a cluster are assumed to have the same delay. Although the rank of the channel tensor becomes high in general, we can approximate the channel tensor to be a low-rank tensor for spatially sparse channels. Fig. 7 shows the RPE results when , , , , and , or . Instead of using the actual number of channel paths , the methods based on CPD and CS use for its low-rank (or sparse) approximation while the MUSIC-based method uses due to its inherent limitation of using . The figure shows that, although the multiple subrays result in performance loss compared to the single subray case, the proposed method still works properly even in this case.
IX Conclusions
In this paper, we proposed a spatial channel covariance estimation method for the hybrid analog/digital architecture over time-varying frequency-selective channels. Leveraging the fact that a low-rank higher-order tensor can be uniquely decomposed into factor matrices in each domain, we formulated the estimation problem by using high-order tensors and proposed a solution that achieves performance close to its theoretical bound. We also derived the CRLB of the proposed method and showed that compared it is lower than the CRLB of MUSIC-based approach. Numerical results showed that our proposed work outperforms the MUSIC-based work and the CS-based work. The results also showed that our proposed work has a more significant gain in the low SNR regime and the performance degradation caused by the increase in the number of channel paths is less severe than prior work.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] O. El Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. Heath, “Spatially sparse precoding in millimeter wave MIMO systems,” IEEE Trans. Wireless Commun. , vol. 13, no. 3, pp. 1499–1513, Mar. 2014.
- 2[2] W. Roh, J. Seol, J. Park, B. Lee, J. Lee, Y. Kim, J. Cho, K. Cheun, and F. Aryanfar, “Millimeter-wave beamforming as an enabling technology for 5G cellular communications: theoretical feasibility and prototype results,” IEEE Commun. Mag. , vol. 52, no. 2, pp. 106–113, Feb. 2014.
- 3[3] R. Heath, N. Gonz a ´ ´ a \acute{\textrm{a}} lez-Prelcic, S. Rangan, W. Roh, and A. Sayeed, “An overview of signal processing techniques for millimeter wave MIMO systems,” IEEE J. Sel. Topics Signal Process. , vol. 10, no. 3, pp. 436–453, Apr. 2016.
- 4[4] F. Sohrabi and W. Yu, “Hybrid digital and analog beamforming design for large-scale antenna arrays,” IEEE J. Sel. Topics Signal Process. , vol. 10, no. 3, pp. 501–513, Apr. 2016.
- 5[5] X. Zhang, A. Molisch, and S. Kung, “Variable-phase-shift-based RF-baseband codesign for MIMO antenna selection,” IEEE Trans. Signal Process. , vol. 53, no. 11, pp. 4091–4103, Nov. 2005.
- 6[6] V. Venkateswaran and A. J. van der Veen, “Analog beamforming in MIMO communications with phase shift networks and online channel estimation,” IEEE Trans. Signal Process. , vol. 58, no. 8, pp. 4131–4143, Aug. 2010.
- 7[7] A. Adhikary, J. Nam, J. Ahn, and G. Caire, “Joint spatial division and multiplexing: The large-scale array regime,” IEEE Trans. Inf. Theory , vol. 59, no. 10, pp. 6441–6463, Oct. 2013.
- 8[8] J. González-Coma, J. Rodríguez-Fernández, N. González-Prelcic, L. Castedo, and R. Heath, “Channel estimation and hybrid precoding for frequency selective multiuser mmwave MIMO systems,” IEEE J. Sel. Topics Signal Process. , vol. 12, no. 2, pp. 353–367, May 2018.
