Block- and Rank-Sparse Recovery for Direction Finding in Partly Calibrated Arrays
Christian Steffens, Marius Pesavento

TL;DR
This paper introduces a novel sparse recovery method for direction finding in partly calibrated arrays, leveraging block-sparsity and low-rank structures, with improved performance over existing methods in challenging scenarios.
Contribution
The paper presents a new technique combining nuclear norm and 1 norm minimization for direction finding in partly calibrated arrays, including a reformulation for efficient implementation and a gridless extension.
Findings
Outperforms the RARE method in low SNR scenarios
Effective with low sample numbers and correlated signals
Applicable to arbitrary subarray topologies
Abstract
A sparse recovery approach for direction finding in partly calibrated arrays composed of subarrays with unknown displacements is introduced. The proposed method is based on mixed nuclear norm and 1 norm minimization and exploits block-sparsity and low-rank structure in the signal model. For efficient implementation a compact equivalent problem reformulation is presented. The new technique is applicable to subarrays of arbitrary topologies and grid-based sampling of the subarray manifolds. In the special case of subarrays with a common baseline our new technique admits extension to a gridless implementation. As shown by simulations, our new block- and rank-sparse direction finding technique for partly calibrated arrays outperforms the state of the art method RARE in difficult scenarios of low sample numbers, low signal-to-noise ratio or correlated signals.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Block- and Rank-Sparse Recovery for Direction Finding
in Partly Calibrated Arrays
Christian Steffens and Marius Pesavento
Communication Systems Group
Darmstadt University of Technology, Germany
e-mail: {steffens, pesavento}@nt.tu-darmstadt.de
(February 2017)
Abstract
A sparse recovery approach for direction finding in partly calibrated arrays composed of subarrays with unknown displacements is introduced. The proposed method is based on mixed nuclear norm and norm minimization and exploits block-sparsity and low-rank structure in the signal model. For efficient implementation a compact equivalent problem reformulation is presented. The new technique is applicable to subarrays of arbitrary topologies and grid-based sampling of the subarray manifolds. In the special case of subarrays with a common baseline our new technique admits extension to a gridless implementation. As shown by simulations, our new block- and rank-sparse direction finding technique for partly calibrated arrays outperforms the state of the art method RARE in difficult scenarios of low sample numbers, low signal-to-noise ratio or correlated signals.
1 Introduction
Direction finding with sensor arrays has applications in various fields of signal processing such as wireless communications, radar, sonar or astronomy. In these types of applications it is desired to achieve a high angular resolution and to identify a large number of sources. This can be achieved by sensor arrays with a large aperture and a large number of sensors [1]. However, a large aperture size makes it difficult to achieve and maintain precise array calibration. Possible reasons for imperfect calibration are inaccuracies in the sensor positions, timing synchronization errors, or other unknown gain and phase offsets among sensors [2, 3, 4, 5, 6, 7, 8]. Standard approaches to this problem usually rely on either offline or online calibration. Offline calibration of the overall array is performed using reference sources at known positions and can easily become a challenging and time consuming task [2]. Alternatively, several online calibration techniques have been proposed which use calibration sources at unknown positions [3, 4, 5, 6, 7], but the complexity of these techniques is prohibitively high, and performance can be severely limited in the case of large sensor position errors [7]. Moreover, these techniques cannot be employed in scenarios with imperfect time synchronization of sensors or other unknown sensor gain and phase offsets.
One way to overcome the calibration problem is to partition the overall array into smaller subarrays which are themselves comparably easy to calibrate. This type of array is referred to as partly calibrated array (PCA) and has received considerable interest in recent years. Generally, direction finding approaches for this type of arrays can be classified into non-coherent and coherent methods. In the non-coherent case the subarrays independently perform estimation of the directions of arrival (DoAs) or the signal covariance matrix to communicate these estimates to a central processor, where further processing is performed to achieve an improved joint estimate [9, 10, 11, 12, 13]. In the coherent approach parameter estimation is performed based on joint coherent processing of all available sensor measurements, e.g., by computing a global sample covariance matrix, and imperfect calibration among the different subarrays is taken account of in the estimation process [14, 15, 16, 17, 18, 19, 8, 20, 21]. In this work we consider the latter of the two approaches.
A prominent class of DoA estimation methods is based on subspace separation. In [14, 15, 16, 17, 18] the authors consider PCAs composed of multiple identical subarrays. Such types of array exhibit multiple shift invariances and methods such as the multiple invariant MUSIC and MODE [14] or multiple invariance ESPRIT [16, 17, 18] can be used to provide DoA estimates in a search-free fashion. In [19] it is assumed that the PCA is composed of identically oriented linear arrays that can be transformed to a uniform linear array by linear translations of the subarrays. The authors present the root-RARE algorithm which admits search-free DoA estimation. The root-RARE method in [19] was modified to the spectral RARE in [8] which admits application to arbitrary array topologies at the cost of increased computational complexity. Subspace-based methods are well investigated and are shown to asymptotically achieve an estimation performance close to the Cramér-Rao bound at low computational complexity. However, these subspace-based methods often have difficulties in certain practical scenarios. First, correlated source signals, e.g., in multipath environments, can significantly reduce the estimation performance. Second, subspace-based methods yield poor performance in the case of low number of snapshots and low signal-to-noise ratio, e.g., in fast changing environments.
Recently, sparse recovery (SR) methods came into focus of DoA estimation studies. As reported in [22, 23], SR methods provide high-resolution parameter estimation performance without the aforementioned shortcomings of subspace-based methods. Moreover, SR methods are computationally tractable since they can be formulated as convex optimization problems. While classical SR methods aim at recovering sparse signal vectors and admit grid-based parameter estimation [24, 25], the special case of fully calibrated arrays (FCAs) of uniform linear topology with possibly missing sensors allows for gridless SR methods as proposed in [26, 27, 28]. In [12, 13] the authors propose grid-based and gridless SR methods applicable for non-coherent processing in PCAs, where joint sparsity in the subarray signal representations is exploited. SR methods for coherent processing in PCAs have been presented in citesteffens2017shiftinvariance, 6882328. The method in [20] is based on the recently proposed SPARROW formulation [23] and exploits multiple shift-invariances in PCAs composed of identical subarrays to provide gridless parameter estimation. In [21] the well-known mixed-norm minimization approach [22] for FCAs is generalized to grid-based SR in PCAs of arbitrary topology by means of a mixed nuclear norm [29, 30] and norm, termed here as mixed-norm. As shown by numerical experiments [21], mixed-norm minimization clearly outperforms the spectral RARE [8] in frequency resolution performance for low signal-to-noise ratio and low number of snapshots.
In this paper we consider the mixed-norm minimization problem proposed in [21] and derive an equivalent compact reformulation, termed as COmpact Block- and RAnk-Sparse recovery (COBRAS). The COBRAS formulation has a reduced number of optimization parameters as compared to the original mixed-norm minimization problem and we provide efficient implementations of the COBRAS formulation by means of semidefinite programming (SDP). While the SDP implementation is based on grid-based sampling of the subarray manifolds and applicable to arbitrary array topologies, we furthermore present a search-free implementation of our COBRAS formulation for the special case of linear subarrays with a common baseline. We show by extensive numerical experiments that the COBRAS approach outperforms the state of the art methods in difficult scenarios. In summary, our main contributions are given as:
- •
We introduce a sparse recovery approach for coherent processing in PCAs using mixed-norm minimization.
- •
We derive a compact reformulation of the mixed-norm minimization problem, termed as COBRAS.
- •
We develop a computationally efficient grid-based SDP implementation of the COBRAS formulation for arbitrary array topologies, and
- •
an efficient gridless SDP implementation of the COBRAS formulation for PCAs composed of subarrays with a common baseline.
The paper is organized as follows: Section 2 introduces the PCA signal model. The and mixed-norm minimization problems for FCAs and PCAs are discussed in Section 3. The COBRAS formulation is derived in Section 4 while grid-based and gridless SDP implementations are provided in Sections 5 and 6. Numerical results are presented in Section 7 before the paper is concluded in Section 8.
Notation: Boldface uppercase letters denote matrices, boldface lowercase letters denote column vectors, and regular letters denote scalars, with j denoting the imaginary unit. Superscripts and denote transpose and conjugate transpose of a matrix , respectively. The term denotes the set of positive semidefinite block-diagonal matrices composed of blocks of size on the main diagonal . We write to indicate the element in the th row and th column of matrix . The statistical expectation of a random variable is denoted as , and the trace of a matrix is referred to as . The Frobenius norm and the mixed-norm of a matrix are referred to as and , respectively, while the norm of a vector is denoted as . The term denotes a diagonal matrix with the elements on its main diagonal while denotes a block-diagonal matrix composed of submatrices on its main block-diagonal.
2 Signal Model
Consider a linear array of arbitrary topology, composed of omnidirectional sensors, as depicted in Figure 1. Assume the overall array is partitioned into subarrays with sensors in subarray , for , such that . We define as the vector containing the unknown inter-subarray displacements expressed in half signal wavelength and relative to the first subarray, i.e., . Furthermore, let , for , , denote the perfectly known intra-subarray position of the th sensor of subarray relative to the first sensor in the subarray, hence , and expressed in half signal wavelength. Consequently, the position of sensor in subarray , relative to the first sensor in the first subarray, can be expressed as
[TABLE]
for and .
Moreover, assume a number of narrowband and far-field sources are illuminating the sensor array from angular directions , as illustrated in Figure 1. The corresponding spatial frequencies are defined as , for , and comprise the vector . A total of signal snapshots are obtained at the output of each subarray and collected in the subarray measurement matrix , for , where denotes the output of the th sensor in the th subarray at time instant . The subarray measurement matrices are collected in the array measurement matrix , which is modeled as
[TABLE]
where is the source signal matrix and denotes a spatio-temporal white Gaussian sensor noise matrix. The array steering matrix in (2) is given by
[TABLE]
and represents the response of the entire array, where denotes the steering vector for spatial frequency and subarray displacements . Based on the sensor position definition in (1), the array steering vectors can be factorized as
[TABLE]
where the block-diagonal matrix
[TABLE]
contains the perfectly known subarray steering vectors
[TABLE]
for , on its diagonal, and the vector
[TABLE]
takes account of the subarray displacement shifts , for , depending on the spatial frequencies in and the subarray displacements in , and further unknown shifts , e.g., gain/phase or timing offsets among the subarrays [8]. In relation to (3), let us define the matrix
[TABLE]
containing all subarray responses for the spatial frequencies in , and the block-diagonal matrix
[TABLE]
composed of the subarray shift vectors in (7). Using (8) and (9), the overall array steering matrix (3) can be factorized as
[TABLE]
such that the overall array measurement matrix in (2) is equivalently modeled as
[TABLE]
which forms the basis for the mixed-norm minimization problem discussed in the following section.
3 State-of-the-Art
In this section we will shortly review the mixed-norm minimization approach for FCAs before turning to the mixed-norm minimization approach for PCAs.
3.1 Fully Calibrated Array
We first consider the case of an FCA where the subarray displacements in are perfectly known. Based on the signal model in (2) we introduce a sparse representation of the measurement matrix as
[TABLE]
The overcomplete dictionary matrix is obtained by sampling the field-of-view in spatial frequencies . For ease of presentation we assume that the frequency grid is sufficiently fine, such that the true frequencies in are contained in the frequency grid , i.e., . In Section 6 we present an extension of our proposed formulation for subarrays with a common baseline which does not rely on the on-grid assumption. The sparse signal matrix in (12) contains elements
[TABLE]
for , . Thus exhibits a row-sparse structure, i.e., the elements in a row of are either jointly zero or primarily non-zero. Based on the sparse representation (12), the frequency estimation problem can be formulated as the mixed-norm minimization problem
[TABLE]
where is a regularization parameter determining the sparsity, i.e., the number of non-zero rows in the minimizer . Row-sparsity is enforced by minimizing the mixed-norm in (14), which is defined as the number of non-zero rows of the matrix , i.e., the cardinality of the union support set according to
[TABLE]
Since the problem in (14) is NP-hard, several approximation methods have been proposed in the literature, including convex relaxation to the well-known mixed-norm minimization problem [31, 32, 22, 23]
[TABLE]
The mixed-norm in (16) is defined as
[TABLE]
and induces a non-linear coupling among the elements in each row , , of the matrix such that the norm, i.e., the nonnegative summation, is performed on the norms of the rows in . Given a minimizer of (16), the frequency estimation problem reduces to finding the local maxima in the vector of the signal row-norms and assigning the corresponding frequency grid points to the set of estimated frequencies.
For the PCA case with uncertain array response due to the unknown displacements in , the mixed-norm minimization approach in (16) cannot be applied and a more sophisticated approach has to be devised, as discussed in the following subsection.
3.2 Partly Calibrated Array
Analogous to the FCA case in (12), we introduce a sparse representation of the signal model in (11) for the PCA case as
[TABLE]
where the row-sparse matrix is defined similarly as for the FCA case in (13). Furthermore, the overcomplete subarray dictionary matrix and the overcomplete subarray shift matrix are defined in correspondence to (8) and (9), respectively.
In the PCA case, the inter-subarray displacements in are unknown and thus represent additional estimation variables, hence the subarray shifts in , which depend on the spatial frequencies in and the subarray displacements , have to be appropriately included in the sparse estimation problem. To this end we introduce a model that couples among the variables in the rows of and the subarray shifts in , for . We define the extended signal matrix as
[TABLE]
containing the products of the subarray shifts and the signal waveforms. Note that in this formulation the number of the unknown complex-valued signal variables is increased to elements in the matrix , as compared to the total complex-valued unknowns in both and . On the other hand, due to the block structure of the subarray shift matrix as defined in (9), the matrix in (19) enjoys a special structure as it is composed of stacked rank-one matrices
[TABLE]
Using the formulation in (19), the sparse representation for the PCA case in (18) is equivalently described by
[TABLE]
where for ease of presentation we use to denote the dictionary matrix in (21) and throughout the paper. An SR approach to take account of the special structure of the signal matrix in (19) is given as
[TABLE]
The formulation in (22) takes twofold advantage of the sparsity assumption. First, minimization of the rank-terms encourages low-rank blocks in the minimizer . Second, minimizing the sum-of-ranks provides a block-sparse structure of , i.e., the elements in each block , for , are either jointly zero or primarily non-zero. However, the problem in (22) is NP-hard and computationally intractable.
The nuclear norm represents a tight convex approximation of the rank function and it has been successfully applied in a variety of rank minimization problems [29, 30]. The definition of the nuclear norm is given as
[TABLE]
where and is the th singular value of . Along these lines it has been proposed in [21] to approximate the sparse estimation problem (22) by the following convex minimization problem
[TABLE]
where denotes the mixed-norm, computed as
[TABLE]
Similar to (22), the problem in (24) motivates low-rank blocks and a block-sparse structure in the minimizer . Note that the PCA formulations in (22) and (24) reduce to the FCA formulations in (14) and (16), respectively, in the case of a single subarray, i.e., .
Performing singular value decomposition on the matrix blocks in , i.e.,
[TABLE]
the signal waveform and subarray shifts corresponding to the spatial frequency can be recovered according to
[TABLE]
The left and right singular vectors and in (27) correspond to the largest singular value of and normalization to the first element of in (27) is performed to take account of the structure of the subarray shift vectors according to (7).
While the mixed-norm minimization problem in (24) provides a tractable approach for SR in PCAs, it suffers from high computational complexity in the case of a large number of snapshots and grid points . To overcome this difficulty we provide in the following section a compact reformulation of problem (24).
4 Compact Block- and Rank-Sparse Recovery
One of the main results of this paper is formulated in the following theorem:
Theorem 1** (Problem Equivalence).**
The block- and rank-sparsity inducing mixed-norm minimization problem
[TABLE]
is equivalent to the convex problem
[TABLE]
with and denoting the sample covariance matrix and the set of positive semidefinite block-diagonal matrices composed of blocks of size , respectively. The equivalence holds in the sense that a minimizer for problem (28) can be factorized as
[TABLE]
where is a minimizer for problem (29).
A proof of the equivalence is provided in Appendix Acknowledgment, while a proof of the convexity of (29) is provided in Section 5.2 by establishing equivalence to a semidefinite program.
In addition to relation (30), it can be shown (see Appendix Acknowledgment) that a minimizer of (29) relates to the signal matrix according to
[TABLE]
for , such that the block-support of is equivalently represented by the block-support of the matrix . Similarly, the rank of the matrix blocks is equivalently represented by the matrix blocks , i.e., , for .
We observe that the problem in (29) only relies on the measurement matrix through the sample covariance matrix , leading to a significantly reduced problem size, especially in the case of large number of snapshots . In this context we term the formulation in (29) as COmpact Block- and RAnk-Sparse recovery (COBRAS). The compact formulation (29) contains real-valued optimization parameters in the positive semidefinite matrix , as opposed to the real-valued optimization parameters in in problem (28). Consequently, in the case of a large number of snapshots , the reformulation (29) has reduced computational complexity as compared to (28).
5 Implementation of the COBRAS Formulation
The mixed-norm minimization problem (28) has been well investigated in literature and implementations based on the coordinate descent method [21], the STELA algorithm [33] and semidefinite programming (SDP) [34] have been proposed. Here we will shortly revise the SDP implementation of the mixed-norm minimization problem (28) to highlight the reduction in computational complexity obtained by employing the COBRAS formulation in (29).
5.1 SDP Form of the Mixed-Norm Minimization Problem
As discussed in [29], minimization of the nuclear norm
[TABLE]
for some convex set , can be expressed as the SDP
[TABLE]
where and are auxiliary variables of size and , respectively. The SDP formulation (33) admits simple implementation of the nuclear norm minimization problem using standard convex solvers, such as SeDuMi [35].
Based on the equivalence of (32) and (33), the mixed-norm minimization problem (28) can be equivalently formulated as
[TABLE]
Note that with the auxiliary variables in and , for , the problem (34) has real-valued optimization variables as opposed to the problem formulation in (28) which has real-valued optimization variables. For a large number of grid points or snapshots the SDP formulation (34) becomes intractable and alternative implementations are required as presented in the next subsection. We remark that the problem of large snapshot number has been addressed in previous literature by matching the signal subspace of the measurements instead of the measurements itself, see [22, 34], leading to a reduced number of effective signal snapshots, however at the expense of potential performance degradation, e.g., in the case of correlated source signals.
5.2 SDP Form of the COBRAS Method
In order to solve the COBRAS formulation in (29) by means of a tractable SDP which can be treated by standard convex solvers consider the following corollaries [36]:
Corollary 1**.**
The COBRAS formulation in (29) is equivalent to the convex semidefinite program
[TABLE]
where is a Hermitian matrix of size .
To see the equivalence between the two problems we note that is positive definite for any and consider the Schur complement of the constraint (35b)
[TABLE]
which implies
[TABLE]
Since in problem (35) is minimized, it can be proved by contradiction that the relation in (37) must hold with equality, proving the equivalence of (29) and (35).
Corollary 2**.**
The COBRAS formulation in (29) admits the equivalent problem formulation
[TABLE]
The proof to Corollary 2 follows similar arguments as in the proof of Corollary 1 and is therefore omitted here. In contrast to (35), the size of the semidefinite constraint in (38) is independent of the number of snapshots . It follows that either problem formulation (35) or (38) can be selected to solve (29), depending on the number of snapshots and the resulting size of the semidefinite constraint. The problems (35) and (38) have real-valued optimization variables in and additional or real-valued parameters in and , respectively. Thus, in the undersampled case it is preferable to use the SDP formulation in (35), while in the oversampled case it is preferable to apply the SDP formulation in (38). We remark that the subspace matching approach discussed in [22, 34] can be applied to formulation (35) as well. A further investigation of the subspace matching approach is, however, beyond the scope this paper. The various equivalent SDP implementations and the corresponding number of variables and constraints are listed in Table 1.
6 Gridless COBRAS Implementation
While the SDP formulations in Section 5 are applicable to arbitrary array topologies, we consider in the following the special case of linear subarrays with a common baseline, where the sensors within each subarray are located at integer multiples of a baseline , i.e., with for and . This type of array topologies admits the extension of the COBRAS formulation to gridless frequency estimation.
We start noting that strong duality holds for problem (38) and consider the Lagrange dual problem, which is given as
[TABLE]
where is an positive semidefinite matrix and is of size and does not exhibit specific structure. Complementary slackness requires that
[TABLE]
for , i.e., if then must be singular, such that
[TABLE]
Condition (41) indicates that instead of solving the primal problem (29) and identifying the block-support from , we can equivalently solve the dual problem (39) and identify the block-support from the roots of (41).
Let us consider the limiting case of an infinitesimal frequency grid spacing, i.e., for , such that the frequency becomes a continuous parameter . By introducing the variable
[TABLE]
the subarray steering matrices in (5) can be equivalently described as
[TABLE]
where the subarray steering vectors are given as
[TABLE]
with for and . By the definition in (43), the matrix product in constraint (39c) constitutes a trigonometric matrix polynomial of degree , according to
[TABLE]
with matrix coefficients of size [37]. In the continuous case the constraint (39c) is replaced by the constraint
[TABLE]
which provides an upper bound on the matrix polynomial (45) and can be implemented by semidefinite programming, e.g., by the problem formulation (84) derived in Appendix Appendix A - Proof of Problem Equivalence or by other techniques discussed in [37]. Once the continuous implementation of problem (39) is solved, the spatial frequencies can be recovered by finding the roots for which the left-hand side of (46) becomes singular, e.g., by rooting the continuous counterpart of (41) using the techniques discussed in [38].
6.1 Related Work
In a recent work [20] it has been shown that PCAs composed of identical subarrays admit gridless compressed sensing by means of the SPARROW formulation [23]. Interestingly, for the special case considered in this section of PCAs composed of linear subarrays with a common baseline (and possibly missing sensors in particular subarrays) the dual problem formulation (39) can equivalently be derived by means of the SPARROW formulation.
Let us consider the mixed-norm minimization problem for FCAs given in (16) which can equivalently be formulated as the SPARROW problem [23]
[TABLE]
where is a regularization parameter and is of size . Problem (47) can be formulated as the semidefinite program
[TABLE]
The Lagrange dual problem of (48) is given as
[TABLE]
and with strong duality it follows from complementary slackness that
[TABLE]
As previously discussed in the context of condition (41), the support of the vector , i.e., the spatial frequency estimates, can equivalently be identified by rooting the function in (50).
Making use of the notation , as introduced in (4), condition (50) can be rewritten as
[TABLE]
where
[TABLE]
Condition (51) is fulfilled if
[TABLE]
which is identical to the constraint (39c) in problem (39). Replacing the constraint (49c) in problem (49) by the condition (54) and further using (53) and the substitutions and shows that for the PCA case the dual problem (49) can be reformulated as the dual problem (39). As demonstrated in the previous section, condition (54) can be extended to an infinitesimal grid spacing, resulting in a matrix polynomial constraint, such that the resulting gridless estimation problem can be implemented by semidefinite programming (see Appendix Appendix A - Proof of Problem Equivalence).
7 Numerical Results
For experimental performance evaluation of our proposed COBRAS method we compare its estimation performance to the state-of-the-art methods spectral RARE [8] and root-RARE [19] as well as the Cramér-Rao bound (CRB) [8].
For all simulations we use circular complex Gaussian source signals with covariance matrix , if not specified otherwise. We further consider spatio-temporal white circular complex Gaussian sensor noise with covariance matrix and define the signal-to-noise ratio (SNR) as . The vector contains the global sensor positions of subarray , for , , expressed in half-wavelength, as defined in (1). If not stated otherwise, we perform Monte Carlo trials for each experimental setup and compute the statistical error.
The estimation performance of the COBRAS method strongly depends on proper selection of a regularization parameter . While regularization parameter selection is a research field of its own, in this paper we follow a heuristic approach and select the regularization parameter as
[TABLE]
which has shown good estimation performance in all investigated scenarios.
We remark that the RARE and COBRAS method make different assumptions on the availability of a-priori knowledge. While the RARE method requires knowledge of the number of source signals, the regularization parameter selection for the COBRAS method according to (55) requires knowledge of the noise power. However, since estimation of these parameters itself might affect the frequency estimation performance of the RARE and COBRAS methods, we apply the standard assumption of perfectly known number of source signals and noise power and investigate the achievable performance under these idealized assumptions.
7.1 Arbitrary Array Topologies and Grid-Based Estimation
In the first scenario we consider a PCA with a large aperture, composed of sensors which are partitioned in linear subarrays with 3,2,3, and 3 sensors, respectively. The sensor positions for each subarray are , , , and , and we assume no additional gain/phase offsets among the subarrays, i.e., in (7). We further consider uncorrelated Gaussian source signals with spatial frequencies .
The array topology does not admit a direct implementation of the gridless COBRAS and the root-RARE methods such that we limit the experiments in this subsection to the investigation of the grid-based COBRAS method and the spectral RARE method. For both grid-based methods we use a gird of grid points according to .
To investigate the frequency estimation performance, we compute the root-mean-square error of the frequency estimates in as
[TABLE]
where denotes the frequency estimate of signal in trial and denotes the wrap-around distance for two frequencies . Since the computation (56) requires the number of estimated source signals to be equal to the true number of source signals , we have to consider two special cases: in the case of overestimation of the model order, , we select the frequency estimates with the largest corresponding magnitudes, whereas we select additional random spatial frequencies in the case of underestimation .
In the first experiment the signal-to-noise ratio () is fixed to , while the number of snapshots is varied. Figure 3 clearly demonstrates that our proposed grid-based COBRAS technique outperforms the spectral RARE for low number of signal snapshots . While the spectral RARE method is not able to always resolve the two closely spaced signals with spatial frequencies and for signal snapshots, our proposed COBRAS method resolves the signals for any snapshots.
In a second experiment we fix the number of snapshots as and vary the SNR. As can be observed from Figure 3 the grid-based COBRAS method shows superior threshold performance as compared to the spectral RARE. While the spectral RARE can reliably resolve the two closely spaced sources only for , our proposed COBRAS can do so for . For high , spectral RARE reaches a bias in the RMSE which is caused mainly by the finite grid. A similar bias effect can be observed for the grid-based COBRAS method. However, for the grid-based COBRAS method the bias is larger than for the spectral RARE method and it is not only caused by the finite grid, as discussed in the following subsection.
7.2 Resolution Performance and Estimation Bias
For further investigation of the spatial frequency estimation bias, we consider a uniform linear array of sensors, partitioned into identical, uniform linear subarrays of 3 sensors each, without additional gain/phase offsets, i.e., in (7). For the experiment we consider uncorrelated signals and fix the spatial frequency of the first signal as while the spatial frequency of the second signal is varied according to with . For all grid-based estimation methods we make use of a uniform grid of points according to . The SNR and number of snapshots are fixed as and .
First, we observe from Figure 6 that the spectral RARE performs significantly worse than root-RARE in terms of threshold performance, i.e., the spectral RARE cannot always resolve the two signals for a frequency separation of while the root-RARE can resolve the signals for . The reason for this difference in resolution performance is that the root-RARE method locates the roots of the corresponding matrix polynomial in the entire complex plane, while the spectral RARE only searches minima on the unit circle (see also [39, 40]). In contrast to that, the grid-based and the gridless COBRAS methods both show rather similar estimation performance, comparable to that of the root-RARE method, and reach the CRB for sufficiently large frequency separation. This observation can be explained by the fact that both dual COBRAS optimization problems provide matrix polynomials with the roots of interest constrained on the unit circle, as discussed in Section 6. This explains the similar performance results of the grid-based and gridless COBRAS methods. The only difference between the grid-based and gridless COBRAS methods is that in the first case the roots are generated on a grid of candidate frequencies on the unit circle, while in the latter case the roots are continuously located on the unit circle.
In a slightly modified experiment we fix the SNR and number of snapshots to and , respectively. While the root-RARE method performs close to the CRB for the region of interest, the spectral RARE can not always resolve the signals for and reaches an estimation bias for large source separation, which is caused by the finite frequency grid. Furthermore, it can be observed that the estimation performance of the COBRAS methods deviates from that of the root-RARE method. For large frequency separation the grid-based COBRAS method reaches the grid bias, similar to the spectral RARE. However, also for low frequency separation both methods do not reach the CRB. This can be explained by an inherent frequency estimation bias for SR methods (see also [22, 23]). For further investigation we compute the spatial frequency estimation bias as
[TABLE]
where the mean estimate for spatial frequency is computed as .
For the given scenario, the estimation bias is displayed in Figure 6. In the case of low frequency separation both COBRAS methods show a relatively large estimation bias of . For larger frequency separation , the bias of the grid-based COBRAS method is mainly determined by the finite grid, while the bias of the gridless COBRAS method shows to be periodic in . In difficult scenarios, with low SNR and low number of snapshots as for the previous setup, the estimation bias is below the CRB, such that it is negligible in the RMSE performance. The frequency estimation bias is a well known phenomenon in SR research [22, 23] and bias mitigation techniques have been discussed, e.g., in [41].
7.3 Correlated Signals
As discussed in the previous subsection, gridless COBRAS and root-RARE show approximately equal resolution performance for uncorrelated signals in difficult scenarios with low SNR, low number of snapshots and uncorrelated signals. This situation changes in the case of correlated signals, where preprocessing in form of subspace separation, as required for the RARE method, becomes difficult. For further investigation of this aspect we consider a PCA of sensors partitioned into subarrays of 3,4 and 2 sensors with positions , and . Furthermore we consider gain/phase offsets among the subarrays according to in (7). The SNR and number of snapshots are selected as and . We consider source signals with spatial frequencies and a source covariance matrix given as
[TABLE]
where the correlation coefficient is assumed to be real-valued and varied in the experiment. For the grid-based estimation methods we consider a grid of candidate frequencies, defined as in the previous subsection. As seen from Figure 7, the spectral and root-RARE methods fail to properly estimate the spatial frequencies for high correlation () while the grid-based and gridless COBRAS methods still show estimation performance close to the CRB, since these methods do not require subspace separation.
7.4 Array Calibration Performance
Besides estimation of the spatial frequencies, the COBRAS method also admits estimation of the subarray shifts in as defined in (7). Since the RARE methods do not provide direct estimation of the subarray shifts, we use the method presented in [15] in equation (11) 111Without the restriction that the complex phase terms must be of unit magnitude. on the basis of the spatial frequency estimates obtained by the RARE methods.
The setup under investigation consists of a PCA of sensors partitioned into subarrays of 3,2,3 and 2 sensors at positions , , and . The subarray gain/phase offsets are set as in (7). We consider uncorrelated source signals with spatial frequencies and the number of snapshot is set to .
Figure 9 displays the frequency estimation error of the different methods for varying SNR, where both COBRAS methods show the best thresholding performance but reach an estimation bias for . Similarly, the spectral RARE algorithm reaches an estimation bias which is caused by the finite grid. On the other hand, the root-RARE performs asymptotically optimal and reaches the CRB for high SNR. The corresponding subarray shift estimation performance is displayed in Figure 9, where the root-mean-square error is computed according to
[TABLE]
with being the displacement phase vector estimate for signal in Monte Carlo trial . As can be observed from Figure 9, the subarray shift estimation method in [15], based on the frequency estimates obtained from the RARE methods, achieves a relatively large estimation bias for high SNR. In contrast to that, the grid-based and gridless COBRAS methods show a significantly reduced estimation error, which demonstrates the advantage of joint frequency and displacement phase estimation.
7.5 Computational Complexity
To investigate the computation time of the COBRAS formulation, we perform simulations in Matlab using the SeDuMi solver [35] with the CVX interface [42, 43] on a machine with an Intel Core i5-760 CPU @ and RAM. We consider a scenario with two independent complex Gaussian sources with static spatial frequencies and and a uniform linear PCA of sensors partitioned into identical and uniform linear subarrays of 3 sensors. We neglect subarray gain/phase offsets, i.e., in (7).
For the first experiment the SNR is fixed at while the number of snapshots is varied. Figure 11 shows the average computation time for Monte Carlo runs of the SDP implementation of the mixed-norm minimization (34) and the grid-based COBRAS formulations (35) and (38) with a grid size of , as well as the gridless (GL-) COBRAS formulation in (84). The computation time is measured only for solving the corresponding optimization problem in CVX. Pre-processing steps, such as computation of the sample covariance matrix, or post-processing steps, such as peak-search or polynomial rooting, are not included into this consideration. As can be observed from Figure 11, for a number of snapshots all grid-based methods exhibit approximately equal computation time. For the mixed-norm minimization problem has largest computation time while the COBRAS formulation (35) requires longest computation time for , due to the large dimension of the semidefinite constraint (35b). Regarding the computation time of the grid-based COBRAS formulation using the sample covariance matrix (38) we observe that it is relatively constant for any number of snapshots and lower than for the other implementations especially for large number of snapshots . The lowest computation time is required for solving the GL-COBRAS implementation (84).
Figure 11 shows the average computation time for Monte Carlo runs for a varying number of grid points and a fixed number of signal snapshots, corresponding to the case where the signal subspace matching techniques are applied (compare [22, 34]). For all grid-based methods the number of SDP constraints grows nearly linear with the number of grid points . Clearly, the mixed-norm minimization approach has the largest computation time for any investigated number of grid points . The grid-based COBRAS formulations (35) and (38) show approximately equal computation time, since both formulations have SDP constraints of identical dimension. Since the GL-COBRAS formulation is independent of the grid size , it is constant for all grid size numbers in Figure 11 and provides the fastest computation of all methods under investigation.
8 Conclusion
Partly calibrated arrays are attractive setups for direction of arrival estimation. Computationally efficient subspace-based methods such as RARE and ESPRIT show asymptotically optimal performance, but have problems in difficult scenarios such as low sample size, low signal-to-noise ratio or correlated source signals. Sparse recovery in the form of mixed-norm minimization has been shown to be an attractive alternative to subspace-based methods in these difficult scenarios. In this paper we have derived a compact, equivalent formulation of the mixed-norm minimization, referred to as COmpact Block- and RAnk-Sparse recovery (COBRAS). The COBRAS formulation is attractive especially in the case of a large number of signal snapshots. For the special case of subarrays with common baseline we have presented an extension to gridless estimation, referred to as gridless COBRAS (GL-COBRAS).
As shown by numerical results, the grid-based COBRAS significantly outperforms the spectral RARE method in terms of thresholding performance for closely spaced source signals. Furthermore, the COBRAS method outperforms the RARE method in the case of strongly correlated source signals and in the calibration (subarray shift estimation) performance. A drawback of the mixed-norm minimization approach and the COBRAS formulation is the estimation bias which becomes significant in the asymptotic case of large number of snapshots or high signal-to-noise ratio. However, if higher estimation accuracy is required, the COBRAS estimates can be used to provide initial estimates, e.g., for a subsequent maximum likelihood estimator.
Acknowledgment
This work was supported by the German Research Foundation (DFG) within the DFG priority program on Compressed Sensing in Information Processing (CoSIP DFG-SPP 1798).
Appendix A - Proof of Problem Equivalence
The proof of Theorem 1 relies on the following lemma, see [44, 30]:
Lemma 1**.**
The nuclear norm of the matrix is equivalently computed by the minimization problem
[TABLE]
where and are complex matrices of dimensions and , respectively, with .
Proof of Lemma 1.
Let us define the compact singular value decomposition of matrix as
[TABLE]
such that the factorization terms of can be expressed as
[TABLE]
for of size and some arbitrary unitary matrix , i.e., . Based on (62) it holds that
[TABLE]
where the inequality stems from the Cauchy-Schwartz inequality and is fulfilled with equality if and only if . In this case, the matrix factors in (62) are given as
[TABLE]
Furthermore, by the arithmetic-geometric-mean inequality it follows that
[TABLE]
where equality holds if , such that the minimum of (60) is given by \left\|\boldsymbol{Q}_{k}\right\|_{*}=\frac{1}{2}\big{(}\|\boldsymbol{\varGamma}_{k}\|_{\text{\sf F}}^{2}+\|\boldsymbol{G}_{k}\|_{\text{\sf F}}^{2}\big{)} with and given by (64).
∎
Proof of Theorem 1.
Based on Lemma 1, the mixed-norm of the source signal matrix , as defined in (25), is equivalently computed by
[TABLE]
where , is taken from the set of block-diagonal matrices composed of blocks of size on the main diagonal, and is a complex matrix composed of blocks , for . Inserting equation (66) into the mixed-norm minimization problem in (28) we formulate the minimization problem
[TABLE]
For a fixed matrix , the minimizer of problem (66) has the closed form expression
[TABLE]
where the last equation is derived from the Woodbury matrix identity [45, p.151]. Reinserting the optimal matrix into equation (67) and using basic reformulations of the objective function results in the concentrated minimization problem
[TABLE]
Upon summarizing and defining the positive semidefinite block-diagonal matrix
[TABLE]
we can rewrite (69) as
[TABLE]
Neglecting the factor in (71), we arrive at formulation (29). Using equations (61), (64) and the definition of in (70) we conclude that
[TABLE]
as given in (31). Making further use of (68) and the factorization in (66) we obtain
[TABLE]
which corresponds to relation (30).
∎
Appendix B - SDP Form of the Matrix Polynomial Constraint
As discussed in Section 6, the matrix represents a matrix polynomial of degree in the variable [37]. To see the relation between matrix and the matrix coefficients , let us define the matrix
[TABLE]
and introduce the permutation and selection matrix such that the subarray steering matrix can be expressed as
[TABLE]
[TABLE]
where is of size and is composed of the blocks , for , as
[TABLE]
Equation (76) is also referred to as the Gram matrix representation of the polynomial , and is referred to as the corresponding Gram matrix [37].
We define the block trace operator for matrix as
[TABLE]
i.e., the summation of the submatrices , for , on the main diagonal of matrix . Furthermore, let us define the elementary Toeplitz matrix , with ones on the th diagonal and zeros elsewhere, as well as the elementary block Toeplitz matrix .
Using the block trace operator (78) and the elementary block Toeplitz matrices , the matrix coefficients in (45) can be computed from the Gram matrix in (76) as
[TABLE]
i.e., the summation of the submatrices on the th block-diagonal of the Gram matrix . Note that the mapping (45) is unique, i.e., for any PCA steering matrix block and matrix the coefficients , , of the matrix polynomial are unique. However, the Gram matrix in (76) is not unique, i.e., a matrix polynomial generally admits different Gram matrix representations.
Let us define a second matrix polynomial which has constant value as
[TABLE]
such that the corresponding Gram matrix of size fulfills
[TABLE]
By using (76), (80) and (81) we can express the constraint (46) as
[TABLE]
which is fulfilled for
[TABLE]
Applying (81) and (83) in problem (39) we can define the gridless frequency estimation problem
[TABLE]
Given a minimizer to problem (84) the frequency estimation problem reduces to finding roots for which the constraint (46) becomes singular, as discussed in Section 6.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. Krim and M. Viberg, “Two decades of array signal processing research: the parametric approach,” IEEE Signal Processing Magazine , vol. 13, no. 4, pp. 67–94, Jul 1996.
- 2[2] B. Porat and B. Friedlander, “Accuracy requirements in off-line array calibration,” IEEE Transactions on Aerospace and Electronic Systems , vol. 33, no. 2, pp. 545–556, April 1997.
- 3[3] Y. Rockah and P. Schultheiss, “Array shape calibration using sources in unknown locations–part I: Far-field sources,” IEEE Transactions on Acoustics, Speech and Signal Processing , vol. 35, no. 3, pp. 286–299, Mar 1987.
- 4[4] A. J. Weiss and B. Friedlander, “Self-calibration in high-resolution array processing,” in Advances in Spectrum Estimation and Array Processing , S. Haykin, Ed. Prentice-Hall, 1991, vol. II.
- 5[5] M. Viberg and A. Swindlehurst, “A Bayesian approach to auto-calibration for parametric array signal processing,” IEEE Transactions on Signal Processing , vol. 42, no. 12, pp. 3495–3507, Dec 1994.
- 6[6] B. C. Ng and C. M. S. See, “Sensor-array calibration using a maximum-likelihood approach,” IEEE Transactions on Antennas and Propagation , vol. 44, no. 6, pp. 827–835, Jun 1996.
- 7[7] B. P. Flanagan and K. L. Bell, “Array self-calibration with large sensor position errors,” Signal Processing , vol. 81, no. 10, pp. 2201–2214, 2001.
- 8[8] C. See and A. Gershman, “Direction-of-arrival estimation in partly calibrated subarray-based sensor arrays,” IEEE Transactions on Signal Processing , vol. 52, no. 2, pp. 329–338, 2004.
