Asymptotic Performance of Complex M-estimators for Multivariate Location and Scatter Estimation
Bruno M\'eriaux, Chengfang Ren, Mohammed Nabil El Korso, Arnaud Breloy, and Philippe Forster

TL;DR
This paper derives the asymptotic performance of complex M-estimators for multivariate location and scatter estimation, addressing robustness issues in multivariate analysis, especially in complex data scenarios.
Contribution
It extends the asymptotic analysis of M-estimators from real to complex cases, providing theoretical insights into their performance.
Findings
Asymptotic performance formulas derived for complex M-estimators
Addresses robustness against outliers in complex multivariate data
Enhances understanding of complex estimator behavior in high-dimensional settings
Abstract
The joint estimation of means and scatter matrices is often a core problem in multivariate analysis. In order to overcome robustness issues, such as outliers from Gaussian assumption, M-estimators are now preferred to the traditional sample mean and sample covariance matrix. These estimators are well established and studied in the real case since the seventies. Their extension to the complex case has drawn recent interest. In this letter, we derive the asymptotic performance of complex M-estimators for multivariate location and scatter matrix estimation.
| (17) | ||||
| 0 | (18) | |||
| (19) | ||||
| (20) | ||||
| (21) | ||||
| (22) | ||||
| (23) | ||||
| (24) |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Asymptotic Performance of Complex -estimators for Multivariate Location and Scatter Estimation
Bruno Mériaux, Chengfang Ren, Mohammed Nabil El Korso, Arnaud Breloy, and Philippe Forster B. Mériaux and C. Ren are with SONDRA, CentraleSupélec, France. M.N. El Korso and A. Breloy are with LEME, Paris-Nanterre University, France. P. Forster is with SATIE, Paris-Nanterre University, France.This work is financed by the Direction Générale de l’Armement as well as the ANR ASTRID referenced ANR-17-ASTR-0015.
Abstract
The joint estimation of means and scatter matrices is often a core problem in multivariate analysis. In order to overcome robustness issues, such as outliers from Gaussian assumption, -estimators are now preferred to the traditional sample mean and sample covariance matrix. These estimators are well established and studied in the real case since the seventies. Their extension to the complex case has drawn recent interest. In this letter, we derive the asymptotic performance of complex -estimators for multivariate location and scatter matrix estimation.
Index Terms:
Complex observations, Robust estimation of multivariate location and scatter, Complex Elliptically Symmetric distributions.
I Introduction
Several classical methods in multivariate analysis require the estimation of means and scatter matrices from collected observations [1, 2, 3, 4, 5]. In pratice, the Sample Mean and the Sample Covariance Matrix are classically used in such procedures. Indeed, they coincide with the Maximum Likelihood Estimators (MLE) for multivariate Gaussian data. However, they are neither robust to deviation from Gaussianity assumption nor to the presence of outliers, which could lead to a dramatic performance loss. To overcome these problems, several approaches have been proposed in the literature [6, 7] improving the estimator’s behavior under contamination through diverse criteria such as the breakdown point, the contamination bias, the finite-sample efficiency while preserving the computationally feasibility for high dimension. Among the most frequently used robust affine equivariant estimators [8, 9, 10, 11, 12, 13, 14, 15, 16, 17], we focus on the -estimators, which have been first introduced within the framework of elliptical distributions [18]. The latters encompass a large number of classical distributions as for instance the Gaussian one but also non-Gaussian heavy-tailed distributions such as the -, - and -distributions [19]. One of the main interests of the real -estimators is to possess, under mild conditions, good asymptotic properties, namely weak consistency and asymptotic normality over the whole class of elliptical distributions [8, 20, 21, 22]. Their extension to the complex case has drawn recent interest [23], notably with the class of Complex Elliptically Symmetric (CES) distributions [24]. In the context of known mean, their asymptotic properties are established in [25, 26]. In this letter, we extend this result for multivariate location and scatter matrix -estimates under CES distributed data. The achieved outcome is analogous to its real-counterpart [21] and may be used for deriving the performance of adaptive processes in non-zero mean non Gaussian distributed observations [1].
In the following, the notation , and indicate respectively equality in distribution, convergence in law and in probability. The symbol refers to statistical independence. The operator stacks all columns of A, designated by into a vector. The operator refers to Kronecker product. The notation refers to the zero mean non-circular complex Gaussian distribution, where (respectively ) denotes the covariance matrix (respectively pseudo-covariance matrix) [27, 28].
II Problem setup
Let be i.i.d samples of -dimensional vectors, following a complex elliptical distribution, which is denoted by and whose p.d.f. is proportional to
[TABLE]
The vector is the location parameter, denotes the scatter matrix and the function is the density generator [29]. We aim to estimate jointly the location parameter and the scatter matrix from the observations. The complex joint -estimators \big{(}\mathbf{\hat{t}}_{N},\mathbf{\widehat{M}}_{N}\big{)} are solutions of the system [23]:
[TABLE]
with . Let us consider \big{(}\textbf{t}_{e},\textbf{M}_{e}\big{)} a solution related to the system :
[TABLE]
The functions and verify the conditions given in [8] for the real case (for the special case, where , [30] provides more general conditions). The proofs of Lemmas 1 and 2 and Theorems 1-3 of [8], addressing the existence and uniqueness of -estimates, are transposable to the complex field, by following the same methodology as in [8]. The derivations of these proofs require the same conditions on and as the ones needed in the real case. Thus, this ensures the existence of \big{(}\mathbf{\hat{t}}_{N},\mathbf{\widehat{M}}_{N}\big{)} and \big{(}\textbf{t}_{e},\textbf{M}_{e}\big{)} as well as the uniqueness of \big{(}\textbf{t}_{e},\textbf{M}_{e}\big{)}. In this letter, we derive the statistical performance of the complex joint -estimators \big{(}\mathbf{\hat{t}}_{N},\mathbf{\widehat{M}}_{N}\big{)}, namely consistency and asymptotic distribution.
III Consistency of the joint -estimator
Let \big{(}\mathbf{\hat{t}}_{N},\mathbf{\widehat{M}}_{N}\big{)} be a solution of and \big{(}\textbf{t}_{e},\textbf{M}_{e}\big{)} be the solution of the system .
Theorem III.1**.**
The complex joint -estimators are consistent, i.e.
[TABLE]
with in which is the solution of , for and .
Proof.
First, let us define and the function \boldsymbol{\Psi}_{N}\left(\boldsymbol{\theta}\right)=\left[\begin{array}[]{l}\boldsymbol{\Psi}_{1,N}\left(\boldsymbol{\theta}\right)=\dfrac{1}{N}\sum\limits_{n=1}^{N}u_{1}\left(d\left(\textbf{z}_{n},\textbf{t};\textbf{M}\right)\right)\left(\textbf{z}_{n}-\textbf{t}\right)\\ \boldsymbol{\Psi}_{2,N}\left(\boldsymbol{\theta}\right)=\text{vec}{}\left(\mathcal{H}\left(\textbf{Z}_{N},\textbf{t},\textbf{M}\right)-\textbf{M}\right)\end{array}\right].
The Strong Law of Large Numbers (SLLN) gives
[TABLE]
with \forall\,\boldsymbol{\theta}\in\boldsymbol{\Theta},\;\boldsymbol{\Psi}\left(\boldsymbol{\theta}\right)=\left[\begin{array}[]{l}\boldsymbol{\Psi}_{1}\left(\boldsymbol{\theta}\right)=\mathbb{E}\left[u_{1}\left(d\left(\textbf{z}_{1},\textbf{t};\textbf{M}\right)\right)\left(\textbf{z}_{1}-\textbf{t}\right)\right]\\ \boldsymbol{\Psi}_{2}\left(\boldsymbol{\theta}\right)=\text{vec}{}\left(\mathcal{H}_{\infty}\left(\textbf{t},\textbf{M}\right)-\textbf{M}\right)\end{array}\right].
According to the Theorem 5.9 [31, Chap. 5] and uniqueness of solution, we can show that any \widehat{\boldsymbol{\theta}}_{N}^{T}=\left[\mathbf{\hat{t}}_{N}^{T},\text{vec}{}\big{(}\mathbf{\widehat{M}}_{N}\big{)}^{T}\right] solution of converges in probability to solution of , yielding the intended outcome. Furthermore, the matrix is proportional to through a scale factor [32, Chap. 6]. Multiplying (5) by and taking the trace yields of which is the solution. ∎
IV Asymptotic distribution of the joint -estimators
IV-A Main theorem
We consider \big{(}\mathbf{\hat{t}}_{N},\mathbf{\widehat{M}}_{N}\big{)} a solution of as well as \big{(}\textbf{t}_{e},\textbf{M}_{e}\big{)} the solution of .
Theorem IV.1**.**
Assuming that () are bounded and , the asymptotic distribution of \big{(}\mathbf{\hat{t}}_{N},\mathbf{\widehat{M}}_{N}\big{)} is given by
[TABLE]
where is the commutation matrix satisfying \textbf{K}_{m}\text{vec}{}\left(\textbf{A}\right)=\text{vec}{}\big{(}\textbf{A}^{T}\big{)} [33] and is obtained by
[TABLE]
in which
[TABLE]
with solution of in which .
IV-B Proof of Theorem IV.1
The starting point of the proof is to map the complex joint -estimators \big{(}\mathbf{\hat{t}}_{N},\mathbf{\widehat{M}}_{N}\big{)} into real-ones, then to study the asymptotic behavior of the latter, and finally to relate the latter to the asymptotic distribution of the complex joint -estimators \big{(}\mathbf{\hat{t}}_{N},\mathbf{\widehat{M}}_{N}\big{)}.
IV-B1 Complex vector space isomorphism
Let us first introduce functions and with defined by and
[TABLE]
In addition, let be the matrix . Some useful properties of the previous functions are given in [25]. Furthermore, we set , , and . In addition, let us define and for , where , and . The notation refers to real elliptical distributions [18]. Moreover, there exist another relation between the vectors , and , for any Hermitian matrix, [25]:
[TABLE]
Let us apply the function to the equation (3) (respectively to (2)), we obtain
[TABLE]
[TABLE]
where and according to (9). Let be the functions related to by . Finally, we introduce the two following real joint -estimators \big{(}\mathbf{\hat{t}}_{N}^{u},\mathbf{\widehat{M}}_{N}^{u}\big{)} and \big{(}\mathbf{\hat{t}}_{N}^{v},\mathbf{\widehat{M}}_{N}^{v}\big{)} respectively solution of and . From the results in the real case on the consistency [8, 21], we obtain
[TABLE]
in which and are solutions of and and is the solution of , . Thus, we have .
Finally, by applying to the system , we obtain and . Moreover, since , we have and .
IV-B2 Link between asymptotic behaviors of \big{(}\mathbf{\hat{t}}_{N}^{u},\mathbf{\widehat{M}}_{N}^{u}\big{)}, \big{(}\mathbf{\hat{t}}_{N}^{v},\mathbf{\widehat{M}}_{N}^{v}\big{)} and \big{(}\mathbf{\hat{t}}_{N}^{\mathbb{R}},\mathbf{\widehat{M}}_{N}^{\mathbb{R}}\big{)}
Lemma IV.1**.**
* and (respectively and \dfrac{1}{2}\big{(}\mathbf{\widehat{M}}_{N}^{u}+\mathbf{\widehat{M}}_{N}^{v}\big{)}) share the same asymptotic Gaussian law.*
Proof.
See Appendix. ∎
IV-B3 Asymptotic behavior of \big{(}\mathbf{\hat{t}}_{N},\mathbf{\widehat{M}}_{N}\big{)}
Since \text{vec}{}\big{(}\mathbf{\widehat{M}}_{N}^{\mathbb{R}}\big{)} and have an asymptotic Gaussian distribution according to Lemma IV.1, and \text{vec}{}\big{(}\mathbf{\widehat{M}}_{N}\big{)}=\left(\textbf{g}_{m}^{T}\otimes\textbf{g}_{m}^{H}\right)\text{vec}{}\big{(}\mathbf{\widehat{M}}_{N}^{\mathbb{R}}\big{)} and with . Consequently, and \text{vec}{}\big{(}\mathbf{\widehat{M}}_{N}\big{)} have a non-circular complex Gaussian distribution [27]. Additionally, using the same approach as in [25], we obtain
[TABLE]
Regarding the location estimate, we have
[TABLE]
[TABLE]
in which
[TABLE]
with and hence . Furthermore, we have the relations and . Moreover, we have
[TABLE]
Thus, we prove that . Applying Lemma IV.1, we have and consequently,
[TABLE]
thus we prove that .
V Simulations
In order to illustrate our theoretical results, some simulations results are presented. Two scenarios have been considered for the simulations. For , the true location parameter is and the true scatter matrix is , due to the affine equivariance propertie of the -estimators, there is no loss of generality.
- •
Case 1 : the data are generated under a -distribution with degrees of freedom [34].
- •
Case 2 : the data are generated under a -distribution with shape parameter and scale parameter [26].
The complex joint -estimators is obtained with and the reweighting algorithm of [30], whose convergence is established.
The first case coincides with the MLE unlike the case 2, which is a general complex joint -estimator.
In Fig. 1, we plot the trace of the Mean Squared Error of the estimates of the location and the scatter matrix as well as the trace of the theoretical asymptotic covariance matrices and . The results are validated since the drawn quantities are identical when . Moreover these quantities tend asymptotically to zero, which illustrate the consistency.
VI Conclusion
In this letter, we established the asymptotic performance of the joint -estimators for the complex multivariate location and scatter matrix. This statistical study highlights a better understanding on the performance of the -estimators of the complex multivariate location and scatter matrix. Again, the obtained results may be used for conducting a performance analysis of adaptive processes involving non-zero mean observations.
[Proof of Lemma IV.1]
-A Asymptotic behavior of \big{(}\mathbf{\hat{t}}_{N}^{u},\mathbf{\widehat{M}}_{N}^{u}\big{)} and \big{(}\mathbf{\hat{t}}_{N}^{v},\mathbf{\widehat{M}}_{N}^{v}\big{)}
First of all, let us define and . Since \big{(}\mathbf{\hat{t}}_{N}^{u},\mathbf{\widehat{M}}_{N}^{u}\big{)}\overset{\mathbb{P}}{\rightarrow}\left(\textbf{t}_{u},\textbf{M}_{u}\right), we can write for , and . Let us note , , and and use first order expansions for sufficiently large, then we obtain (17) and (18) from . By vectorizing (17) and after some calculus, we obtain
[TABLE]
where and .
Remark**.**
Note that . Let be , thus we have and , , all 3rd-order moments vanish and the only non-vanishing 4th-order moments are and for where .
The SLLN yields to (19)–(22). Furthermore, since and , it yields from the central limit theorem that and with and zero-mean Gaussian distributed. Applying Slutsky’s lemma [32], it comes
[TABLE]
In the same way, we obtain
[TABLE]
-B Asymptotic behavior of \big{(}\mathbf{\hat{t}}_{N}^{\mathbb{R}},\mathbf{\widehat{M}}_{N}^{\mathbb{R}}\big{)}
With the results of Theorem III.1, the continuous mapping theorem implies
[TABLE]
Let us define . Since and , becomes and satisfies . For , we can write and . As previously, from (10) we obtain (23). Since , the Slutsky’s lemma leads to
[TABLE]
Similarly, from (11) we obtain
[TABLE]
and thus , which means that and have the same asymptotic distribution.
Lastly, we also introduce , which are zero mean and i.i.d., then
[TABLE]
Furthermore, since we have (24), we obtain .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Frontera-Pons, J.-P. Ovarlez, and F. Pascal, “Robust ANMF detection in noncentered impulsive background,” IEEE Signal Processing Letters , vol. 24, no. 12, pp. 1891–1895, Dec. 2017.
- 2[2] J. A. Fessler, “Mean and variance of implicitly defined biased estimators (such as penalized maximum likelihood): applications to tomography,” IEEE Transactions on Image Processing , vol. 5, no. 3, pp. 493–506, Mar. 1996.
- 3[3] A. M. Zoubir, V. Koivunen, Y. Chakhchoukh, and M. Muma, “Robust estimation in signal processing: A tutorial-style treatment of fundamental concepts,” IEEE Signal Processing Magazine , vol. 29, no. 4, pp. 61–80, Jul. 2012.
- 4[4] V. Ollier, M. N. E. Korso, R. Boyer, P. Larzabal, and M. Pesavento, “Robust calibration of radio interferometers in non-gaussian environment,” IEEE Transactions on Signal Processing , vol. 65, no. 21, pp. 5649–5660, Nov. 2017.
- 5[5] X. Zhang, M. N. E. Korso, and M. Pesavento, “ MIMO radar target localization and performance evaluation under SIRP clutter,” Elsevier Signal Processing , vol. 130, pp. 217–232, Jan. 2017.
- 6[6] M. Hubert, P. Rousseeuw, D. Vanpaemel, and T. Verdonck, “The det S and det MM estimators for multivariate location and scatter,” Computational Statistics and Data Analysis , vol. 81, pp. 64–75, 2015.
- 7[7] R. A. Maronna and V. J. Yohai, “Robust and efficient estimation of multivariate scatter and location,” Computational Statistics and Data Analysis , vol. 109, pp. 64–75, 2017.
- 8[8] R. A. Maronna, “Robust M -estimators of multivariate location and scatter,” The Annals of Statistics , vol. 4, no. 1, pp. 51–67, Jan. 1976.
