Asymptotic normality of element-wise weighted total least squares estimator in a multivariate errors-in-variables model
Yaroslav Tsaregorodtsev

TL;DR
This paper investigates the asymptotic normality of a weighted total least squares estimator in a multivariate errors-in-variables model with complex error structures, providing conditions for its distribution as data size grows.
Contribution
It establishes conditions under which the element-wise weighted total least squares estimator is asymptotically normal in a multivariable errors-in-variables model.
Findings
Conditions for asymptotic normality are derived.
The limiting Gaussian distribution has a nonsingular covariance.
The model accounts for row-wise correlated errors and varying error covariances.
Abstract
A multivariable measurement error model is considered. Here and are input and output matrices of measurements and is a rectangular matrix of fixed size to be estimated. The errors in are row-wise independent, but within each row the errors may be correlated. Some of the columns are observed without errors and the error covariance matrices may differ from row to row. The total covariance structure of the errors is known up to a scalar factor. The fully weighted total least squares estimator of is studied. We give conditions for asymptotic normality of the estimator, as the number of rows in is increasing. We provide that the covariance structure of the limiting Gaussian random matrix is nonsingular.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical and numerical algorithms · Advanced Statistical Methods and Models · Geochemistry and Geologic Mapping
Asymptotic normality of element-wise weighted total least squares estimator in a multivariate errors-in-variables model
Ya. V. Tsaregorodtsev
Department of Mathematical Analysis, Faculty of Mechanics and Mathematics, Taras Shevchenko National University of Kyiv, Building 4-e, Akademika Glushkova Avenue, Kyiv, Ukraine, 03127
Abstract.
A multivariable measurement error model is considered. Here and are input and output matrices of measurements and is a rectangular matrix of fixed size to be estimated. The errors in are row-wise independent, but within each row the errors may be correlated. Some of the columns are observed without errors and the error covariance matrices may differ from row to row. The total covariance structure of the errors is known up to a scalar factor. The fully weighted total least squares estimator of is studied. We give conditions for asymptotic normality of the estimator, as the number of rows in is increasing. We provide that the covariance structure of the limiting Gaussian random matrix is nonsingular.
Key words and phrases:
Asymptotic normality, element-wise weighted total least squares estimator, heteroscedastic errors, multivariate errors-in-variables model
2000 Mathematics Subject Classification:
62E20; 62F12; 62J05; 62H12; 65F20
1. Introduction
We deal with an overdetermined set of linear equations which is common in linear parameter estimation problems [12]. If both the data matrix and observation matrix are contaminated with errors, and all the errors are uncorrelated and have equal variances, the total least squares (TLS) technique is appropriate for solving this set [4], [12]. Under mild conditions, the TLS estimator of is consistent and asymptotically normal, as the number of rows in is increasing [3], [7].
In this paper we consider heteroscedastic errors. The errors in are row-wise independent, but within each row the errors may be correlated. Some of the columns are observed without errors, and the error covariance matrices may differ from row to row. The total error covariance structure is assumed known up to a scalar factor. For this model, the element-wise weighted total least squares (EW-TLS) estimator is introduced and its consistency is proven in [6]. Concerning the computation of the estimator see [10], [5]. The EW-TLS estimator is applied, e.g., in geodesy [9].
Our goal is to extend the asymptotic normality result of [7] to the EW-TLS estimator. We work under the conditions of Theorem 2, [6] about the consistency of We use the objective function of the estimator, see formula (22) in [6], and the rules of matrix calculus [2].
The paper is organized as follows. In section 2, we describe the model, introduce main assumptions, refer to the consistency result for and present the objective function and the matrix estimating function. In Section 3, we state the asymptotic normality result and provide a nonsingular covariance structure for a limiting random matrix. In Section 4, we derive consistent estimators for nuisance parameters of the model in order to estimate consistently the asymptotic covariance structure of and Section 5 concludes. The proofs are given in Appendix.
Throughout the paper all vectors are column ones, stands for expectation and acts as an operator on the total product, denotes the covariance matrix of a random vector and for a sequence of random matrices of the same size, notation means that the sequence is stochastically bounded, and means that denotes the identity matrix of size
2. Observation model and consistency of the estimator
2.1. The EW-TLS promblem
We deal with the model Here and are matrices of observations, and the matrix is to be estimated. Assume that
[TABLE]
and that there exists such that
[TABLE]
Here is nonrandom true input matrix, is a true output matrix, and are error matrices. is the true value of the matrix parameter.
It is useful to rewrite the model (2.1) and (2.2) as a classical errors-in-variables (EIV) model [1]. Denote the rows of and respectively. Then the model above is equivalent to the EIV model
[TABLE]
Vectors are nonrandom and unknown, and vectors are random errors. Based on observations one has to estimate
Rewrite the model (2.1) and (2.2) in an implicit way. Introduce matrices
[TABLE]
Then (2.1), (2.2) is equivalent to the next relations:
[TABLE]
Let Following [6] we state global assumptions of the paper, conditions (i) to (iv).
- (i).
Vectors are independent with zero mean and finite second moments.
Let We allow that some of are vanishing.
- (ii).
For a fixed every and every satisfy Moreover
[TABLE]
with unknown positive factor of proportionality and known matrices 2. (iii).
There exists such that for every it holds
For the matrix given in (2.4) and the set from condition (ii), denote
[TABLE]
- (iv).
[TABLE]
The EW-TLS problem consists in finding the value of the unknown matrix and values of disturbances minimizing the weighted sum of squared corrections:
[TABLE]
subject to constrains
[TABLE]
Here and the column vectors
[TABLE]
2.2. EW-TLS estimator and its consistency
For a random realization, it can happen that the problem (2.5) has no solution. Assume conditions (i) – (iv).
Definition 1**.**
The EW-TLS estimator of in the model (2.1), (2.2) is a Borel measurable mapping of the data matrix into which solves the problem (2.5) under the additional constraint
[TABLE]
if there exists a solution, and otherwise.
The EW-TLS estimator always exists due to [11]. We need more conditions to provide the consistency of
- (v).
There exists with such that
[TABLE] 2. (vi).
as 3. (vii).
as
The next result on weak consistency is stated in Theorem 2, [6].
Theorem 2**.**
Assume conditions (i) to (vii). Then the EW-TLS estimator is finite with probability tending to one, and tends to in probability, as
Notice that under a bit stronger assumptions on eigenvalues of the estimator is strongly consistent, see Theorem 3, [6].
2.3. The estimating function
Remember that error vectors enter condition (i) and the matrix is introduced in Definition 1. Let
[TABLE]
Denote also
[TABLE]
where and
[TABLE]
Notice that due to (iv) and under constraint (2.6) is of full rank. Then, under conditions (i) – (iii) the matrix is nonsingular,
The EW-TLS estimator is known to minimize the objective function (2.7), see Theorem 1, [6].
Lemma 3**.**
Assume conditions (i) to (iv). The EW-TLS estimator is finite if, and only if, there exists an unconditional minimum of the function (2.8), and then is a minimum point of this function.
Introduce an estimating function related to the loss function (2.7):
[TABLE]
Here
[TABLE]
Corollary 4**.**
Assume conditions (i) – (vii). Then the next two statements hold true.
- (a)
With probability tending to one is a solution to the equation
[TABLE] 2. (b)
The function (2.9) is an unbiased estimating function, i.e., for each
For fixed the function (2.9) maps into The derivative is a linear operator in this space.
Lemma 5**.**
Under conditions (i) – (vii), for each and it holds
[TABLE]
3. Asymptotic normality of the estimator
Introduce further assumptions.
- (viii).
For some 2. (ix).
For from the condition (viii),
[TABLE] 3. (x).
as where is a nonsingular matrix.
Notice that condition (x) implies assumptions (vi), (vii).
- (xi).
For matrices from condition the (ii), as where is certain matrix.
Notice that conditions (xi), (iii) imply that is nonsingular.
- (xii).
If (they are not necessarily distinct) and then
[TABLE] 2. (xiii).
If (they are not necessarily distinct), then converges to a finite limit as tends to infinity.
Introduce a random element in the space of couples of matrices:
[TABLE]
Hereafter stands for the convergence in distribution.
Lemma 6**.**
Assume conditions (i), (ii) and (viii) – (xiii). Then
[TABLE]
where is a Gaussian centered random element with independent matrix components and
Now, we state the asymptotic normality of the EW-TLS estimator.
Theorem 7**.**
Assume conditions (i) – (v) and (viii) – (xiii). Then
[TABLE]
where enters condition (x), is the projector with and enter relation (3.2), and
[TABLE]
Moreover the limiting random matrix has a nonsingular covariance structure, i.e., for each nonzero vector is a nonsingular matrix.
4. Construction of confidence region for a linear functional of
4.1. Estimation of nuisance parameters
Theorem 7 can be applied, e.g., to construct a confidence region for a linear functional of For this purpose one has to estimate consistently a covariance structure of the limiting random matrix Such a structure, besides of depends on nuisance parameters. Some of them can be estimated consistently.
Hereafter bar means average for rows e.g.,
[TABLE]
Lemma 8**.**
Assume conditions of Theorem 7. Define
[TABLE]
Then, as
[TABLE]
4.2. Estimation of the asymptotic covariance structure of
Let Theorem 7 implies the convergence
[TABLE]
with nonsingular matrix
We start with the case of normal errors Then condition (xii) holds true, and Theorem 7 is applicable. The asymptotic covariance matrix is a continuous function of unknown parameters (here the limiting covariance matrix could be unknown, though for a given matrices are assumed known). Due to Theorem 2 and Lemma 8 the matrix
[TABLE]
is a consistent estimator of
Now, we do not assume the normality of the errors. Then the exact formula for does not allow to estimate it consistently, because the formula involves higher moments of errors which are difficult to estimate consistently. Instead, we use Corollary 4 to construct the so-called sandwich estimator [1] for Denote
[TABLE]
with introduced in (2.10)
Lemma 9**.**
Assume conditions of Theorem 7. For define
[TABLE]
with given in (4.2), (4.1). Then as
Remark. In the case of normal errors, the estimator (4.4) is asymptotically more efficient than the estimator (4.6), cf. the discussion in [1], p. 369.
Given a consistent estimator of we have from (4.3) that
[TABLE]
Based on (4.7), one can construct in a standard way an asymptotic confidence ellipsoid for Similarly a confidence ellipsoid can be constructed for any finite set of linear combinations of entries.
5. Conclusion
We proved the asymptotic normality of the EW-TLS estimator in a multivariate errors-in-variables model with heteroscedastic errors. We assumed the convergence (xi) of the second error moments, vanishing third moments (xiii), and the convergence of averaged fourth moments (xiii). The condition (xii) ensured that the asymptotic covariance structure of is nonsingular. This condition holds true in two cases: (a) all the error vectors are symmetrically distributed, or (b) for each random variables are independent and have vanishing coefficient of asymmetry.
The obtained asymptotic normality result made it possible to construct a confidence ellipsoid for a linear functional of Another plausible application is goodness-of-fit test in the model with heteroscedastic errors (see [7] for such a test in the model with homoscedastic errors).
The author is grateful to Prof. A. Kukush for the problem statement and fruitful discussions.
Appendix
Proof of Corollary 4
(a) The space is endowed with natural inner product The matrix derivative of the functional (2.7) is a linear functional on and based on the inner product, this functional can be identified with certain matrix from
Remember that is introduced in Definition 1. Using the rules of matrix calculus [2], we have for
[TABLE]
Remember relations (2.11). Collecting similar terms, we obtain:
[TABLE]
and
[TABLE]
Using the inner product in we obtain
[TABLE]
with given in (2.10). Now, Theorem 2 and Lemma 3 imply the statement of Corollary 4(a).
(b) We set
[TABLE]
where is a nonrandom vector and like in (2.3),
[TABLE]
Then
[TABLE]
Therefore, see (2.9),
[TABLE]
The statement (b) of Corollary 4 is proven.
Proof of Lemma 5
The derivative of the function (2.9) with respect to is a linear operator in Denote For it holds:
[TABLE]
We set (A.1), use relations
[TABLE]
and get:
[TABLE]
Next,
[TABLE]
Combining (A.2) and (A.3) we see that on the right-hand side of (A.2) summands containing are cancelled out. We get finally
[TABLE]
which implies the statement, because by Corollary 4(b) it holds
Proof of Lemma 6
The proof is similar to the proof of Lemmas 6 and 7 from [7] and based on Lyapunov’s Central Limit Theorem. We just notice that due to condition (xii) the matrix components of namely and are uncorrelated, and this implies the independence of matrix components and in (3.2).
Proof of Theorem 7
We follow the line of [7], see there the proof of Theorem 8(a). By Corollary 4(a), it holds with probability tending to 1:
[TABLE]
Denote
[TABLE]
Using Taylor’s formula around (see [2], Theorem 5.6.2), we obtain from (A.4) that
[TABLE]
Here is a multiplier of the form
[TABLE]
with positive chosen such that , for all with the choice is possible due to condition (iv), and expression (A.6) is indeed (i.e., stochastically bounded), because is quadratic in and the averaged second moments of are assumed bounded. Thus, the relation (A.5) holds true due to the consistency of stated in Theorem 2.
We have Now, by Lemma 5 and condition (x) and (xi) it holds
[TABLE]
and we derive from (A.5) the relation
[TABLE]
The summands in have zero expectation by Corollary 4(b). Remember that and the projector is introduced in Theorem 7. Then, see (2.9),
[TABLE]
Here are components of (3.1). By Lemma 6 it holds, see (3.4) and condition (xi):
[TABLE]
Now, relations (A.7), (A.8) and nonsingularity of imply and by Slutsky’s lemma
[TABLE]
This implies the desired convergence (3.3) – (3.5).
Let By Lemma 6 the components and are independent. We have
[TABLE]
and the latter matrix is positive definite, because and are positive definite under the conditions of Theorem 7. Therefore, is a positive definite matrix as well.
Proof of Lemma 8
We have
[TABLE]
Relation (A.9) and the convergence imply the desired convergence as
Next,
[TABLE]
Proof of Lemma 9
Denote Then expansion (A.7) implies that
[TABLE]
and by Lemma 8
[TABLE]
Then as because and (see formulas (2.9), (2.10) and (4.5)). Lemma 9 is proven.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. J. Carroll, D. Ruppert, L. A. Stefanski, and C. M. Crainiceanu, Measurement Error in Nonlinear Models: A Modern Perspective. 2nd ed. Boca Raton, Chapman and Hall/CRC, 2006.
- 2[2] H. Cartan, Differential Calculus . Hermann/Houghton Mifflin Co., Paris/Boston, MA. Translated from French, 1971.
- 3[3] L. J. Gleser, Estimation in a multivariate “errors in variables” regression model: large sample results , Ann. Stat. 9 (1981), no. 1, 24–44.
- 4[4] G. H. Golub and C. F. Van Loan, An analysis of the total least squares problem , SIAM J. Numer. Anal. 17 (1980), no. 6, 883–893.
- 5[5] S. Jazaerti, A. R. Amiri-Simkooei, and M. A. Sharifi, Iterative algorithm for weighted total least squares adjustment , Survey Review 46 , (2014), no. 334, 19–27.
- 6[6] A. Kukush and S. Van Huffel, Consistency of elementwise-weighted total least squares estimator in a multivariate errors-in-variables model A X = B 𝐴 𝑋 𝐵 AX=B , Metrika 59 (2004), no. 1, 75–97.
- 7[7] A. Kukush and Ya. Tsaregorodtsev, Asymptotic normality of total least squares estimator in a multivariable errors-in-variables model A X = B 𝐴 𝑋 𝐵 AX=B , Modern Stochastics: Theory and Applications 3 (2016), no. 1, 47–57.
- 8[8] A. Kukush and Ya. Tsaregorodtsev, Goodness-of-fit test in a multivariate errors-in-variables model A X = B 𝐴 𝑋 𝐵 AX=B , Modern Stochastics: Theory and Applications 3 (2016), no. 4, 287–302.
