Asymptotics of an empirical bridge of regression on induced order statistics
Artyom Kovalevskii

TL;DR
This paper introduces a new statistical test for linear regression models based on empirical bridges derived from induced order statistics, with proven convergence properties and chi-square type tests.
Contribution
It presents a novel class of tests using empirical bridges for regression on concomitants, with theoretical proof of weak convergence to Gaussian processes.
Findings
Empirical bridges converge weakly to Gaussian processes.
Tests are of chi-square type and based on residual sums.
Method provides a new approach for regression analysis on induced order statistics.
Abstract
We propose a class of tests for linear regression on concomitants (induced order statistics). These tests are based on sequential sums of regression residuals. We self-center and self-normalize these sums. The resulting process is called an empirical bridge. We prove weak convergence of the empirical bridge in uniform metrics to a centered Gaussian process. The proposed tests are of chi-square type.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Asymptotics of an empirical bridge of regression on induced order statistics
Artyom Kovalevskii [email protected], Novosibirsk State Technical University, Novosibirsk State University. The research was supported by RFBR grant 17-01-00683
Abstract
We propose a class of tests for linear regression on concomitants (induced order statistics). These tests are based on sequential sums of regression residuals. We self-center and self-normalize these sums. The resulting process is called an empirical bridge. We prove weak convergence of the empirical bridge in uniform metrics to a centered Gaussian process. The proposed tests are of chi-square type.
Keywords: concomitants, weak convergence, regression residuals, empirical bridge.
1 Introduction
There are few approaches to regression models testing. An empirical fluctuation process function is implemented in R package by Zeileis at al. [40]. The process uses recursive regression residuals proposed by Brown, Durbin, Ewans [11].
MacNeill [31] studied linear regression for time series. He obtained limit processes for sequences of partial sums of regression residuals. Later Bischoff [10] showed that the MacNeill’s theorem holds in a more general setting. Aue et al. [1] introduced a new test for polynomial regression functions which is analogous to the classical likelihood test. This approach is developed in [2], [3].
Stute [37] proposed a class of tests for one-parametric case.
Our approach is adopted specifically for regression on induced order statistics (concomitants). These models arise in applications [28]. Partial results are proposed in [29], [30]. The proofs use the theory of induced order statistics.
David [12] and Bhattacharya [7] have introduced induced order statistics (concomitants) simultaneously. The asymptotic theory was developed in in [4], [5], [6], [8], [9], [13], [14], [15], [16], [17], [18], [19], [21], [22], [24], [38], [39]. Strong convergence to a corresponding Gaussian process can be proved by methods of [20], [27], [32], [35].
New contributions to the theory deal with extremal order statistics [23], [34], [36].
2 Main result and corollaries
Let be independent and identically distributed random vector rows, has uniform distribution on , . Random variables are not observed. They will be used for ordering.
We assume a linear regression hypothesis . Here
[TABLE]
and are independent, are i.i.d., , .
Vector and constant are unknown.
We order rows by the first component, that is, we change rows and while and . The result is matrix with rows . So a.s. are order statistics from uniform distribution on . Elements of matrix are concomitants. Let . Note that are i.i.d. and have the same distribution as . Sequences and are independent.
Let be LSE:
[TABLE]
It does not depend on the order of rows.
Let be conditional expectation, be induced theoretical generalised Lorentz curve (see [1]),
[TABLE]
be a matrix of conditional covariances.
Let , , Then .
Let , , .
Let be a piecewise linear random function with nodes
[TABLE]
We designate weak convergence in with uniform metrics by .
Theorem 1 *If exists, , then . Here is a centered Gaussian process with covariation function *
[TABLE]
Let be an empirical bridge (see [28], [29], [30]):
[TABLE]
with . Let .
Let be an empirical induced generalised Lorentz curve:
[TABLE]
, .
Corollary 1 Let assumptions of Theorem 1 be held.
*1) Then , a centered Gaussian process with covariation function *
[TABLE]
*2) Let be integer, *
[TABLE]
, ,
[TABLE]
. Then converges weakly to a chi-squared distribution with degrees of freedom.
Note that ordering by , , can be viewed as ordering by with (quantile function). In this case .
The next corollary is proved in [29].
Corollary 2 *Let , , , are order statistics of i.i.d. , random variables are i.i.d. and independent of them, , , . Then , a centered Gaussian process with covariance function *
[TABLE]
The next corollary is a partial case of Theorem 1 in [30].
Corollary 3 Let , , , are order statistics of i.i.d. , random variables are i.i.d. and independent of them, , , . Then , a centered Gaussian process with covariance function
[TABLE]
3 Proof of Theorem 1
Note that
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Note that a.s. uniformely on compact sets, and a.s.
So we study process
[TABLE]
This process is a bounded linear functional of -dimensional process
[TABLE]
We use the functional central limit theorem for induced order statistics by Davydov and Egorov [17].
We assume that , and are independent, are i.i.d., , .
Let see rows . We have
[TABLE]
The conditional covariance matrix of the vector is
[TABLE]
Let be an upper triangular matrix such that . Then
[TABLE]
Here is an upper triangular matrix such that . By Theorem 1 of Davydov and Egorov [17] the process
[TABLE]
converges weakly in the uniform metrics to the Gaussian process
[TABLE]
Here is an -dimensional standard Wiener process.
So process
[TABLE]
converges weakly in the uniform metrics to the Gaussian process ,
[TABLE]
.
By the noted convergencies a.s. uniformely on compact sets, a.s., the process has the same weak limit .
The covariance function of the limiting Gaussian process is
[TABLE]
[TABLE]
[TABLE]
[TABLE]
The proof is complete.
Acknowledgement
The research was supported by RFBR grant 17-01-00683.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Aue A., Horvath L., Huskova M., Kokoszka P., 2008. Testing for change in polynomial regression. Bernoulli 14, 637–660.
- 2[2] Aue A., Horvath L., 2013. Structural breaks in time series. Journal of Time Series Analysis 34:1, 1–16.
- 3[3] Aue A., Rice G., Sonmez O., 2018. Detecting and dating structural breaks in functional data without dimension reduction. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 80:3, 509–529.
- 4[4] Balakrishnan N. and Cohen A.C., 1991. Order Statistics and Inference: Estimation Methods, Academic Press, Boston, MA.
- 5[5] Barnett V., 1976. The ordering of multivariate data, J. Roy. Statist. Soc. Ser. A 139, 318–354.
- 6[6] Barnett V., Green P.J. and Robinson A., 1976. Concomitants and correlation estimates, Biometrika 63, 323–328.
- 7[7] Bhattacharya P.K., 1974. Convergence of sample paths of normalized sums of induced order statistics, The Annals of Statist. 2, 1034–1039.
- 8[8] Bhattacharya P.K., 1976. An invariance principle in regression analysis, The Annals of Statist. 4, 621–624.
