On estimation and prediction in spatial functional linear regression model
St\'ephane Bouka, Sophie Dabo-Niang, Guy Martial Nkiet

TL;DR
This paper develops a smoothing spline estimator for spatial functional linear regression, providing finite sample bounds for variance and prediction error, supported by simulation studies.
Contribution
It introduces a novel estimator for the spatial functional linear model and derives finite sample bounds under spatial dependence.
Findings
Finite sample variance bounds for the estimator.
Prediction error bounds established.
Simulation results validate theoretical findings.
Abstract
We consider a spatial functional linear regression, where a scalar response is related to a square integrable spatial functional process. We use a smoothing spline estimator for the functional slope parameter and establish a finite sample bound for variance of this estimator under mixing spatial dependence. Then, we give a bound of the prediction error. Finally, we illustrate our results by simulations
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpatial and Panel Data Analysis · Statistical Methods and Inference · Spondyloarthritis Studies and Treatments
On estimation and prediction in spatial functional linear regression model
Stéphane BOUKA1, Sophie DABO-NIANG2,3 and Guy Martial NKIET1
1 URMI, Université des Sciences et Techniques de Masuku, Franceville, Gabon.
2Laboratoire LEM, CNRS 9221, Université de Lille, France.
3INRIA-MODAL, Lille, France.
E-mail : [email protected]; [email protected]; [email protected].
**Abstract.**We consider a spatial functional linear regression, where a scalar response is related to a square integrable spatial functional process. We use a smoothing spline estimator for the functional slope parameter and establish a finite sample bound for variance of this estimator under mixing spatial dependence. Then, we give a bound of the prediction error. Finally, we illustrate our results by simulations.
**AMS 1991 subject classifications: **60G60; 62F12.
Key words: Functional linear regression; spatial functional process ; mixing spatial dependence
1 Introduction
Consider the following spatial functional linear regression model where the spatial scalar response is related to a square integrable spatial functional process through
[TABLE]
where is a constant, is the domain of , is a space of functions endowed with a semi-norm, is an unknown function representing the slope function, and is a centered random spatial noise and with variance . The functional linear regression with functional or scalar response has been the focus of various investigations. There exist many contributions in this field for non spatial data, and recent references are: [1], [6], [7], [8], [15], [16], [17], [18], [23]. This work is motivated by a large number of applications for which the data are of spatial nature. For example, non-parametric prediction from kriging methods for geostatistical functional data was tackled in [3], [4], [11], [12], [13], [14] and [19] whereas spatial autoregressive functional models were considered in [20, 21]. In this paper, we are interested in estimation of the slope function in model (1). To the best of our knowledge, this problem have not yet been considered for the basic spatial functional linear regression model, but only for non spatial data (e.g [7]) or for spatial linear regression model with derivatives (see [5]). The paper is organized as follow. The section 2 is devoted to the estimation of the estimator that will be use. Assumptions and main results are stated in Section 3, and a simulation study is given in Section 4. The proofs are postponed to Section 5.
2 Smoothing splines estimation of slope function
In this section, we give an estimator of in (1) by using an approach similar to the one of [7]. Since this procedure of estimation does not take into account the nature of the dependence of the data, we obtain an estimator that has the same form than that of [7]. The process is defined on probability space with the same distribution as a couple of variable . For with , let be a grid of points in and consider observations . We assume that the random functions are observed at equidistant points , with for all . By using the lexico-graphic order, the previous sample is rewritten as , then we put (where denotes the transposed of ) and we consider the matrix with general term for , . Then, we consider the estimator of given by
[TABLE]
where is a smoothing parameter, is a symmetric matrix defined from B-splines (see [7] for details), is a functional basis of the -dimensional linear space of functions having a -th order derivative that belongs to , and is the matrix with general term for . For estimating the intercept we take , where denotes the usual inner product of .
3 Assumptions and main results
In this section, we first introduce the assumptions that are needed to obtain the main results of the paper, then theorems that give the rate of convergence of the estimator and also that of the prediction at a non-visited site are established.
3.1 Assumptions
Assumption 1
* is -times differentiable and belongs to .*
Assumption 2
There exist , and such that, for any ,
Assumption 3
For and all there exists a -dimensional linear subspace of and a real such that
[TABLE]
Assumption 4
For any ,
[TABLE]
where and is a complete orthonormal system of eigenfunctions of the operator from to itself defined by:
[TABLE]
each being associated with the -th largest eigenvalue .
Assumptions 1–4 are technical conditions that are similar to the ones considered in [7]. In order to give the remaining assumptions, let us first recall the notion of polynomial mixing dependence. Letting be the -mixing coefficient given, for two sub -algebras and of , by
[TABLE]
we consider the strong mixing coefficient (see [10]) related to a random field , defined as
[TABLE]
where and the distance is defined for any subsets and of by where is the usual Euclidean norm of . Then, is polynomial mixing if the related strong mixing coefficients satisfy , .
Assumption 5
* is a strictly stationary random field, polynomial mixing, independent of and such that almost surely, where is a strictly positive constant.*
Assumption 6
* is a strictly stationary and polynomial mixing random field.*
Assumption 7
There exists such that for all , almost surely.
Assumptions 5 and 6 are classical assumptions (see [2]). Assumption 7 has already been made in some works (see, e.g., [18]).
Assumption 8
* is an isotropic process such that for all , in ,*
[TABLE]
where is a positive function and is a known -valued decreasing function that verified .
The separable covariance structure stated in Assumption 8 has also been used in [17]. Examples on isotropic spatial models can be founded in [9]. We may mention for instance, the exponential spatial model.
3.2 The results
We consider the semi-norm defined by
[TABLE]
and the discretized empirical semi-norm defined for any as
[TABLE]
The following theorem gives a bound of the estimator’s variance. In this theorem, refers to the conditional expectation given .
Theorem 1
Under Assumptions 1, 5, 6 and 7 with , , for all , if the eigenvalues of satisfy with , and , then
[TABLE]
where , and stands the integer part of .
Using Theorem 1 and Arguing as in [7] , we obtain the Corollary below.
Corollary 1
*Under assumptions of Theorem 1 together with
Assumptions 2-4, as well as , , as we have*
[TABLE]
Next, we give a bound for prediction error. For that, we assume what follows
Assumption 9
The non-visited site is such that
[TABLE]
In this Assumption 9, it is sufficient to choice large for doing the prediction at any non-visited site.
we consider the prediction and the ”theoretical” prediction at a non-visited site such that has the same distribution than . In fact,
[TABLE]
We are interested by the bound of the prediction error between and .
Theorem 2
Suppose that assumptions of Corollary 1 together with assumptions 8–9 hold. If , , and is chosen sufficiently large compared to , then
[TABLE]
4 A simulation study
This section presents the results of simulations made in order to evaluate the performances of the proposed methods for slope estimation and prediction in the model (1). We computed estimation and prediction errors from simulated spatial data in . Using the lexico-graphic order, we generated a sample as follows: we consider the -th first elements of the B-splines basis. For , we generate a vector from a normal distribution in , where is the covariance matrix with general term . Further, we generate a vector from a normal distribution in , where is the covariance matrix with general term , and for we take
[TABLE]
Considering equispaced points in , we compute each by approximating the integral in equation (1) using the rectangular method. That gives
[TABLE]
where , , the vector is generated from a normal distribution with controlled by the signal-to-noise ratio (snr) defined by
[TABLE]
and is a given function. We considered two cases for the function given by:
Case A : ;
Case B : .
The estimator of in model is computed by using the function ”fregre.basis” of the fda package. We assess performance of our methods through the semi-norm defined in (15) for evaluating the estimation error between and , and through the mean squared error (MSE) for evaluating the prediction error between the prediction and the ”theoretical” prediction at the non-visited site . is obtained by the ordinary krigging method, and and are obtained as defined in (8). We take and over replications and we obtain the following tables.
[TABLE]
Table 1: Estimation errors
[TABLE]
Table 2: Prediction errors at a non-visited site
Table 1 and Table 2 present, respectively, the obtained estimation errors and prediction mean squared errors for different sample sizes and snr. The site is beyond the grid of size whereas it is inside the grid of size . We remark that when this point is inside the grid the prediction errors decrease as the sample size increases. Also, we see that estimation and prediction errors are small even when the sample size and the snr increase.
Conclusion
In this paper, we propose to study asymptotic properties of a smoothing splines estimator of slope function in a spatial functional linear regression model, where a scalar response is related to a square integrable spatial functional process. The originality of the proposed method is to consider spatially dependent data. The main difficult is technical, especially in the proof of the prediction error because of the presence of the data spatial dependency. The prediction proposed in this work is available as well as for the points inside the grid than those beyond the grid compared to [5] where the prediction is only available for the points beyond the grid. One can then see the proposed methodology as a good alternative to [7] when available data are spatially dependent.
5 Proofs
5.1 A useful lemma
Let
[TABLE]
then we have:
Lemma 1
.
Proof. Since is a symmetric nonnegative matrix, its has a square root, denoted by , that is also a symmetric nonnegative matrix. Denoting by the inverse of and by the identity matrix, we have:
[TABLE]
Then from the spectral decomposition where the ’s are the nonegative eigenvalues and is an orthonormal basis of consisting of eigenvectors, it follows:
[TABLE]
Therefore, since , we deduce that Finally,
[TABLE]
5.2 Proof of Theorem 1
Putting
[TABLE]
we have
[TABLE]
where , with . Putting , we deduce from and the strict stationarity that
[TABLE]
Notice that, putting , we have
[TABLE]
Then, using the Cauchy-Schwartz inequality as well as Lemma in [22], we obtain, under Assumption 1:
[TABLE]
where is a positive constant. Since , this finally gives:
[TABLE]
where is a positive constant. Therefore, from (11), it follows:
[TABLE]
Clearly, , where is defined in (9). Then, from Lemma 1, and the proof of Theorem 1 in Crambes et al (2009) (see p. 55-56) that shows that where and are positive constants, it follows:
[TABLE]
Then, we deduce from (13) and (14) that
[TABLE]
where is a positive constant. On the other hand,
[TABLE]
Then, using (12), we obtain , and
[TABLE]
where is a positive constant. Note that , where
[TABLE]
then
[TABLE]
and, putting , we deduce from this inequality and from (14) and (16) that
[TABLE]
5.3 Proof of Theorem 2
5.3.1 Lemma
Lemma 2
Under assumptions Theorem 2, we have
[TABLE]
5.3.2 Proofs
Proof of Lemma 2 :
We have
[TABLE]
On the one hand, from assumption 1, we have . On the other hand, for large enough, we have
[TABLE]
Set , where , . Then, by definition of , we have
[TABLE]
The first and second term on the right-hand side of (5.3.2) are bounded as in [7] (see p. 57), that is to say
[TABLE]
[TABLE]
Set where
. We have
[TABLE]
from Assumption 5, we have
[TABLE]
and since
[TABLE]
it follows that
[TABLE]
We then obtain the result of Lemma 2.
Proof of Theorem 2 :
[TABLE]
Since, from assumption 7 and Lemma 2, we have
[TABLE]
and , it follows from Lemma in [22], Assumption 7 and Lemma 2 that
[TABLE]
where is a positive constant. However, we have from Lemma in [22] that
[TABLE]
Since if and [math] otherwise, it follows from Lemma 2 and Assumption 7 that
[TABLE]
where is a positive constant. From assumption 9 and Jensen inequality, we have
[TABLE]
where , and are positive constants. From Assumption 7 and Lemma 2, we have
[TABLE]
where , , and are positive constants. Then
[TABLE]
where and are positive constants. Applying Corollary 1 with , , we obtain the result of Theorem 2.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Aue A, Norinho DD, Hormann S. On the Prediction of Stationary Functional Time Series. J. Amer. Statist. Assoc. 2015; 110:378–392.
- 2[2] Biau G, Cadre B. Nonparamatric Spatial Prediction. Stat. Inference Stoch. Process. 2004; 7:327–349.
- 3[3] Bohorquez M, Giraldo R, Matheu J. Optimal sampling for spatial prediction of functional data. Stat. Methods Appl. 2016; 25: 39–54.
- 4[4] Bohorquez M, Giraldo R, Matheu J. Multivariate functional random fields: prediction and optimal sampling. Stoch. Environ. Res. Risk Assess. 2017; 31:53–70.
- 5[5] Bouka S, Dabo-Niang S, Nkiet GM. (2018). On estimation in spatial functional regression with derivatives. C. R. Acad. Sci. Paris Ser. I 2018; 356:558–562.
- 6[6] Comte F, Johannes J. Adaptive functional linear regression. Ann. Statist. 2012; 40:2765–2797.
- 7[7] Crambes C, Kneip A, Sarda P. Smoothing splines estimators for functional linear regression. Ann. Statist. 2009; 37: 35–72.
- 8[8] Cuevas A. A partial overview of the theory of statistics with functional data. J. Statist. Plan. Inf. 2014; 147:1-23.
