Classical and bayesian componentwise predictors for non-compact correlated ARH(1) processes
M. Dolores Ruiz-Medina, J. \'Alvarez-Li\'ebana

TL;DR
This paper investigates classical and Bayesian componentwise predictors for a special class of Gaussian ARH(1) processes with non-Hilbert-Schmidt autocorrelation operators, providing theoretical analysis and simulation validation.
Contribution
It introduces and analyzes diagonal classical and Bayesian estimators for non-compact ARH(1) processes, proving their asymptotic efficiency and equivalence.
Findings
Both estimators are asymptotically efficient.
Classical and Bayesian predictors are asymptotically equivalent.
Simulation confirms theoretical results.
Abstract
A special class of standard Gaussian Autoregressive Hilbertian processes of order one (Gaussian ARH(1) processes), with bounded linear autocorrelation operator, which does not satisfy the usual Hilbert-Schmidt assumption, is considered. To compensate the slow decay of the diagonal coefficients of the autocorrelation operator, a faster decay velocity of the eigenvalues of the trace autocovariance operator of the innovation process is assumed. As usual, the eigenvectors of the autocovariance operator of the ARH(1) process are considered for projection, since, here, they are assumed to be known. Diagonal componentwise classical and bayesian estimation of the autocorrelation operator is studied for prediction. The asymptotic efficiency and equivalence of both estimators is proved, as well as of their associated componentwise ARH(1) plugin predictors. A simulation study is undertaken to…
| Sample size | Classical estimator | Bayes estimator |
|---|---|---|
| Sample size | Classical predictor | Bayes predictor |
|---|---|---|
| Sample size | Classical estimator | Bayes estimator |
|---|---|---|
| Sample size | Classical predictor | Bayes predictor |
|---|---|---|
| Sample size | Classical estimator | Bayes estimator | |
|---|---|---|---|
| Sample size | Classical predictor | Bayes predictor | |
|---|---|---|---|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsControl Systems and Identification · Financial Risk and Volatility Modeling · Statistical Methods and Inference
Classical and bayesian componentwise predictors for non-compact correlated ARH(1) processes
M. Dolores Ruiz–Medina1 and Javier Álvarez-Liébana1
Summary
A special class of standard Gaussian Autoregressive Hilbertian processes of order one (Gaussian ARH(1) processes), with bounded linear autocorrelation operator, which does not satisfy the usual Hilbert-Schmidt assumption, is considered. To compensate the slow decay of the diagonal coefficients of the autocorrelation operator, a faster decay velocity of the eigenvalues of the trace autocovariance operator of the innovation process is assumed. As usual, the eigenvectors of the autocovariance operator of the ARH(1) process are considered for projection, since, here, they are assumed to be known. Diagonal componentwise classical and bayesian estimation of the autocorrelation operator is studied for prediction. The asymptotic efficiency and equivalence of both estimators is proved, as well as of their associated componentwise ARH(1) plugin predictors. A simulation study is undertaken to illustrate the theoretical results derived.
Published in REVSTAT (in press)
https://www.ine.pt/revstat/pdf/Classicalandbayesiancomponentwise.pdf
1 Department of Statistics and O. R., University of Granada, Spain.
E-mail: [email protected]
Key words: Asymptotic efficiency; autoregressive Hilbertian processes; bayesian estimation; classical moment-based estimation; functional prediction; non-compact bounded autocorrelation operators.
1 Introduction
Functional time series theory plays a key role in the analysis of high-dimensional data (see, for example, Aue et al. [2015], Bosq [2000], Bosq and Blanke [2007]). Inference for stochastic processes can also be addressed from this framework (see Álvarez-Liébana et al. [2016] in relation to functional prediction of the Ornstein–Uhlenbeck process, in an ARH(1) process framework). Bosq [2000] addresses the problem of infinite–dimensional parameter estimation and prediction of ARH(1) processes, in the cases of known and unknown eigenvectors of the autocovariance operator. Alternative projection methodologies have been adopted, for example, in Antoniadis and Sapatinas [2003], in terms of wavelet bases, and Besse and Cardot [1996], in terms of spline bases. The book by Bosq and Blanke [2007] provides a general overview on statistical prediction, including Bayesian predictors, inference by projection and kernel methods, empirical density estimation, and linear processes in high–dimensional spaces (see also Blanke and Bosq [2015] on Bayesian prediction for stochastic processes). Recently, Bosq and Ruiz-Medina [2014] have derived new results on asymptotic efficiency and equivalence of classical and Bayes predictors for –valued Poisson process, where, as usual, denotes the Hilbert space of square summable sequences. Classical and Bayesian componentwise parameter estimators of the mean function and autocovariance operator, characterizing Gaussian measures in Hilbert spaces, are also compared in terms of their asymptotic efficiency, in that paper.
We first recall that the class of processes studied here could be of interest in applications, for instance, in the context of anomalous physical diffusion processes (see, for example, Gorenflo and Mainardi [2003], Meerschaert et al. [2002], Metzler and Klafter [2004], and the references therein). An interesting example of our framework corresponds to the case of spatial fractal diffusion operator, and regular innovations. Specifically, the class of standard Gaussian ARH(1) processes studied have a bounded linear autocorrelation operator, admitting a weak–sense diagonal spectral representation, in terms of the eigenvectors of the autocovariance operator. The sequence of diagonal coefficients, in such a spectral representation, displays an accumulation point at one. The singularity of the autocorrelation kernel is compensated by the regularity of the autocovariance kernel of the innovation process. Namely, the key assumption here is the summability of the quotient between the eigenvalues of the autocovariance operator of the innovation process and of the ARH(1) process. Under suitable conditions, the asymptotic efficiency and equivalence of the studied diagonal componentwise classical and Bayesian estimators of the autocorrelation operator are derived (see Theorem 4.1 below). Under the same setting of conditions, the asymptotic efficiency and equivalence of the corresponding classical and Bayesian ARH(1) plug–in predictors are proved as well (see Theorem 4.2 below). Although both theorems only refer to the case of known eigenvectors of the autocovariance operator, as illustrated in the simulation study undertaken in Álvarez-Liébana et al. [2017] (see also Álvarez-Liébana [2017], Ruiz-Medina and Álvarez-Liébana [2018a]), a similar performance is obtained for the case of unknown eigenvectors, in comparison with other componentwise, kernel–based, wavelet-based penalized and nonparametric approaches adopted in the current literature (see Antoniadis and Sapatinas [2003], Besse and Cardot [1996], Bosq [2000], Guillas [2001], Mas [1999]).
Note that, for being the unknown parameter, in order to compute with denoting the functional sample, we suppose that
[TABLE]
which leads to
[TABLE]
Here, for each and for each with being the inner product in the real separable Hilbert space . Note that denotes an orthonormal basis of diagonalizing the common autocovariance operator of We can then perform an independent computation of the respective posterior distributions of the projections of parameter with respect to the orthonormal basis of
Finally, some numerical examples are considered to illustrate the results derived on asymptotic efficiency and equivalence of moment–based classical and Beta–prior–based Bayes diagonal componentwise parameter estimators, and the associated ARH(1) plug–in predictors.
2 Preliminaries
The preliminary definitions and results needed in the subsequent development are introduced in this section. We first refer to the usual class of standard ARH(1) processes introduced in Bosq [2000].
Definition 2.1
Let be a real separable Hilbert space. A sequence of –valued random variables on a basic probability space is called an autoregressive Hilbertian process of order one, associated with if it is stationary and satisfies
[TABLE]
where is a Hilbert–valued white noise in the strong sense (i.e., a zero–mean stationary sequence of independent valued random variables with for every ), and with being the space of linear bounded operators on For each and are assumed to be uncorrelated.
If there exists a positive such that then, the ARH(1) process in (1) is standard, and there exists a unique stationary solution to equation (1) admitting a MAH() representation (see [Bosq, 2000, Theorem 3.1, p. 74]).
The autocovariance and cross–covariance operators are given, for each , by
[TABLE]
where, for
[TABLE]
defines a Hilbert–Schmidt operator on The operator is assumed to be in the trace class. In particular,
[TABLE]
It is well-known that, from equations (1)–(2), for all (see, for example, Bosq [2000]). However, since is a nuclear or trace operator, its inverse operator is an unbounded operator in Different methodologies have been adopted to overcome this problem in the current literature on ARH(1) processes. In particular, here, we consider the case where under Assumption A2 below, since is assumed to be strictly positive. That is, its eigenvalues are strictily positive and the kernel space of is trivial. In addition, they are assumed to have multiplicity one. Therefore, for any there exist such that and and
[TABLE]
In particular,
[TABLE]
Assumption A1. The operator in (1) is self–adjoint with
Assumption A2. The operator is strictly positive, and its positive eigenvalues have multiplicity one. Furthermore, and admit the following diagonal spectral decompositions, such that for all
[TABLE]
where and are the respective systems of eigenvalues of and and is the common system of orthonormal eigenvectors of the autocovariance operator
Remark 2.1
As commented before, we consider here the case where the eigenvectors of the autocovariance operator are known. Thus, under Assumption A2, the natural way to formulate a componentwise estimator of the autocorrelation operator is in terms of the respective estimators of its diagonal coefficients computed from the respective projections of the observed functional data, into . We adopt here a moment–based classical and Beta–prior–based Bayesian approach in the estimation of such coefficients
From the Cauchy–Schwarz’s inequality, applying the Parseval’s identity,
[TABLE]
Thus, equation (4) holds in the weak sense.
From Assumption A2, the projection of into the common eigenvector system leads to the following series expansion in
[TABLE]
and, for each and
[TABLE]
where the last equality is obtained from the orthonormality of the eigenvectors Hence, under Assumptions A1–A2, the projection of equation (1) into the elements of the common eigenvector system leads to the following infinite-dimensional system of equations:
[TABLE]
or equivalently,
[TABLE]
where
[TABLE]
Thus, for each
[TABLE]
defines a standard AR(1) process. Its moving average representation of infinite order is given by
[TABLE]
Specifically, under Assumption A2,
[TABLE]
where
[TABLE]
From equation (9), under Assumptions A1–A2,
[TABLE]
with, as before,
[TABLE]
Equation (10) leads to the identity
[TABLE]
from which, we obtain
[TABLE]
Under (11), equation (7) can also be rewritten as
[TABLE]
Assumption A2B. The sequences
[TABLE]
satisfy
[TABLE]
Equation (13) means that and are both summable sequences, with faster decay to zero of the sequence than the sequence leading, from equations (11)–(12), to the definition of as a sequence with accumulation point at one.
Remark 2.2
Under Assumption A2B and A3 below holds.
For each from equations (6)–(8),
[TABLE]
where
[TABLE]
Hence, for every and
Assumption A3. There exists a sequence of real-valued independent random variables such that
[TABLE]
with
[TABLE]
Remark 2.3
Note that the mean value of
[TABLE]
is of order and the mean value of
[TABLE]
is of order Hence, for the almost surely boundedness of the inverse of
[TABLE]
by a suitable sequence of random variables with summable –moments, for the eigenvalues of operator must be close to one but strictly less than one. As commented in Remark 2.2, from Assumption A2B, this condition is satisfied in view of equation (12).
Assumption A4. with, as before, denoting the Kronecker delta function, for every and
Remark 2.4
Assumption A4* implies that the cross–covariance operator admits a diagonal spectral decomposition in terms of the system of eigenvectors Thus, under Assumption A4, the diagonal spectral decompositions (3)–(4) also hold.*
The classical diagonal componentwise estimator of considered here is given by
[TABLE]
From equations (6)–(7) and (11), for each
[TABLE]
[TABLE]
Remark 2.5
It is important to note that, for instance, unconditional bases, like wavelets, provide the spectral diagonalization of an extensive family of operators, including pseudodifferential operators, and in particular, Calderón–Zygmund operators (see Kyriazis and Petrushev [2001], Meyer and Coifman [1997]). Therefore, the diagonal spectral representations (3)–(4), in Assumption A2, hold for a wide class of autocovariance and cross-covariance operators, for example, in terms of wavelets. When the autocovariance and the cross–covariance operators are related by a continuous function, the diagonal spectral representations (3)–(4) are also satisfied (see [Dautray and Lions, 1990, pp. 119, 126 and 140]). Assumption A2 has been considered, for example, in [Bosq, 2000, Theorem 8.5, pp. 215–216; Theorem 8.7, p. 221], to establish strong consistency, although, in this book, a different setting of conditions is assumed. Thus, Assumptions A1–A2 already have been used (e.g., in Bosq [2000], Álvarez-Liébana et al. [2017], Ruiz-Medina and Álvarez-Liébana [2018a]), and Assumptions A2B, A3 and A4 appear in Ruiz-Medina et al. [2016]. Assumptions A2B is needed since the usual assumption on the Hilbert–Schmidt property of made by several authors, is not considered here. At the same type, as commented before, Assumptions A2B implies Assumption A3.
The following lemmas will be used in the derivation of the main results of this paper, Theorems 4.1 and 4.2, obtained in the Gaussian ARH(1) context.
** Lemma 2.1**
Let be the values of a standard zero–mean autoregressive process of order one (AR(1) process) at times and
[TABLE]
with representing the random initial condition. Assume that and that the innovation process is white noise. Then, as
[TABLE]
The proof of Lemma 2.1 can be found in [Hamilton, 1994, p. 216].
** Lemma 2.2**
Let and be two normal distributed random variables having correlation and with means and and variances and respectively. Then, the following identities hold:
[TABLE]
(see, for example, Aroian [1947], Ware and Lad [2003]).
** Lemma 2.3**
For each the following limit is obtained:
[TABLE]
(see, for example, Bartlett [1946]).
3 Bayesian diagonal componentwise estimation
Now let us denote by the functional random variable on the basic probability space characterized by the prior distribution for In our case, we assume that is of the form
[TABLE]
where, for is a real–valued random variable such that almost surely, for every In the following, is assumed to follow a beta distribution with shape parameters and ; i.e., for every We also assume that is independent of the functional components of the innovation process and that the random variables are globally independent. That is, for each
[TABLE]
Thus,
[TABLE]
where the last identity is understood in the weak–sense; i.e., in the sense of equation (19).
In the definition of from we can then apply the Kolmogorov extension Theorem under the condition
[TABLE]
(see, for example, Khoshnevisan [2007]).
As in the real–valued case (see Supplementary Material 7), considering for each the Bayes estimator of is defined by (see Case 2 in Supplementary Material 7)
[TABLE]
with, for every
[TABLE]
where
[TABLE]
4 Asymptotic efficiency and equivalence
In this section, sufficient conditions are derived to ensure the asymptotic efficiency and equivalence of the diagonal componentwise estimators of formulated in the classical (see equation (15)), and in the Bayesian (see equations (20)–(22)) frameworks.
** Theorem 4.1**
Under Assumptions A1–A2, A2B, A3 and A4, let us assume that the ARH(1) process satisfies, for each and, for every
[TABLE]
That is, and are almost surely positive empirically correlated. In addition, for every the hyper–parameters and of the beta prior distribution, are such that Then, the following identities are obtained:
[TABLE]
where is defined in equation (15), and is defined from equations (20)–(22), considering
[TABLE]
with, as before, for each
[TABLE]
and and are given in (22), for every
Proof. Under Assumptions A1–A2, from Remark 8.1 and Corollary 8.1 in Supplementary Material 8, for each and for sufficiently large,
[TABLE]
Also, under (23),
[TABLE]
which is equivalent to
[TABLE]
for every
From (26), to obtain the following a.s. inequality:
[TABLE]
it is sufficient that
[TABLE]
which is equivalent to
[TABLE]
That is, keeping in mind that
[TABLE]
condition (28) can also be expressed as
[TABLE]
i.e.,
[TABLE]
for Since, for each
[TABLE]
it is sufficient that
[TABLE]
to hold to ensure that inequality (27) is satisfied. Furthermore, from Remark 8.1 and Corollary 8.1, in Supplementary Material 8, for each and
[TABLE]
Also, we have, from such remark and theorem, that
[TABLE]
Thus, for each the upper bound, in (29), diverges as which means, that, for sufficiently large, inequality (27) holds, if for each Now, from (27), under Assumption A3, for each
[TABLE]
Furthermore, for each and as almost surely. Hence,
[TABLE]
From equation (25), we then have that, for each
[TABLE]
almost surely. Thus, the almost surely convergence, when of and to the same limit is obtained, for every
From equation (30),
[TABLE]
Since applying the Dominated Convergence Theorem, from equation (32), considering (18) we obtain, for each
[TABLE]
Under Assumptions A3, from (30), for each and for every
[TABLE]
with
[TABLE]
Applying again the Dominated Convergence Theorem (with integration performed with respect to a counting measure), we obtain from (33), keeping in mind relationship (12),
[TABLE]
in view of equation (13) in Assumption A2B. That is, equation (24) holds.
** Theorem 4.2**
Under the conditions of Theorem 4.1,
[TABLE]
Here,
[TABLE]
Proof.
From equation (LABEL:A5:eqfconv), for every
[TABLE]
In addition, from equation (32), for every
[TABLE]
with
[TABLE]
under Assumption A3. Applying the Dominated Convergence Theorem from (36), the almost surely convergence in (35) implies the convergence in mean to zero, when Furthermore, under Assumption A3, for
[TABLE]
From (37), for every
[TABLE]
Equation (38) means that the rate of convergence to zero, as of the functional sequence in the space is of order
From definition of the norm in the space bounded linear operators, applying the Cauchy–Schwarz’s inequality, we obtain
[TABLE]
From the orthogonal expansion (5) of , in terms of the independent real–valued standard Gaussian random variables we have
[TABLE]
[TABLE]
Thus, and have the same limit in the space
We now prove the approximation by of the limit, in equation (34). Consider
[TABLE]
where
[TABLE]
From Lemmas 2.1– 2.2 (see the last identity in equation (17)), for each and for sufficiently large,
[TABLE]
Under Assumption A3, from equations (14)–(16), for every
[TABLE]
[TABLE]
since
[TABLE]
by the trace property of Here, we have applied the Cauchy–Schwarz’s inequality to obtain, for a certain constant
[TABLE]
from the trace property of and since
[TABLE]
under Assumption A3.
From equations (18) and (44), one can get, applying the Dominated Convergence Theorem,
[TABLE]
where we have considered that
[TABLE]
5 Numerical examples
This section illustrates the theoretical results derived on asymptotic efficiency and equivalence of the proposed classical and Bayesian diagonal componentwise estimators of the autocorrelation operator, as well as of the associated ARH(1) plug–in predictors. Under the conditions assumed in Theorem 4.1, three examples of standard zero–mean Gaussian ARH(1) processes are generated, respectively corresponding to consider different rates of convergence to zero of the eigenvalues of the autocovariance operator. The truncation order in Examples 1–2 (see Sections 5.1–5.2) is fixed; i.e., it does not depend on the sample size (see equations (46)–(47) below). While in Example 3 (see Section 5.3), is selected such that
[TABLE]
Specifically, in the first two examples, the choice of is driven looking for a compromise between the sample size and the number of parameters to be estimated. With this aim the value is fixed, independently of This is the number of parameters that can be estimated in an efficient way, from most of the values of the sample size studied. In Example 3, the truncation parameter is defined as a fractional power of the sample size. Note that Example 3 corresponds to the fastest decay velocity of the eigenvalues of the autocovariance operator. Hence, the lowest truncation order for a given sample size must be selected according to the truncation rule (45).
The generation of realizations of the functional values , for
[TABLE]
denoting as before the sample size, is performed, for each one of the ARH(1) processes, defined in the three examples below. Based on those generations, and on the sample sizes studied, the truncated empirical functional mean-square errors of the classical and Bayes diagonal componentwise parameter estimators of the autocorrelation operator are computed as follows:
[TABLE]
where can be the classical or the Bayes diagonal componentwise estimator of the autocorrelation operator, and ω denotes the sample point associated with each one of the realizations generated of each functional value of the ARH(1) process
On the other hand, as assumed in the previous section,
[TABLE]
for each . Thus, parameters are defined as follows:
[TABLE]
where
[TABLE]
with being a random sequence such that its elements tend to be concentrated around point one, when From (49), since
[TABLE]
Assumption A2B is satisfied. In addition, condition (23) is verified in the generations performed in the Gaussian framework.
Example 1
Let us assume that the eigenvalues of the autocovariance operator of the ARH(1) process are given by
[TABLE]
Thus, is a strictly positive and trace operator, where
[TABLE]
Tables 1–2 display the values of the empirical functional mean–square errors, given in (46)–(47), associated with and and with the corresponding ARH(1) plug–in predictors, with, as before,
[TABLE]
considering The respective graphical representations are displayed in Figures 1–2, where, for comparative purposes, the values of the curve are also drawn for the finite sample sizes (51).
Example 2
In this example, a bit slower decay velocity, than in Example 1, of the eigenvalues of the autocovariance operator of the ARH(1) process is considered. Specifically,
[TABLE]
Thus, is a strictly positive self-adjoint trace operator, where and are generated, as before, from (48)-(50).
Tables 3–4 show the values of the empirical functional mean–square errors, associated with and and with the corresponding ARH(1) plug–in predictors, respectively. Figures 3–4 provide the graphical representations in comparison with the values of the curve for given in (51), with, as before, .
Example 3
It is well–known that the singularity of the inverse of the autocovariance operator increases, when the rate of convergence to zero of the eigenvalues of indicates a faster decay velocity, as in this example. Specifically, here,
[TABLE]
As before, and are generated from (48)-(50). The truncation order satisfies
[TABLE]
(see also the simulation study undertaken in Álvarez-Liébana et al. [2017], for the case of being a Hilbert–Schmidt operator). In particular, (52) holds for Thus, and we consider i.e.,
Tables 5–6 show the empirical functional mean–square errors associated with and and with the corresponding ARH(1) plug–in predictors, respectively. As before, Figures 5–6 provide the graphical representations, and the values of the curve for in (51), with the aim of illustrating the rate of convergence to zero of the truncated empirical functional mean quadratic errors.
In Examples 1–2 in Sections 5.1–5.2, where a common fixed truncation order is considered, we can observe that the biggest values of the empirical functional mean–square errors are located at the smallest sample sizes, for which the number of parameters to be estimated is too large, with a slightly worse performance for those sample sizes, in Example 3 in Seciton 5.2, where a slower decay velocity, than in Example 1, of the eigenvalues of the autocovariance operator is considered. Note that, on the other hand, when a slower decay velocity of the eigenvalues of is given, a larger truncation order is required to explain a given percentage of the functional variance. For the fastest rate of convergence to zero of the eigenvalues of the autocovariance operator in Example 3, to compensate the singularity of the inverse covariance operator a suitable truncation order is fitted, depending on the sample size obtaining a slightly better performance than in the previous cases, where a fixed truncation order is studied.
6 Final comments
This paper addresses the case where the eigenvectors of are known, in relation to the asymptotic efficiency and equivalence of and and the associated plug-in predictors. However, as shown in the simulation study undertaken in Álvarez-Liébana et al. [2017], a similar performance is obtained in the case where the eigenvectors of are unknown (see also Bosq [2000] in relation to the asymptotic properties of the empirical eigenvectors of ).
In the cited references in the ARH(1) framework, the autocorrelation operator is usually assumed to belong to the Hilbert–Schmidt class. Here, in the absence of the compactness assumption (in particular, of the Hilbert–Schmidt assumption) on the autocorrelation operator singular autocorrelation kernels can be considered. As commented in the Section 1, the singularity of is compensated by the regularity of the autocovariance kernel of the innovation process, as reflected in Assumption A2B.
Theorem 4.1 establishes sufficient conditions for the asymptotic efficiency and equivalence of the proposed classical and Bayes diagonal componentwise parameter estimators of as well as of the associated ARH(1) plug-in predictors (see Theorem 4.2). The simulation study illustrates the fact that the truncation order should be selected according to the rate of convergence to zero of the eigenvalues of the autocovariance operator, and depending on the sample size Although, a fixed truncation order, independently of has also been tested in Examples 1–2, where a compromise between the rate of convergence to zero of the eigenvalues, and the rate of increasing of the sample sizes is found.
7 Supplementary Material: Bayesian estimation of real–valued autoregressive processes of order one
In this section, we consider the Beta–prior–based Bayesian estimation of the autocorrelation coefficient in a standard AR(1) process. Namely, the generalized maximum likelihood estimator of such a parameter is computed, when a beta prior is assumed for In the ARH(1) framework, we have adopted this estimation procedure in the approximation of the diagonal coefficients of operator with respect to in a Bayesian componentwise context. Note that we also denote by the autocorrelation coefficient of an AR(1) process, since there is no place for confusion here.
Let be an AR(1) process satisfying
[TABLE]
where and is a real–valued Gaussian white noise; i.e., are independent Gaussian random variables, with Here, we will use the conditional likelihood, and assume that are observed for sufficiently large to ensure that the effect of the random initial condition is negligible. A beta distribution with shape parameters and is considered as a-priori distribution on i.e., Hence, the distribution of has density
[TABLE]
where
[TABLE]
is the beta function.
We first compute the solution to the equation
[TABLE]
where
[TABLE]
Thus, the following equation must be solved:
[TABLE]
- Case 1
Considering and we obtain the solution
[TABLE]
- Case 2
The general case where is more intricate, since the solutions are and
[TABLE]
- Case 3
For we have
[TABLE]
8 Supplementary Material 2: strong–ergodic AR(1) processes
This section collects some strong–ergodicity results applied in this paper, for real–valued weak–dependent random sequences. In particular, their application to the AR(1) case is considered.
A real–valued stationary process is strongly–ergodic (or ergodic in an almost surely sense), with respect to if, as
[TABLE]
In particular, the following lemma provides sufficient condition to get the strong–ergodicity for all second–order moments (see, for example, [Stout, 1974, Theorem 3.5.8] and [Billingsley, 1995, p. 495]).
** Lemma 8.1**
Let be an i.i.d. sequence of real–valued random variables. If is a measurable function, then
[TABLE]
is a stationary and strongly–ergodic process for all second–order moments.
Lemma 8.1 is now applied to the invertible AR(1) case, when the innovation process is white noise.
Remark 8.1
If is a real–valued zero–mean stationary AR(1) process
[TABLE]
where is strong white noise, we can define the measurable (even continuous) function
[TABLE]
such that, from Lemma 8.1 and for each ,
[TABLE]
is a stationary and strongly–ergodic process for all second–order moments.
In the results derived in this paper, Remark 8.1 is applied, for each to the real–valued zero–mean stationary AR(1) processes
[TABLE]
with now representing an ARH(1) process.
** Corollary 8.1**
Under Assumptions A1–A2, for each let us consider the real–valued zero–mean stationary AR(1) process , such that, for each
[TABLE]
Here, is a real-valued strong white noise, for any . Thus, for each , is a stationary and strongly-ergodic process for all second-order moments. In particular, for any , as
[TABLE]
Acknowledgments
This work has been supported in part by projects MTM2012–32674 and MTM2015–71839–P (co-funded by Feder funds), of the DGI, MINECO, Spain.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Álvarez-Liébana [2017] \NAT@biblabelnum Álvarez-Liébana 2017 Álvarez-Liébana , J.: Functional time series: a review and comparative study. Submitted (2017)
- 2Álvarez-Liébana et al. [2016] \NAT@biblabelnum Álvarez-Liébana et al. 2016 Álvarez-Liébana , J. ; Bosq , D. ; Ruiz-Medina , M. D.: Consistency of the plug-in functional predictor of the Ornstein-Uhlenbeck in Hilbert and Banach spaces. Statist. Probab. Lett. 117 (2016), pp. 12–22. – DOI: doi.org/10.1016/j.spl.2016.04.023
- 3Álvarez-Liébana et al. [2017] \NAT@biblabelnum Álvarez-Liébana et al. 2017 Álvarez-Liébana , J. ; Bosq , D. ; Ruiz-Medina , M. D.: Asymptotic properties of a componentwise ARH(1) plug-in predictor. J. Multivariate Anal. 155 (2017), pp. 12–34. – DOI: doi.org/10.1016/j.jmva.2016.11.009
- 4Antoniadis and Sapatinas [2003] \NAT@biblabelnum Antoniadis and Sapatinas 2003 Antoniadis , A. ; Sapatinas , T.: Wavelet methods for continuous-time prediction using Hilbert-valued autoregressive processes. J. Multivariate Anal. 87 (2003), pp. 133–158. – DOI: doi.org/10.1016/S 0047-259X(03)00028-9
- 5Aroian [1947] \NAT@biblabelnum Aroian 1947 Aroian , L. A.: The probability function of a product of two normal distributed variables. Ann. Math. Statist. 18 (1947), pp. 256–271. – DOI: doi.org/10.1214/aoms/1177730442
- 6Aue et al. [2015] \NAT@biblabelnum Aue et al. 2015 Aue , A. ; Norinho , D. ; Hörmann , S.: On the prediction of stationary functional time series. J. Amer. Statist. Assoc. 110 (2015), pp. 378–392. – DOI: doi.org/10.1080/01621459.2014.909317
- 7Bartlett [1946] \NAT@biblabelnum Bartlett 1946 Bartlett , M. S.: On the theoretical specification and sampling properties of autocorrelated time series. Supplement to J. Roy. Stat. Soc. 8 (1946), pp. 27–41. – URL http://www.jstor.org/stable/2983611
- 8Besse and Cardot [1996] \NAT@biblabelnum Besse and Cardot 1996 Besse , P. C. ; Cardot , H.: Approximation spline de la prévision d’un processu fonctionnel autoregréssif d’ordre 1. Canad. J. Statist. 24 (1996), pp. 467–487. – DOI: doi.org/10.2307/3315328
