Non-parametric estimation of time varying AR(1)--processes with local   stationarity and periodicity

Jean-Marc Bardet (SAMM); Paul Doukhan (AGM)

arXiv:1705.10140·math.ST·November 13, 2018

Non-parametric estimation of time varying AR(1)--processes with local stationarity and periodicity

Jean-Marc Bardet (SAMM), Paul Doukhan (AGM)

PDF

Open Access

TL;DR

This paper develops a kernel-based non-parametric method for estimating time-varying AR(1) processes with local stationarity and periodicity, providing theoretical guarantees and minimax rates under mild conditions.

Contribution

It introduces a novel estimation approach for a new class of periodic, locally stationary AR(1) processes with proven asymptotic properties.

Findings

01

Kernel estimators reach classical minimax rates.

02

Establishment of central limit theorems for the estimators.

03

Method requires only second-order moments of noise.

Abstract

Extending the ideas of [7], this paper aims at providing a kernel based non-parametric estimation of a new class of time varying AR(1) processes (Xt), with local stationarity and periodic features (with a known period T), inducing the definition Xt = at(t/nT)X t--1 + $ξ$ t for t $\in$ N and with a t+T $\neq \equiv$ at. Central limit theorems are established for kernel estima-tors as(u) reaching classical minimax rates and only requiring low order moment conditions of the white noise ( $ξ$ t)t up to the second order.

Tables2

Table 1. Table 1: Results of the Monte Carlo experiments providing the accuracy of a ^ s subscript ^ 𝑎 𝑠 \widehat{a}_{s} for the three chosen functions the three chosen functions with ξ 0 subscript 𝜉 0 \xi_{0} following a 𝒩 ( 0 , 4 ) 𝒩 0 4 {\cal N}(0,4) distribution, 1000 1000 1000 independent replications are generated.

	$a^{(ρ)}$	$a_{s}^{(2)}$		$a_{s}^{(1.5)}$		$a_{s}^{(0.8)}$		$a_{s}^{(0.5)}$
	Kernel	$K_{E}$	$K_{G}$	$K_{E}$	$K_{G}$	$K_{E}$	$K_{G}$	$K_{E}$	$K_{G}$
$n = 100$	$\bar{λ}$	0.243	0.407	0.283	0.450	0.172	0.322	0.235	0.392
	${\bar{M I S E}}^{1 / 2}$	0.248	0.239	0.286	0.282	0.230	0.234	0.354	0.353
$n = 200$	$\bar{λ}$	0.227	0.363	0.278	0.429	0.256	0.392	0.250	0.386
	${\bar{M I S E}}^{1 / 2}$	0.185	0.175	0.219	0.219	0.232	0.232	0.308	0.303
$n = 500$	$\bar{λ}$	0.234	0.320	0.276	0.399	0.321	0.431	0.287	0.406
	${\bar{M I S E}}^{1 / 2}$	0.129	0.119	0.154	0.156	0.213	0.210	0.256	0.254
$n = 1000$	$\bar{λ}$	0.240	0.321	0.270	0.384	0.373	0.476	0.328	0.438
	${\bar{M I S E}}^{1 / 2}$	0.098	0.093	0.124	0.122	0.207	0.202	0.226	0.221

Table 2. Table 2: Results of the Monte Carlo experiments providing the accuracy of a ^ s subscript ^ 𝑎 𝑠 \widehat{a}_{s} for the three chosen functions with ξ 0 subscript 𝜉 0 \xi_{0} following a t ( 3 ) 𝑡 3 t(3) distribution, 1000 1000 1000 independent replications are generated.

	$a^{(ρ)}$	$a_{s}^{(2)}$		$a_{s}^{(1.5)}$		$a_{s}^{(0.8)}$		$a_{s}^{(0.5)}$
	Kernel	$K_{E}$	$K_{G}$	$K_{E}$	$K_{G}$	$K_{E}$	$K_{G}$	$K_{E}$	$K_{G}$
$n = 100$	$\bar{λ}$	0.226	0.394	0.267	0.430	0.161	0.295	0.220	0.360
	${\bar{M I S E}}^{1 / 2}$	0.341	0.320	0.350	0.340	0.311	0.309	0.418	0.405
$n = 200$	$\bar{λ}$	0.207	0.343	0.259	0.402	0.231	0.355	0.225	0.362
	${\bar{M I S E}}^{1 / 2}$	0.261	0.258	0.281	0.287	0.296	0.293	0.353	0.346
$n = 500$	$\bar{λ}$	0.194	0.304	0.252	0.373	0.286	0.383	0.239	0.360
	${\bar{M I S E}}^{1 / 2}$	0.214	0.201	0.213	0.217	0.269	0.261	0.302	0.296
$n = 1000$	$\bar{λ}$	0.193	0.321	0.246	0.342	0.346	0.450	0.258	0.368
	${\bar{M I S E}}^{1 / 2}$	0.166	0.093	0.172	0.181	0.258	0.250	0.262	0.275

Equations251

\sum_{j=0}^{p}\alpha_{j}\Big{(}\frac{t}{n}\Big{)}\,X^{(n)}_{t-j}=\sum_{k=0}^{q}\beta_{k}\Big{(}\frac{t}{n}\Big{)}\,\xi_{t-k},\qquad 1\leq t\leq n,

\sum_{j=0}^{p}\alpha_{j}\Big{(}\frac{t}{n}\Big{)}\,X^{(n)}_{t-j}=\sum_{k=0}^{q}\beta_{k}\Big{(}\frac{t}{n}\Big{)}\,\xi_{t-k},\qquad 1\leq t\leq n,

X^{(n)}_{t}=a_{t}\left(\frac{t}{nT}\right)X^{(n)}_{t-1}+\xi_{t},\quad\mbox{with $a_{t+T}\equiv a_{t}$},\ \mbox{for any}\left\{\begin{array}[]{l}1\leq t\leq nT\\ n\in\mathbb{N}^{*}\end{array}\right.,

X^{(n)}_{t}=a_{t}\left(\frac{t}{nT}\right)X^{(n)}_{t-1}+\xi_{t},\quad\mbox{with $a_{t+T}\equiv a_{t}$},\ \mbox{for any}\left\{\begin{array}[]{l}1\leq t\leq nT\\ n\in\mathbb{N}^{*}\end{array}\right.,

X_{t}^{(n)}

X_{t}^{(n)}

\big{|}f^{(\lceil\rho\rceil)}(u_{1})-f^{(\lceil\rho\rceil)}(u_{2})\big{|}\leq C_{f}\,|u_{1}-u_{2}|^{\rho-\lceil\rho\rceil},\quad\mbox{for any}~{}u_{1},u_{2}\in{\cal V}_{u}.

\big{|}f^{(\lceil\rho\rceil)}(u_{1})-f^{(\lceil\rho\rceil)}(u_{2})\big{|}\leq C_{f}\,|u_{1}-u_{2}|^{\rho-\lceil\rho\rceil},\quad\mbox{for any}~{}u_{1},u_{2}\in{\cal V}_{u}.

\mathbb{E}\big{(}(X^{(n)}_{t})^{2}\big{)}=\gamma_{s}^{(2)}(\frac{t}{nT})+{\cal O}\big{(}\frac{1}{n}\big{)},\\ \mbox{with}\quad\left\{\begin{array}[]{ccl}\displaystyle\gamma_{s}^{(2)}(v)&=&\displaystyle\sigma^{2}\,\frac{1+\sum_{i=0}^{T-2}\beta_{s,i}(v)}{1-\beta_{s,T-1}(v)},\\ \displaystyle\beta_{s,i}(v)&=&\prod_{j=0}^{i}a^{2}_{s-j}(v)\leq\alpha^{2i}<1.\end{array}\right.

\mathbb{E}\big{(}(X^{(n)}_{t})^{2}\big{)}=\gamma_{s}^{(2)}(\frac{t}{nT})+{\cal O}\big{(}\frac{1}{n}\big{)},\\ \mbox{with}\quad\left\{\begin{array}[]{ccl}\displaystyle\gamma_{s}^{(2)}(v)&=&\displaystyle\sigma^{2}\,\frac{1+\sum_{i=0}^{T-2}\beta_{s,i}(v)}{1-\beta_{s,T-1}(v)},\\ \displaystyle\beta_{s,i}(v)&=&\prod_{j=0}^{i}a^{2}_{s-j}(v)\leq\alpha^{2i}<1.\end{array}\right.

\mathbb{E}\big{(}(X^{(n)}_{t})^{4}\big{)}=\gamma_{s}^{(4)}(\frac{t}{nT})+{\cal O}\big{(}\frac{1}{n}\big{)},\\ \quad\mbox{with}\quad\left\{\begin{array}[]{ccl}\gamma_{s}^{(4)}(v)&=&\displaystyle\big{(}\mu_{4}+6\sigma^{2}\gamma_{s}^{(2)}(v)-6\sigma^{4}\big{)}\frac{1+\sum_{i=0}^{T-2}\delta_{s,i}(v)}{1-\delta_{s,T-1}(v)},\\ \delta_{s,i}(v)&=&\prod_{j=0}^{i}a^{4}_{s-j}(v)\leq\alpha^{4i}<1.\end{array}\right.

\mathbb{E}\big{(}(X^{(n)}_{t})^{4}\big{)}=\gamma_{s}^{(4)}(\frac{t}{nT})+{\cal O}\big{(}\frac{1}{n}\big{)},\\ \quad\mbox{with}\quad\left\{\begin{array}[]{ccl}\gamma_{s}^{(4)}(v)&=&\displaystyle\big{(}\mu_{4}+6\sigma^{2}\gamma_{s}^{(2)}(v)-6\sigma^{4}\big{)}\frac{1+\sum_{i=0}^{T-2}\delta_{s,i}(v)}{1-\delta_{s,T-1}(v)},\\ \delta_{s,i}(v)&=&\prod_{j=0}^{i}a^{4}_{s-j}(v)\leq\alpha^{4i}<1.\end{array}\right.

\mbox{Cov}\,\big{(}(X^{(n)}_{t})^{2},(X^{(n)}_{t^{\prime}})^{2}\big{)}=\Big{(}\gamma_{s^{\prime}}^{(4)}\big{(}\frac{t^{\prime}}{nT}\big{)}+{\cal O}\big{(}\frac{1}{n}\big{)}\Big{)}\prod_{i=1}^{t-t^{\prime}}\,a^{2}_{t^{\prime}+i}(\frac{t^{\prime}+i}{n}).

\mbox{Cov}\,\big{(}(X^{(n)}_{t})^{2},(X^{(n)}_{t^{\prime}})^{2}\big{)}=\Big{(}\gamma_{s^{\prime}}^{(4)}\big{(}\frac{t^{\prime}}{nT}\big{)}+{\cal O}\big{(}\frac{1}{n}\big{)}\Big{)}\prod_{i=1}^{t-t^{\prime}}\,a^{2}_{t^{\prime}+i}(\frac{t^{\prime}+i}{n}).

\displaystyle a_{t}\Big{(}\frac{t}{nT}\Big{)}=a_{s}\Big{(}\frac{t}{nT}\Big{)}=\frac{\mathbb{E}\big{(}X_{t}X_{t-1}\big{)}}{\mathbb{E}\big{(}X_{t-1}^{2}\big{)}}.

\displaystyle a_{t}\Big{(}\frac{t}{nT}\Big{)}=a_{s}\Big{(}\frac{t}{nT}\Big{)}=\frac{\mathbb{E}\big{(}X_{t}X_{t-1}\big{)}}{\mathbb{E}\big{(}X_{t-1}^{2}\big{)}}.

a_{s}\Big{(}\frac{t}{nT}\Big{)}=\frac{\mathbb{E}\big{(}X_{t}X_{t-1}\big{)}}{\mathbb{E}\big{(}X_{t-1}^{2}\big{)}},\qquad\forall t\in I_{n,s}.

a_{s}\Big{(}\frac{t}{nT}\Big{)}=\frac{\mathbb{E}\big{(}X_{t}X_{t-1}\big{)}}{\mathbb{E}\big{(}X_{t-1}^{2}\big{)}},\qquad\forall t\in I_{n,s}.

n \to \infty lim b_{n} = 0, n \to \infty lim n b_{n} = \infty.

n \to \infty lim b_{n} = 0, n \to \infty lim n b_{n} = \infty.

\displaystyle\widehat{a}^{(n)}_{s}(u)=\frac{\widehat{N}^{(n)}_{s}(u)}{\widehat{D}^{(n)}_{s}(u)},~{}\mbox{with}\left\{\begin{array}[]{ccl}\displaystyle\widehat{N}^{(n)}_{s}(u)&=&\frac{1}{nb_{n}}\sum_{j\in I_{n,s}}K\Big{(}\frac{\frac{j}{nT}-u}{b_{n}}\Big{)}X_{j}X_{j-1},\\ \displaystyle\widehat{D}^{(n)}_{s}(u)&=&\frac{1}{nb_{n}}\sum_{j\in I_{n,s}}K\Big{(}\frac{\frac{j}{nT}-u}{b_{n}}\Big{)}X_{j-1}^{2}.\end{array}\right.

\displaystyle\widehat{a}^{(n)}_{s}(u)=\frac{\widehat{N}^{(n)}_{s}(u)}{\widehat{D}^{(n)}_{s}(u)},~{}\mbox{with}\left\{\begin{array}[]{ccl}\displaystyle\widehat{N}^{(n)}_{s}(u)&=&\frac{1}{nb_{n}}\sum_{j\in I_{n,s}}K\Big{(}\frac{\frac{j}{nT}-u}{b_{n}}\Big{)}X_{j}X_{j-1},\\ \displaystyle\widehat{D}^{(n)}_{s}(u)&=&\frac{1}{nb_{n}}\sum_{j\in I_{n,s}}K\Big{(}\frac{\frac{j}{nT}-u}{b_{n}}\Big{)}X_{j-1}^{2}.\end{array}\right.

\sqrt{nb_{n}}\big{(}\widehat{a}_{s}(u)-a_{s}(u)\big{)}\begin{array}[t]{c}\stackrel{{\scriptstyle{\cal L}}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}{\cal N}\Big{(}0\,,\frac{\sigma^{2}}{\gamma^{(2)}_{s}(u)}\int_{\mathbb{R}}K^{2}(x)\,dx\Big{)},

\sqrt{nb_{n}}\big{(}\widehat{a}_{s}(u)-a_{s}(u)\big{)}\begin{array}[t]{c}\stackrel{{\scriptstyle{\cal L}}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}{\cal N}\Big{(}0\,,\frac{\sigma^{2}}{\gamma^{(2)}_{s}(u)}\int_{\mathbb{R}}K^{2}(x)\,dx\Big{)},

{\cal N}\Big{(}\mu(u)\,,\,\frac{\sigma^{2}}{\gamma^{(2)}_{s}(u)}\int_{\mathbb{R}}K^{2}(x)\,dx\Big{)}

{\cal N}\Big{(}\mu(u)\,,\,\frac{\sigma^{2}}{\gamma^{(2)}_{s}(u)}\int_{\mathbb{R}}K^{2}(x)\,dx\Big{)}

\widehat{CV}(\tau)=\sum_{j=2}^{N}\Big{(}X^{(n)}_{j}-\widehat{a}_{j}^{(\tau)}\big{(}\frac{j}{N}\big{)}X^{(n)}_{j-1}\Big{)}^{2}.

\widehat{CV}(\tau)=\sum_{j=2}^{N}\Big{(}X^{(n)}_{j}-\widehat{a}_{j}^{(\tau)}\big{(}\frac{j}{N}\big{)}X^{(n)}_{j-1}\Big{)}^{2}.

T = Arg 1 \leq τ \leq T_{m a x} min C V (τ) .

T = Arg 1 \leq τ \leq T_{m a x} min C V (τ) .

\sqrt{nb_{n}\,\int_{\mathbb{R}}K^{2}(x)\,dx}\,\sqrt{\frac{1+\sum_{i=0}^{T-2}\prod_{j=0}^{i}\widehat{a}^{2}_{s-j}(u)}{1-\prod_{j=0}^{T-1}\widehat{a}^{2}_{s-j}(u)}}\Big{(}\widehat{a}_{s}(u)-a_{s}(u)\Big{)}\begin{array}[t]{c}\stackrel{{\scriptstyle{\cal L}}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}{\cal N}\big{(}0\,,1\big{)}.

\sqrt{nb_{n}\,\int_{\mathbb{R}}K^{2}(x)\,dx}\,\sqrt{\frac{1+\sum_{i=0}^{T-2}\prod_{j=0}^{i}\widehat{a}^{2}_{s-j}(u)}{1-\prod_{j=0}^{T-1}\widehat{a}^{2}_{s-j}(u)}}\Big{(}\widehat{a}_{s}(u)-a_{s}(u)\Big{)}\begin{array}[t]{c}\stackrel{{\scriptstyle{\cal L}}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}{\cal N}\big{(}0\,,1\big{)}.

\widehat{A}_{s}=\sqrt{nb_{n}\,\int_{\mathbb{R}}K^{2}(x)\,dx}\,\sqrt{\frac{1+\sum_{i=0}^{T-2}\prod_{j=0}^{i}\widehat{a}^{2}_{s-j}(u)}{1-\prod_{j=0}^{T-1}\widehat{a}^{2}_{s-j}(u)}}\Big{(}\widehat{a}_{s}(u)-c_{a}\Big{)},

\widehat{A}_{s}=\sqrt{nb_{n}\,\int_{\mathbb{R}}K^{2}(x)\,dx}\,\sqrt{\frac{1+\sum_{i=0}^{T-2}\prod_{j=0}^{i}\widehat{a}^{2}_{s-j}(u)}{1-\prod_{j=0}^{T-1}\widehat{a}^{2}_{s-j}(u)}}\Big{(}\widehat{a}_{s}(u)-c_{a}\Big{)},

\widehat{MISE}_{s}(\lambda)=\frac{1}{99}\,\sum_{i=1}^{99}\big{(}\widehat{a}_{s}(u_{i})-a_{s}(u_{i})\big{)}^{2}.

\widehat{MISE}_{s}(\lambda)=\frac{1}{99}\,\sum_{i=1}^{99}\big{(}\widehat{a}_{s}(u_{i})-a_{s}(u_{i})\big{)}^{2}.

λ_{j} = \mbox A r g 0.1 \leq λ \leq 0.8 min s = 1 \sum T M I S E_{s} (λ)

λ_{j} = \mbox A r g 0.1 \leq λ \leq 0.8 min s = 1 \sum T M I S E_{s} (λ)

\overline{M I S E}^{1/2} = \frac{1}{1000} j = 1 \sum 1000 s = 1 \sum T M I S E_{s} (λ_{j}) .

\overline{M I S E}^{1/2} = \frac{1}{1000} j = 1 \sum 1000 s = 1 \sum T M I S E_{s} (λ_{j}) .

v_{t} = α_{t} v_{t - 1} + σ^{2} \leq α^{2} v_{t - 1} + σ^{2} \leq α^{2} s sup v_{s} + σ^{2}, t > 0

v_{t} = α_{t} v_{t - 1} + σ^{2} \leq α^{2} v_{t - 1} + σ^{2} \leq α^{2} s sup v_{s} + σ^{2}, t > 0

s sup v_{s} \leq \frac{σ ^{2} + v _{0}}{1 - α} < \infty.

s sup v_{s} \leq \frac{σ ^{2} + v _{0}}{1 - α} < \infty.

δ_{t}

δ_{t}

\displaystyle\big{|}\delta_{t}\big{|}

\displaystyle\big{|}\alpha_{t}-\alpha_{t-T}\big{|}

\displaystyle\big{|}\alpha_{t}-\alpha_{t-T}\big{|}

\big{|}\delta_{t}\big{|}\leq\frac{C}{1-\alpha}\cdot\frac{1}{n^{\rho\wedge 1}}+\delta_{T+1}\alpha^{t-T+1}.

\big{|}\delta_{t}\big{|}\leq\frac{C}{1-\alpha}\cdot\frac{1}{n^{\rho\wedge 1}}+\delta_{T+1}\alpha^{t-T+1}.

\big{|}\delta_{t}\big{|}\leq C^{\prime}\frac{1}{n^{\rho\wedge 1}},\qquad\forall t\geq c\log n.

\big{|}\delta_{t}\big{|}\leq C^{\prime}\frac{1}{n^{\rho\wedge 1}},\qquad\forall t\geq c\log n.

v_{t}

v_{t}

v_{t}

v_{t}

\displaystyle v_{t}=\sigma^{2}\,\frac{1+\sum_{i=0}^{T-2}\widetilde{\alpha}_{t}\cdots\widetilde{\alpha}_{t-i}}{1-\widetilde{\alpha}_{t}\cdots\widetilde{\alpha}_{t-T+1}}+{\cal O}\big{(}\frac{1}{n}\big{)}=\gamma_{s}^{(2)}(\frac{t}{nT})+{\cal O}\big{(}\frac{1}{n}\big{)}.

\displaystyle v_{t}=\sigma^{2}\,\frac{1+\sum_{i=0}^{T-2}\widetilde{\alpha}_{t}\cdots\widetilde{\alpha}_{t-i}}{1-\widetilde{\alpha}_{t}\cdots\widetilde{\alpha}_{t-T+1}}+{\cal O}\big{(}\frac{1}{n}\big{)}=\gamma_{s}^{(2)}(\frac{t}{nT})+{\cal O}\big{(}\frac{1}{n}\big{)}.

w_{t} = E (X_{t}^{4}) = E (A_{t} X_{t - 1} + ξ_{t})^{4} = q_{t} w_{t - 1} + 4 μ_{3} A_{t} E X_{t - 1} + 6 σ^{2} A_{t}^{2} v_{t - 1} + μ_{4} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsControl Systems and Identification · Statistical Methods and Inference

Full text

Non-parametric estimation of time varying AR(1)–processes with local stationarity and periodicity

Jean-Marc Bardetlabel=e1 [

mark][email protected]

Paul Doukhanlabel=e2 [

mark][email protected]

SAMM EA4543, University Panthéon-Sorbonne, 90, rue de Tolbiac, 75634, Paris, France.

AGM-UMR8088, University Cergy-Pontoise, France, and CIMFAV, Valparaiso, Chile.

Some University and Another University

(0)

Abstract

Extending the ideas of [7], this paper aims at providing a kernel based non-parametric estimation of a new class of time varying AR $(1)$ processes $(X_{t})$ , with local stationarity and periodic features (with a known period $T$ ), inducing the definition $X_{t}=a_{t}(t/nT)X_{t-1}+\xi_{t}$ for $t\in\mathbb{N}$ and with $a_{t+T}\equiv a_{t}$ . Central limit theorems are established for kernel estimators $\widehat{a}_{s}(u)$ reaching classical minimax rates and only requiring low order moment conditions of the white noise $(\xi_{t})_{t}$ up to the second order.

62G05,

62M10,

60F05,

Local stationarity,

Nonparametric estimation,

Central limit theorem,

keywords:

[class=AMS]

keywords:

††volume: 0††issue: 0

and

This paper is dedicated to the memory of Jean Bretagnolle

1 Introduction

Since the seminal paper [5], the local-stationarity property provides new models and approaches for introducing non-stationarity in times series. The recently published handbook [7] gives a complete survey about new results obtained since $20$ years on this topics.

An interesting new kind of models is obtained from a natural extension of usual ARMA processes, so called tvARMA( $p,q$ )–processes defined in [8], as:

[TABLE]

where $\alpha_{j}$ and $\beta_{k}$ are bounded functions. This is a special case of locally stationary linear process defined by $X^{(n)}_{t}=\sum_{j=0}^{\infty}\gamma_{j}\Big{(}\frac{t}{n}\Big{)}\,\xi_{t-j}$ . Such models have been studied in many papers, especially concerning the parametric, semi-parametric or non-parametric estimations of functions $\alpha_{j}$ , $\beta_{k}$ or $\gamma_{j}$ , or other functions depending on these functions; see, for instance references [6], [8], [7], or [12], [3], [11], [17] or [2].

For simplicity, we restrict in this first work to time-varying AR $(1)$ –processes $(X^{(n)}_{t})$ including a periodic component:

[TABLE]

where $T\in\mathbb{N}^{*}$ is a fixed and known integer number, and $(\xi_{t})$ a white noise. Note that given the functions $a_{1},\ldots,a_{T}$ , one may even build a periodic sequence $(a_{t})_{t\in\mathbb{Z}}$ through the relation $a_{t+T}=a_{t}$ .

The choice of such extension of the tvAR $(1)$ processes is relative to modelling considerations: for instance, in the climatic framework, [4] considered models of air temperatures where the function of interest writes as the product of a periodic sequence by a locally varying function. This choice provide an interesting extension of more classical periodic models of air temperature such as those proposed in [14].

Other periodic representation for locally stationary processes can also be found in for instance in the paper [19], but the seasonal component is treated as an additive deterministic trend and is not included in the dynamic of the process, which is the case for model (1.2).

We then study non-parametric estimators $\widehat{a}_{s}(u)$ , for $s=1,2,\ldots,T$ , $u\in(0,1)$ from an observed trajectory $(X^{(n)}_{1},\ldots,X^{(n)}_{nT})$ . We consider kernel-based estimators which are naturally induced from covariance relationships satisfied by the process (see Section 2). Central limit theorems are established for these estimators under some regularity conditions on the functions $a_{s}(\cdot)$ for $s=1,2,\ldots,T\,$ . The results are only obtained by assuming second-order moments on the white noise $(\xi_{t})$ . This is a main improvement with respect to usual limit theorems on locally-stationary processes which are obtained with the assumption that any moment exists for $(\xi_{t})$ . This is due to the new ideas developed in our proof which combines a central limit theorem for martingale increment arrays as well as an embedding in an Orlicz space (see details in Section 4).

The obtained convergence rate is optimal with respect to the minimax rate up to a logarithmic term. Simulations based on Monte-Carlo experiments illustrate the accuracy of the estimators. An application to real-life data, i.e. monthly average temperature readings in London from 1659 to 1998, shows the interest of using our new model (1.2).

This paper is also a first step concerning new results for new class of non-stationary processes. Indeed, we can extend the definition (1.2) to processes $(X_{t}^{(n)})$ such as:

[TABLE]

where $(Z_{t})$ is a sequence of i.i.d. random vectors modelling for instance exogenous inputs. This more tough case is deferred to forthcoming papers.

Other time-varying models with an infinite memory may also be processed as GARCH-type models (see for instance [9]). Remark also that [10] introduced INGARCH-models. Those models are GLM models; non-stationary versions of which also may be considered. They will be considered in further works.

The structure of the paper is as follows. In Section 2, we define and study asymptotic properties of non-parametric estimators for the process (1.2). Section 3 provides the results of some Monte-Carlo experiments and real-life data application, while the proofs are reported in Section 4.

2 Asymptotic normality of a non-parametric estimator for periodic tvAR(1) processes

2.1 Definition and first properties of the process

Denote classically $\mathbb{N}=\{0,1,\ldots\}$ and $\mathbb{N}^{*}=\{1,2,\ldots\}$ . Here we consider $T\in\mathbb{N}^{*}$ a fixed and known period. We will write $s\equiv t[T]$ if $t-s$ is a multiple of $T$ .

The paper is dedicated to the simplest case $X=(X^{(n)}_{t})_{1\leq t\leq nT,\,n\in\mathbb{N}}$ , of a $T-$ periodic locally stationary AR $(1)-$ process, defined in (1.2) where $X^{(n)}_{0}=X_{0}$ with $\mathbb{E}(X^{2}_{0})<\infty$ . Here $(\xi_{t})_{t\in\mathbb{N}}$ is a sequence of i.i.d. r.v.s satisfying $\mathbb{E}(\xi_{t})=0$ and $\mbox{Var}\,(\xi_{t})=\sigma^{2}$ for any $t\in\mathbb{N}^{*}$ , with $(\xi_{t})_{t}$ independent of $X_{0}^{(n)}$ .

The functions $(a_{s}(\cdot))_{1\leq s\leq T}$ , $[0,1]\to\mathbb{R}$ are supposed to satisfy some regularity. Hence, we provide the forthcoming definition usually made in a non-parametric framework:

Definition 2.1.

For $\rho>0$ , we denote $\lceil\rho\rceil\in\mathbb{N}$ the largest integer such that $\lceil\rho\rceil<\rho$ . A function $f:x\in\mathbb{R}\mapsto f(x)\in\mathbb{R}$ is said to belong to the class ${\cal C}^{\rho}({\cal V}_{u})$ where ${\cal V}_{u}$ is a neighbourhood of $u\in\mathbb{R}$ , if $f\in{\cal C}^{\lceil\rho\rceil}({\cal V}_{u})$ and if $f^{(\lceil\rho\rceil)}$ is a $\,(\rho-\lceil\rho\rceil)$ -Hölderian function, i.e. there exists $C_{f}\geq 0$ such as

[TABLE]

In case $\rho$ is an integer we simply assume that $f^{(\rho)}$ exists and is a continuous and bounded function on the neighbourhood of $u$ . As a consequence we specify the assumptions on functions $(a_{t})$ using a fixed positive real number $\rho>0$ :

Assumption (A $(\rho)$ ): The functions $\{a_{t}(\cdot);\,{t\in\mathbb{N}}\}$ are such as:

(Periodicity) There exists $T\in\mathbb{N}^{*}$ such that $a_{t}(v)=a_{t+T}(v)$ for any $(t,v)\in\mathbb{Z}\times[0,1]$ . 2. 2.

(Contractivity) There exists $\alpha=\sup_{\{t\in\mathbb{Z},\,v\in[0,1]\}}|a_{t}(v)|<1$ . 3. 3.

(Regularity) For any $t\in\mathbb{Z}$ , assume that $a_{t}\in{\cal C}^{\rho}$ .

Remark 2.1.

Quote that $T=1$ corresponds to a non-periodic case and $(X^{(n)}_{t})$ is then a usual tvAR(1) process defined in (1.1).

First it is clear that the conditions on functions $(a_{s})$ ensure the existence of a causal linear process $(X_{t}^{(n)})_{1\leq t\leq nT}$ for any $n\in\mathbb{N}$ satisfying (1.2). More precisely, we obtain the following moment relationships:

Proposition 2.1.

Let $X=(X^{(n)}_{t})_{1\leq t\leq nT,\,n\in\mathbb{N}^{*}}$ satisfy (1.2) under Assumption (A $(\rho)$ ) with $\rho\geq 1$ . Then for some convenient constant $c>0$ ,

For any $n\in\mathbb{N}^{*}$ and $1\leq t\leq nT$ , $\big{|}\mathbb{E}\big{(}X^{(n)}_{t}\big{)}\big{|}\leq\alpha^{t}\,\big{|}\mathbb{E}(X_{0})\big{|}$ . 2. 2.

Let $s\in\{1,\ldots,T\}$ . There exists functions $\gamma^{(2)}_{s}\in{\cal C}^{\rho}([0,1])$ such as if $t\in\{[c\log n],\ldots,nT\}$ and $t\equiv s~{}[T]$ :

[TABLE] 3. 3.

Assume $\mathbb{E}(\xi_{0}^{4})=\mu_{4}<\infty$ and $\mathbb{E}(\xi_{0}^{3})=0$ (this holds e.g. if $\xi_{0}$ admits a symmetric distribution).

For $s\in\{1,\ldots,T\}$ , there exist functions $\gamma^{(4)}_{s}\in{\cal C}^{\rho}([0,1])$ such as, for $t\in\{[c\log n],\ldots,nT\}$ with $t\equiv s~{}[T]$ ,

[TABLE]

Moreover, for any $(t,t^{\prime})\in\{[c\log n],\ldots,nT\}^{2}$ with $t>t^{\prime}$ ,

[TABLE]

We will now assume $X_{0}=0$ .

In addition of the previous proposition, another relation can be easily established. Indeed, for $t\in\{1,2,\ldots,nT\}$ , with $s=t~{}[T]$ , by multiplying (1.2) by $X_{t-1}^{(n)}$ and taking the expectation:

[TABLE]

The relation (2.4) is at the origin of the definition of the following non-parametric estimators of the functions $a_{s}(\cdot)$ .

2.2 Asymptotic normality of the estimator

Assume that the sample $(X_{1},\ldots,X_{nT})$ is observed for some $n\geq 1$ ; this condition entails a reasonable loss of at most $T$ data and allows us for a more comprehensive study.

For each $s\in\{1,\ldots,T\}$ , we define $I_{n,s}=\big{\{}s,s+T,\ldots,s+(n-1)T\big{\}}$ , a set with $\#I_{n,s}=n$ . Now (2.4) writes:

[TABLE]

A convolution kernel $K:\mathbb{R}\to\mathbb{R}$ will be required in the sequel and it satisfies one of both the following assumptions:

Assumption $(K)$ : Let $K:\mathbb{R}\to\mathbb{R}$ be a Borel bounded function such that:

•

$\displaystyle\int_{\mathbb{R}}K(t)dt=1$ and $K(-x)=K(x)$ for any $x\in\mathbb{R}$ ;

•

there exists $\beta>0$ such as $\lim_{|t|\to+\infty}e^{\beta\,|t|}K(t)=0$ .

Assumption $(\widetilde{K})$ : Let $K:\mathbb{R}\to\mathbb{R}$ be a Borel bounded function such that:

•

$\displaystyle\int_{\mathbb{R}}K(t)dt=1$ and $K(-x)=K(x)$ for any $x\in\mathbb{R}$ ;

•

there exists some $B>0$ such as $K(t)=0$ , if $|t|>B$ .

Typical examples of kernel functions are $K(t)=(2\pi)^{-1/2}e^{-t^{2}/2}$ and $K(t)=\frac{1}{2}\,\mbox{\hskip 1.99997ptI1}_{[-1,1]}(t)$ satisfying respectively Assumptions $(K)$ and $(\widetilde{K})$ . Note also the $K\geq 0$ would exclude dealing with a regularity $\rho>2$ .

For $r\geq 1$ , we also specify another condition satisfied by such a function:

Assumption ker $(r)$ : Let $K:\mathbb{R}\to\mathbb{R}$ be a Borel bounded function such that:

•

$\int_{\mathbb{R}}(|x|^{r}+1)\,|K(x)|\,dx<\infty$ and $\int_{\mathbb{R}}x^{p}K(x)\,dx=0$ , if $p\in\{1,2,\ldots,\lceil r\rceil-1\}$ ;

•

$\|K\|_{\infty}=\sup_{x\in\mathbb{R}}|K(x)|<\infty$ and $\mbox{Lip}\,(K)=\sup_{x\neq y}\frac{|K(x)-K(y)|}{|x-y|}<\infty$ .

Assume that a sequence of positive bandwidths $(b_{n})_{n\in\mathbb{N}}$ is chosen in such a way that

[TABLE]

Now, keeping in mind the expression (2.4) and following the same ideas as with Nararaya-Watson estimator (see [18] and [22]), for $s\in\{1,\ldots,T\}$ and $u\in(0,1)$ , we set

[TABLE]

Since extremities are omitted we avoid the corresponding edge effects due to the fact that at the extremities, summations are not considered over a symmetric interval of times containing $nu$ . The case $u=0$ does not make any contribution while the case $u=1$ corresponds with simple periodic behaviours and such results should be found in [14].

Using essentially a martingale central limit theorem (the steps of the proofs are precisely detailed in Section 4), we obtain:

Theorem 2.1.

Let $0<\rho\leq 2$ and Assumption (A $(\rho)$ ), let $K$ satisfy Assumption $(K)$ or $(\widetilde{K})$ as well as Assumption ker $(\rho\vee 1)$ . Then, for a sequence $(b_{n})_{n\in\mathbb{N}}$ of positive real numbers such as $\lim_{n\to\infty}b_{n}\,n^{\frac{1}{1+2(\rho\wedge 1)}}=0$ ,

[TABLE]

for any $u\in(0,1)$ , $s\in\{1,\ldots,T\},$ with $\displaystyle\gamma^{(2)}_{s}(u)=\sigma^{2}\,\frac{1+\sum_{i=0}^{T-2}\beta_{s,i}(u)}{1-\beta_{s,T-1}(u)}.$

Note that for $\rho\leq 1$ the classical optimal semi-parametric minimax rate is reached.

This is not the case if $\rho\in(1,2]$ . In that case, another moment condition is needed in order to improve the convergence rate of $\widehat{a}_{s}(u)$ .

Theorem 2.2.

*Let $1\leq\rho\leq 2$ and Assumption (A $(\rho)$ ), let $K$ satisfy Assumption $(K)$ or $(\widetilde{K})$ as well as Assumption ker $(\rho)$ . Moreover, suppose that $\mathbb{E}|\xi_{0}|^{\beta}<\infty$ with $\displaystyle\beta=4-\frac{2\rho}{5\rho-4}\in\Big{[}2,\frac{10}{3}\Big{]}$ (Note that $\beta=2$ if $\rho=2$ ) and that $\xi_{0}$ admits a symmetric distribution. Then (2.8) holds for a sequence $(b_{n})_{n\in\mathbb{N}}$ of positive real numbers such as $b_{n}\,n^{\frac{1}{2\rho+1}}\begin{array}[t]{c}\stackrel{{\scriptstyle}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}0$ .

Moreover in case $\rho=2$ and if $b_{n}=c\,n^{-\frac{1}{5}}$ then the central limit still holds but the limit distribution is now non-centred:*

[TABLE]

*with $\displaystyle\mu(u)=\frac{c^{\frac{5}{2}}}{\gamma^{(2)}_{s}(u)}\Big{(}\frac{1}{2}a_{s}^{\prime\prime}(u)\gamma_{s}^{(2)}(u)+a_{s}^{\prime}(u)(\gamma_{s}^{(2)})^{\prime}(u)\Big{)}\int_{\mathbb{R}}z^{2}K(z)\,dz$ . *

Remark 2.2.

*Optimal window widths write as $b_{n}\sim cn^{-\frac{1}{2\rho+1}}$ thus the above result holds with a suboptimal window width. Moreover the symmetry assumption is discussed in Remark 4.2. Now for the case $\rho=2$ in case the derivatives of $a_{s}$ are regular around the point $u$ , then the optimal window width actually may be used and the central limit theorem again holds with a non-centred Gaussian limit.

Quote that the proposed normalisation yields the standard minimax rates $n^{-\frac{\rho}{2\rho+1}}$ , in the case of compactly supported symmetric kernel (a $(\log n)-$ loss is observed for the Gaussian kernel); the obtained rates are in probability and further work is needed to prove that this is the minimax $\mathbb{L}^{2}-$ rate.

Moreover for large $T$ the convergence rate is degraded with a factor $T^{\frac{\rho}{2\rho+1}}$ since the sample size is $N=nT$ and thus $n=N/T$ .*

Remark 2.3.

Of course, if $T=1$ , Theorems 2.1 and 2.2 hold, which provide another minimax estimation of the function $u\mapsto a_{1}(u)$ ( $u\in[0,1]$ ) requiring sharper moment and regularity conditions than the ones proposed in Theorem 4.1 of [8].

Remark 2.4.

*If $T$ is unknown we better consider an $N$ -sample and set $n=[N/T]$ , the proof of previous central limit theorem 2.1 provides an approach for estimating this period $T$ . First fix $T_{\max}\geq 2$ (typically $T_{\max}=12$ for monthly data). Then, for each $1\leq\tau\leq T_{\max}$ , we define an estimator $\widehat{a}_{s}^{(\tau)}(u)$ for any $1\leq s\leq\tau$ and $u\in(0,1)$ . It is clear that when $\tau$ is not a multiple of $T$ , then the sums in (2.7) that are done on the set $I_{n,s}$ , which depends on $\tau$ , is now a sum involving other $a_{k}$ with $k\neq s$ . As a consequence, $\widehat{a}_{s}(u)$ is not a convergent estimator of $a_{s}(u)$ .

Then, using a classical cross-validation, for each $1\leq\tau\leq T_{\max}$ , we compute*

[TABLE]

Finally, define $\widehat{T}$ as the smallest value such as

[TABLE]

Remark 2.5.

The central limit theorem 2.1 naturally provides a test statistics $\widehat{A}_{s}$ for solving the test problem: $H_{0}:$ $a_{s}(u)=c_{a}$ versus $H_{0}:$ $a_{s}(u)\neq c_{a}$ , where $c_{a}\in(0,1)$ . Indeed, from (2.8) and Slutsky Lemma we deduce:

[TABLE]

Then if we consider

[TABLE]

this provides a natural statistics test with usual standard Gaussian quantile as asymptotic threshold.

3 Monte-Carlo experiments and an application to climatic data

3.1 Monte-Carlo experiments

In this section, numerous Monte-Carlo experiments have been made for studying the accuracy of the new non-parametric estimator $\widehat{a}_{s}(\cdot)$ .

Firstly, we considered $3$ typical functions $[0,1]\to[-1,1]$ , $a^{(\rho)}_{s}(u)\in{\cal C}^{\rho}([0,1])$ and such as $\sup_{u\in[0,1],s\in\mathbb{N}}|a^{(\rho)}_{s}(u)|\leq\alpha<1$ :

•

For $\rho=2$ , we choose $\displaystyle a_{s}^{(2)}(u)=0.9\,\cos\big{(}2\pi\frac{ns}{T}\big{)}\cos(3u)$ . Figure 1 exhibits the graph of the function $a_{1}^{(2)}$ and an example of its estimation (for $n=1000$ );

•

For $\rho=1.5$ , we choose $\displaystyle a_{s}^{(1.5)}(u)=0.9\,\cos\big{(}2\pi\frac{ns}{T}\big{)}\frac{\int_{0}^{u}W_{t}(\omega)\,dt}{\sup_{x\in[0,1]}|W_{x}(\omega)|}$ where $(W_{t})_{t\in[0,1]}$ is an observed trajectory of a Wiener Brownian motion;

•

For $\rho=0.8$ , we choose $\displaystyle a_{s}^{(0.8)}(u)=0.9\,\cos\big{(}2\pi\frac{ns}{T}\big{)}\frac{B_{0.8}(\omega,u)}{\sup_{x\in[0,1]}|B_{0.8}(\omega,x)|}$ where $B_{H}(\omega,t))_{t\in[0,1]}$ is an observed trajectory of a fractional Brownian motion with Hurst exponent $H=0.8$ (Figure 2 exhibits the graph of this chosen function $a_{1}^{(0.8)}$ ). It is well known that a trajectory of a fractional Brownian motion with Hurst exponent $H\in(0,1)$ is almost surely $\alpha$ -Höderian for any $\alpha<H$ ;

•

For $\rho=0.5$ , we choose $\displaystyle a_{s}^{(0.5)}(u)=0.9\,\cos\big{(}2\pi\frac{ns}{T}\big{)}\frac{W_{u}(\omega)}{\sup_{x\in[0,1]}|W_{x}(\omega)|}$ where $(W_{t}(\omega))_{t\in[0,1]}$ is an observed trajectory of a Wiener Brownian motion.

We also consider two “typical” kernels:

•

A bounded supported kernel, the well-known Epanechnikov kernel defined by $K_{E}(x)=\frac{3}{4}\,(1-x^{2})\,\mbox{\hskip 1.99997ptI1}_{\{|x|\leq 1\}}$ , which is known to minimize the asymptotic MISE in the kernel density estimation frame;

•

The unbounded supported Gaussian kernel with $K_{G}(x)=\frac{1}{\sqrt{2\pi}}\exp\big{(}-\frac{x^{2}}{2}\big{)}$ .

We considered the cases $n=100,\,200,\,500$ and $1000$ , and we fixed $T=2$ . Finally $1000$ independent replications of $(X^{(n)})$ are generated with two different cases of innovations $(\xi_{t})$ :

•

Firstly, the case where the probability distribution of $\xi_{0}$ is a Gaussian ${\cal N}(0,4)$ distribution, then $\mathbb{E}|\xi_{0}|^{4}<\infty$ and therefore Theorem 2.1 holds for $\rho=0.5$ and Theorem 2.2 holds for $\rho=1.5$ and $\rho=2$ .

•

Secondly, the case where the probability distribution of $\xi_{0}$ is a Student $t(3)$ (with $3$ degrees of freedom) distribution implying $\mathbb{E}|\xi_{0}|^{\beta}<\infty$ for any $\beta<3$ but $\mathbb{E}|\xi_{0}|^{3}=\infty$ . Then if $\rho=0.5$ , Theorem 2.1 holds but if $\rho=1.5$ and $\rho=2$ , Theorem 2.2 does not hold.

Finally, for each $n$ , each functions $a_{s}^{(\rho)}$ and kernel $K$ , and each probability distributions of $\xi_{0}$ , we present the results computed from $1000$ replications and the following methodology:

For each replication $j$ , we defined $b_{n}=n^{-\lambda}$ with $\lambda=0.10,\,0.11,\ldots,0.80$ , $(u_{i})_{1\leq i\leq 99}=0.01,\,0.02,\ldots,0.99$ , $s=1,2,\ldots,T$ , and the estimators $\widehat{a}_{s}(u_{i})$ are computed. 2. 2.

For each replication $j$ and each $\lambda=0.10,0.11,\ldots,0.80$ , an estimator of the $MISE$ is computed:

[TABLE] 3. 3.

For each replication $j$ , we minimised an estimator of the global square root of MISE:

[TABLE] 4. 4.

Then we computed $\overline{\lambda}=\frac{1}{1000}\,\sum_{j=1}^{1000}\widehat{\lambda}_{j}$ over all the replications. 5. 5.

Finally, we computed the estimator of the minimal global square root of MISE,

[TABLE]

As a consequence, $\overline{\lambda}$ and $\overline{MISE}^{1/2}$ are two interesting estimators relative to Theorems 2.1 and 2.2. The first one specifies the link between the choice of an optimal bandwidth $b_{n}$ qnd the regularity $\rho$ of the functions $a_{s}(\cdot)$ . The second one measures the optimal convergence rate of the estimators $\widehat{a}_{s}(\cdot)$ to $a_{s}(\cdot)$ . All the results are printed in Tables 1 and 2.

Moreover, for exhibiting the asymptotic normality of the estimators provided in the central limit theorem (2.8), we draw in Figure the histograms of $\widehat{a}^{(2)}_{s}(u)$ for $u=0.25,\,0.5$ and $0.75$ from $10000$ independent replications for $n=5000$ . We also used a Jarque-Bera test to confirm the Gaussian asymptotic distribution since the p-values of this test are successively: $p-value=0.105$ , $0.927$ and $0.345$ . Hence, the asymptotic normality of the estimator seems to be attested by Monte-Carlo experiments.

Conclusions of the simulations: Firstly, and as it should be deduced from Theorem 2.1 and 2.2, we observed the larger the regularity $\rho$ , the smaller $\overline{\lambda}$ and therefore the larger the optimal bandwidth $\overline{b_{n}}=n^{-\overline{\lambda}}$ , and the faster the convergence rate of $\widehat{a}_{s}$ . Secondly, even if the choice of the optimal bandwidth is significantly different following the choice of the kernel (clearly smaller with the Epanechnikov kernel), the optimal convergence rate is almost the same for both the kernel. Finally, according also with Theorem 2.2, the convergence rate is clearly slower with a heavy tail distribution ( $t(3)$ ) than with a Gaussian distribution, and this phenomenon increases when $\rho$ increases.

3.2 Numerical application on climatic data

We also applied our model and its estimator to an example of real data, specifically the monthly average temperature readings in London from 1659 to 1998, or 340 years. Obviously in such a case one can expect that $T=12$ .

First, we removed an additive seasonal and trend component (estimated by LOESS) from these data and considered the residual data. On these, a global correlogram (see Figure 4) confirms a modelling by a process of type AR( $1$ ) and also the presence of a periodic phenomenon of period $12$ .

As a consequence we may assume that these residual data can be modelled by the model (1.2). We then applied the $\widehat{a}_{s}(u)$ estimator for $u=0.25,\,0.50$ and $0.75$ and $s=1,\ldots,12$ . Figure 5 summarizes these results and shows:

•

The crucial interest of taking a pseudo-periodic model as we defined it in (1.2);

•

The relatively small but not negligible change in the coefficient $a_{t}(t)$ as a function of $t$ .

4 Proofs

We first provide the proof of Proposition 2.1.

Proof of Proposition 2.1.

We have $\mathbb{E}X_{1}^{(n)}=a_{1}\Big{(}\frac{1}{nT}\Big{)}\mathbb{E}(X_{0})$ and $\mathbb{E}X_{t}^{(n)}=a_{t}\Big{(}\frac{t}{nT}\Big{)}\mathbb{E}X_{t-1}^{(n)})$ from the relation (1.2). From Assumption (A $(\rho)$ ) and since $\Big{|}a_{1}\Big{(}\frac{1}{nT}\Big{)}\Big{|}\leq\alpha<1$ , we deduce the first item of Proposition 2.1. 2. 2.

Below, for ease of reading, we will omit the exponent $n$ . Set $v_{t}=\mathbb{E}\big{(}X_{t}^{2}\big{)}$ , and $v=\sup_{s}v_{s}\in[0,+\infty]$ ; also write $\alpha_{t}=a^{2}_{t}\big{(}\frac{t}{nT}\big{)}$ . We have:

[TABLE]

thus

[TABLE]

Moreover, with $\delta_{t}=v_{t}-v_{t-T}$ for any $t>T$ , we have

[TABLE]

from (4.2) and since for some constant $C>0$ ,

[TABLE]

from Assumption (A $(\rho)$ ). As a consequence of (4.3), we also obtain:

[TABLE]

Thus for other constants $C,c>0$ we derive

[TABLE]

From now on, assume that $\rho\geq 1$ .

Now use again the definition (1.2) of the model, and by iterating (4.1), we derive:

[TABLE]

from (4.5).

Hence,

[TABLE]

Now quoting that $\displaystyle\alpha_{t-j}=a_{t-j}^{2}\big{(}\frac{t-j}{nT}\big{)}$ we set $\displaystyle\widetilde{\alpha}_{t-j}=a_{t-j}^{2}\big{(}\frac{t}{nT}\big{)}$ for $1\leq j<T$ , then since $\rho\geq 1$ and from (4.6) we derive

[TABLE]

The conclusion follows. 3. 3.

The proof mimics the case of $\mathbb{E}(X^{2}_{t})$ . Denote $q_{t}=a_{t}^{4}\big{(}\frac{t}{nT}\big{)}$ , and $\mu_{k}=\mathbb{E}(\xi_{0}^{k})$ , for $k=1,2,3,4$ . Then $\mu_{1}=0$ and

[TABLE]

Since $\mu_{3}=0$ , we have:

[TABLE]

with $r(t)=6\,\sigma^{2}\,v_{t}+\mu_{4}-6\sigma^{4}$ and this implies as previously $\sup_{t}w_{t}<\infty$ . We also obtain for constants again denoted $C^{\prime},C^{\prime\prime}>0$ :

[TABLE]

Finally by iterating (4.8), we obtain:

[TABLE]

from (4.9). Hence, always following the previous case

[TABLE]

for $t\geq C^{\prime\prime}\log n$ , and this implies (2.2) from using again the regularity of the functions $(a_{i})_{1\leq i\leq T}$ .

Finally, for any $t>t^{\prime}$ such that $t,t^{\prime}\in\{[c\log n],\ldots,nT\}$ , since $(X_{t})$ is a causal process and by iteration,

[TABLE]

where $s^{\prime}\equiv t^{\prime}~{}[T]$ and $\displaystyle\Big{|}\prod_{i=1}^{t-t^{\prime}}\alpha_{t^{\prime}+i}\Big{|}\leq\alpha^{2|t-t^{\prime}|}$ .

This completes the proof.

Now we establish a technical lemma, which we were not able to find in the past literature (even if variants of this result may be found) and that will be extremely useful in the sequel. For a bounded continuous function $c$ defined on $[0,1]$ , and a kernel function $H$ (see details below), an approximation of integral by appropriate Riemann sums yields (as for [20]’s estimator, see [21] for further developments):

[TABLE]

where $u\in(0,1)$ , $I_{n,s}=\big{\{}s,s+T,\ldots,s+(n-1)T\}$ with $s\in\{1,\ldots,T\}$ and $T\in\mathbb{N}^{*}$ . More precisely we would like to provide expansions of

[TABLE]

Lemma 4.1.

Let $u\in(0,1)$ , $\rho>0$ , $c\in{\cal C}^{\rho}([0,1])$ a bounded function. Let $H$ satisfy ker $(\rho\vee 1)$ . Consider also a sequence of positive real numbers $(b_{n})_{n}$ satisfying $\lim_{n\to\infty}b_{n}=0$ . Then, there exists $C>0$ depending only on $\|H\|_{\infty}$ , $\|c\|_{\infty}$ and $\mbox{Lip}\,(H)$ such that for $n$ large enough

[TABLE]

Finally, if $\rho\in\mathbb{N}^{*}$ we have:

[TABLE]

Proof of Lemma 4.1. In the sequel we will denote $h_{n}(v)=\frac{1}{b_{n}}H\big{(}b_{n}^{-1}(v-u)\big{)}$ for $v\in\mathbb{R}$ . Then $h_{n}$ is a Lipschitz function with $\mbox{Lip}\,h_{n}=\frac{1}{b_{n}^{2}}\,\mbox{Lip}\,H$ .

•

*First assume that the function $c\equiv 1$ is a constant. * Set $v_{i}=i(nT)^{-1}$ for $i\in\mathbb{Z}$ . For $1\leq s\leq T$ , we consider the sets

[TABLE]

and $L_{n,s}=I_{n,s}\setminus K_{n,s}$ . Then, for $n$ large enough,

[TABLE]

But $|h_{n}(v_{s+jT})|\leq\frac{C}{b_{n}}\,\exp\Big{(}-\beta\Big{|}\frac{j/n-u+s/nT}{b_{n}}\Big{|}\Big{)}$ from Assumption $(K)$ and using the usual comparison between sums and integrals for monotonic functions, we obtain:

[TABLE]

Thus

[TABLE]

because since $u\in(0,1)$ , the above indices remain in the index set $[-n,n]$ for $n$ large enough.

Then, if $A_{n}\geq\beta^{-1}\log n$ then $\exp(-\beta\,A_{n})\leq 1/n$ and we deduce (4.11).

•

We now turn to the case of a non-constant function $c$ . First, if $\rho>0$ , for $(u,v)\in(0,1)^{2}$ the Taylor-Lagrange formula implies:

[TABLE]

with $\ell=\lceil\rho\rceil$ and $\lambda\in(0,1)$ . Since $c\in{\cal C}^{\rho}([0,T])$ ,

[TABLE]

Therefore,

[TABLE]

with $|R(u,v)|\leq C_{\rho}\,|u-v|^{\rho}$ . Then for any $u\in(0,1)$ , using Assumption ker $(\rho\vee 1)$ and especially the relation $\int z^{p}H(z)dz=0$ for $p=1,\ldots,\ell$ ,

[TABLE]

with $C^{\prime}>0$ . Here we denote $k_{n}(v)=h_{n}(v)c(v)$ for $v\in[0,1]$ .

Now, if $\rho\in(0,1)$ , we have

[TABLE]

and therefore using the previous results:

[TABLE]

from (4.15) and this implies (4.11) since $nb_{n}\to\infty$ and therefore $n^{-\rho}$ is negligible with respect from $b_{n}^{\rho}$ .

Now, if $\rho\geq 1$ and since $H$ and $c$ are bounded continuous Lipschitz functions, we obtain the inequality

[TABLE]

Then, using the same computations than previously (replace $h_{n}$ by $h_{n}\times c$ ),

[TABLE]

from (4.15) and this completes the first item since $b_{n}$ is supposed to converge to [math]. The proof is now easily completed.

•

Finally, in the case $\rho\in\mathbb{N}^{*}$ , we can use the previous case an a Taylor-Lagrange expansion of the function $c$ , implying $\displaystyle R(u,v)=\frac{c^{(\rho)}(\theta)}{\rho!}\,\big{|}u-v\big{|}^{\rho}$ with $\theta=\lambda u+(1-\lambda)v$ and $\lambda\in[0,1]$ .

Then, using (• ‣ 4) and with $\mu_{u}(z)\in[0,1]$ , and $\displaystyle\zeta_{n}=\int_{\mathbb{R}}h_{n}(v)\,c(v)\,dv-c(u)\int_{\mathbb{R}}h_{n}(v)\,dv$

[TABLE]

from Lebesgue theorem on dominated convergence.

In the sequel we will denote the $\sigma$ -algebra

[TABLE]

Lemma 4.2.

Let $H$ satisfy Assumption ker $(1)$ and $(X^{(n)}_{t})$ be a solution of (1.2) under Assumption (A $(\rho)$ ) with $\rho>0$ . Then for any $u\in(0,1)$ , and $s\in\{1,\ldots,T\}$ ,

[TABLE]

Proof of Lemma 4.2. We use here a limit theorem for $\mathbb{L}^{1}$ -mixingales established in [1]. Indeed, for $u\in(0,1)$ , $s\in\{1,\ldots,T\}$ , let

[TABLE]

Then, set

[TABLE]

we have:

[TABLE]

Therefore, with $({\cal F}_{n,t}^{(s)})$ defined in (4.16),

[TABLE]

But for any $t\in\mathbb{N}$ , we have $|c_{k}(t)|\leq\alpha^{k}$ from Assumption (A $(\rho)$ ). Then,

[TABLE]

Thus, using the notations of Definition 2 in [1], it is easy to derive that $(Z_{n,t})$ is a triangular array such that $\phi_{m}=\alpha^{2mT-2}\|H\|_{1}\to 0$ (as $m\to\infty$ ) since $0\leq\alpha<1$ and:

[TABLE]

As a consequence,

[TABLE]

implies

[TABLE]

Now, we collect the above relations. Lemma 4.1 and Proposition 2.1 with the $\rho-$ regularity of the function $c(v)$ , together conclude the proof.

Lemma 4.3.

Under the conditions of Theorem 2.1, with $(Y_{n,i})_{1\leq i\leq n,\,n\in\mathbb{N}}$ defined in (4.30), for any $\varepsilon>0$ ,

[TABLE]

Proof of Lemma 4.3. Since $\mathbb{E}\xi_{0}^{2}=\sigma^{2}<\infty$ this is easy to exhibit an increasing sequence $(c_{k})_{k}$ with

[TABLE]

Define $g(\cdot)$ as the piecewise affine function such that $g(c_{k})=k$ for $k\in\mathbb{N}$ and $g(0)=0$ . Then the function $\psi$ defined by $\psi(x)=x^{2}g(x)$ for $x\geq 0$ satisfies $\psi(0)=0$ and it is a continuous and non-decreasing function (for almost all $x>0$ , $\psi^{\prime}(x)=x^{2}g^{\prime}(x)+2xg(x)>0$ ) and convex function (indeed, for almost all $x>0$ , $\psi^{\prime\prime}(x)=4xg^{\prime}(x)+2g(x)>0$ ). Hence, we have:

[TABLE]

Therefore,

[TABLE]

The construction of $(c_{k})_{k}$ and the relation $c_{k+1}\geq c^{2}_{k}$ together imply:

[TABLE]

Indeed, this relationship is equivalent to

[TABLE]

But if $0\leq x\leq 1$ and $y\geq x$ , then $xy\leq y$ : therefore $g(xy)\leq g(y)\leq g(x)g(y)$ since $g$ is an increasing function and $g(x)\geq 1$ for any $x\geq 0$ . Moreover, if $1<x\leq y$ , there exists $0\leq k$ and $\lambda\in[0,1]$ such as $y=\lambda c_{k}+(1-\lambda)c_{k+1}$ . But $h:[0,\infty)\to\mathbb{R}^{+}$ defined by $x\mapsto h(x)=g(x^{2})$ is a convex function since $h^{\prime\prime}\geq 0$ a.e. As a consequence,

[TABLE]

from the construction of $(c_{k})$ . Since $g(y)=\lambda g(c_{k})+(1-\lambda)g(c_{k+1})=k+1-\lambda$ because $g$ is a piecewise function, we finally obtain $g(y^{2})\leq g(y)+1$ . We conclude with $g(xy)\leq g(y^{2})$ for any $1\leq x\leq y$ and $g(x)\geq 2$ (since $c_{1}=2$ ).

Hence the function $\psi$ is a Orlicz function and $\|\xi_{0}\|_{\psi}<\infty$ with

[TABLE]

Now Theorem 1.1 in [16] implies:

[TABLE]

Therefore $\|V\|_{\psi}\leq 1+\mathbb{E}\psi(|V|)$ , and $\frac{1}{z}\mathbb{E}\psi(z|V|)\leq 2\|V\|_{\psi}$ for any $z>0$ since from convexity

[TABLE]

and $\psi(0)=0$ .

Then, from the definition of $(X_{t}^{(n)})$ and the triangular inequality

[TABLE]

with $0\leq\alpha<1$ . Since $\|\xi_{s}\|_{\psi}=\|\xi_{0}\|_{\psi}$ for any $s\in\mathbb{N}$ , we finally obtain

[TABLE]

Thus (4.23) implies with the independence of $\xi_{t}$ and $X^{(n)}_{t-1}$ that:

[TABLE]

Now relation (4.26) with $z=1$ entails

[TABLE]

Thus with $t=s+(j-1)T$ we have from (4.26),

[TABLE]

Again using (4.23) and with $K_{t}=K\Big{(}\frac{\frac{t}{nT}-u}{b_{n}}\Big{)}$ ,

[TABLE]

As a consequence, for any $\varepsilon>0$ ,

[TABLE]

if $n$ is large enough, from Lemma 4.1. As a consequence, for any $\varepsilon>0$ , since $g(\varepsilon\,\sqrt{nb_{n}})\begin{array}[t]{c}\stackrel{{\scriptstyle}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}\infty$ , then $\mathbb{E}\Big{(}\sum_{j=1}^{n}\mathbb{E}\big{(}Y_{n,j}^{2}\mbox{\hskip 1.99997ptI1}_{\{|Y_{n,j}|>\varepsilon\}}|{\cal F}^{(s)}_{j-1}\big{)}\Big{)}\begin{array}[t]{c}\stackrel{{\scriptstyle}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}0$ . Since $Y_{n,j}^{2}\mbox{\hskip 1.99997ptI1}_{\{|Y_{n,j}|>\varepsilon\}}$ is a non-negative triangular array, the proof of Lemma 4.3 is complete.

Proof of Theorem 2.1. Using (1.2), write

[TABLE]

we decompose it as: $\widehat{N}^{(n)}_{s}(u)=\widetilde{N}^{(n)}_{s}(u)+M^{(n)}_{s}(u)$ , with

[TABLE]

Therefore we obtain:

[TABLE]

with

[TABLE]

We are going to derive the consistency of the estimator $\widehat{a}_{s}(u)$ of $a_{s}(u)$ , in two parts.

**1/ **

We first prove that $\sqrt{nb_{n}}{M^{(n)}_{s}(u)}\Big{/}{\widehat{D}^{(n)}_{s}(u)}\begin{array}[t]{c}\stackrel{{\scriptstyle{\cal L}}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}{\cal N}\big{(}0,C\big{)}$ for some convenient constant $C>0$ .

Let $s\in\{1,\ldots,T\}$ and $u\in(0,1)$ . For $n\in\mathbb{N}^{*}$ and $j\in\{1,\ldots,n\}$ , we denote

[TABLE]

This is clear that $(Y_{n,j})_{\leq j\leq n,~{}n\in\mathbb{N}^{*}}$ is a triangular array of martingale increments with respect to the $\sigma$ -algebra ${\cal F}^{(s)}_{t}=\sigma\big{(}(\xi_{i})_{i\leq s+(t-1)T}\big{)}$ . Indeed $(X^{(n)}_{t})_{t\geq 0}^{\ }$ is a process, causal with respect to $(\xi_{t})_{t\geq 0}$ . This implies that $\xi_{t}$ is independent of $\displaystyle(X_{i}^{(n)})_{i\leq t-1}$ and that $\mathbb{E}(\xi_{0})=0$ . We are going to use a central limit theorem for triangular arrays of martingale increments, see for example [13] and more recently [15].

Denote

[TABLE]

since $\mathbb{E}(\xi^{2}_{0})=0$ . Using Lemma 4.2, we obtain:

[TABLE]

$\widehat{D}^{(n)}_{s}(u)$ is defined from (4.28) and satisfies

[TABLE]

Moreover, from Lemma 4.3, then for any $\varepsilon>0$ ,

[TABLE]

As a consequence, the conditions of the central limit theorem for triangular arrays of martingale increments, in [15]), are satisfied and this implies that $\displaystyle\frac{\sum_{j=1}^{n}Y_{n,j}}{\sqrt{\sum_{j=1}^{n}\sigma_{n,j}^{2}}}\begin{array}[t]{c}\stackrel{{\scriptstyle{\cal L}}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}{\cal N}\big{(}0,1\big{)}$ .

Therefore from Slutsky lemma entails:

[TABLE]

**2/ **

The second term $J_{n}/\widehat{D}^{(n)}_{s}(u)$ in the expansion of $\sqrt{nb_{n}}\big{(}\widehat{a}_{s}(u)-a_{s}(u)\big{)}$ depends on the non-martingale term $J_{n}$ , see (4.29), and the consistent term $\widehat{D}^{(n)}_{s}(u)$ , see (4.28) and (4.36). The asymptotic behavior of this second term can be first obtained following two steps.

**a. **

A first step consists in establishing an expansion of $\mathbb{E}J_{n}$ . Using Proposition 2.1 and with $\gamma_{s}^{(2)}\in{\cal C}^{\rho}([0,1])$ defined in (2.1), we have

[TABLE]

Using twice Lemma 4.1, with firstly $c(x)=\gamma_{s}^{(2)}(x)(a_{s}(x)-a_{s}(u))$ , and secondly $c(x)=(a_{s}(x)-a_{s}(u))$ , we deduce:

[TABLE]

As a consequence, if $b_{n}=o\big{(}n^{-1/(1+2\rho)}\big{)}$ , then $\mathbb{E}J_{n}\begin{array}[t]{c}\stackrel{{\scriptstyle}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}0$ .

In the case $\rho\in\{1,2\}$ , we also obtain from (4.12) and with $d_{s}(v)=(a_{s}(v)-a_{s}(u))\gamma_{s}^{(2)}(v)\in{\cal C}^{\rho}([0,1])$ ,

[TABLE]

with $\displaystyle B_{s}(u)=\frac{d_{s}^{\prime\prime}(u)}{2}\,\int_{\mathbb{R}}z^{2}K(z)\,dz$ .

**b. **

Now we are going to prove a first consistency result for $J_{n}/\widehat{D}^{(n)}_{s}(u)$ using the Markov Inequality. Indeed,

[TABLE]

Now using Lemma 4.1 with $c(v)=|a_{s}(v)-a_{s}(u)|$ which also belongs in ${\cal C}^{\rho}([0,1])$ (this is clear if $\rho<1$ and, for $\rho=1$ the Lipschitz property of $z\mapsto|z|$ allows to conclude), and $c(u)=0$ , we derive:

[TABLE]

Therefore, if $b_{n}=o\big{(}n^{-\frac{1}{1+2(\rho\wedge 1}}\big{)}$ , then $\mathbb{E}J_{n}\begin{array}[t]{c}\stackrel{{\scriptstyle}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}0$ and $\mathbb{E}|J_{n}|\begin{array}[t]{c}\stackrel{{\scriptstyle}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}0$ , implying from Markov Inequality, $J_{n}\begin{array}[t]{c}\stackrel{{\scriptstyle{\mathbb{P}}}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}0$ . Finally, since (4.36) establishes the consistency of $\widehat{D}^{(n)}_{s}(u)$ , from Slutsky lemma, we deduce

[TABLE]

As a consequence, the proof of the Theorem results by using the decomposition (4.27), the consistency results (4.37) and (4.43).

Proof of Theorem 2.2. We restrict to the case $\rho\in(1,2]$ .

**a. **

Case $\mathbb{E}\big{(}\xi_{0}^{4}\big{)}<\infty$ .

Denote again $\displaystyle K_{t}=K\Big{(}\frac{\frac{t}{nT}-u}{b_{n}}\Big{)}$ , for $t\in\mathbb{Z}$ . First remark that the symmetry assumption on $\xi_{0}$ ’s distribution implies $\mathbb{E}\big{(}\xi_{0}\big{)}=\mathbb{E}\big{(}\xi_{0}^{3}\big{)}=0$ .

[TABLE]

with $L_{n,s,\alpha}=\big{\{}(t,t^{\prime})\in I^{2}_{n,s},~{}\,|t-t^{\prime}|\leq\frac{\log n}{\log\alpha}\big{\}}$ .

Firstly, consider the first left side term of the last inequality. If $t\in I_{n,s}$ then Proposition 2.1 entails $\mbox{Var}\,(X_{t}^{2})=\gamma_{s}^{(2)}(t/(nT))+{\cal O}(1/n)$ for an adequate function $\gamma_{s}^{(2)}\in{\cal C}^{\rho}([0,1])$ .

Hence we also have $\mbox{Var}\,(X_{t}^{2})=\gamma_{s}^{(2)}(t/(nT))+{\cal O}(\log(n)/n)$ .

Here the fact that $(z\mapsto z^{2})$ is a function in ${\cal C}^{\rho}$ , implies that the function defined from $b(v)=\big{(}a_{s}(v)-a_{s}(u)\big{)}^{2}$ is in ${\cal C}^{\rho}([0,1])$ too, and again $b(u)=0$ and $\int xH^{2}(x)dx=0$ .

Therefore, we use Lemma 4.1 to derive:

[TABLE]

with $\displaystyle g_{j}(x)=\big{(}a_{s}(x)-a_{s}(u)\big{)}^{2}\prod_{i=1}^{j}\big{(}\gamma_{s}^{(4)}(x)$ , since for $n$ large enough the above expression satisfies $\big{|}{\cal O}\big{(}\frac{\log n}{nb_{n}^{2}}\big{)}\big{|}\leq 1$ . Using Lemma 4.1, with functions $H=K^{2}$ and $c=g_{j}$ with $g_{j}\in{\cal C}^{\rho}([0,1])$ (quote that $\max_{i\leq j}\big{(}\|g_{i}\|\vee\mbox{Lip}\,(g_{i})\big{)}={\cal O}(j)$ ), we finally obtain:

[TABLE]

Secondly, from Proposition 2.1, for $t,t^{\prime}\in I_{n,s}^{2}\setminus L_{n,s,\alpha}$ , we have

[TABLE]

Thus,

[TABLE]

from Lemma 4.1. Then, (4.44) and (4.45) provide

[TABLE]

implying $\mbox{Var}\,\big{(}J_{n}\big{)}\begin{array}[t]{c}\stackrel{{\scriptstyle}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}0$ for any $(b_{n})$ such as

[TABLE]

**b. **

Case $\mathbb{E}\big{(}|\xi_{0}|^{\beta}\big{)}<\infty$ , for some $\beta\in[2,4]$ .

From its expression given in (4.29), $J_{n}$ is a quadratic form of $(X_{t})$ and therefore, as $X_{t}$ is a linear process with innovations $(\xi_{t})$ , $J_{n}$ is also a quadratic form of $(\xi_{t})$ . As a consequence, the fourth order moment can be injected such as there exists a sequence $z_{n}\downarrow 0$ (as $n\uparrow\infty$ ) satisfying:

[TABLE]

Now, assume only that $\mathbb{E}(\xi_{0}^{2})<\infty$ . The innovations $(\xi_{t})$ can be truncated at level $M$ , and write

[TABLE]

Note that the symmetry assumption entails $\mathbb{E}(\xi_{j,M})=0$ . Define also Define also

[TABLE]

A consequence of (4.47) is:

[TABLE]

with $h(M)=\mathbb{E}\big{(}|\xi_{0}|^{2}\mbox{\hskip 1.99997ptI1}_{\{|\xi_{0}|>M\}}\big{)}$ which satisfies $\lim_{M\to\infty}h(M)=0$ .

Moreover,

[TABLE]

But

[TABLE]

We first remark from Proposition 2.1 that $\mathbb{E}(X^{(n)}_{j-1})^{2}+\mathbb{E}(X^{(n)}_{j-1,M})^{2}\leq c$ for some constant $c>0$ . Hence, Cauchy-Schwartz Inequality shows that, for each $j$ :

[TABLE]

with $\delta_{j-1,M}=\mathbb{E}\big{(}|X^{(n)}_{j-1}-X^{(n)}_{j-1,M}|^{2}\big{)}$ .

We are going to bound $\delta_{j-1,M}$ . A first simple bound is clearly $\delta_{j-1,M}\leq 2\,c$ and we use it together with (4.50), and Cauchy-Schwartz inequality in order to derive

[TABLE]

since $\delta_{0,M}\leq h(M)\leq H(M)$ .

Now, from (4.51), we obtain for $M$ large enough:

[TABLE]

with $C>0$ and always with $h(M)=\mathbb{E}\big{(}|\xi_{0}|^{2}\mbox{\hskip 1.99997ptI1}_{\{|\xi_{0}|>M\}}\big{)}$ . Now a careful use of (4.42) and (4.49) entails:

[TABLE]

since $x\to|a(x)-a(u)|$ is a ${\cal C}^{1}$ function (in the above defined sense). Finally, using Cauchy-Schwartz inequality in (4.48), we obtain for $M$ large enough,

[TABLE]

assuming $A_{n}/nb_{n}=o(b_{n}^{\rho/2})$ i.e. $(n/A_{n})^{-2/(2+\rho)}=o(b_{n})$ (and note that $-2/(2+\rho)\leq 1/(1+2\rho)$ ).

Now, if $\mathbb{E}\big{(}|\xi_{0}|^{\beta}\big{)}<\infty$ with $\beta\in(2,4]$ , then using Hölder and Markov Inequalities, there exists $C_{\beta}>0$ such as

[TABLE]

Since here $b_{n}=o\big{(}n^{-1/(1+2\rho)}\big{)}$ , does not yields the minimax rates, we deduce that

[TABLE]

Thus, from inequality (4.54), we deduce that the optimal choice is obtained when

[TABLE]

d.

Case $\rho=2$ .

The expression of the non-central limit for the case of optimal window widths and the expansion of the bias (4.41) now the asymptotic expression for (4.43) yields the proposed non-centred Gaussian limit, see Remark 4.1. The same truncation step as above is also needed.

The proof is now complete.

Remark 4.1.

*Using the previous bound (4.38) of $\mathbb{E}J_{n}$ and Bienaymé-Tchebychev inequality, we deduce that if $b_{n}=o\big{(}n^{-1/(1+2\rho)}\big{)}$ then $J_{n}\begin{array}[t]{c}\stackrel{{\scriptstyle{\mathbb{P}}}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}0$ .

Moreover, if $\rho=2$ and $b_{n}=c\,n^{-1/5}$ , using the expansion (4.41) of $\mathbb{E}J_{n}$ and again Bienaymé-Chebychev inequality, then $J_{n}\begin{array}[t]{c}\stackrel{{\scriptstyle{\mathbb{P}}}}{{\longrightarrow}}\\ {\scriptstyle n\rightarrow+\infty}\end{array}B_{s}(u)\,c^{5/2}$ .

Therefore with the consistency result (4.36), for any $u\in(0,1)$ and $s\in\{1,\ldots,T\}$ ,*

[TABLE]

Remark 4.2.

For the general case with maybe $\xi_{0}$ non symmetric and $\mathbb{E}\xi_{0}=0$ , the item 3. of Proposition 2.1 needs some improvements. Denote $w^{(k)}_{t}=\mathbb{E}(X^{k}_{t})$ for $k=1,3$ , then $w^{(4)}_{t}=w_{t}$ and $w^{(2)}_{t}=v_{t}$ , then (4.8) turns to be written

[TABLE]

*as previously $\sup_{t}w_{t}<\infty$ .

We need to derive suitable equivalents of $w^{(k)}_{t}$ if $k=1$ . Firstly*

[TABLE]

*and in fact this term is negligible and the proof of Proposition 2.1 and Lemma 3. remains unchanged.

In this case the proof of the above point 2/ c. needs a simple improvement and*

[TABLE]

In this truncated setting, inequality (4.50) writes:

[TABLE]

so that the end of the proof is unchanged by only setting $C={2c\mathbb{E}\xi_{0}^{2}}/{(1-\alpha)}$ .

Remark 4.3.

Secondly, in case we even omit the condition $\mathbb{E}\xi_{0}=0$ one needs to also express an asymptotic expansion for $w^{(3)}_{t}=\mathbb{E}A_{t}^{3}w^{(3)}_{t-1}+3\mathbb{E}A_{t}w^{(1)}_{t-1}\sigma^{2}+\mu_{3}\sim\mathbb{E}A_{t}^{3}w^{(3)}_{t-1}+\mu_{3}$ ; an analogue expansion to Proposition 2.1 and Lemma 3. may thus be derived. Namely $\displaystyle w^{(3)}_{t}=\gamma_{s}^{(3)}(\frac{t}{nT})+{\cal O}\big{(}\frac{1}{n}\big{)},$ , with

[TABLE]

Then the expression of the equivalent of $w_{t}$ is also adequately transformed up to the above relations.

Aknowledgement.

This work has been developed within the “MME-DII centre of excellence” (ANR-11-LABEX-0023-01) and with the help of PAI- CONICYT MEC Nr. 80170072.

The authors thank the referees for their fruitful comments and suggestions, which notably improved the quality of the paper. The second author wishes to thank Rainer Dahlhaus for many interesting discussions. As well, numerous discussions with Karine Bertin were extremely useful.

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Andrews, D. Laws of large numbers for dependent non-identically distributed random variables. Econometric Theory 4 , 3 (1988), 458–467.
2[2] Azrak, R. and Mélard, G. Asymptotic properties of quasi-maximum likelihood estimators for ARMA models with time-dependent coefficients. Statistical Inference for Stochastic Processes 9 (2006), 279–330.
3[3] Bibi, A. and Francq, C. Consistent and asymptotically normal estimators for cyclically time-dependent linear models. Annals of the Institute of Statistical Mathematics 55 (2003), 41–68.
4[4] Dacunha-Castelle, D., Huong Hoang, H. T. and Parey, S. Modeling of air temperatures: preprocessing and trends, reduced stationary process, extremes, simulation. Journal de la Société Française de Statistique 156 , 2 (2015), 138–168.
5[5] Dahlhaus, R. On the Kullback-Leibler information divergence of locally stationary processes. Stochastic Processes and Applications 62 (1996), 139–168.
6[6] Dahlhaus, R. Fitting time series models to nonstationary processes. Annals of Statistics 25 (1997), 1–37.
7[7] Dahlhaus, R. Locally Stationary Processes , vol. 30. Time Series Analysis: Methods and Applications, Elsevier, 2012.
8[8] Dahlhaus, R. and Polonik, W. Empirical spectral processes for locally stationary time series. Bernoulli 15 (2009), 1–39.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Non-parametric estimation of time varying AR(1)–processes with local stationarity and periodicity

Abstract

keywords:

keywords:

1 Introduction

2 Asymptotic normality of a non-parametric estimator for periodic tvAR(1) processes

2.1 Definition and first properties of the process

Definition 2.1**.**

Remark 2.1**.**

Proposition 2.1**.**

2.2 Asymptotic normality of the estimator

Theorem 2.1**.**

Theorem 2.2**.**

Remark 2.2**.**

Remark 2.3**.**

Remark 2.4**.**

Remark 2.5**.**

3 Monte-Carlo experiments and an application to climatic data

3.1 Monte-Carlo experiments

3.2 Numerical application on climatic data

4 Proofs

Lemma 4.1**.**

Lemma 4.2**.**

Lemma 4.3**.**

Remark 4.1**.**

Remark 4.2**.**

Remark 4.3**.**

Aknowledgement.

Definition 2.1.

Remark 2.1.

Proposition 2.1.

Theorem 2.1.

Theorem 2.2.

Remark 2.2.

Remark 2.3.

Remark 2.4.

Remark 2.5.

Lemma 4.1.

Lemma 4.2.

Lemma 4.3.

Remark 4.1.

Remark 4.2.

Remark 4.3.