On the Koml\'os, Major and Tusn\'ady strong approximation for some   classes of random iterates

Christophe Cuny (ERIM); J\'er\^ome Dedecker (MAP5); Florence; Merlev\`ede (LAMA)

arXiv:1706.08282·math.PR·June 27, 2017

On the Koml\'os, Major and Tusn\'ady strong approximation for some classes of random iterates

Christophe Cuny (ERIM), J\'er\^ome Dedecker (MAP5), Florence, Merlev\`ede (LAMA)

PDF

Open Access

TL;DR

This paper extends the Komlós, Major and Tusnádý strong approximation results to functions of random iterates within a Markovian framework, providing new dependent conditions for approximation with rate o(n^{1/p}).

Contribution

It adapts existing methods to Markovian settings, introducing natural coupling conditions that broaden applicability to various stochastic models.

Findings

01

Established strong approximation with rate o(n^{1/p}) for functions of random iterates.

02

Provided new dependent conditions based on L-infinity or L-1 coupling.

03

Demonstrated the optimality of the L-1 coupling condition.

Abstract

The famous results of Koml\'os, Major and Tusn\'ady (see [15] and [17]) state that it is possible to approximate almost surely the partial sums of size n of i.i.d. centered random variables in L p (p > 2) by a Wiener process with an error term of order o(n 1/p). Very recently, Berkes, Liu and Wu [3] extended this famous result to partial sums associated with functions of an i.i.d. sequence, provided a condition on a functional dependence measure in L p is satisfied. In this paper, we adapt the method of Berkes, Liu and Wu to partial sums of functions of random iterates. Taking advantage of the Markovian setting, we shall give new dependent conditions, expressed in terms of a natural coupling (in L $\infty$ or in L 1), under which the strong approximation result holds with rate o(n 1/p). As we shall see our conditions are well adapted to a large variety of models, including left random…

Equations562

\int_{G} (lo g N (g))^{p} μ (d g) < \infty,

\int_{G} (lo g N (g))^{p} μ (d g) < \infty,

n \to \infty lim \frac{1}{n} lo g ∥ A_{n} ∥ = λ_{μ} P -a.s.,

n \to \infty lim \frac{1}{n} lo g ∥ A_{n} ∥ = λ_{μ} P -a.s.,

lo g ∥ A_{n} x ∥ .

lo g ∥ A_{n} x ∥ .

X_{n, x} := h (ε_{n}, W_{n - 1, x}), n \geq 1,

X_{n, x} := h (ε_{n}, W_{n - 1, x}), n \geq 1,

h (g, y) = lo g (\frac{∥ g \cdot y ∥}{∥ y ∥}) .

h (g, y) = lo g (\frac{∥ g \cdot y ∥}{∥ y ∥}) .

S_{n, x} = k = 1 \sum n X_{k, x} = lo g ∥ A_{n} x ∥ .

S_{n, x} = k = 1 \sum n X_{k, x} = lo g ∥ A_{n} x ∥ .

lo g ∥ A_{n} x ∥ - n λ_{μ} - i = 1 \sum n N_{i} = o (n^{1/ p} lo g n) a.s.

lo g ∥ A_{n} x ∥ - n λ_{μ} - i = 1 \sum n N_{i} = o (n^{1/ p} lo g n) a.s.

∥ x ∥ = 1, ∥ y ∥ = 1 sup E (∣ X_{k, x} - X_{k, y} ∣) .

∥ x ∥ = 1, ∥ y ∥ = 1 sup E (∣ X_{k, x} - X_{k, y} ∣) .

\iint E (∣ X_{k, x} - X_{k, y} ∣) ν (d x) ν (d y),

\iint E (∣ X_{k, x} - X_{k, y} ∣) ν (d x) ν (d y),

W_{n} = F (ε_{n}, W_{n - 1}),

W_{n} = F (ε_{n}, W_{n - 1}),

X_{n} = h (ε_{n}, W_{n - 1}) .

X_{n} = h (ε_{n}, W_{n - 1}) .

X_{n}^{*} = h (ε_{n}, W_{n - 1}^{*}) \mbox w i t h W_{n}^{*} = F (ε_{n}, W_{n - 1}^{*}) .

X_{n}^{*} = h (ε_{n}, W_{n - 1}^{*}) \mbox w i t h W_{n}^{*} = F (ε_{n}, W_{n - 1}^{*}) .

δ_{\infty} (n) = ∥ E (∣ X_{n} - X_{n}^{*} ∣ ∣ (W_{0}, W_{0}^{*})) ∥_{\infty}, n \geq 1,

δ_{\infty} (n) = ∥ E (∣ X_{n} - X_{n}^{*} ∣ ∣ (W_{0}, W_{0}^{*})) ∥_{\infty}, n \geq 1,

δ_{\infty} (n) \leq c n^{- q} with q > (p - 1) /2,

δ_{\infty} (n) \leq c n^{- q} with q > (p - 1) /2,

n \geq 1 sup ∥ E (X_{n}^{2} ∣ G_{n - 1}) ∥_{\infty} \leq c .

n \geq 1 sup ∥ E (X_{n}^{2} ∣ G_{n - 1}) ∥_{\infty} \leq c .

S_{n} - n E (X_{1}) - i = 1 \sum n N_{i} = o (n^{1/ p}) P -a.s.

S_{n} - n E (X_{1}) - i = 1 \sum n N_{i} = o (n^{1/ p}) P -a.s.

δ (0) = δ (1) = E (∣ X_{1} ∣) and δ (n) = 2^{- 1} k \geq n - 1 sup ∥ X_{k} - X_{k}^{*} ∥_{1}, n \geq 2 .

δ (0) = δ (1) = E (∣ X_{1} ∣) and δ (n) = 2^{- 1} k \geq n - 1 sup ∥ X_{k} - X_{k}^{*} ∥_{1}, n \geq 2 .

δ (x) = δ ([x])

δ (x) = δ ([x])

δ^{- 1} (u) = in f {q \in N : δ (q) \leq u} = n \geq 0 \sum 1_{u < δ (n)} .

δ^{- 1} (u) = in f {q \in N : δ (q) \leq u} = n \geq 0 \sum 1_{u < δ (n)} .

n \geq 1 \sum n^{p - 2} \int_{0}^{δ (n)} Q^{p - 1} \circ H^{- 1} (u) d u < \infty .

n \geq 1 \sum n^{p - 2} \int_{0}^{δ (n)} Q^{p - 1} \circ H^{- 1} (u) d u < \infty .

S_{n} - n E (X_{1}) - i = 1 \sum n N_{i} = o (n^{1/ p}) P -a.s.

S_{n} - n E (X_{1}) - i = 1 \sum n N_{i} = o (n^{1/ p}) P -a.s.

\gamma(x)=H^{-1}(\delta([x]))\mbox{ for any $x\geq 0$ and }\gamma^{-1}(u)=\delta^{-1}\circ H(u)\mbox{ for any $u\in[0,1]$}\,,

\gamma(x)=H^{-1}(\delta([x]))\mbox{ for any $x\geq 0$ and }\gamma^{-1}(u)=\delta^{-1}\circ H(u)\mbox{ for any $u\in[0,1]$}\,,

n \geq 1 \sum n^{p - 2} \int_{0}^{γ (k)} Q^{p} (u) d u < \infty,

n \geq 1 \sum n^{p - 2} \int_{0}^{γ (k)} Q^{p} (u) d u < \infty,

\int_{0}^{1} R^{p - 1} (u) Q (u) d u < \infty where R (u) = γ^{- 1} (u) Q (u),

\int_{0}^{1} R^{p - 1} (u) Q (u) d u < \infty where R (u) = γ^{- 1} (u) Q (u),

\|X_{1}\|_{r}\,\mbox{ for some $r>p$, and }\,\sum_{n\geq 1}n^{(pr-2r+1)/(r-p)}\delta(n)<\infty\,,

\|X_{1}\|_{r}\,\mbox{ for some $r>p$, and }\,\sum_{n\geq 1}n^{(pr-2r+1)/(r-p)}\delta(n)<\infty\,,

T^{*} = in f {k \in N : W_{k} = W_{k}^{*}},

T^{*} = in f {k \in N : W_{k} = W_{k}^{*}},

δ (n) \leq \int_{0}^{P_{ν \otimes ν} (T^{*} \geq n)} Q (u) d u .

δ (n) \leq \int_{0}^{P_{ν \otimes ν} (T^{*} \geq n)} Q (u) d u .

n \geq 0 \sum (n + 1)^{p - 2} \int_{0}^{P_{ν \otimes ν} (T^{*} \geq n)} Q^{p} (u) d u < \infty .

n \geq 0 \sum (n + 1)^{p - 2} \int_{0}^{P_{ν \otimes ν} (T^{*} \geq n)} Q^{p} (u) d u < \infty .

\|X_{1}\|_{r}\,\mbox{ for some $r>p$, and }\,\sum_{n\geq 1}n^{(pr-2r+p)/(r-p)}{\mathbb{P}}_{\nu\otimes\nu}(T^{*}\geq n)<\infty\,,

\|X_{1}\|_{r}\,\mbox{ for some $r>p$, and }\,\sum_{n\geq 1}n^{(pr-2r+p)/(r-p)}{\mathbb{P}}_{\nu\otimes\nu}(T^{*}\geq n)<\infty\,,

n \geq 1 \sum n^{p - 2} P_{ν \otimes ν} (T^{*} \geq n) < \infty .

n \geq 1 \sum n^{p - 2} P_{ν \otimes ν} (T^{*} \geq n) < \infty .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProbability and Risk Models · Stochastic processes and financial applications · Financial Risk and Volatility Modeling

Full text

On the Komlós, Major and Tusnády strong approximation for some classes of random iterates

Christophe Cuny111Université de la Nouvelle-Calédonie, Equipe ERIM. Email: [email protected], Jérôme Dedecker222Université Paris Descartes, Sorbonne Paris Cité, Laboratoire MAP5 (UMR 8145). Email: [email protected] and Florence Merlevède333Université Paris-Est, LAMA (UMR 8050), UPEM, CNRS, UPEC. Email: [email protected]

Abstract

The famous results of Komlós, Major and Tusnády (see [15] and [17]) state that it is possible to approximate almost surely the partial sums of size $n$ of i.i.d. centered random variables in ${\mathbb{L}}^{p}$ ( $p>2$ ) by a Wiener process with an error term of order $o(n^{1/p})$ . Very recently, Berkes, Liu and Wu [3] extended this famous result to partial sums associated with functions of an i.i.d. sequence, provided a condition on a functional dependence measure in ${\mathbb{L}}_{p}$ is satisfied. In this paper, we adapt the method of Berkes, Liu and Wu to partial sums of functions of random iterates. Taking advantage of the Markovian setting, we shall give new dependent conditions, expressed in terms of a natural coupling (in ${\mathbb{L}}^{\infty}$ or in ${\mathbb{L}}^{1}$ ), under which the strong approximation result holds with rate $o(n^{1/p})$ . As we shall see our conditions are well adapted to a large variety of models, including left random walks on $GL_{d}({\mathbb{R}})$ , contracting iterated random functions, autoregressive Lipschitz processes, and some ergodic Markov chains. We also provide some examples showing that our ${\mathbb{L}}^{1}$ -coupling condition is in some sense optimal.

1 Introduction

In this paper we shall adapt the approach of Berkes-Liu-Wu [3] to certain classes of Markov chains. To motivate this work, let us describe in detail the example of the left random walk on $GL_{d}(\mathbb{R})$ , $d\geq 2$ (the group of invertible $d$ -dimensional real matrices).

Let $(\varepsilon_{n})_{n\geq 1}$ be independent random matrices taking values in $G=GL_{d}(\mathbb{R})$ , with common distribution $\mu$ . Let $\|\cdot\|$ be the euclidean norm on ${\mathbb{R}}^{d}$ . We shall say that $\mu$ has a moment of order $p\geq 1$ if

[TABLE]

where $N(g):=\max(\|g\|,\|g^{-1}\|)$ .

Let $A_{0}={\rm Id}$ and for every $n\geq 1$ , $A_{n}=\varepsilon_{n}\cdots\varepsilon_{1}$ . Recall that if $\mu$ admits a moment of order $1$ then

[TABLE]

where $\lambda_{\mu}:=\lim_{n\to+\infty}n^{-1}\mathbb{E}(\log\|\varepsilon_{n}\cdots\varepsilon_{1}\|)$ is the so-called first Lyapunov exponent (see for instance [14]). For any $x\in S^{d-1}$ , we want to describe as precisely as possible the asymptotic behavior of the quantity

[TABLE]

The left random walk of law $\mu$ started at $x\in S^{d-1}$ is the Markov chain defined by $W_{0,x}:=x$ and $W_{n,x}=\varepsilon_{n}W_{n-1,x}$ for $n\geq 1$ . As usual, to handle the quantity (3), we consider the partial sums associated with the random variables $(X_{n,x})_{n\geq 1}$ given by

[TABLE]

where for every $g\in G$ and every $y\in{\mathbb{R}}^{d}-\{0\}$ ,

[TABLE]

By definition of $h$ , and since $X_{n,x}=h(\varepsilon_{n},A_{n-1}x)$ , we easily see that, for any $x\in S^{d-1}$ ,

[TABLE]

Hence, the asymptotic behavior of (3) can be deduced from the asymptotic behavior of partial sums of functions of the Markov chain $W_{n,x}$ .

This problem can be tackled under some assumptions on $\mu$ (strong irreducibility and proximality, see subsection 3.1 for more details) which implies that the chain $(W_{n})_{n\geq 0}$ admits an unique invariant measure $\nu$ defined on the projective space $X:=P_{d-1}({\mathbb{R}}^{d})$ of ${\mathbb{R}}^{d}-\{0\}$ . Under these assumptions on $\mu$ , and assuming moreover that $\mu$ has a moment of order $p\in(2,4)$ , Cuny-Dedecker-Jan [7] proved the following strong approximation result: there exists $\sigma^{2}\geq 0$ such that, for every (fixed) $x\in S^{d-1}$ , one can redefine $(\log\|A_{n}x\|)_{n\geq 1}$ without changing its distribution on a (richer) probability space on which there exist iid random variables $(N_{i})_{i\geq 1}$ with common distribution ${\mathcal{N}}(0,\sigma^{2})$ , such that,

[TABLE]

If $\mu$ has a moment of order $p=4$ , the same authors showed that this strong approximation holds with a rate of order $O(n^{1/4}\sqrt{\log(n)}(\log\log n)^{1/4})$ .

To prove (5), Cuny-Dedecker-Jan used a martingale approximation (as described for instance in Cuny-Merlevède [9]), together with some appropriate upper bounds on the quantities

[TABLE]

The main drawback of this approach is that it cannot give a better rate than $n^{1/4}$ , because it is based on the Skorokhod representation theorem for martingales.

On another hand, since the stationary Markov chain $W_{n}$ is a function of the starting point $W_{0}$ and of the “innovations” $\varepsilon_{1},\cdots,\varepsilon_{n}$ , one can also apply the approximation results by Berkes-Liu-Wu (in fact, this is not completely immediate because it does not fit exactly into the framework described by these authors, and some extra work is required there). Doing so, one can reach a rate of order $n^{1/p}$ for any $p>2$ , but only by assuming that $\mu$ has a moment of order $q(p)>p$ . More precisely, their functional measure of dependence in ${\mathbb{L}}_{p}$ , say $\delta_{k,p}$ , can be bounded by $\sup_{\|x\|=1,\|y\|=1}\|X_{k,x}-X_{k,y}\|_{p}$ . Hence, applying Proposition 3 in [7], one can see that condition (2.3) in [3] is satisfied provided $\mu$ has at least a moment of order $(5p/2)-1$ . This is somewhat surprising: on the one hand, one can go beyond the rate of order $n^{1/4}$ , and on the other hand we need stronger assumptions than in Cuny-Dedecker-Jan [7] to get the rate $n^{1/p}$ when $p\in(2,4)$ .

This gave us a strong motivation to understand completely the proof by Berkes-Liu-Wu [3], and to see whether it is possible to take advantage of the Markovian setting to get the rate $n^{1/p}$ in (5) under a moment of order $p$ , for any $p>2$ . As we shall see in this paper, the answer is positive.

As already mentioned, in the case of the left random walk on $GL_{d}({\mathbb{R}})$ , one can get a control on the quantities defined in (6). However, in many other cases of random iterates, such a control is not possible, while one can get some upper bounds on

[TABLE]

where $\nu$ is the invariant distribution of the chain $(W_{n})_{n\geq 1}$ .

Consequently, we shall establish two distinct results, with different range of applicability. In Theorem 1, we give a strong approximation result under conditions involving some quantities similar to (6). In Theorem 2 the conditions are expressed in terms of the quantities (7). The second Theorem applies to a large variety of examples, including some well known examples of irreducible and aperiodic Markov Chains with countable or continuous state space. These examples of ergodic Markov chains will allow us to prove that the conditions given in Theorem 2 are in some sense optimal.

In all the paper, we shall use the notation $a_{n}\ll b_{n}$ , which means that there exists a positive constant $C$ not depending on $n$ such that $a_{n}\leq Cb_{n}$ , for all positive integers $n$ .

2 Main results

Let $(\Omega,{\mathcal{A}},{\mathbb{P}})$ be a probability space, and let $(\varepsilon_{i})_{i\geq 1}$ be iid random variables defined on $\Omega$ , with values in a measurable space $G$ and with common distribution $\mu$ . Let $W_{0}$ be a random variable defined on $\Omega$ with values in a measurable space $X$ , independent of $(\varepsilon_{i})_{i\geq 1}$ , and let $F$ be a measurable function from $G\times X$ to $X$ . For any $n\geq 1$ , define

[TABLE]

and assume that $(W_{n},n\geq 1)$ has a stationary distribution $\nu$ . Let now $h$ be a measurable function from $G\times X$ to ${\mathbb{R}}$ and define, for any $n\geq 1$ ,

[TABLE]

Then $(X_{n})_{n\geq 1}$ forms a stationary sequence with stationary distribution, say $\pi$ . Let $({\mathcal{G}}_{i})_{i\in{\mathbb{Z}}}$ be the non-decreasing filtration defined as follows: for any $i<0$ , ${\mathcal{G}}_{i}=\{\emptyset,\Omega\}$ , ${\mathcal{G}}_{0}=\sigma(W_{0})$ and for any $i\geq 1$ , ${\mathcal{G}}_{i}=\sigma(\varepsilon_{i},\ldots,\varepsilon_{1},W_{0})$ . It follows that for any $n\geq 1$ , $X_{n}$ is ${\mathcal{G}}_{n}$ -measurable.

Our first result proves that the strong approximation result holds with rate $n^{1/p}$ when the stationary distribution $\pi$ has a moment of order $p>2$ and we impose that the sequence of coupling coefficients $({\delta}_{\infty}(n))_{n\geq 1}$ defined in (10) decreases arithmetically to zero plus the condition (12). As we shall see in Section 3, these conditions are satisfied for instance for the left random walk on $GL_{d}({\mathbb{R}})$ .

Let $W_{0}$ and $W_{0}^{*}$ be random variables with law $\nu$ , and such that $W_{0}^{*}$ is independent of $(W_{0},(\varepsilon_{i})_{i\geq 1})$ . For any $n\geq 1$ , let

[TABLE]

Define then

[TABLE]

where, above and in all the rest of the paper, the infinite norm is the usual essential supremum norm.

Theorem 1

Let $(X_{n},n\geq 1)$ be the stationary sequence defined by (8) and assume that its stationary distribution $\pi$ has moment of order $p>2$ . Assume in addition that there exists a positive constant $c$ such that for any $n\geq 1$ ,

[TABLE]

where $(\delta_{\infty}(n))_{n\geq 1}$ is defined in (10), and that

[TABLE]

Let $S_{n}=\sum_{k=1}^{n}X_{k}$ . Then $n^{-1}\mathbb{E}\big{(}(S_{n}-n\mathbb{E}(X_{1}))^{2}\big{)}\rightarrow\sigma^{2}$ as $n\rightarrow\infty$ and one can redefine $(X_{n})_{n\geq 1}$ without changing its distribution on a (richer) probability space on which there exist iid random variables $(N_{i})_{i\geq 1}$ with common distribution ${\mathcal{N}}(0,\sigma^{2})$ , such that,

[TABLE]

In the rest of this section, we shall give conditions expressed in terms of the quantities $\|X_{n}-X_{n}^{*}\|_{1}$ for the strong approximation (13) to hold. Before stating the result, we need to introduce some notations:

For any $n\geq 0$ , let us define the sequence $(\delta(n))_{n\geq 0}$ as follows

[TABLE]

These quantities are finite if $\pi$ has a moment of order $1$ .

For any $x\geq 0$ , denote by

[TABLE]

and, for any $u\in[0,\mathbb{E}(|X_{1}|)]$ , let

[TABLE]

Denote also by $Q$ the quantile function associated with $|X|$ where $X$ is a random variable with law $\pi$ : it is then the generalized inverse of the tail function $t\mapsto{\mathbb{P}}(|X|>t)=\pi((-\infty,-t[)+\pi(]t,\infty))$ . Let $H$ be the function from $[0,1]$ to ${\mathbb{R}}^{+}$ defined by $H(x)=\int_{0}^{x}Q(u)du$ . We shall assume the following condition

[TABLE]

Theorem 2

Let $(X_{n},n\geq 1)$ be a stationary sequence defined by (8) and assume that its stationary distribution $\pi$ has a moment of order $p>2$ . Assume in addition that condition (14) holds. Let $S_{n}=\sum_{k=1}^{n}X_{k}$ . Then $n^{-1}\mathbb{E}\big{(}(S_{n}-n\mathbb{E}(X_{1}))^{2}\big{)}\rightarrow\sigma^{2}$ as $n\rightarrow\infty$ and one can redefine $(X_{n})_{n\geq 1}$ without changing its distribution on a (richer) probability space on which there exist iid random variables $(N_{i})_{i\geq 1}$ with common distribution ${\mathcal{N}}(0,\sigma^{2})$ , such that,

[TABLE]

Remark 3

If we define

[TABLE]

then condition (14) can be rewritten as

[TABLE]

which also reads as

[TABLE]

Remark 4

Sufficient conditions for (14) to hold in terms of moments (or weak moments) of $\pi$ can be given by using Lemma 2 in Dedecker and Doukhan [10]. For instance, if

[TABLE]

then condition (14) is satisfied. Note that in the case where $\|X_{1}\|_{\infty}<\infty$ , condition (14) is equivalent to $\sum_{n\geq 1}n^{p-2}\delta(n)<\infty$ .

If we define the following meeting time

[TABLE]

it follows that, for any $n\geq 2$ ,

[TABLE]

Therefore the following corollary holds.

Corollary 5

Let $(X_{n},n\geq 1)$ be the stationary sequence defined by (8) and assume that its stationary distribution $\pi$ has a moment of order $p>2$ . Assume in addition that

[TABLE]

Then the conclusions of Theorem 2 hold.

According to the computations given in Annex C of Rio [23], if

[TABLE]

then condition (20) is satisfied. In the case where $\|X_{1}\|_{\infty}<\infty$ , condition (20) is equivalent to

[TABLE]

Propositions 15 and 18 in Section 3.3 will show that condition (22) is optimal in some sense.

3 Applications

3.1 Left random

walk on $GL_{d}({\mathbb{R}})$

As in the introduction, let $(\varepsilon_{n})_{n\geq 1}$ be independent random matrices taking values in $G=GL_{d}(\mathbb{R})$ , $d\geq 2$ , with common distribution $\mu$ . let $A_{0}={\rm Id}$ and for every $n\geq 1$ , $A_{n}=\varepsilon_{n}\cdots\varepsilon_{1}$ .

Let $\|\cdot\|$ be the euclidean norm on ${\mathbb{R}}^{d}$ . Recall that $\mu$ has a moment of order $p\geq 1$ if (1) holds. Recall also that if $\mu$ admits a moment of order $1$ then (2) holds, and the quantity $\lambda_{\mu}$ is well defined.

Let $X:=P_{d-1}({\mathbb{R}}^{d})$ be the projective space of ${\mathbb{R}}^{d}-\{0\}$ and write ${\bar{x}}$ as the projection of $x\in{\mathbb{R}}^{d}-\{0\}$ to $X$ . We assume that $\mu$ is strongly irreducible (i.e. that no proper finite union of subspaces of ${\mathbb{R}}^{d}$ are invariant by $\Gamma_{\mu}$ , the closed semi-group generated by the support of $\mu$ ) and proximal (i.e. that there exists a matrix in $\Gamma_{\mu}$ admitting a unique (with multiplicity one) eigenvalue with maximum modulus). Under those assumptions (see e.g. Bougerol-Lacroix [4] or Benoist-Quint [2]) it is well-known that there exists a unique invariant measure $\nu$ on ${\mathcal{B}}(X)$ , meaning that for any continuous and bounded function $f$ from $X$ to $\mathbb{R}$ ,

[TABLE]

The left random walk of law $\mu$ is the process defined by $W_{0}:=\varepsilon_{0}$ and $W_{n}=\varepsilon_{n}W_{n-1}$ for $n\geq 1$ where we assume that $\varepsilon_{0}$ is independent of $(\varepsilon_{n})_{n\geq 1}$ . As explained in the introduction, our aim is to study the partial sums associated with the random sequence $(X_{n})_{n\geq 1}$ given by

[TABLE]

where for every $g\in G$ and every ${\bar{x}}\in X$ ,

[TABLE]

As usual, we shall denote by $X_{n,{\bar{x}}}$ the random variable for which $W_{0}={\bar{x}}$ . We then define $S_{n,{\bar{x}}}=\sum_{k=1}^{n}X_{n,{\bar{x}}}$ and recall that the identity (4) holds: for any $x\in S^{d-1}$ ,

[TABLE]

Applying Theorem 1, the following strong approximation with rate holds.

Corollary 6

Let $\mu$ be a proximal and strongly irreducible probability measure on ${\mathcal{B}}(G)$ . Assume that $\mu$ has a moment of order $p>2$ . Then $n^{-1}\mathbb{E}_{\nu}\big{(}(S_{n}-n\lambda_{\mu})^{2}\big{)}\rightarrow\sigma^{2}$ as $n\rightarrow\infty$ and for every (fixed) ${\bar{x}}\in X$ , one can redefine $(S_{n,{\bar{x}}})_{n\geq 1}$ without changing its distribution on a (richer) probability space on which there exist iid random variables $(N_{i})_{i\geq 1}$ with common distribution ${\mathcal{N}}(0,\sigma^{2})$ , such that,

[TABLE]

Remark 7

It follows from item $c)$ of Theorem 4.11 of Benoist-Quint [2] that $\sigma>0$ if $\mu$ is strongly irreducible and the image of $\Gamma_{\mu}$ in $PGL_{d}({\mathbb{R}})$ is unbounded.

Proof of Corollary 6. Using the same arguments as in Cuny-Dedecker-Jan [7] (see the proof of their Theorem 1), we infer that it suffices to prove the result on stationary regime. More precisely, it suffices to prove that one can redefine $(S_{n})_{n\geq 1}$ without changing its distribution on a (richer) probability space on which there exist iid random variables $(N_{i})_{i\geq 1}$ with common distribution ${\mathcal{N}}(0,\sigma^{2})$ , such that,

[TABLE]

Note also that the fact that $n^{-1}\mathbb{E}_{\nu}\big{(}(S_{n}-n\lambda_{\mu})^{2}\big{)}\rightarrow\sigma^{2}$ as $n\rightarrow\infty$ comes from Theorem 2 (ii) in [7]. Now the strong invariance principle (23) is a direct application of Theorem 1. To see this, note first that the following estimate is valid (see Proposition 3 in [7]):

[TABLE]

Since $\big{(}\sup_{{\bar{x}},{\bar{y}}\in X}\mathbb{E}\big{(}\big{|}X_{k,{{\bar{x}}}}-X_{k,{{\bar{y}}}}\big{|}\big{)}_{k\geq 1}$ is non increasing, $\sup_{{\bar{x}},{\bar{y}}\in X}\mathbb{E}\big{(}\big{|}X_{k,{{\bar{x}}}}-X_{k,{{\bar{y}}}}\big{|}\big{)}\ll k^{-(p-1)}$ . Hence condition (11) holds with $q=p-1$ . To end the proof it suffices to notice that condition (12) also holds since, for any $k\geq 1$ ,

[TABLE]

$\square$

3.2 Contracting iterated random functions

3.2.1 Uniform contraction

Assume that there is a distance $d$ on $X$ , and that there exist $\kappa>0$ and $\rho\in(0,1)$ such that, for any $n\geq 1$ ,

[TABLE]

where $W_{n}^{*}$ is defined in (9). Note that condition (24) holds if the chain is “one step contracting” in the following sense

[TABLE]

Let us now define a class of observables from $G\times X$ to ${\mathbb{R}}$ for which one can easily compute the coefficient $\delta_{\infty}(n)$ . Let $\eta$ be a measurable function from $G$ to ${\mathbb{R}}^{+}$ such that ${\mathbb{E}}(\eta(\varepsilon_{0}))<\infty$ , and let $c$ be a concave non-decreasing function from ${\mathbb{R}}^{+}$ to ${\mathbb{R}}^{+}$ such that $c(0)=0$ .

One says that $h:G\times X\rightarrow{\mathbb{R}}$ belongs to the class ${\mathcal{L}}(\eta,c)$ if,

[TABLE]

Lemma 8

Assume that the stationary Markov chain $(W_{n})_{n\geq 0}$ satisfies the contraction condition (24), and let $(X_{n})_{n\geq 1}$ be defined by (8) for some $h\in{\mathcal{L}}(\eta,c)$ . Then, there exists a constant $A>0$ such that, for any $n\geq 1$ ,

[TABLE]

Proof. Let $A={\mathbb{E}}(\eta(\varepsilon_{0}))$ . Since $h$ belongs to ${\mathcal{L}}(\eta,c)$ , and since $c$ is concave,

[TABLE]

Hence, since $c$ is non-decreasing and $(W_{n})_{n\geq 0}$ satisfies (24),

[TABLE]

$\square$

Applying Theorem 1, the following result holds:

Corollary 9

Assume that the stationary Markov chain $(W_{n})_{n\geq 0}$ satisfies the contraction condition (24), and let $(X_{n})_{n\geq 1}$ be defined by (8) for some $h\in{\mathcal{L}}(\eta,c)$ . Assume moreover that ${\mathbb{E}}(\eta(\varepsilon_{1})^{p})<\infty$ for some $p>2$ , and that there exists $x_{0}\in X$ such that $\|c(d(W_{0},x_{0}))\|_{\infty}<\infty$ and ${\mathbb{E}}(|h(\varepsilon_{1},x_{0})|^{p})<\infty$ . If $c(\kappa\rho^{n})=O(n^{-q})$ for some $q>(p-1)/2$ , then the conclusion of Theorem 1 holds.

Remark 10

Note that Corollary 9 applies to a large class of continuous observales (as functions of $x$ ), including all Hölder observables (case where $c(x)=x^{\alpha}$ for some $\alpha\in(0,1)$ ). More precisely it applies to any concave non-decreasing function $c$ such that $c(x)\leq C|\ln(x)|^{-\gamma}$ in a neighborhood of [math], for some $\gamma>(p-1)/2$ .

Proof of Corollary 9. Applying Lemma 8, we infer that $\delta_{\infty}(n)=O(n^{-q})$ for some $q>(p-1)/2$ . Hence, if one can prove that

[TABLE]

for some finite constant $M$ , the result will follow directly from Theorem 1. To prove (25), we note that

[TABLE]

For the first term on the right-hand side of (26), we use the fact that $h\in{\mathcal{L}}(\eta,c)$ , which gives

[TABLE]

Under the assumptions of Corollary 9, it follows from (26) and (27) that the upper bound (25) holds. $\square$

3.2.2 ${\mathbb{L}}^{1}$ -contraction

Assume that there is a distance $d$ on $X$ , and that there exist $\kappa>0$ and $\rho\in(0,1)$ such that, for any $n\geq 1$ ,

[TABLE]

where $W_{n}^{*}$ is defined in (9). Note that condition (28) holds if the chain is “one step contracting” in the following sense:

[TABLE]

and

[TABLE]

Note also that, under the two conditions above, there exists an unique stationary distribution $\nu$ (see Theorem 2 of [25]).

Let us now define a class of observables from $G\times X$ to ${\mathbb{R}}$ for which one can easily compute the coefficients $\delta(n)$ . Let $c$ be a concave non-decreasing function from ${\mathbb{R}}^{+}$ to ${\mathbb{R}}^{+}$ such that $c(0)=0$ .

One says that $h:G\times X\rightarrow{\mathbb{R}}$ belongs to the class ${\mathcal{L}}(c)$ if,

[TABLE]

Lemma 11

Assume that the stationary Markov chain $(W_{n})_{n\geq 0}$ satisfies the contraction condition (28), and let $(X_{n})_{n\geq 1}$ be defined by (8) for some $h\in{\mathcal{L}}(c)$ . Then, for $n\geq 2$ ,

[TABLE]

Proof. Let $k\geq n\geq 2$ . Since $h$ belongs to ${\mathcal{L}}(c)$ , and since $c$ is concave,

[TABLE]

Hence, since $c$ is non-decreasing and $(W_{n})_{n\geq 0}$ satisfies (28),

[TABLE]

The result follows from the definition of $\delta(n)$ and the fact that $c$ is non-decreasing. $\square$

Recall that the function $Q$ and $H$ related to the tail function $t\mapsto{\mathbb{P}}(|X_{1}|>t)$ have been defined in Section 2. Combining Theorem 2 and Lemma 11, the following result holds:

Corollary 12

Assume that the stationary Markov chain $(W_{n})_{n\geq 0}$ satisfies the contraction condition (28), and let $(X_{n})_{n\geq 1}$ be defined by (8) for some $h\in{\mathcal{L}}(c)$ . Assume moreover that

[TABLE]

Then the conclusion of Theorem 2 holds.

Remark 13

From Remark 4, it follows that (29) holds as soon as

[TABLE]

The condition (30) is equivalent to the following integral condition on the function $c$

[TABLE]

3.3 Ergodic Markov chains

3.3.1 A discrete ergodic Markov chain example

Let $(\varepsilon_{i})_{i\in{\mathbb{Z}}}$ be a sequence of iid real-valued random variables distributed as $\varepsilon$ with

[TABLE]

Let $W_{0}$ be a random variable with values in ${\mathbb{N}}$ independent of $(\varepsilon_{i})_{i\in{\mathbb{Z}}}$ , and define for any $k\geq 1$ ,

[TABLE]

Hence $(W_{k},k\in{\mathbb{N}})$ is a Markov chain with state space ${\mathbb{N}}$ , initial distribution ${\mathcal{L}}(W_{0})$ and transition probabilities satisfying

[TABLE]

Assume that $p_{1}>0$ and $p_{n_{j}}>0$ along $n_{j}\rightarrow\infty$ . Then the chain $\{W_{k};k\geq 0\}$ is irreducible and aperiodic. Moreover, the stationary distribution exists if and only if $\mathbb{E}(\varepsilon)<\infty$ and is given by

[TABLE]

Corollary 14

Let $p>2$ and $f$ be a function from ${\mathbb{N}}$ to ${\mathbb{R}}$ such that $\nu(|f|^{r})<\infty$ with $r>p$ . Assume that

[TABLE]

*Then condition (21) is satisfied and the conclusions of Theorem 2 hold for $X_{n}=f(W_{n})$ where $(W_{n})_{n\geq 0}$ is the Markov chain defined by (31) with ${\mathcal{L}}(W_{0})=\nu$ . *

For bounded observables (case $r=\infty$ ), condition (32) reads as $\sum_{n\geq 1}n^{p}p_{n}<\infty$ . As we shall see in the proof of the next proposition (see (39)), $\sum_{n\geq 1}n^{p}p_{n}<\infty$ is equivalent to $\sum_{n\geq 1}n^{p-2}{\mathbb{P}}_{\nu\otimes\nu}(T^{*}\geq n)<\infty$ , where $T^{*}$ is the meeting time defined in (19). The next proposition shows that this latter condition is in some sense optimal.

Proposition 15

Let $p>2$ and $(W_{k})_{k\geq 0}$ be the Markov chain described above with $p_{k}:=1/(\zeta(p+1)k^{p+1})$ , $k\in{\mathbb{N}^{*}}$ , where $\zeta(p+1)=\sum_{k\geq 1}k^{-(p+1)}$ . Then

[TABLE]

Moreover, for any stationary and Gaussian centered sequence $(g_{k})_{k\in{\mathbb{Z}}}$ with convergent series of covariances,

[TABLE]

Proof of Corollary 14. Define

[TABLE]

By definition, $T^{*}\leq T_{0}^{*}$ . Hence for any $n\in{\mathbb{N}}$ ,

[TABLE]

Next, it is easy to see that for any $n\in{\mathbb{N}}$ ,

[TABLE]

with

[TABLE]

where $(W_{k}^{\prime},k\in{\mathbb{N}})$ is the Markov chain defined as follows: Let $(\varepsilon^{\prime}_{k})_{k\in{\mathbb{Z}}}$ be an independent copy of $(\varepsilon_{k})_{k\in{\mathbb{Z}}}$ and independent of $W_{0}$ . Let $W^{\prime}_{0}$ be independent of $(W_{0},(\varepsilon_{k})_{k\in{\mathbb{Z}}},(\varepsilon^{\prime}_{k})_{k\in{\mathbb{Z}}})$ and, for any $k\geq 1$ , set

[TABLE]

According to Lindvall [16], if $\mathbb{E}_{\nu}(\psi(\tau))<\infty$ where $\tau=\inf\{k\geq 1\,:\,W_{k}=0\}$ and $\psi$ is a non-decreasing function from ${\mathbb{N}}$ to $[2,\infty[$ such that $((\log(\psi(n))/n)_{n}$ is non-increasing and converges to [math], then $\mathbb{E}_{\nu\otimes\nu}(\psi(T^{\prime}_{0}))<\infty$ . Note now that

[TABLE]

Hence under (32), $\mathbb{E}_{\nu}(\psi_{r,p}(\tau))<\infty$ with $\psi_{r,p}(x)=x^{r(p-1)/(r-p)}$ . It follows that $\mathbb{E}_{\nu}(\psi_{r,p}(T^{\prime}_{0}))<\infty$ which in turn implies that $\mathbb{E}_{\nu}(\psi_{r,p}(T^{*}))<\infty$ by taking into account (35) and (36). Therefore condition (21) is satisfied and Corollary 5 applies. $\square$

Proof of Proposition 15. Note first that the following coupling inequality holds: for any $n\geq 1$ ,

[TABLE]

where $\|\mu\|_{v}$ denotes the total variation norm of a signed measure $\mu$ and $P$ is the transition function of the Markov chain $(W_{k})_{k\in{\mathbb{N}}}$ . But for any $n\geq 1$ , $\beta(n)\geq 2\alpha(n)$ where $(\alpha(n))_{n\geq 1}$ is the sequence of strong mixing coefficients of the chain which starts from the stationary distribution. As quoted in Chapter 30 of Bradley [6],

[TABLE]

It follows that for any $s\geq 0$ ,

[TABLE]

which together with the arguments developed in the proof of Corollary 14 show that

[TABLE]

This proves the first part of (33). To prove its second part, it suffices to use again the arguments developed in the proof of Corollary 14 and to notice that, for $p_{k}:=1/(\zeta(p+1)k^{p+1})$ , $k\in{\mathbb{N}^{*}}$ , the upper bound (37) entails that $\mathbb{E}_{\nu}(\psi_{p}(\tau))<\infty$ with $\psi_{p}(x)=\frac{x^{p-1}}{(\log(1+x))^{1+\varepsilon}}$ where $\varepsilon>0$ . This ends the proof of (33).

To prove the second part of the proposition, we shall use similar arguments as those developed in the proof of Theorem 2.2 in Dedecker-Merlevède-Rio [11] and adopt the following notations: the regeneration times $(T_{k})_{k\geq 0}$ of the Markov chain $(W_{k})_{k\geq 0}$ are defined by induction as follows: $R_{0}=\inf\{n>0\,:\,W_{n}=0\}$ and $R_{k}=\inf\{n>R_{k-1}\,:\,W_{n}=0\}$ . Let $\tau_{k}=R_{k+1}-R_{k}$ for $k\geq 0$ . Note that $(\tau_{k})_{k\geq 0}$ are iid and that their common law is the law of $R_{0}$ when the chain starts at zero. Note that

[TABLE]

Since the regeneration times $\tau_{k}$ are independent, by the converse Borel-Cantelli lemma, it follows that

[TABLE]

Now we take

[TABLE]

$f$ is obviously a bounded function and $\nu(g)=0$ . Note that, for any $\ell\geq 0$ ,

[TABLE]

Since $R_{n}/n$ converges to ${\mathbb{E}}(\tau_{0})$ almost surely, it follows that, for some positive constant $c$ depending on ${\mathbb{E}}(\tau_{0})$ ,

[TABLE]

Consider now a stationary and Gaussian centered sequence $(g_{k})_{k\in{\mathbb{Z}}}$ with convergent series of covariances. If follows from both the Borel-Cantelli lemma and the usual tail inequality for Gaussian random variables that, for any positive $\theta$ ,

[TABLE]

Taking $\theta=\nu_{0}/4$ in the above inequality and using (40), we then infer that

[TABLE]

which implies (34). $\square$

3.3.2 An example of ergodic Markov chain with continuous state space

In this section, we consider an homogenous Markov chain with state space $[0,1]$ and transition probability kernel $P(x,\cdot)$ given by

[TABLE]

where $\delta_{x}$ denotes the Dirac measure at point $x$ and

[TABLE]

Note that the chain is irreducible and aperiodic and admits a unique invariant probability measure $\nu$ given by

[TABLE]

As in Section 9.3 in Rio [23], we now construct a stationary Markov chain $(W_{n})_{n\in{\mathbb{N}}}$ with initial law $\nu$ and transition probability measure $P(x,\cdot)$ . Let $\xi_{0}$ be a random variable with law $\nu$ . We assume that the underlying probability space is rich enough to contain a sequence $(\varepsilon_{i})_{i\in{\mathbb{Z}}}:=(U_{i},V_{i})_{i\in{\mathbb{Z}}}$ of independent random variables with uniform law over $[0,1]^{2}$ , and that this random sequence is independent of $\xi_{0}$ . The stationary Markov chain $(W_{n})_{n\in{\mathbb{N}}}$ is then constructed via the following recursive equation: $W_{0}=\xi_{0}$ and, for any $k\geq 1$ ,

[TABLE]

where $F_{\pi}^{-1}$ is the inverse of the cumulative function of $\pi$ . It is easy to see that $(W_{n})_{n\in{\mathbb{N}}}$ is a Markov chain with initial distribution $\nu$ and transition probability kernel given by (41).

Corollary 16

Let $p>2$ and $(W_{k})_{k\in{\mathbb{N}}}$ be the stationary Markov chain defined by (42) with $a>p-1$ . Then condition (22) is satisfied and the conclusions of Theorem 2 hold for $X_{n}=f(W_{n})$ , for any bounded function $f$ defined on $[0,1]$ .

The proof of this corollary is a direct application of Corollary 5 by taking into account the following lemma whose proof is postponed to the Appendix (see Section 5.2).

Lemma 17

For any $a>1$ there exist positive constants $c(a)$ and $C(a)$ depending only on $a$ such that for any $n\geq 1$ ,

[TABLE]

where $T^{*}$ is the meeting time defined in (19).

In addition, this lemma together with Theorem 2.2 in Dedecker-Merlevède-Rio [11] proves the sharpness of condition (22) also in case of Markov chains with continuous state space. This is summarized in the next proposition.

Proposition 18

Let $p>2$ and $(W_{k})_{k\in{\mathbb{N}}}$ be the stationary Markov chain defined by (42) with $a=p-1$ . Then condition (22) fails. In addition, for any map $f$ from $[0,1]$ to ${\mathbb{R}}$ with continuous and strictly positive derivative $f^{\prime}$ on $[0,1]$ , and any stationary and Gaussian centered sequence $(g_{k})_{k\in{\mathbb{Z}}}$ with convergent series of covariances,

[TABLE]

3.4 Lipschitz autoregressive models

We consider the autoregressive Lipschitz model as in Dedecker-Rio [13]. Let $\tau\in[0,1)$ , $C\in(0,1]$ and $f\,:\,{\mathbb{R}}\to{\mathbb{R}}$ a $1$ -Lipschitz function such that

[TABLE]

Let $(\varepsilon_{i})_{i\geq 1}$ be iid real-valued random valued with common law $\mu$ and define for any $n\geq 1$

[TABLE]

Let $S_{n}(h)=\sum_{k=1}^{n}h(W_{i})$ for any measurable function $h$ .

The model above corresponds to the previously considered situation with $G={\mathbb{R}}$ and $F\,:\,{\mathbb{R}}\times{\mathbb{R}}\to{\mathbb{R}}$ given by $F(x,y)=x+f(y)$ , for every $x,y\in{\mathbb{R}}$ .

Let $S\geq 1$ and assume that $\mu$ admits a moment of order $S$ . It follows from Dedecker-Rio [13] that there exists a unique invariant probability $\nu$ on ${\mathbb{R}}$ , such that

[TABLE]

The following strong approximation with rates holds.

Corollary 19

Let $\tau\in(0,1)$ and assume that $\mu$ admits a moment of order $S=p+\tau p$ for some $p>2$ . Let $(W_{n})_{n\geq 0}$ be defined by (45) with ${\mathcal{L}}(W_{0})=\nu$ . Then, for any Lipschitz function $h$ such that $\nu(h)=0$ , $n^{-1}{\rm Var}(S_{n}(h))\rightarrow\sigma^{2}(h)$ as $n\rightarrow\infty$ and one can redefine $(W_{n})_{n\geq 0}$ without changing its distribution on a (richer) probability space on which there exist iid random variables $(N_{i})_{i\geq 1}$ with common distribution ${\mathcal{N}}(0,\sigma^{2}(h))$ , such that,

[TABLE]

Proof. The result comes from an application of Theorem 2 by taking into account Remark 4. As already mentionned, $\nu$ admits a moment of order $S-\tau=p+(p-1)\tau$ . Hence, one can prove that condition (18) holds with $r=p+\tau(p-1)$ , by using the last statement of the following lemma (taking $\gamma=(pr-2r+1)/(r-p)=-2+(S-1)/\tau$ ).

Lemma 20

Let $\gamma>-1$ and $t>0$ . Assume that $S\geq t+(\gamma+2)\tau$ . Then

[TABLE]

In particular, for any Lipschitz function $h$ , if $S\geq 1+(\gamma+2)\tau$ then $\sum_{n\geq 1}n^{\gamma}\delta(n)<\infty$ .

The proof of the lemma above is postponed to the Appendix (see Section 5.3).

4 Proofs of Theorems 1 and 2

The proofs of Theorems 1 and 2 follow the scheme of proof of Theorem 2.1 in Berkes-Liu-Wu [3] by applying the following general Proposition 21, which comes from a careful analysis of the proof of their strong approximation result. To state this general proposition several preliminary notations are needed.

A Preliminary result. For Proposition 21 below, we consider $(X_{k})_{k\geq 1}$ a strictly stationary sequence of real-valued random variables in ${\mathbb{L}}^{p}$ ( $p>2$ ) and $(\varepsilon_{i})_{i\geq 0}$ a sequence of iid random variables. Let $(M_{k})_{k\geq 1}$ be a sequence of positive real numbers and define

[TABLE]

Then, define

[TABLE]

Let now $(m_{k})_{k\geq 1}$ be a non-decreasing sequence of positive integers such that $m_{k}=o(3^{k})$ , as $k\rightarrow\infty$ , and define

[TABLE]

Finally set $k_{0}:=\inf\{k\geq 1\,:\,m_{k}\leq 2^{-1}3^{k-2}\}$ and define

[TABLE]

The general proposition coming from a careful analysis of the proof of Theorem 2.1 in Berkes-Liu- Wu [3] reads as follows

Proposition 21 (Berkes-Liu-Wu [3])

Let $p>2$ . Assume that we can find a sequence of positive real numbers $(M_{k})_{k\geq 1}$ a non-decreasing sequence of positive integers $(m_{k})_{k\geq 1}$ such that $m_{k}=o(3^{2k/p}k^{-1})$ , as $k\rightarrow\infty$ , in such a way that the following conditions are satisfied:

[TABLE]

there exists $\alpha\geq 1$ such that

[TABLE]

and there exists $r\in]2,\infty[$ such that

[TABLE]

Assume in addition that

[TABLE]

*and *

[TABLE]

Then, one can redefine $(X_{n})_{n\geq 1}$ without changing its distribution on a (richer) probability space on which there exist iid random variables $(N_{i})_{i\geq 1}$ with common distribution ${\mathcal{N}}(0,\sigma^{2})$ , such that,

[TABLE]

Note that (54) implies that $n^{-1}{\rm Var}(S_{n})$ converges to $\sigma^{2}$ (which is therefore non-negative). Let us now briefly explain how the proposition follows from the work of Berkes-Liu-Wu [3].

Condition (51) together with condition (52) prove that it is enough to show (56) with

[TABLE]

instead of $S_{n}-n\mathbb{E}(X_{1})$ , where, for $n\geq 2$ , $h_{n}:=\lceil(\log n)/(\log 3)\rceil$ (so that $h_{n}$ is the unique integer such that $3^{h_{n}-1}<n\leq 3^{h_{n}}$ ). Next, condition (53) allows first to show that the proof of the proposition is reduced to prove (56) with $S_{n}^{\diamond}$ replacing $S_{n}-n\mathbb{E}(X_{1})$ where

[TABLE]

with $B_{k,j}=0$ if $k<k_{0}$ and for $k\geq k_{0}$ ,

[TABLE]

A careful analysis of the steps 3.2 and 3.3 of the proof of Theorem 2.1 in Berkes-Liu-Wu [3] reveals that condition (53) is also sufficient to apply Theorem 1 in Sakhanenko [24] (at different steps of their proof) and this leads to the following strong approximation result: one can redefine $(X_{n})_{n\geq 1}$ without changing its distribution on a (richer) probability space on which there exists a standard Brownian motion $B=\{B(t),t\in{\mathbb{R}}^{+}\}$ such that,

[TABLE]

where

[TABLE]

The last step 3.4 of their proof then consists in showing that one can construct another standard Brownian motion $W=\{W(t),t\in{\mathbb{R}}^{+}\}$ (depending on $B$ ) such that

[TABLE]

This step is achieved provided that we can prove that $\nu_{k}\rightarrow\sigma^{2}$ , $m_{k}=o(3^{2k/p}k^{-1})$ , as $k\rightarrow\infty$ , and condition (55) holds.

Some preliminary considerations. The following considerations allowing to extend the stationary sequence $(X_{n})_{n\geq 1}$ defined by (8) to a stationary sequence on ${\mathbb{Z}}$ will be useful.

For any $n\geq 1$ , let $V_{n}=(\varepsilon_{n},W_{n-1})$ . Hence $(X_{n})_{n\geq 1}$ is a functional of the Markov chain $(V_{n})_{n\geq 1}$ with state space $G\times X$ and stationary distribution $\mu\otimes\nu$ . The Markov chain $(V_{n})_{n\geq 1}$ being stationary, by Kolmogorov’s theorem, there exists a probability ${\hat{\mathbb{P}}}$ on the measurable space $({\hat{\Omega}},{\hat{\mathcal{F}}})=((G\times X)^{\mathbb{Z}},({\mathcal{B}}(G)\times{\mathcal{B}}(X))^{\mathbb{Z}})$ invariant by the shift ${\hat{\eta}}$ on ${\hat{\Omega}}$ and such that the law of the coordinate process $({\hat{V}}_{n}=({\hat{\varepsilon}}_{n},{\hat{W}}_{n-1}))_{n\in{\mathbb{Z}}}$ (with values in $G\times X$ ) under ${\hat{\mathbb{P}}}$ is the same as the one of $(V_{n})_{n\geq 1}$ under ${\mathbb{P}}_{\nu}$ . Hence, if we define for any integer $n$ , ${\hat{X}}_{n}:=h({\hat{V}}_{0})\circ{\hat{\eta}}^{n}$ , it follows that $({\hat{X}}_{n})_{n\in{\mathbb{Z}}}$ forms a stationary sequence with stationary distribution $\pi$ , whose law under ${\hat{\mathbb{P}}}$ is the same as the one of $(X_{n})_{n\geq 1}$ under ${\mathbb{P}}_{\nu}$ . To prove the theorem, it suffices then to prove that it holds for the extended sequence $({\hat{X}}_{n})_{n\in{\mathbb{Z}}}$ which is a stationary sequence adapted to the stationary filtration $({\widehat{\mathcal{F}}}_{n})_{n\in{\mathbb{Z}}}$ where ${\widehat{\mathcal{F}}}_{n}=\sigma({\hat{V}}_{k},k\leq n)$ To avoid additional notations, in the rest of the proof we write $(X_{n})_{n\in{\mathbb{Z}}}$ for $({\hat{X}}_{n})_{n\in{\mathbb{Z}}}$ , $(V_{n})_{n\in{\mathbb{Z}}}$ for $({\hat{V}}_{n})_{n\in{\mathbb{Z}}}$ and $({\mathcal{F}}_{n})_{n\in{\mathbb{Z}}}$ for $({\widehat{\mathcal{F}}}_{n})_{n\in{\mathbb{Z}}}$ .

4.1 Proof of Theorem 1

By the reverse martingale convergence theorem and stationarity, $\|\mathbb{E}(X_{n}|{\mathcal{F}}_{0})-\mathbb{E}(X_{n})\|_{2}$ is decreasing to $\|\mathbb{E}(X_{0}|{\mathcal{F}}_{-\infty})-\mathbb{E}(X_{0})\|_{2}$ , as $n\rightarrow\infty$ . Hence, by condition (11), $\mathbb{E}(X_{0}|{\mathcal{F}}_{-\infty})=\mathbb{E}(X_{0})$ a.s. Applying Lemma 22 of the Appendix and taking into account condition (11), we get (since $q>1/2$ ),

[TABLE]

This proves that the series $\sigma^{2}={\rm Var}(X_{1})+2\sum_{i\geq 1}{\rm Cov}(X_{1},X_{i+1})$ converge absolutely and condition (54) of Proposition 21 holds.

Assume first that $\sigma^{2}=0$ . To prove that $S_{n}-n\mathbb{E}(X_{1})=o(n^{1/p})$ a.s., we shall use Theorem 4.7 in Cuny-Merlevède [8]. Hence, it suffices to prove that

[TABLE]

With this aim, we start by noticing that by condition (11),

[TABLE]

Theorem 2.3 in [8] then asserts that there exists a stationary sequence $(D_{k})_{k\in{\mathbb{Z}}}$ of martingale differences in ${\mathbb{L}}^{p}$ , adapted to $({\mathcal{F}}_{k})_{k\in{\mathbb{Z}}}$ and such that $n^{-1/2}\|S_{n}-n\mathbb{E}(X_{1})-\sum_{k=1}^{n}D_{k}\|_{p}\rightarrow 0$ , as $n\rightarrow\infty$ . Together with the fact that $\lim_{n\rightarrow\infty}n^{-1}{\rm Var}(S_{n})=\sigma^{2}=0$ , it follows that $D_{k}=0$ a.s, for any $k$ . Therefore, the upper bound (4) in [8] and condition (11) entail that

[TABLE]

which proves (57) since $q+2/p^{2}-1>(2p^{2})^{-1}(p^{3}-3p^{2}+4)=(2p^{2})^{-1}(p-2)^{2}(p+1)>0$ . The theorem is then proved in the case where $\sigma^{2}=0$ .

Assume from now that $\sigma^{2}>0$ . We choose

[TABLE]

Note that the sequence $(m_{k})_{k\geq 0}$ satisfies $m_{k}=o(3^{2k/p}k^{-1})$ , as $k\rightarrow\infty$ . We prove below that conditions (51), (52), (53) and (55) of Proposition 21 are satisfied with the above choices of $(M_{k})_{k\geq 0}$ and $(m_{k})_{k\geq 0}$ .

Since the $X_{i}$ ’s are in ${\mathbb{L}}^{p}$ , it is easy to see that with the choice of $M_{k}$ , condition (51) is satisfied (it suffices to write that $\mathbb{E}(|g_{k}(X_{1})|)\leq\mathbb{E}(|X_{1}|{\mathbf{1}}_{|X_{1}|>M_{k}})$ and to use Fubini’s Theorem). Next, for $k\geq k_{0}$ , Lemma 24 of the Appendix combined with condition (11) implies that

[TABLE]

Therefore,

[TABLE]

since $2q(1-\varepsilon)>p-1$ . Condition (52) is then satisfied with $\alpha=1$ . We prove now that we can find a real number $r\in]2,\infty[$ such that (53) holds. Let $r\geq 2$ ,

[TABLE]

and

[TABLE]

With these notations, we have

[TABLE]

By Rosenthal’s inequality for martingales,

[TABLE]

Note that

[TABLE]

where ${\mathcal{H}}_{k,i}=\sigma(\varepsilon_{i+3^{k-1}},\ldots,\varepsilon_{i+3^{k-1}-m_{k}})$ . Here, recall the following well known fact: if $Y$ is an integrable random variable, and ${\mathcal{G}}_{1}$ and ${\mathcal{G}}_{2}$ are two $\sigma$ -algebras such that $\sigma(Y)\vee{\mathcal{G}}_{1}$ is independent of ${\mathcal{G}}_{2}$ , then

[TABLE]

Applying (59) with ${\mathcal{G}}_{1}=\sigma(\varepsilon_{i+3^{k-1}-1},\ldots,\varepsilon_{i+3^{k-1}-m_{k}})$ , ${\mathcal{G}}_{2}={\mathcal{G}}_{k,i-m_{k}-1}$ and $Y=\mathbb{E}(X^{2}_{i+3^{k-1}}|{\mathcal{H}}_{k,i})$ , we get

[TABLE]

Hence, by assumption (12),

[TABLE]

On another hand, by stationarity,

[TABLE]

So, overall,

[TABLE]

We handle now the second term in the right-hand side of (58). We apply Proposition 23 of the Appendix with $\alpha=r$ , $r=r_{k}$ where $r_{k}$ is the unique positive integer such that $2^{r_{k}-1}\leq 3m_{k}<2^{r_{k}}$ ,

[TABLE]

and

[TABLE]

We then get

[TABLE]

where $T_{\ell}=\sum_{i=1}^{\ell}\mathbb{E}(Y_{k,i}|{\mathcal{G}}_{k,i-1})$ . By fact (59), we note that, for any $i\geq 1$ ,

[TABLE]

Therefore, by condition (12),

[TABLE]

Next, since ${\mathcal{F}}_{i}=\{\emptyset,\Omega\}$ for $i\leq 0$ and the $Z_{i}$ ’s are centered , for any $\ell\geq 0$ ,

[TABLE]

Moreover, for any $m\geq 2$ and any $\ell\geq 0$ ,

[TABLE]

But, for any $m\geq 2$ , any $\ell\geq 0$ and any $i\geq(m-1)2^{\ell}+1$ ,

[TABLE]

Hence, if $2^{\ell}\geq m_{k}$ ,

[TABLE]

and if $2^{\ell}\leq m_{k}-1$ , by using (59),

[TABLE]

But, by using stationarity, the Markov property and the fact that $\varphi_{k}$ is 1-Lipschitz,

[TABLE]

Hence, for any $m\geq 2$ , any $\ell\geq 0$ and any $i\geq(m-1)2^{\ell}+1$ ,

[TABLE]

Since $q>1/2$ , the above considerations imply that

[TABLE]

Combined with (61) and (62), the upper bound above implies that

[TABLE]

Hence, starting from (58) and taking into account (60) and (64), we get that for any $r\geq 2$ ,

[TABLE]

This implies that (53) holds with $r>\max\big{\{}2,\varepsilon^{-1}\big{(}p-2(1-\varepsilon)\big{)}\big{\}}$ .

To end the proof it remains to prove condition (55). Note first that since $\sigma^{2}$ is assumed to be strictly positive, we have

[TABLE]

and therefore condition (55) reads as

[TABLE]

To verify condition (65), let us define, for $i\geq 0$ ,

[TABLE]

Using stationarity, we have

[TABLE]

Therefore

[TABLE]

We first prove that

[TABLE]

With this aim we use the arguments developed in [3] to get their inequality (3.56). Hence, we start by noting that since $\varphi_{k}$ is $1$ -Lipschitz, $\big{(}\|\mathbb{E}(\varphi_{k}(X_{n})|{\mathcal{F}}_{0})-\mathbb{E}(\varphi_{k}(X_{n})\|_{2})\big{)}_{n\geq 0}$ is a decreasing sequence such that $\|\mathbb{E}(\varphi_{k}(X_{n})|{\mathcal{F}}_{0})-\mathbb{E}(\varphi_{k}(X_{n})\|_{2}\leq\delta_{\infty}(n)$ . Hence, by the same arguments as those developed in the first lines of the proof of Theorem 1, we infer that, under condition (11), there exists a constant $C$ not depending on $k$ such that $\sum_{\ell\in{\mathbb{Z}}}|{\hat{c}}_{k,\ell}|\leq C$ . Therefore, $\lim_{j\rightarrow\infty}j^{-1}\mathbb{E}({W}^{2}_{k,j})={\hat{c}}_{k,0}+2\sum_{\ell\geq 1}{\hat{c}}_{k,\ell}$ . On another hand, the following convergence clearly holds: $\lim_{j\rightarrow\infty}j^{-1}\mathbb{E}({\widetilde{W}}^{2}_{k,j})=\nu_{k}$ . In addition, for all $j\geq 1$ ,

[TABLE]

The above considerations imply

[TABLE]

To take care of $\|{\widetilde{W}}_{k,j}-W_{k,j}\|_{2}$ , we apply Proposition 23 of the Appendix with, this time, $\alpha=2$ , $r=r_{j}$ where $r_{j}$ is the unique positive integer such that $2^{r_{j}-1}\leq j<2^{r_{j}}$ ,

[TABLE]

and

[TABLE]

Hence

[TABLE]

where $T_{\ell}=\sum_{i=1}^{\ell}(X_{k,i+3^{k-1}}-{\tilde{X}}_{k,i+3^{k-1}})$ . Lemma 24 of the Appendix combined with condition (11) implies that

[TABLE]

Next, since ${\mathcal{F}}_{i}=\{\emptyset,\Omega\}$ for $i\leq 0$ , for any $\ell\geq 0$ ,

[TABLE]

Moreover, by (63), we infer that for any $m\geq 2$ , any $\ell\geq 0$ and any $i\geq(m-1)2^{\ell}+1$ ,

[TABLE]

On another hand, Lemma 24 of the Appendix combined with condition (11) implies that

[TABLE]

Hence, for any $m\geq 2$ , any $\ell\geq 0$ and any $i\geq(m-1)2^{\ell}+1$ , we also have

[TABLE]

The considerations above imply that, for any $m\geq 2$ , any $\ell\geq 0$ and any $i\geq(m-1)2^{\ell}+1$ ,

[TABLE]

Hence, since $q>1/2$ ,

[TABLE]

Starting from (69) and considering the upper bounds (70) and (71), we get

[TABLE]

Hence starting from (68) and taking into account (72) together with the fact that $3^{k/(2p)}m_{k}^{-q/2}\leq 2^{q/2}$ , the upper bound (67) follows.

Let now $c_{i}={\rm Cov}(X_{0},X_{i})$ and note that (see Relation (3.54) in [3], where the same truncation level is used)

[TABLE]

Let

[TABLE]

Since $\sigma^{2}=c_{0}+2\sum_{i\geq 1}c_{i}$ , it follows that

[TABLE]

But

[TABLE]

Set $g_{k}(x)=x-\varphi_{k}(x)$ and note that, by the reverse martingale convergence theorem and condition (11), $\mathbb{E}(g_{k}(X_{0})|{\mathcal{F}}_{-\infty})=\mathbb{E}(g_{k}(X_{0}))$ a.s. and $\mathbb{E}(X_{0}|{\mathcal{F}}_{-\infty})=\mathbb{E}(X_{0})$ a.s. Hence, applying Lemma 22 of the Appendix and taking into account condition (11), we get

[TABLE]

where $P_{0}(\cdot)=\mathbb{E}(\cdot|{\mathcal{F}}_{0})-\mathbb{E}(\cdot|{\mathcal{F}}_{-1})$ . But, by Lemma 22 of the Appendix,

[TABLE]

Note now that, since $q>1/2$ ,

[TABLE]

So, overall,

[TABLE]

Next, we note that

[TABLE]

and that, for $j\geq 1$ , by condition (11),

[TABLE]

Hence, since $q>1/2$ , we infer that

[TABLE]

We handle now the series

[TABLE]

Applying again Lemma 22 of the Appendix, we first write that

[TABLE]

By condition (11) and since $q>1/2$ ,

[TABLE]

So, taking into account (75) and the fact that $q>1/2$ ,

[TABLE]

Considering the upper bounds (73), (76) and (77), we then derive

[TABLE]

which combined with (67) gives

[TABLE]

Let us verify that (65) holds, namely:

[TABLE]

The choice of $\ell_{k}$ implies that $\ell_{k}3^{-k(p-2)/p}=3^{-k(p-2)/(2p)}(\log k)^{-1/2}$ and $\ell_{k}^{1/2-q}=o((\log k)^{-1/2})$ (since $q>1/2$ ). Moreover, when $q>1$ , we clearly have $3^{k(p-2)/(2p)}\ell_{k}^{1-2q}=o((\log k)^{-1/2})$ and $3^{k(p-2)/(2p)}3^{-k(p-1)/(2p)}\ell_{k}^{(1-q)/2}=o((\log k)^{-1/2})$ . It is also clear that $3^{k(p-2)/(2p)}k3^{-k(p-1)/(2p)}=o((\log k)^{-1/2})$ . Next, since $q>(p-1)/2$ ,

[TABLE]

proving (since $p>2$ ) that $3^{k(p-2)/(2p)}3^{-k(p-1)(2q-1)/(2pq)}{\bf 1}_{q<1}=o((\log k)^{-1/2})$ . Also, since $p>2$ ,

[TABLE]

which proves that $3^{k(p-2)/(2p)}\ell_{k}^{1-3q/2}3^{-k(p-1)/(2p)}{\bf 1}_{(p-1)/2<q<1}=o((\log k)^{-1/2})$ . Next, we note that

[TABLE]

since $\varepsilon<1-\frac{p-1}{2q}$ .

Now, if $q=1$ then $p<3$ (since $q>(p-1)/2$ ). Hence since $\varepsilon<1/2$ , we get that $3^{k(p-2)/(2p)}m_{k}^{-1/2}(\log m_{k}){\bf 1}_{q=1}=o((\log k)^{-1/2})$ . Finally, using again that $q>(p-1)/2$ and that $\varepsilon<1/2$ , we derive that $3^{k(p-2)/(2p)}m_{k}^{-q+1/2}{\bf 1}_{q<1}=o((\log k)^{-1/2})$ . This ends the proof of (65) and then of the theorem. $\square$

4.2 Proof of Theorem 2

By Remark 3, we know that condition (14) is equivalent to (17), namely:

[TABLE]

where, for any $u\in[0,1]$ , $\gamma^{-1}(u)=\delta^{-1}\circ H(u)$ and $R(u)=\gamma^{-1}(u)Q(u)$ .

Notice first that, by Proposition 1 in Dedecker-Doukhan [10],

[TABLE]

by condition (17). Hence the series $\sigma^{2}={\rm Var}(X_{1})+2\sum_{i\geq 1}{\rm Cov}(X_{1},X_{i+1})$ converge absolutely and condition (54) of Proposition 21 holds.

Assume first that $\sigma^{2}>0$ . To prove the theorem, we shall verify that the other conditions of Proposition 21 are satisfied and with this aim we need to define suitable sequences $(m_{k})$ and $(M_{k})$ . Since we have ${\rm Var}(S_{n})/n\rightarrow\sigma^{2}>0$ , it follows that ${\rm Var}(S_{n})\rightarrow\infty$ . Hence ${\mathbb{P}}(|X_{1}|>0)>0$ since otherwise we would have $X_{1}=0$ a.s. and then $S_{n}=0$ a.s. for all $n\geq 1$ , contradicting the fact that ${\rm Var}(S_{n})\rightarrow\infty$ . Let $u_{1}=(1/2){\mathbb{P}}(|X_{1}|>0)$ (hence $u_{1}>0$ ) and define

[TABLE]

Obviously $K_{0}<\infty$ since $u_{1}>0$ which implies that $Q(u_{1})<\infty$ and $\gamma^{-1}(u_{1})<\infty$ . Next, for any $k\geq K_{0}$ , let

[TABLE]

and $M_{k}=1$ for $0\leq k<K_{0}$ . Since $u_{1}<{\mathbb{P}}(|X_{1}|>0)$ , it follows that $Q(u_{1})>0$ and therefore since $Q$ is non-increasing and $v_{k}\leq u_{1}$ , $M_{k}\geq Q(u_{1})>0$ , for $k\geq K_{0}$ . Let now, for any $k\geq K_{0}$ ,

[TABLE]

and $m_{k}=1$ for any $1\leq k<K_{0}$ . Since $v_{k}$ is assumed to be strictly less than $1$ (since $v_{k}\leq u_{1}\leq 1/2$ ), $m_{k}\geq 1$ (indeed $\gamma(0)=H^{-1}(\mathbb{E}(|X_{1}|))=1$ ). In addition, since $R$ is right continuous and non-increasing, $u<R^{-1}(x)\iff R(u)>x$ . Hence, $R(R^{-1}(u))\leq u$ for all $u\in[0,1]$ , implying that

[TABLE]

Therefore, for any $k\geq K_{0}$ , since $M_{k}\geq Q(u_{1})>0$ ,

[TABLE]

which proves that $m_{k}=o(3^{2k/p}k^{-1})$ , as $k\rightarrow\infty$ .

To prove now that the conditions (51), (52), (53) and (55) of Proposition 21 are satisfied, we first notice the following useful facts:

[TABLE]

Let us start by proving that condition (51) holds. By using (79), we get

[TABLE]

But

[TABLE]

by condition (17) (which is equivalent to condition (14)). Hence condition (51) is satisfied. Next we note that by Lemma 24 of the Appendix,

[TABLE]

Therefore, by using (80),

[TABLE]

Hence, condition (52) is satisfied with $\alpha=1$ . We prove now that we can find a real number $r\in]2,\infty[$ such that (53) holds. With this aim we start by noticing that, for any $r\geq 1$ , by Lemma 24 of the Appendix,

[TABLE]

Hence, since $m_{k}M_{k}\leq 3^{k/p}$ , for any $r\geq 1$ ,

[TABLE]

which is finite by taking into account (81). Hence to prove that condition (53) holds, it suffices to prove that we can find a real number $r\in]2,\infty[$ such that

[TABLE]

To prove (82), we apply the Rosenthal inequality for $\tau$ -dependent sequences as given in Corollary 1 in Dedecker-Prieur [12]. Let us first recall the definition of the $\tau$ -dependence coefficients: for any random variable $Y$ with values in ${\mathbb{R}}^{\ell}$ and any $\sigma$ -algebra ${\mathcal{F}}$ ,

[TABLE]

where, for any integer $\ell\geq 1$ , ${\Lambda_{1}}({\mathbb{R}}^{\ell})$ is the set of $1$ -Lipschitz function from ${\mathbb{R}}^{\ell}$ to ${\mathbb{R}}$ with respect to the norm $|x-y|_{1}\leq\sum_{k=1}^{\ell}|x_{i}-y_{i}|$ . Taking ${\mathcal{F}}_{p}=\sigma(X_{i},i\leq p)$ , the coefficients $\tau(i)$ of the sequence $(\varphi_{k}(X_{i}))_{i\in{\mathbb{Z}}}$ are then defined by: for any $i\geq 0$ ,

[TABLE]

In the stationary case, Corollary 1 in Dedecker-Prieur [12] implies that, for any $r>2$ ,

[TABLE]

where ${\tau}^{-1}$ is the generalized inverse of the function $\tau$ defined by $\tau(x)=\tau([x])$ .

To compare the coefficients $\tau(i)$ with the coefficients $\delta(i)$ , we consider $(W_{0}^{\prime},(\varepsilon_{j}^{\prime})_{j\geq 1})$ an independent copy of $(W_{0},(\varepsilon_{j})_{j\geq 1})$ and define $W_{1}^{\prime}=F(\varepsilon^{\prime}_{1},W_{0}^{\prime})$ and $W_{m}^{\prime}=F(\varepsilon_{m},W_{m-1}^{\prime})$ for any $m\geq 2$ . Note that for any $j\geq 2$ , by using the relation (97) of the Appendix, we have

[TABLE]

Define now, for any $j\geq 2$ ,

[TABLE]

Clearly for any $2\leq j_{1}<\ldots<j_{\ell}$ , $(\varphi_{k}(X^{\prime}_{j_{1}}),\ldots,\varphi_{k}(X^{\prime}_{j_{\ell}}))$ is distributed as $(\varphi_{k}(X_{j_{1}}),\ldots,\varphi_{k}(X_{j_{\ell}}))$ and is independent of $(\varepsilon_{0},W_{-1})$ . Hence, by stationarity and Lemma 3 in Dedecker-Prieur [12],

[TABLE]

where the second inequality comes from the fact that $f\in{\Lambda_{1}}({\mathbb{R}}^{\ell})$ and $\varphi_{k}$ is $1$ -Lipschitz. Therefore, since $\delta$ is non-increasing, for any $i\geq 2$ ,

[TABLE]

Moreover, for any $i\in\{0,1\}$ , we obviously get that $\tau(i)\leq 2\mathbb{E}(|X_{1}|)=2\delta(0)$ . It follows that for any $x\geq 0$ ,

[TABLE]

Therefore, since both $\tau$ and $\delta$ are non-increasing,

[TABLE]

In addition, since $\varphi_{k}$ is $1$ -Lipschitz and such that $\varphi_{k}(0)=0$ ,

[TABLE]

since $H$ is non-decreasing. Therefore, using additionally the fact that $u<v\iff Q_{|\varphi_{k}(X_{1})|}(v)<Q_{|\varphi_{k}(X_{1})|}(u)$ , we get

[TABLE]

and then, since $\gamma^{-1}(u)=\delta^{-1}\circ H(u)$ ,

[TABLE]

Recall now that $m_{k}=\gamma^{-1}(v_{k})$ , therefore since $\gamma^{-1}$ is non-increasing,

[TABLE]

Using also the fact that $Q_{|\varphi_{k}(X_{1})|}(x)=Q(x\vee v_{k})$ , we get

[TABLE]

Using the fact that $m_{k}Q(v_{k})\leq 3^{k/p}$ and (80), we get that, for any $r>2$ ,

[TABLE]

On another hand, for any $r>p$ ,

[TABLE]

by condition (17) (which is equivalent to condition (14)). Finally using again that $m_{k}Q(v_{k})\leq 3^{k/p}$ , we derive that, for any $r>2(p-1)$ ,

[TABLE]

since condition (17) obviously implies that $\int_{0}^{1}\gamma^{-1}(u)Q^{1+2/r}(u)du<\infty$ . So, overall, (82) holds provided we select $r>2(p-1)$ .

To end the proof it remains to show that condition (55) holds. With this aim, we start by recalling the equation (66), namely:

[TABLE]

where, for $i\geq 0$ ,

[TABLE]

But, by using Lemma 24 of the Appendix, we have, for any $i\geq 0$ ,

[TABLE]

Hence, since $m_{k}Q(v_{k})\leq 3^{k/p}$ ,

[TABLE]

by condition (17) (which is equivalent to condition (14)). Taking into account (86) together with the fact that $\sigma^{2}=\sum_{k\in{\mathbb{Z}}}{\rm cov}(X_{0},X_{k})$ , we get

[TABLE]

Next, by using Proposition 1 in Dedecker-Doukhan [10], we derive

[TABLE]

But, since $m_{k}=\gamma^{-1}(v_{k})$ , note that

[TABLE]

Hence

[TABLE]

by condition (17). On another hand, by using inequality (1.11a) in [23] and (79), we derive that, for any $i\geq 0$ ,

[TABLE]

Hence, by taking into account (88),

[TABLE]

So, by the computations in (89),

[TABLE]

Hence, starting from (87) and taking into account (89) and (90), it follows that

[TABLE]

implying, since $p>2$ , that

[TABLE]

This proves that (65) holds and then that (55) is satisfied since $\sigma^{2}>0$ . The proof is complete for the case $\sigma^{2}>0$ .

Assume now that $\sigma^{2}=0$ . Let $M$ be a positive real number. According to inequality (5.42) in Merlevède-Rio [21], for any positive integer $n$ , any real number $\lambda$ , and any positive integer $q\leq n$ and such that $qM\leq\lambda$ , we have

[TABLE]

Choose now $u=R^{-1}(\lambda)$ , $q=\gamma^{-1}(u)\wedge n$ and $M=Q(u)$ . Since $R$ is right continuous, we have $R(u)\leq\lambda$ , hence $qM\leq R(u)\leq\lambda$ . Note also that

[TABLE]

In addition,

[TABLE]

Since $\gamma(q)\leq u$ , it follows that

[TABLE]

Starting from (91) and taking into account the considerations above, we get that, for any $\lambda>0$ ,

[TABLE]

Hence, for any $\varepsilon>0$ , selecting $\lambda=\varepsilon n^{1/p}$ , we derive

[TABLE]

The second series in the right-hand side is finite under condition condition (17) (which is equivalent to condition (14)). Hence, if we can prove that

[TABLE]

then we will get that, for any $\varepsilon>0$ ,

[TABLE]

which will imply $S_{n}-n\mathbb{E}(X_{1})=o(n^{1/p})$ a.s. and therefore the proof of the theorem will be complete. In the case where $p\geq 3$ , (93) is almost immediate. To see this, we first note that condition (17) implies $\sum_{i\geq 1}i|{\rm Cov}(X_{0},X_{i})|<\infty$ . Indeed, by Proposition 1 in Dedecker-Doukhan [10],

[TABLE]

which is finite under condition (17). Therefore, by Lemma 1 in Bradley [5], ${\rm Var}(S_{n})$ is bounded which obviously entails (93). To handle the case where $p\in]2,3[$ , we first note that, by inequality (4.84) in [19],

[TABLE]

But, $\|\mathbb{E}(X_{k}|V_{0})-\mathbb{E}(X_{k})\|_{1}\leq 2\delta(k)$ . Hence

[TABLE]

Hence condition (14) entails

[TABLE]

which implies (since $p>2$ ) that

[TABLE]

We use now the same arguments as developed at the beginning of the proof of Theorem 1. The fact that the series in (94) converge implies that there exists a stationary sequence $(D_{k})_{k\in{\mathbb{Z}}}$ of martingale differences in ${\mathbb{L}}^{2}$ , adapted to $({\mathcal{F}}_{k})_{k\in{\mathbb{Z}}}$ and such that

[TABLE]

Together with the fact that $\lim_{n\rightarrow\infty}n^{-1}{\rm Var}(S_{n})=\sigma^{2}=0$ , it follows that $D_{k}=0$ a.s, for any $k$ . Hence, using the upper bound (4) in Cuny-Merlevède [8] (see also Proposition 1 in [18]), it follows that, for any $p\in]2,3[$ ,

[TABLE]

Therefore, for any $p\in]2,3[$ ,

[TABLE]

which is finite since $p+2/p-3=p^{-1}(p-1)(p-2)>0$ . This ends the proof of the theorem. $\square$

5 Appendix

5.1 Some technical results

In this section, we collect some technical results that are useful for the proofs of Theorems 1 and 2.

Lemma 22

Let $(Y_{k})_{k\in{\mathbb{Z}}}$ be a stationary sequence of real-valued random variables adapted to an increasing and stationary filtration $({\mathcal{F}}_{k})_{k\in{\mathbb{Z}}}$ . Let $f$ and $g$ be two functions in ${\mathbb{L}}^{2}({\mathbb{R}},P_{Y_{0}})$ such that $\mathbb{E}(f(Y_{0})|{\mathcal{F}}_{-\infty})=\mathbb{E}(f(Y_{0}))$ a.s. and $\mathbb{E}(g(Y_{0})|{\mathcal{F}}_{-\infty})=\mathbb{E}(g(Y_{0}))$ a.s. Then, for any positive integer $L$ ,

[TABLE]

and

[TABLE]

where $P_{j}(\cdot)=\mathbb{E}(\cdot|{\mathcal{F}}_{j})-\mathbb{E}(\cdot|{\mathcal{F}}_{j-1})$ .

Proof. Since $\mathbb{E}(f(Y_{0})|{\mathcal{F}}_{-\infty})=\mathbb{E}(f(Y_{0}))$ a.s. and $\mathbb{E}(g(Y_{0})|{\mathcal{F}}_{-\infty})=\mathbb{E}(g(Y_{0}))$ a.s., we first write

[TABLE]

Hence, by orthogonality, for any $i\geq 0$ ,

[TABLE]

and then, by Cauchy-Schwarz’s inequality and stationarity,

[TABLE]

But, for any $m\geq 1$ , by Cauchy-Schwarz’s inequality,

[TABLE]

giving

[TABLE]

Since $(\|\mathbb{E}(g(Y_{k})|{\mathcal{F}}_{0})-\mathbb{E}(g(Y_{k}))\|_{2})_{k\geq 0}$ is non-increasing, we get that for any $m\geq 1$ ,

[TABLE]

which combined with (95) gives the first inequality of the lemma. To prove the second one, it suffices to write that $\sum_{i=0}^{L}\|P_{0}(g(Y_{i}))\|_{2}=\sum_{i=0}^{L}(i+1)^{-1}\|P_{0}(g(Y_{i}))\|_{2}\big{(}\sum_{k=1}^{i+1}1\big{)}$ and to use Cauchy-Schwarz’s inequality as in (96). $\square$

The following proposition is a non stationary version of the Peligrad-Utev-Wu [22] inequality. As in [22], the proof can be done by induction (a complete proof appears in Section 3.2.1 of [20]).

Proposition 23

Let $\alpha\geq 2$ and $(Z_{k})_{k\in{\mathbb{Z}}}$ be a sequence of real-valued random variables in ${\mathbb{L}}^{\alpha}$ and adapted to a non-decreasing filtration $(\mathcal{F}_{k})_{k\in{\mathbb{Z}}}$ . Then, for any $n\geq 1$ ,

[TABLE]

where $S_{k}=\sum_{i=1}^{k}Z_{i}$ , $c_{\alpha}=\frac{\alpha}{(\alpha-1)^{1/2}}$ if $\alpha>2$ , $c_{2}=1$ and $r$ is the unique positive integer such that $2^{r-1}\leq n<2^{r}$ .

Lemma 24

For any $q\in[1,p)$ , for any $k\geq 1$ and any $j\geq m_{k}+1$ ,

[TABLE]

where $X_{k,j}$ and ${\tilde{X}}_{k,j}$ are defined in (48) and (49) respectively.

Proof. Let $(W_{0}^{\prime},(\varepsilon_{j}^{\prime})_{j\geq 1})$ be an independent copy of $(W_{0},(\varepsilon_{j})_{j\geq 1})$ and define $W_{j}^{\prime}=F(\varepsilon^{\prime}_{j},W^{\prime}_{j-1})$ , $j\geq 1$ . For $\ell\geq 1$ , let $F_{\ell}$ be the function from $G^{\ell}\times X$ to $X$ defined in an iterative way as follows

[TABLE]

Note that for any integer $\ell$ such that $1\leq\ell\leq j-1$ ,

[TABLE]

Hence, for any $j\geq m_{k}+1$ ,

[TABLE]

On another hand, for any $j\geq 1$ ,

[TABLE]

Hence, for any $j\geq m_{k}+1$ ,

[TABLE]

where the second inequality comes from the fact that $\varphi_{k}$ is $1$ -Lipschitz. By stationarity, it follows that

[TABLE]

Hence, if we define $(X^{*}_{n})_{n\geq 1}$ by

[TABLE]

with $W^{*}_{0}$ independent of $(W_{0},(\varepsilon_{k})_{k\geq 1})$ and such that $W^{*}_{0}=^{\mathcal{L}}W_{0}$ , we get that for any $j\geq m_{k}+1$ ,

[TABLE]

But,

[TABLE]

which combined with (98) gives the lemma. $\square$

5.2 Proof of Lemma 17

The first inequality in (43) comes from the coupling inequality (38) and the fact that $\liminf_{n\rightarrow\infty}n^{a}\beta(n)>0$ (see Theorem 9.4 in Rio [23]). We prove now the second inequality in (43).

Let $W_{n,x}$ be the chain starting at $x$ . Note first that for any any $x,y\in[0,1]$ ,

[TABLE]

But

[TABLE]

For $j>i$ , define ${\mathcal{W}}_{i,j}:=\bigcap_{k=i}^{j}\{W_{k,x}\neq W_{k,y}\}$ , ${\mathcal{E}}_{i,j}(x):=\bigcap_{k=i}^{j}\{W_{k,x}=F_{\pi}^{-1}(V_{i})\}$ , and note that

[TABLE]

So, overall, setting $w_{i}(x,y):={\mathbb{P}}_{x,y}(T^{*}>i,\,W_{i,x}=F_{\pi}^{-1}(V_{i}))$ ,

[TABLE]

Using the fact that for any $b>-1$ ,

[TABLE]

we get that

[TABLE]

By easy computations (that are left to the reader), we infer that Lemma 17 will hold provided one can prove that:

Lemma 25

For any $a>1$ , there exists a positive constant $\kappa(a)$ depending only on $a$ such that for any $n\geq 1$ ,

[TABLE]

Obviously, inequality (101) holds for any positive integer $n\leq\kappa(a)$ . It is then enough to prove it for $n>\kappa(a)$ . Let us do it by recurrence. Hence we assume that for any $k\leq n-1$ , $\displaystyle\nu\otimes\nu(w_{k})\leq\kappa(a)k^{-a}$ and we want to prove it at step $n$ . With this aim, we argue as above and infer that

[TABLE]

Hence,

[TABLE]

Using the recurrence assumption, it follows that

[TABLE]

Then, taking into account (99), we infer that

[TABLE]

So, overall, since $n\geq\kappa(a)$ , we get

[TABLE]

where

[TABLE]

So choosing $\kappa(a)$ large enough so that $\rho(a)\leq 1$ (which is always possible since $a^{-1}<1$ ), inequality (101) is proved at step $n$ which ends the recurrence. $\square$

5.3 Proof of Lemma 20

We start by recalling the inequality line 5 page 27 of Dedecker-Rio [13], which holds for every $x,y\in{\mathbb{R}}$ , every $n\geq 1$ and any $t>0$ :

[TABLE]

where $\alpha(u)=1-\frac{C}{(1+u)^{\tau}}$ , for every $u\geq 0$ , $\Sigma_{0}=0$ and $\Sigma_{n}=|\varepsilon_{1}|+\cdots|\varepsilon_{n}|$ , for every $n\geq 1$ .

Denote $\upsilon:=\mathbb{E}(|\varepsilon_{1}|)$ and let $0<\eta\leq 1/\tau-1$ . Notice that $\alpha$ is non-decreasing and bounded by $1$ . Hence, for any $n\geq 1$ , using that $n\leq n^{1/\tau-\eta}$ , we get

[TABLE]

By Theorems 3 and 4 in Baum and Katz [1], since $\mu$ has a moment of order $S$ ,

[TABLE]

provided that $\gamma\leq S(1/\tau-\eta)-2$ . Since $S/\tau-2\geq t/\tau+\gamma$ , the latter holds as soon as $\eta\leq t/(S\tau)$ . Hence, we choose $\eta=\min(t/(S\tau),1/\tau-1)$ . On another hand,

[TABLE]

Finally,

[TABLE]

where $D$ is a constant depending on $\gamma$ , $t$ and $C$ . Starting from (102) and taking into account (103), (104) and (105) together with the fact that, by (46), $\nu$ has a moment of order $S-\tau$ and that $S-\tau\geq\tau(\gamma+1)+t$ , we get the first part of the lemma.

To prove the last statement, it suffices to notice that for any Lipschitz function $h$ with Lipschitz coefficient equal to $C$ , we have, for any $n\geq 2$ ,

[TABLE]

Next simple arguments entail that, for any $n\geq 2$ ,

[TABLE]

$\square$

Acknowledgement. The second author is very thankful to the laboratories MAP5 and LAMA for their invitations that made possible the present collaboration.

Bibliography25

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Baum, L. and Katz, M. Convergence rates in the law of large numbers. Bull. Amer. Math. Soc. 69 (1963), 771-772.
2[2] Y. Benoist and J.-F. Quint, Central limit theorem for linear groups , Ann. Probab. 44 (2016), no. 2, 1308-1340.
3[3] Berkes, I., Liu, W. and Wu, W. B. Komlós-Major-Tusnády approximation under dependence. Ann. Probab. 42 (2014), no. 2, 794-817.
4[4] P. Bougerol and J. Lacroix, Products of random matrices with applications to Schrödinger operators. Progress in Probability and Statistics, 8. Birkhäuser Boston, Inc., Boston, MA, 1985
5[5] Bradley, R. C. On quantiles and the central limit question for strongly mixing sequences. J. Theor. Probab. 10 (1997), 507-555.
6[6] Bradley, R. C. Introduction to strong mixing conditions . Vol. 1,2,3. Kendrick Press, Heber City, UT, 2007.
7[7] Cuny, C., Dedecker, J. and Jan, C. Limit theorems for the left random walk on G L d ( ℝ ) 𝐺 subscript 𝐿 𝑑 ℝ GL_{d}({\mathbb{R}}) . (2017). hal-01283929 . To appear in Ann. Inst. H. Poincaré Probab. Statist.
8[8] Cuny, C. and Merlevède, F. On martingale approximations and the quenched weak invariance principle. Ann. Probab. 42 (2014), no. 2, 760-793.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

On the Komlós, Major and Tusnády strong approximation for some classes of random iterates

Abstract

1 Introduction

2 Main results

Theorem 1

Theorem 2

Remark 3

Remark 4

Corollary 5

3 Applications

3.1 Left random

Corollary 6

Remark 7

3.2 Contracting iterated random functions

3.2.1 Uniform contraction

Lemma 8

Corollary 9

Remark 10

3.2.2 L1{\mathbb{L}}^{1}L1-contraction

Lemma 11

Corollary 12

Remark 13

3.3 Ergodic Markov chains

3.3.1 A discrete ergodic Markov chain example

Corollary 14

Proposition 15

3.3.2 An example of ergodic Markov chain with continuous state space

Corollary 16

Lemma 17

Proposition 18

3.4 Lipschitz autoregressive models

Corollary 19

Lemma 20

4 Proofs of Theorems 1 and 2

Proposition 21** (Berkes-Liu-Wu [3])**

4.1 Proof of Theorem 1

4.2 Proof of Theorem 2

5 Appendix

5.1 Some technical results

Lemma 22

Proposition 23

Lemma 24

5.2 Proof of Lemma 17

Lemma 25

5.3 Proof of Lemma 20

3.2.2 ${\mathbb{L}}^{1}$ -contraction

Proposition 21 (Berkes-Liu-Wu [3])