Sunklodas' approach to normal approximation for time-dependent dynamical   systems

Juho Lepp\"anen; Mikko Stenlund

arXiv:1906.03217·math.DS·October 28, 2020

Sunklodas' approach to normal approximation for time-dependent dynamical systems

Juho Lepp\"anen, Mikko Stenlund

PDF

TL;DR

This paper develops a normal approximation method for sums in time-dependent dynamical systems, providing explicit error bounds and applying Stein's method to systems like expanding maps and intermittent systems.

Contribution

It introduces a new normal approximation approach for time-dependent systems with explicit error rates and constants, extending Stein's method to this context.

Findings

01

Error in approximation decays at rates $O(N^{-1/2})$ or $O(N^{-1/2} \log N)$

02

Conditions depend on the normalizing sequence $b(N)$ and metric used

03

Applications include expanding maps and intermittent systems

Abstract

We consider time-dependent dynamical systems arising as sequential compositions of self-maps of a probability space. We establish conditions under which the Birkhoff sums for multivariate observations, given a centering and a general normalizing sequence $b (N)$ of invertible square matrices, are approximated by a normal distribution with respect to a metric of regular test functions. Depending on the metric and the normalizing sequence $b (N)$ , the conditions imply that the error in the approximation decays either at the rate $O (N^{- 1/2})$ or the rate $O (N^{- 1/2} lo g N)$ , under the additional assumption that $∥ b (N)^{- 1} ∥ ≲ N^{- 1/2}$ . The error comes with a multiplicative constant whose exact value can be computed directly from the conditions. The proof is based on an observation due to Sunklodas regarding Stein's method of normal approximation. We give applications to…

Equations613

W = W (N) = n = 1 \sum N - 1 b^{- 1} (f \circ T_{n} - μ (f \circ T_{n}))

W = W (N) = n = 1 \sum N - 1 b^{- 1} (f \circ T_{n} - μ (f \circ T_{n}))

h \in H sup ∣ μ [h (W) - Φ_{Σ} (h)] ∣,

h \in H sup ∣ μ [h (W) - Φ_{Σ} (h)] ∣,

tr Σ D^{2} A (w) - w^{T} \nabla A (w) = h (w) - Φ_{Σ} (h) (w \in R^{d})

tr Σ D^{2} A (w) - w^{T} \nabla A (w) = h (w) - Φ_{Σ} (h) (w \in R^{d})

\eqref e q : in t r o_{a} im \leq A \in A sup ∣ μ [tr Σ D^{2} A (W) - W^{T} \nabla A (W)] ∣.

\eqref e q : in t r o_{a} im \leq A \in A sup ∣ μ [tr Σ D^{2} A (W) - W^{T} \nabla A (W)] ∣.

W^{n, K} = 0 \leq i \leq N - 1 : ∣ i - n ∣ > K \sum b^{- 1} (f \circ T_{i} - μ (f \circ T_{i}))

W^{n, K} = 0 \leq i \leq N - 1 : ∣ i - n ∣ > K \sum b^{- 1} (f \circ T_{i} - μ (f \circ T_{i}))

∥ D^{k} A ∥_{\infty} = max {∥ \partial_{1}^{t_{1}} \dots \partial_{d}^{t_{d}} A_{α} ∥_{\infty} : t_{1} + \dots + t_{d} = k, 1 \leq α \leq d^{'}} .

∥ D^{k} A ∥_{\infty} = max {∥ \partial_{1}^{t_{1}} \dots \partial_{d}^{t_{d}} A_{α} ∥_{\infty} : t_{1} + \dots + t_{d} = k, 1 \leq α \leq d^{'}} .

∥ A ∥_{s} = sup {∥ A x ∥ : ∥ x ∥ = 1},

∥ A ∥_{s} = sup {∥ A x ∥ : ∥ x ∥ = 1},

f^{i} = g^{i} \circ T_{i} \circ \dots \circ T_{1} and \overset{ˉ}{f}^{i} = f^{i} - μ (f^{i}) .

f^{i} = g^{i} \circ T_{i} \circ \dots \circ T_{1} and \overset{ˉ}{f}^{i} = f^{i} - μ (f^{i}) .

W = W (N) = i = 0 \sum N - 1 b^{- 1} \overset{ˉ}{f}^{i} .

W = W (N) = i = 0 \sum N - 1 b^{- 1} \overset{ˉ}{f}^{i} .

\overset{ˉ}{f}^{n, k} = 0 \leq i < N : ∣ i - n ∣ = k \sum \overset{ˉ}{f}^{i} .

\overset{ˉ}{f}^{n, k} = 0 \leq i < N : ∣ i - n ∣ = k \sum \overset{ˉ}{f}^{i} .

Σ = Cov_{μ} (W) = μ (W \otimes W) .

Σ = Cov_{μ} (W) = μ (W \otimes W) .

G_{h} (x, y) = b^{- 1} [D^{2} h (s b^{- 1} (x + t y) + z) - D^{2} h (s b^{- 1} x + z)] b^{- 1},

G_{h} (x, y) = b^{- 1} [D^{2} h (s b^{- 1} (x + t y) + z) - D^{2} h (s b^{- 1} x + z)] b^{- 1},

∥ F ∥_{\infty} = sup {∥ F (x, y) ∥_{s} : (x, y) \in R^{d} \times B_{d} (0, 4 M + 1)}

∥ F ∥_{\infty} = sup {∥ F (x, y) ∥_{s} : (x, y) \in R^{d} \times B_{d} (0, 4 M + 1)}

∥\nabla F ∥_{\infty} = 1 \leq i \leq 2 d max sup {∥ \partial_{i} F (x, y) ∥_{s} : (x, y) \in R^{d} \times B_{d} (0, 4 M + 1)},

∥\nabla F ∥_{\infty} = 1 \leq i \leq 2 d max sup {∥ \partial_{i} F (x, y) ∥_{s} : (x, y) \in R^{d} \times B_{d} (0, 4 M + 1)},

∣ μ (\overset{ˉ}{f}_{α}^{n} \overset{ˉ}{f}_{β}^{m}) ∣ \leq C_{1} ρ (∣ n - m ∣) .

∣ μ (\overset{ˉ}{f}_{α}^{n} \overset{ˉ}{f}_{β}^{m}) ∣ \leq C_{1} ρ (∣ n - m ∣) .

μ (\overset{ˉ}{f}^{n})^{T} G_{h} 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k} \overset{ˉ}{f}^{n, m} \leq C_{2} (∥ G_{h} ∥_{\infty} + ∥\nabla G_{h} ∥_{\infty}) ρ (m) .

μ (\overset{ˉ}{f}^{n})^{T} G_{h} 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k} \overset{ˉ}{f}^{n, m} \leq C_{2} (∥ G_{h} ∥_{\infty} + ∥\nabla G_{h} ∥_{\infty}) ρ (m) .

μ (\overset{ˉ}{f}^{n})^{T} \overline{G_{h} 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k}} \overset{ˉ}{f}^{n, m} \leq C_{3} (∥ G_{h} ∥_{\infty} + ∥\nabla G_{h} ∥_{\infty}) ρ (k - m) .

μ (\overset{ˉ}{f}^{n})^{T} \overline{G_{h} 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k}} \overset{ˉ}{f}^{n, m} \leq C_{3} (∥ G_{h} ∥_{\infty} + ∥\nabla G_{h} ∥_{\infty}) ρ (k - m) .

∣ μ [h (W)] - Φ_{Σ} (h) ∣ \leq C_{*} ∥ D^{3} h ∥_{\infty} N ∥ b^{- 1} ∥_{s}^{3} m = 1 \sum N - 1 m ρ (m),

∣ μ [h (W)] - Φ_{Σ} (h) ∣ \leq C_{*} ∥ D^{3} h ∥_{\infty} N ∥ b^{- 1} ∥_{s}^{3} m = 1 \sum N - 1 m ρ (m),

C_{*} = M^{3} d^{4} 10 (C_{1} + C_{2} + C_{3} + 4) .

C_{*} = M^{3} d^{4} 10 (C_{1} + C_{2} + C_{3} + 4) .

\overset{ˉ}{f}^{n} and G_{h} 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k} \overset{ˉ}{f}^{n, m}

\overset{ˉ}{f}^{n} and G_{h} 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k} \overset{ˉ}{f}^{n, m}

G_{h} 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k} and \overset{ˉ}{f}^{n} \otimes \overset{ˉ}{f}^{n, m}

G_{h} 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k} and \overset{ˉ}{f}^{n} \otimes \overset{ˉ}{f}^{n, m}

d_{W} (Y_{1}, Y_{2}) = h \in W sup ∣ μ (h (Y_{1})) - μ (h (Y_{2})) ∣,

d_{W} (Y_{1}, Y_{2}) = h \in W sup ∣ μ (h (Y_{1})) - μ (h (Y_{2})) ∣,

W = {h : R^{d} \to R : ∣ h (x) - h (y) ∣ \leq ∥ x - y ∥}

W = {h : R^{d} \to R : ∣ h (x) - h (y) ∣ \leq ∥ x - y ∥}

Lip (G) = x \neq = y sup \frac{∣ G ( x ) - G ( y ) ∣}{∥ x - y ∥} .

Lip (G) = x \neq = y sup \frac{∣ G ( x ) - G ( y ) ∣}{∥ x - y ∥} .

μ \overset{ˉ}{f}^{n} G 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k} \overset{ˉ}{f}^{n, m} \leq C_{2} (∥ G ∥_{\infty} + Lip (G)) ρ (m) .

μ \overset{ˉ}{f}^{n} G 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k} \overset{ˉ}{f}^{n, m} \leq C_{2} (∥ G ∥_{\infty} + Lip (G)) ρ (m) .

μ \overset{ˉ}{f}^{n} \overset{ˉ}{f}^{n, m} \overline{G 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k}} \leq C_{3} (∥ G ∥_{\infty} + Lip (G)) ρ (k - m) .

μ \overset{ˉ}{f}^{n} \overset{ˉ}{f}^{n, m} \overline{G 0 \leq i \leq N - 1 : ∣ i - n ∣ > k \sum \overset{ˉ}{f}^{i}, \overset{ˉ}{f}^{n, k}} \leq C_{3} (∥ G ∥_{\infty} + Lip (G)) ρ (k - m) .

d_{W} (W, Z) \leq C_{*} N b^{- 3} m = 1 \sum N - 1 m ρ (m),

d_{W} (W, Z) \leq C_{*} N b^{- 3} m = 1 \sum N - 1 m ρ (m),

\displaystyle C_{*}=96M^{3}\biggl{(}C_{1}+C_{2}+C_{3}+1\biggr{)},

\displaystyle C_{*}=96M^{3}\biggl{(}C_{1}+C_{2}+C_{3}+1\biggr{)},

b = [Var_{μ} (i = 0 \sum N - 1 \overset{ˉ}{f}^{i})]^{1/2} .

b = [Var_{μ} (i = 0 \sum N - 1 \overset{ˉ}{f}^{i})]^{1/2} .

d_{W} (c^{- 1} i = 0 \sum N - 1 \overset{ˉ}{f}^{i}, c^{- 1} b Z) \leq C_{*} N c^{- 1} b^{- 2} m = 1 \sum N - 1 m ρ (m) .

d_{W} (c^{- 1} i = 0 \sum N - 1 \overset{ˉ}{f}^{i}, c^{- 1} b Z) \leq C_{*} N c^{- 1} b^{- 2} m = 1 \sum N - 1 m ρ (m) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Sunklodas’ approach to normal approximation for time-dependent dynamical systems

Juho Leppänen

LPSM, Laboratoire de Probabilités, Statistique et Modélisation, Sorbonne Université, 4 Place Jussieu, 75252 Paris, France

[email protected]

and

Mikko Stenlund

Department of Mathematics and Statistics, P.O. Box 68, Fin-00014 University of Helsinki, Finland.

[email protected] http://www.helsinki.fi/ stenlund/

Abstract.

We consider time-dependent dynamical systems arising as sequential compositions of self-maps of a probability space. We establish conditions under which the Birkhoff sums for multivariate observations, given a centering and a general normalizing sequence $b(N)$ of invertible square matrices, are approximated by a normal distribution with respect to a metric of regular test functions. Depending on the metric and the normalizing sequence $b(N)$ , the conditions imply that the error in the approximation decays either at the rate $O(N^{-1/2})$ or the rate $O(N^{-1/2}\log N)$ , under the additional assumption that $\|b(N)^{-1}\|\lesssim N^{-1/2}$ . The error comes with a multiplicative constant whose exact value can be computed directly from the conditions. The proof is based on an observation due to Sunklodas regarding Stein’s method of normal approximation. We give applications to one-dimensional random piecewise expanding maps and to sequential, random, and quasistatic intermittent systems.

Key words and phrases:

Stein’s method, multivariate normal approximation, time-dependent dynamical system, intermittency

2010 Mathematics Subject Classification. 60F05; 37A05, 37A50, 37C60

Acknowledgements

JL was supported by DOMAST (University of Helsinki) and by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 787304). He would like to thank Viviane Baladi for helpful discussions. MS was supported by Emil Aaltosen Säätiö, and the Jane and Aatos Erkko Foundation

1. Introduction

In this note we revisit the topic of statistical limit laws by Stein’s method for dynamical systems, studied previously in [30, 28, 29, 35, 19, 10, 46, 20, 24, 26]. We consider discrete time-dependent dynamical systems described by sequential compositions ${\mathcal{T}}_{n}=T_{n}\circ\cdots\circ T_{1}$ , where each $T_{i}:X\to X$ is a transformation of a probability space $(X,{\mathcal{B}},\mu)$ . The measure $\mu$ is not assumed to be invariant under any of the maps $T_{i}$ . Given a bounded obsevable $f:X\to{\mathbb{R}}^{d}$ and a sequence $b=b(N)\in{\mathbb{R}}^{d\times d}$ of invertible matrices, we are interested in approximating the law of the sums

[TABLE]

by a multivariate normal distribution. More precisely, we want to identify conditions that cover a wide range of chaotic time-dependent systems and imply a good upper bound on

[TABLE]

where ${\mathcal{H}}$ is a class of regular test functions $h:{\mathbb{R}}^{d}\to{\mathbb{R}}$ , and $\Phi_{\Sigma}(h)$ denotes the expectation of $h$ with respect to the multivariate normal distribution ${\mathcal{N}}(0,\Sigma)$ with covariance matrix $\Sigma=\text{Cov}_{\mu}(W)=\mu(W\otimes W)$ .

Since its introduction in [50], Stein’s method has seen extensive development in the literature of probability theory. In the present context of dynamical systems, the simple basic idea of the method can be described as follows. If for each test function $h\in{\mathcal{H}}$ the solution $A:{\mathbb{R}}^{d}\to{\mathbb{R}}$ to the differential equation (called a Stein equation)

[TABLE]

lies in another class of functions ${\mathcal{A}}$ , then it follows that

[TABLE]

In this way the original problem of approximating the law of $W$ by ${\mathcal{N}}(0,\Sigma)$ is reduced to bounding the right hand side of (3), which interestingly only depends on the law of $W$ and the class ${\mathcal{A}}$ . It was observed in [30, 28] that, when $b(N)=\sqrt{N}I_{d\times d}$ , Taylor expanding $\nabla A(W)$ about the punctured sums

[TABLE]

with a suitably chosen $K=K(N)\gg 1$ leads to certain correlation-decay conditions for an upper bound on $|\mu[\operatorname{tr}\Sigma D^{2}A(W)-W^{T}\nabla A(W)]|$ . Such an approach calls for bounds on partial derivatives of $A$ , which are known to follow from bounds on partial derivatives of $h$ . In [30, 28], ${\mathcal{H}}$ was taken to be the class of three times differentiable functions with bounded derivatives in the case of a general $d>1$ , and the class of Lipschitz continuous functions in the case $d=1$ .

The approach described above was applied in [30] to stationary Sinai billiards and in [28] to time-dependent smooth uniformly expanding circle maps. Both systems are (the latter in a certain sense) exponentially mixing, which is essentially the reason why replacing $W$ with $W^{n,K}$ in the application of Stein’s method causes only a small error. Indeed, upper bounds of order $O(N^{-1/2}\log N)$ on (1) for sufficiently regular observables could be obtained this way. While such a “fixed gap” approach works also for polynomially mixing systems, it yields a larger error depending on the rate of mixing. This can be seen from the results of [29], where time-dependent systems in the spirit of [1, 43] described by sequential compositions $T_{\alpha_{n}}\circ\cdots\circ T_{\alpha_{1}}$ of polynomially mixing intermittent maps $T_{\alpha_{n}}:[0,1]\to[0,1]$ with parameters $0\leq\alpha_{n}\leq\beta_{*}<1/3$ were considered. Under the condition that $\Sigma=\text{Cov}_{\mu}(W)$ is positive definite, an upper bound of order $O(N^{\beta_{*}-1/2}(\log N)^{1/\beta_{*}})$ was obtained for Lipschitz continuous observables. The result was used to establish central limit theorems for quasistatic and random compositions of intermittent maps.

The purpose of the present note is to describe an adaptation of Stein’s method that is more suitable than those of [30, 28] for normal approximation of polynomially mixing systems, and investigate some of its implications. The starting point is a decomposition of $\mu[\operatorname{tr}\Sigma D^{2}A(W)-W^{T}\nabla A(W)]$ due to Sunklodas [56], which allows to identify correlation-decay conditions that imply a rate of decay for $\eqref{eq:intro_aim}$ depending on the “growth of $b(N)$ ”. In the case of a general $b(N)$ such that $\|b(N)^{-1}\|\lesssim N^{-1/2}$ , the conditions yield the rate $O(N^{-1/2})$ for a class of smooth test functions ${\mathcal{H}}$ , and in the special self-norming case $b(N)=[\text{Cov}_{\mu}(\sum_{n<N}(f\circ{\mathcal{T}}_{n}-\mu(f\circ{\mathcal{T}}_{n})))]^{1/2}$ the rate $O(N^{-1/2}\log N)$ for Lipschitz continuous test functions. A key ingredient in the proof of the latter estimate is a recent result due to Gallouët–Mijoule–Swan [16] concerning the regularity of solutions to Stein equation. As applications we establish rates of convergence in the central limit theorem for the random piecewise expanding model studied by Dragičević et al. in [12] and for sequential, random, and quasistatic intermittent systems. The results for intermittent systems notably improve those of [29].

Statistical properties of time-dependent dynamical systems have been studied in several previous works including [54, 2, 15, 33, 57, 34, 53]. Central limit theorems were shown by Bakhtin [3, 4], Conze–Raugi [8], and more recently by Nándori–Szász–Varjú [41] and Nicol– Török–Vaienti [43]. Heinrich [27] showed a Berry-Esseen type upper bound for sequences of uniformly expanding interval maps admitting a Markov partition. Haydn–Nicol–Török–Vaienti [25] established almost sure invariance principles (ASIP) for piecewise-expanding and other related models, also in higher dimension. ASIPs were obtained also by Castro–Rodrigues–Varandas [6] for convergent sequences of Anosov diffeomorphisms and expanding maps on compact Riemannian manifolds. Recently Su [55] proved a vector valued ASIP for a general class of polynomially mixing time-dependent systems. Among its many implications is a self-norming CLT for the sequential intermittent system with $\beta_{*}<1/2$ , under a (polynomial) variance growth condition. Finally, Hafouta [23] showed several limit theorems, including a Berry-Esseen theorem and a local limit theorem, for sequential compositions of maps belonging to a certain class of distance expanding maps of a compact metric space.

Notation.

For a function $A:\,{\mathbb{R}}^{d}\to{\mathbb{R}}$ , we write $D^{k}A$ for the $k$ th derivative of $A$ , and also denote $\nabla A=D^{1}A$ . We define

[TABLE]

The spectral norm of a matrix $A\in{\mathbb{R}}^{d\times d}$ is denoted by

[TABLE]

where $\|\cdot\|$ is the Euclidean norm of ${\mathbb{R}}^{d}$ . We use $B_{d}(x,r)$ to denote the open ball in ${\mathbb{R}}^{d}$ with center $x$ and radius $r>0$ .

Given a measure space $(X,{\mathcal{B}},\mu)$ and a $\mu$ -integrable function $f:X\to{\mathbb{R}}^{d}$ we set $\mu(f)=\int f\,d\mu$ . The components of $f$ are denoted by $f_{\alpha}$ , where $\alpha\in\{1,\ldots,d\}$ . The Lebesgue measure is denoted by $m$ .

For two vectors $v,w\in{\mathbb{R}}^{d}$ we set $v\otimes w=[v_{\alpha}w_{\beta}]_{\alpha,\beta}$ .

We denote by $C$ a generic positive constant whose value might change from one line to the next. We use $C(a_{1},\ldots,a_{n})$ to denote a positive constant that depends only on the parameters $a_{1},\ldots,a_{n}$ .

Structure of the paper

In Section 2 we present our main results concerning normal approximation of abstract discrete time-dependent dynamical systems. Sections 3 and 4 contain applications to one-dimensional dynamics. The model of Section 3 is a random dynamical system of piecewise smooth uniformly expanding maps, while in Section 4 we consider sequential, quasistatic, and random intermittent systems. Finally, in Section 5 we prove the main results.

2. Main results

Consider a sequence $(T_{n})_{n\geq 1}$ of measurable maps $T_{n}:X\to X$ of a probability space $(X,{\mathcal{B}},\mu)$ . For each $i\geq 0$ let $g^{i}:X\to{\mathbb{R}}^{d}$ be a bounded measurable function and define

[TABLE]

Given $N\geq 1$ and an invertible matrix $b=b(N)\in{\mathbb{R}}^{d\times d}$ , we write

[TABLE]

Given also $n,k\geq 0$ , we write

[TABLE]

The covariance matrix of $W$ is denoted by

[TABLE]

2.1. General normalization

First we consider a general invertible $b=b(N)$ and give conditions that imply an upper bound on the distance between the law of $W$ and the normal distribution ${\mathcal{N}}(0,\Sigma)$ with respect to a smooth metric.

Suppose that $\|g^{i}\|_{\infty}=\sup_{x\in X}\|g^{i}(x)\|\leq M$ for all $0\leq i\leq N-1$ . Then, given a smooth test function $h:{\mathbb{R}}^{d}\to{\mathbb{R}}$ and $(s,t,z)\in[0,1]^{2}\times{\mathbb{R}}^{d}$ we define the matrix-valued function $G_{h}=G_{h}^{(s,t,z)}:{\mathbb{R}}^{d}\times B_{d}(0,4M+1)\to{\mathbb{R}}^{d\times d}$ by

[TABLE]

where $D^{2}h(x)=[\partial_{\alpha}\partial_{\beta}h(x)]_{\alpha,\beta}$ . For a differentiable function $F:{\mathbb{R}}^{d}\times B_{d}(0,4M+1)\to{\mathbb{R}}^{d\times d}$ we set

[TABLE]

and

[TABLE]

where $\partial_{i}F(x,y)=[\partial_{i}F_{\alpha,\beta}(x,y)]_{\alpha,\beta}$ .

Here is the first main result:

Theorem 2.1.

Fix $N\geq 1$ and let $h:\,{\mathbb{R}}^{d}\to{\mathbb{R}}$ be three times differentiable with $\|D^{p}h\|_{\infty}<\infty$ for $1\leq p\leq 3$ . Suppose $M=\max_{i<N}\|g^{i}\|_{\infty}<\infty$ , and that there exist a function $\rho:{\mathbb{N}}\to{\mathbb{R}}_{+}$ with $\lim_{n\to\infty}\rho(n)=0$ and constants $C_{i}>0$ , $1\leq i\leq 3$ , such that the following conditions hold for all $0\leq n,m\leq N-1$ :

(A1)

For all $\alpha,\beta\in\{1,\ldots,d\}$ ,

[TABLE]

(A2)

Whenever $(s,t,z)\in[0,1]^{2}\times{\mathbb{R}}^{d}$ and $m\leq k\leq N-1$ ,

[TABLE]

(A3)

Whenever $(s,t,z)\in[0,1]^{2}\times{\mathbb{R}}^{d}$ and $2m\leq k\leq N-1$ ,

[TABLE]

(A4)

The matrix $\Sigma=\mu(W\otimes W)$ is positive definite.

Then

[TABLE]

where

[TABLE]

Here $\Phi_{\Sigma}(h)$ denotes the expectation of $h$ with respect to ${\mathcal{N}}(0,\Sigma)$ .

Remark 2.2.

Theorem 2.1, as well as Theorems 2.3 and 2.6 given below, continue to hold if $f^{i}$ are replaced with general random vectors.

We postpone proving Theorem 2.1 and other results in this section until Section 5. Due to the smooth metric, the constant $C_{*}$ in the upper bound (4) is independent of the covariance matrix $\Sigma$ . Note that under the additional assumptions $\sum_{m=1}^{\infty}m\rho(m)<\infty$ and $\|b^{-1}\|_{s}\lesssim N^{-1/2}$ we obtain $|\mu[h(W)]-\Phi_{\Sigma}(h)|=O(N^{-1/2})$ as $N\to\infty$ , which is the optimal rate in this generality. Conditions (A1)-(A3) are designed for time-dependent systems with sufficiently good (polynomial) mixing properties. Condition (A1) requires the decay of non-stationary correlations at the rate $\rho$ . Condition (A2) requires that, for large $m$ , the random vectors

[TABLE]

are componentwise nearly uncorrelated. This is reasonable because the function on the right depends on $\bar{f}^{i}$ with $|i-n|\geq m$ only. The function $G_{h}$ is differentiable and its $C^{1}$ norm appears as a factor in the upper bound. Condition (A3) is similar in spirit to condition (A2), for it requires

[TABLE]

to be nearly componentwise uncorrelated, which is again reasonable when $k\gg m$ .

Recall that the Wasserstein distance between two random vectors $Y_{1}$ and $Y_{2}$ is defined by

[TABLE]

where

[TABLE]

is the class of all $1$ -Lipschitz functions. When $d=1$ we obtain a result similar to Theorem 2.1 for the Wasserstein distance. The relaxed smoothness of $h$ comes with the expense that conditions (A2) and (A3) have to be verified for a whole class of regular functions.

For a function $G:{\mathbb{R}}^{d}\to{\mathbb{R}}$ we denote

[TABLE]

Theorem 2.3.

Let $d=1$ and fix $N\geq 1$ . Take $b=\textnormal{Var}_{\mu}(\sum_{i<N}\bar{f}^{i})^{1/2}$ . Suppose that $M=\max_{i<N}\|g^{i}\|_{\infty}<\infty$ , that $b>0$ , and that there exist constants $C_{i}>0$ , $1\leq i\leq 3$ , and a function $\rho:{\mathbb{N}}\to{\mathbb{R}}_{+}$ with $\lim_{n\to\infty}\rho(n)=0$ such that the following conditions hold for all $0\leq n,m\leq N-1$ :

(B1)

$|\mu(\bar{f}^{n}\bar{f}^{m})|\leq C_{1}\rho(|n-m|)$ .

(B2)

Whenever $m\leq k\leq N-1$ and $G:{\mathbb{R}}\times B_{1}(0,4M+1)\to{\mathbb{R}}$ is a bounded Lipschitz continuous function,

[TABLE]

(B3)

Whenever $2m\leq k\leq N-1$ and $G:{\mathbb{R}}\times B_{1}(0,4M+1)\to{\mathbb{R}}$ is a bounded Lipschitz continuous function,

[TABLE]

Then

[TABLE]

where

[TABLE]

and $Z\sim{\mathcal{N}}(0,1)$ is a random variable with standard normal distribution.

The following easy observation allows for normalizing constants other than

[TABLE]

Lemma 2.4.

Suppose (5) of Theorem 2.3 holds. Then, for any $c>0$ ,

[TABLE]

Proof.

For any random variables $X,Y$ and any $a>0$ , the Wasserstein metric satisfies

[TABLE]

∎

Remark 2.5.

There is a notable difference between the upper bounds (4) and (6): unlike (4), (6) always depends on $\textnormal{Var}_{\mu}(\sum_{i<N}\bar{f}^{i})$ in addition to the normalizing constant $c$ . This difference is due to the choice of metric.

2.2. Self-normalization

We now assume that $\operatorname{Cov}_{\mu}(\sum_{i=0}^{N-1}\bar{f}^{i})$ is positive definite and set $b=\operatorname{Cov}_{\mu}(\sum_{i=0}^{N-1}\bar{f}^{i})^{1/2}$ so that $\Sigma=\mu(W\otimes W)=I_{d\times d}$ . In this case we establish an upper bound on the distance between the law of $W$ and a standard normal random vector $Z\sim{\mathcal{N}}(0,I_{d\times d})$ with respect to the Wasserstein metric. Unlike Theorem 2.3, the result applies for a general $d\geq 1$ . We denote by $\lambda_{\min}$ the least eigenvalue of $\operatorname{Cov}_{\mu}(\sum_{i=0}^{N-1}\bar{f}^{i})$ .

Theorem 2.6.

Let $N\geq 1$ . Suppose that $\max_{0\leq i<N}\|g^{i}\|_{\infty}\leq M$ where $M\geq 1$ , that $\lambda_{\min}>1$ , and that there exist a non-increasing function $\rho:{\mathbb{N}}\to{\mathbb{R}}_{+}$ with with $\lim_{n\to\infty}\rho(n)=0$ and constants $C_{i}>0$ , $1\leq i\leq 3$ , such that the following conditions hold for all $0\leq n,m\leq N-1$ :

(C1)

For all $\alpha,\beta\in\{1,\ldots,d\}$ ,

[TABLE]

(C2)

Whenever $m\leq k\leq N-1$ and $G:{\mathbb{R}}^{d}\times B_{d}(0,4M+1)\to{\mathbb{R}}^{d\times d}$ is a bounded $C^{1}$ -function with bounded gradient,

[TABLE]

(C3)

Whenever $2m\leq k\leq N-1$ and $G:{\mathbb{R}}^{d}\times B_{d}(0,4M+1)\to{\mathbb{R}}^{d\times d}$ is a bounded $C^{1}$ -function with bounded gradient,

[TABLE]

Then

[TABLE]

where

[TABLE]

Remark 2.7.

If in addition $\sum_{m=1}^{\infty}(1+\log(\rho(m)^{-1}))m\rho(m)<\infty$ and $\lambda_{\min}\gtrsim N$ hold, then we obtain the rate $d_{\mathscr{W}}(W,Z)=O(N^{-1/2}\log N)$ , as $N\to\infty$ .

2.3. Pène’s CLT for stationary dynamics

The theorems given above apply in the stationary case where $T_{n}=T$ preserves the measure $\mu$ for all $n\geq 1$ . In this case the problem of normal approximation has been studied in several important articles including [22, 9, 14, 44, 13, 32], using different methods, conditions, and metrics. In the multidimensional case $d>1$ , Pène [45] formulated a correlation-decay condition for stationary processes, based on the inductive proof of Rio [48]. Let $S_{n}=\sum_{i=0}^{n-1}f^{i}$ , where $f^{i}=f\circ T^{i}$ , $f:X\to{\mathbb{R}}^{d}$ is bounded and $\mu(f)=0$ . In this context of measure preserving transformations, Pène’s condition can be stated as follows:

(D)

There exist $r\in\mathbb{Z}_{+}$ , $C>0$ , $M\geq\max\{1,\|f\|_{\infty}\}$ , and a sequence of real numbers $(\varphi_{p,l})$ with $|\varphi_{p,l}|\leq 1$ and $\sum_{p=1}^{\infty}p\max_{0\leq l\leq\lfloor p/(r+1)\rfloor}\varphi_{p,l}<\infty$ , such that for any integers $a,b,c\geq 0$ satisfying $1\leq a+b+c\leq 3$ , for any integers $i,j,k,p,q,l$ with $0\leq i\leq j\leq k\leq k+p\leq k+p+q\leq k+p+l$ , for any $\alpha,\beta,\gamma\in\{1,\ldots,d\}$ , and for any bounded differentiable function $F:\,{\mathbb{R}}^{d}\times([-M,M]^{d})^{3}\to{\mathbb{R}}$ with bounded gradient,

[TABLE]

Condition (D) is satisfied by chaotic dynamical systems such as Sinai billiards. It was shown in [45] that condition (D) implies the existence of the limit $\Sigma_{0}:=\lim_{n\to\infty}n^{-1}\mu(S_{n}\otimes S_{n})$ and, whenever $\Sigma_{0}$ is nonnull, the existence of a constant $B>0$ such that

[TABLE]

where $U$ is a Gaussian random variable with expectation [math] and covariance matrix $\Sigma_{0}$ .

Compared to Theorem 2.1, (7) gives an upper bound of the same order $O(N^{-1/2})$ for stationary systems whose correlations decay at a rate which has a finite first moment, for test functions that are only assumed to be Lipschitz continuous. On the other hand, Theorem 2.1 is more general in that it applies for rather arbitrary matrix-valued normalizing sequences $b(N)$ . Furthermore, the constant $C_{*}$ in (4) is more explicit than the one in (7) in terms of its dependence on $d$ , $f$ and the underlying dynamical system. The same can be said about the constant $C_{*}$ in Theorem 2.6, which gives an upper bound for the same metric as (7) but with a slightly weaker rate of convergence due to the logarithmic factor. Note that, similarly to conditions (B2)-(B3) and (C2)-(C3), condition (D) has to be verified for a whole class of regular functions $F$ .

3. Application I: Random $1D$ piecewise expanding maps

In this section we apply Theorem 2.3 to estimate the rate of convergence in the quenched CLT for a class of piecewise expanding random dynamical systems. Namely we consider the setup studied by Dragičević et al. in [12]. Below we recall some definitions and results from [12] as they are necessary for understanding the application given in this section.

Set $(X,{\mathcal{B}})=([0,1],\text{Borel}([0,1]))$ and for a function $g:X\to{\mathbb{R}}$ define its total variation by

[TABLE]

Moreover, define

[TABLE]

The Banach space $BV$ consists of all functions $g$ with $V(g)<\infty$ and is equipped with the norm $\|\cdot\|_{BV}$ .

Let us denote by ${\mathcal{E}}$ the collection of all maps $T:X\to X$ for which there exists a finite partition ${\mathcal{A}}(T)$ of $X$ into subintervals such that for every $I\in{\mathcal{A}}(T)$ :

(1)

$T\upharpoonright I$ extends to a $C^{2}$ map in a neighborhood of $I$ ;

(2)

$\delta(T):=\inf|T^{\prime}|>1$ .

The map $T$ is monotonous on each element $I\in{\mathcal{A}}(T)$ . From now on we take ${\mathcal{A}}(T)$ to be the minimal such partition and set $N(T)=|{\mathcal{A}}(T)|$ .

Let $(\Omega,{\mathcal{F}},{\mathbb{P}})$ be a probability space and let $\tau:\Omega\to\Omega$ be an invertible ${\mathbb{P}}$ -preserving transformation. We consider a map $\omega\mapsto T_{\omega}$ from $\Omega$ into ${\mathcal{E}}$ . Random compositions of maps are denoted by

[TABLE]

and

[TABLE]

where ${\mathcal{L}}_{\omega}:L^{1}(m)\to L^{1}(m)$ is the transfer operator associated to $T_{\omega}$ :

[TABLE]

Conditions (H):

(i)

$\tau:\Omega\to\Omega$ is invertible, ${\mathbb{P}}$ -preserving, and ergodic.

(ii)

The map $(\omega,x)\mapsto({\mathcal{L}}_{\omega}H(\omega,\cdot))(x)$ is measurable for every measurable function $H:\Omega\times X\to{\mathbb{R}}$ such that $H(\omega,\cdot)\in L^{1}(X,m)$ .

(iii)

$N:=\sup_{\omega\in\Omega}N(T_{\omega})<\infty$ ; $\delta:=\inf_{\omega\in\Omega}\delta(T_{\omega})>1$ ; $D:=\sup_{\omega\in\Omega}|T^{\prime\prime}_{\omega}|<\infty$ .

(iv)

There is $r\geq 1$ such that $\delta^{r}>2$ and $\text{ess inf}_{\omega\in\Omega}\min_{J\in{\mathcal{A}}(T_{\omega}^{r})}m(J)>0$ .

(v)

For every subinterval $J\subset X$ there is $k=k(J)\geq 1$ such that $T^{k}_{\omega}(J)=X$ holds for almost every $\omega\in\Omega$ .

Remark 3.1.

It was shown in [12] that conditions (H) imply several nice properties for the transfer operators ${\mathcal{L}}_{\omega}$ , including a Lasota-Yorke inequality and exponential decay in the BV-norm. The authors used such properties to establish an almost sure invariance principle.

Lemma 3.2 (See Proposition 1 in [12]).

Assume conditions (H). Then there exists a unique measurable and non-negative function $h:\Omega\times X\to{\mathbb{R}}$ such that $h_{\omega}:=h(\omega,\cdot)\in BV$ , $m(h_{\omega})=1$ and ${\mathcal{L}}_{\omega}(h_{\omega})=h_{\tau(\omega)}$ for almost every $\omega\in\Omega$ . Moreover, $\textnormal{ess sup}_{\omega\in\Omega}\|h_{\omega}\|_{BV}<\infty$ .

3.1. Statement of result

Let $f:X\to{\mathbb{R}}$ be a bounded measurable function and set

[TABLE]

where $d\mu_{\omega}=h_{\omega}\,dm$ and $h$ is the function from Lemma 3.2. Set

[TABLE]

where $b$ is the square root of ${\mu_{\omega}}[(\sum_{n=0}^{N-1}\widetilde{f}_{\tau^{n}(\omega)}\circ T^{n}_{\omega})^{2}]$ . We denote by $\varphi:\Omega\times X\to\Omega\times X$ the skew product $\varphi(\omega,x)=(\tau(\omega),T_{\omega}(x))$ , which preserves the measure $\mu$ on $\Omega\times X$ defined by

[TABLE]

Theorem 3.3.

Consider a family of piecewise expanding maps $(T_{\omega})_{\omega\in\Omega}$ such that conditions (H) hold. Fix $N\geq 1$ and suppose $f$ is Lipschitz continuous such that $\widetilde{f}$ can not be written as $g-g\circ\varphi$ for any $g\in L^{2}(\Omega\times X,\mu)$ . Then there is $C_{*}>0$ independent of $N$ such that

[TABLE]

holds for almost every $\omega\in\Omega$ . Here $Z\sim{\mathcal{N}}(0,1)$ is a random variable with standard normal distribution.

Remark 3.4.

The proof of Theorem 3.3 is based on Theorem 2.3. Theorem 2.1 or 2.6 could be used instead to obtain similar central limit theorems for multivariate observables $f:X\to{\mathbb{R}}^{d}$ .

3.2. A functional correlation bound

Conditions (B2) and (B3) of Theorem 2.3 will be verified by applying the auxiliary result given below, which facilitates bounding integrals of the form $\int F\circ(T_{\omega}^{m})_{0\leq m<k}\,d\mu$ , where $F:[0,1]^{k}\to{\mathbb{R}}$ is not necessarily a product of one-dimensional observables. Such functional correlation bounds were established for stationary Sinai billiards in [39] and for time-dependent intermittent maps in [36].

For a function $F:[0,1]^{k}\to{\mathbb{R}}$ , $\theta\in(0,1]$ , and $1\leq\beta\leq k$ we denote

[TABLE]

where $x(a/\beta)\in[0,1]^{k}$ is obtained from $x$ by replacing the $\beta$ th coordinate with $a\in[0,1]$ . We say that $F$ is $\theta$ -Hölder continuous in the coordinate $\beta$ if $[F]_{\theta,\beta}<\infty$ .

Proposition 3.5.

Let $k\geq 2$ . Consider integers $0\leq n_{1}\leq\ldots\leq n_{k}$ blocked according to a set of indices $0=\ell_{0}<\ell_{1}<\ldots<\ell_{p}<\ell_{p+1}=k$ , where we assume that $n_{\ell_{i}+1}<\ldots<n_{\ell_{i+1}}$ hold for all $0\leq i\leq p$ . Suppose $(T_{\omega})_{\omega\in\Omega}\subset{\mathcal{E}}$ is a family of maps such that conditions (H) hold, and that $F_{\omega}:\,[0,1]^{k}\to{\mathbb{R}}$ is a function with $\textnormal{ess sup}_{\omega\in\Omega}\|F_{\omega}\|_{\infty}<0$ and

[TABLE]

Denote by $H_{\omega}(x_{1},\ldots,x_{p+1})$ the function

[TABLE]

Then, for any probability measures $\mu_{1},\ldots,\mu_{p+1}$ whose densities belong to $BV$ , and for almost every $\omega\in\Omega$ ,

[TABLE]

where $0<\gamma<1$ , and $C=C(p,(T_{\omega})_{\omega\in\Omega},\theta)>0$ .

Remark 3.6.

The upper bound (9) is independent of $k$ .

The proof for Proposition 3.5 is based on two auxiliary results. The first result is an immediate consequence of Corollary 8 in [2] due to Aimino and Rousseau, who considered sequential (non-random) compositions of piecewise-expanding maps. The second result is Lemma 2 in the paper [12] by Dragičević et al.

Lemma 3.7.

Suppose conditions (H) hold. There is $C>0$ such that for almost every $\omega\in\Omega$ ,

[TABLE]

where $V_{I}(f)$ denotes the total variation of $f$ over the subinterval $I\subset[0,1]$ .

Proof.

As is explained on p. 2252 of [12], condition (iv) implies that there exists $\alpha^{r}\in(0,1)$ and $K^{r}>0$ such that, for almost every $\omega\in\Omega$ ,

[TABLE]

holds for all $\phi\in BV$ and $\ell\geq 1$ . It suffices to fix $\Omega^{*}\subset\Omega$ with ${\mathbb{P}}(\Omega^{*})=1$ such that (11) holds for all $\omega\in\Omega^{*}$ . Then the proof of Corollary 8 in [2] shows that (10) holds for all $\omega\in\Omega^{*}$ . ∎

Lemma 3.8 (See Lemma 2 in [12]).

Assume conditions (H). There is $K>0$ and $\eta\in(0,1)$ such that, for almost every $\omega\in\Omega$ ,

[TABLE]

holds for all $n\geq 0$ and $\phi\in BV$ with $m(\phi)=0$ .

Proof for Proposition 3.5.

The proof proceeds by induction on $p$ . First let $p=1$ and denote $\ell_{1}=\ell$ . Then the function $H_{\omega}(x,y)$ in Proposition 3.5 becomes

[TABLE]

where $n_{1}<\ldots<n_{\ell}\leq n_{\ell+1}<\ldots<n_{k}$ . Set $n_{*}=n_{\ell}+\lfloor(n_{\ell+1}-n_{\ell})/2\rfloor$ 111We denote by $\lfloor x\rfloor$ the greatest non-negative integer $n$ with $n\leq x$ .. Then,

[TABLE]

Claim. If $a,b\in J\in{\mathcal{A}}(T_{\omega}^{n_{*}})$ , then almost surely

[TABLE]

where $L_{1}=\text{ess sup}_{\omega\in\Omega}\max_{1\leq\alpha\leq\ell}[F_{\omega}]_{\theta,\alpha}$ , $\kappa=(\delta^{\theta})^{-1/2}\in(0,1)$ , and $C=C(\kappa)>0$ . We recall that by definition $\delta=\inf_{\omega\in\Omega}\delta(T_{\omega})>1$ .

Proof for Claim.

Since $F_{\omega}$ is $\theta$ -Hölder continuous for a.e. $\omega\in\Omega$ in the first $\ell$ coordinates,

[TABLE]

holds for a.e. $\omega\in\Omega$ . Consequently,

[TABLE]

For each $1\leq\alpha\leq\ell$ , $T_{\tau^{n_{\alpha}}\omega}^{n_{*}-n_{\alpha}}$ maps $T_{\omega}^{n_{\alpha}}(J)$ diffeomorphically onto $T^{n_{*}}_{\omega}(J)$ , which implies the upper bound $m(T^{n_{\alpha}}_{\omega}J)\leq(\delta^{n_{*}-n_{\alpha}})^{-1}m(T^{n_{*}}_{\omega}J)\leq\delta^{-n_{*}+n_{\alpha}}$ . That is, for a.e. $\omega\in\Omega$ ,

[TABLE]

This proves the claim. ∎

We fix a point $c_{J}\in J$ for each $J\in{\mathcal{A}}(T^{n_{*}}_{\omega})$ . Then (12) implies for a.e. $\omega\in\Omega$ the upper bound

[TABLE]

Let $h_{1}\in BV$ denote the density of $\mu_{1}$ , and let $h_{2}\in BV$ denote the density of $\mu_{2}$ . Moreover, let $\widetilde{H}_{\omega}(c_{J},x)$ be the function that satisfies $\widetilde{H}_{\omega}(c_{J},T^{n_{\ell+1}}_{\omega}(x))=H_{\omega}(c_{J},x)$ . Fix $J\in{\mathcal{A}}(T^{n_{*}}_{\omega})$ . Then, for a.e. $\omega\in\Omega$ ,

[TABLE]

Let $x\in[0,1]$ . Since $J\in{\mathcal{A}}(T^{n_{*}}_{\omega})$ , either $x\in T^{n_{*}}_{\omega}(J)$ and

[TABLE]

or ${\mathcal{L}}_{\omega}^{n_{*}}(1_{J}h_{2})x=0$ . It follows easily from this and the strict monotonicity of $T^{n_{*}}_{\omega}\upharpoonright J$ that

[TABLE]

where

[TABLE]

We conclude that

[TABLE]

On the other hand there is $C>0$ such that, for any $\phi\in BV$ , $\sup_{k\geq 0}V({\mathcal{L}}^{k}_{\omega}(\phi))\leq C\|\phi\|_{BV}$ holds for almost every $\omega\in\Omega$ . This follows from (11) together with the fact that $\|{\mathcal{L}}_{\omega}(\phi)\|_{BV}\leq C\|\phi\|_{BV}$ for almost every $\omega\in\Omega$ ; see p. 2257 of [12]. In particular,

[TABLE]

Next we combine Lemma 3.8, (14) and (15) to obtain

[TABLE]

for a.e. $\omega\in\Omega$ , where $\eta_{1}\in(0,1)$ . Then, by Lemma 3.7,

[TABLE]

for a.e. $\omega\in\Omega$ . Taking $\gamma=\max\{\eta_{1},\kappa\}$ completes the proof for the case $p=1$ .

Suppose that we have shown (9) for $p-1$ , and fix integers $0=\ell_{0}<\ell_{1}<\ldots<\ell_{p}<\ell_{p+1}=k$ as in the proposition. Recall that $H_{\omega}(x_{1},\ldots,x_{p+1})$ denotes the function

[TABLE]

From the case $p=1$ we know that, for a.e. $\omega\in\Omega$ ,

[TABLE]

where $h_{i}$ is the density of $\mu_{i}$ .

Next for each $x_{p+1}\in[0,1]$ , we apply the induction hypothesis to the function

[TABLE]

This implies for a.e. $\omega\in\Omega$ the upper bound

[TABLE]

for all $x_{p+1}\in[0,1]$ . Now, to complete the proof for Proposition 3.5, it suffices to combine (16) and (17).

∎

3.3. Proof for Theorem 3.3

It was shown in [12] that there exists a non-random $\sigma^{2}\geq 0$ such that

[TABLE]

for almost every $\omega\in\Omega$ . Moreover, $\sigma^{2}=0$ if and only if there exists $g\in L^{2}(\Omega\times X,\mu)$ such that $\widetilde{f}=g-g\circ\varphi$ . Hence, under our assumption there exists $C>0$ and $n_{0}\geq 1$ such that, for a.e. $\omega\in\Omega$ ,

[TABLE]

holds for all $n\geq n_{0}$ .

Next we show that, with $\mu_{\omega}$ as the initial measure, conditions (B1)-(B3) hold with $\rho(m)=\gamma^{m}$ for a.e. $\omega\in\Omega$ , where $\gamma\in(0,1)$ is the same as in Proposition 3.5. To this end recall that, by Lemma 3.2, the density $h_{\omega}$ of $\mu_{\omega}$ lies in $BV$ for a.e. $\omega\in\Omega$ .

(B1): For brevity, we introduce the notation $\widetilde{f}_{\omega}^{n}=f\circ T_{\omega}^{n}-\mu_{\omega}(f\circ T_{\omega}^{n})$ . Taking $k=1$ , $p=1$ , $F_{\omega}(x,y)=f(x)f(y)$ , and $\mu_{1}=\mu_{\omega}=\mu_{2}$ in Proposition 3.5 yields the upper bound

[TABLE]

for a.e. $\omega\in\Omega$ .

(B2): Let $m\leq k\leq N-1$ and let $G:{\mathbb{R}}\times B_{1}(0,4\|f\|_{\infty}+1)\to{\mathbb{R}}$ be a bounded Lipschitz continuous function. We define $F_{\omega}(x_{0},\ldots,x_{n-k},x_{n-m},x_{n},x_{n+m},x_{n+k},\ldots,x_{N-1})$ by the formula

[TABLE]

where $\psi_{i,\omega}(x)=f(x)-\mu_{\omega}(f\circ T_{\omega}^{i})$ and the summations are over $i$ . Then

[TABLE]

which is the integral we need to control. It is easy to verify that $F_{\omega}$ is Lipschitz continuous with

[TABLE]

and

[TABLE]

where ${\mathcal{I}}=\{0\leq i\leq N-1\>:\>|i-n|\geq k\}\cup\{0\leq i\leq N-1\>:\>|i-n|=m\}\cup\{n\}$ is an indexing for the arguments of $F$ . Observe that, since $\mu_{\omega}(\widetilde{f}_{\omega}^{n})=0$ ,

[TABLE]

where ${\mathcal{T}}_{\omega}^{\leq n-k}x=(T^{0}_{\omega}x,\ldots,T^{n-k}_{\omega}x)$ and ${\mathcal{T}}_{\omega}^{\geq n+k}z=(T^{n+k}_{\omega}z,\ldots,T^{N-1}_{\omega}z)$ . It follows by Proposition 3.5 applied with $F_{\omega}$ and $p=2$ that, for a.e. $\omega\in\Omega$ ,

[TABLE]

(B3): This is obtained in the same way as condition (B2). Namely, whenever $2m\leq k\leq N-1$ , applying Proposition 3.5 with $p=2$ and the function

[TABLE]

where

[TABLE]

implies for a.e. $\omega\in\Omega$ the upper bound

[TABLE]

Since $\sum_{m=1}^{\infty}m\gamma^{m}<\infty$ , Theorem 3.3 now follows by Theorem 2.3.

Remark 3.9.

Another example of a random dynamical system that satisfies the conditions of Theorem 2.3 is the Sinai Billiard of [51], in which a scatterer configuration on the torus is randomly updated between consecutive collisions. The key technical lemmas necessary for obtaining an analog of Proposition 3.5 were proven in [51, 53], including a statistical memory loss starting from an initial measure supported on a single homogeneous local unstable manifold (Lemma 12 of [51]), and a tail estimate on the prevalence of short local unstable manifolds (Lemma 13 of [51]). The application would imply a rate of convergence in the annealed CLT but we will not treat it here.

4. Application II: intermittent maps

Following [40] we define for each $\alpha\in(0,1)$ the map $T_{\alpha}:[0,1]\to[0,1]$ by

[TABLE]

Associated to each map $T_{\alpha}$ is its transfer operator ${\mathcal{L}}_{\alpha}:L^{1}(m)\to L^{1}(m)$ defined by

[TABLE]

We denote by $d\hat{\mu}_{\alpha}=\hat{h}_{\alpha}dm$ the invariant absolutely continuous probability measure associated to $T_{\alpha}$ . It follows from [40] that the density $\hat{h}_{\alpha}$ belongs to the convex cone of functions

[TABLE]

We recall from [40, 1] that

[TABLE]

and that

[TABLE]

4.1. Sequential compositions

First we consider sequential compositions

[TABLE]

of intermittent maps with parameters $0<\alpha_{n}\leq\beta_{*}<1$ . The notation below is adapted from Section 2.2: $\mu$ is a Borel probability measure on $[0,1]$ ; $g^{n}:[0,1]\to{\mathbb{R}}^{d}$ is a bounded observable for all $n\geq 1$ ;

[TABLE]

For a Lipschitz continuous function $g:[0,1]\to{\mathbb{R}}^{d}$ we set $\|g\|_{\text{Lip}}=\|g\|_{\infty}+\text{Lip}(g)$ , where

[TABLE]

and

[TABLE]

Theorem 4.1.

Let $N\geq 1$ and let $\mu$ be a measure whose density lies in the cone ${\mathcal{C}}_{*}(\beta_{*})$ . Suppose that $g^{n}:[0,1]\to{\mathbb{R}}^{d}$ are Lipschitz continuous with $\sup_{n<N}\|g^{n}\|_{\textnormal{Lip}}+1\leq L$ and that $\lambda_{\min}>1$ . Denote by $Z\sim{\mathcal{N}}(0,I_{d\times d})$ a standard normal random vector.

(1)

If $\beta_{*}<1/3$ , then there is $C_{*}=C_{*}(L,d,\beta_{*})>0$ such that

[TABLE]

In particular, if $\lambda_{\min}\gg N^{2/3}(\log N)^{2/3}$ , then $W\stackrel{{\scriptstyle d}}{{\to}}Z$ as $N\to\infty$ .

(2)

If $1/3\leq\beta_{*}<2/5$ , then for any $\delta>0$ there is $C_{*}=C_{*}(L,d,\beta_{*},\delta)>0$ such that

[TABLE]

In particular, if $\lambda_{\min}\gg N^{8/3+2\delta/3-2/3\beta_{*}}$ , then $W\stackrel{{\scriptstyle d}}{{\to}}Z$ as $N\to\infty$ .

Remark 4.2.

A couple of remarks are in order:

(i)

The proof is based on Theorem 2.6. In the special case $d=1$ let us denote $S=\sum_{i=0}^{N-1}\bar{f}^{i}$ and $\sigma^{2}=\mu(S^{2})$ . Assuming $\beta_{*}<1/3$ , the sharper upper bound

[TABLE]

is obtained by applying Theorem 2.3 instead of Theorem 2.6, provided that $\sigma^{2}>0$ . Consequently, by Lemma 2.4, for any $c>0$ ,

[TABLE]

Without any assumption on $\sigma^{2}$ we still obtain the weaker bound

[TABLE]

This follows easily by combining (20) with the fact that, for any random variables $X$ and $Y$ with finite variances $\sigma_{X}^{2}$ and $\sigma_{Y}^{2}$ , respectively, the Wasserstein metric satisfies $d_{\mathscr{W}}(X,Y)\leq\sigma_{X}+\sigma_{Y}$ (see e.g. **[28]** for the last statement).

(ii)

In the stationary case of a single intermittent map $T_{\alpha}$ preserving the measure $\hat{\mu}_{\alpha}$ , a Berry-Esseen theorem for univariate Hölder continuous observables was shown by Gouëzel **[22]**. Gouëzel’s result establishes the rate $O(N^{-1/2})$ with respect to the Kolmogorov metric for parameters $\alpha<1/3$ . For parameters $1/3\leq\alpha<1/2$ , Gouëzel obtains a rate depending on the behavior of $f(x)$ around the fixed point $x=0$ . For multivariate Lipschitz continuous observables, the rate $O(N^{-1/2})$ in the CLT with respect to the Wasserstein metric was shown for parameters $\alpha<1/3$ in **[36]** by an application of Pène’s theorem **[45]**. The upper bound (19) can be viewed as an extension of this result for parameters $1/3\leq\alpha<2/5$ . Pène’s condition (see Section 2.3) does not hold for parameters $\alpha\geq 1/3$ because correlations do not decay at a rate which has a finite first moment.

Proof for Theorem 4.1 .

Set $\rho(n)=n^{1-1/\beta_{*}}(\log n)^{1/\beta_{*}}$ for $n\geq 2$ and $\rho(0)=\rho(1)=1$ . We show that conditions (C1)-(C3) of Theorem 2.6 hold with $\rho$ using Theorem 1.1 in [36].

(C1): Let $\alpha,\beta\in\{1,\ldots,d\}$ and $0\leq n,m\leq N-1$ . Applying Theorem 1.1 in [36] with $k=2$ , $p=1$ , $F(x,y,z)=g_{\alpha}^{n}(y)g_{\beta}^{m}(z)$ , and $\mu_{1}=\mu$ yields the upper bound

[TABLE]

where $C=C(\beta_{*})>0$ .

(C2): Let $0\leq n,m\leq N-1$ , $m\leq k\leq N-1$ , and $G:{\mathbb{R}}^{d}\times B_{d}(0,4L+1)\to{\mathbb{R}}^{d\times d}$ be a bounded $C^{1}$ -function with bounded gradient. We define

[TABLE]

by the formula

[TABLE]

where $\psi^{i}(x)=g^{i}(x)-\mu(g^{i}\circ\widetilde{T}_{i})$ and the summations are over $i$ . Then,

[TABLE]

which is the integral we need to control. It is easy to verify that

[TABLE]

and

[TABLE]

Here

[TABLE]

$[F]_{1,\beta}$ is defined by (8), and ${\mathcal{I}}=\{0\leq i\leq N-1\>:\>|i-n|\geq k\}\cup\{0\leq i\leq N-1\>:\>|i-n|=m\}\cup\{n\}$ is an indexing for the arguments of $F$ . Theorem 1.1 in [36] together with (21) and (22) implies the upper bound

[TABLE]

(C3): This is shown in the same way as condition (C2). Namely Theorem 1.1 in [36] is applied with the function

[TABLE]

where

[TABLE]

We leave the details to the reader.

If $\beta_{*}<1/3$ , it follows by the foregoing that conditions (C1)-(C3) hold also with $\rho(n)=n^{-\kappa}$ for some $\kappa>2$ . In particular $\sum_{m=1}^{\infty}(1+\log(\rho(m)^{-1}))m\rho(m)<\infty$ , so that item (1) of Theorem 4.1 follows by Theorem 2.6. If instead $1/3\leq\beta_{*}<2/5$ we obtain conditions (C1)-(C3) with $\rho(n)=n^{1-1/\beta_{*}+\delta}$ for any $\delta>0$ . Then $\sum_{m=1}^{N-1}(1+\log(\rho(m)^{-1}))m\rho(m)\leq C(\beta_{*},\delta,\delta^{\prime})N^{3-1/\beta_{*}+\delta+\delta^{\prime}}$ holds for arbitrarily small $\delta^{\prime}>0$ and item (2) of Theorem 4.1 follows again by Theorem 2.6. ∎

In the remainder of this section we look at situations where we have control on the limiting behavior of $\operatorname{Cov}_{\mu}(\sum_{i=0}^{N-1}\bar{f}^{i})$ .

4.2. Quasistatic dynamics

We apply Theorem 4.1 to a model described by time-dependent (non-random) compositions of slowly transforming intermittent maps. More precisely we consider the following subclass of quasistatic dynamical systems (QDS); for background and earlier results on quasistatic systems we refer the reader to [37, 52, 38, 11, 28, 29].

Definition 4.3 (Intermittent QDS).

Let $\mathbf{T}=\{T_{\alpha_{n,k}}\>:\>0\leq k\leq n,\ n\geq 1\}$ be a triangular array of intermittent maps with parameters $\alpha_{n,k}\in[0,1)$ . If there is a piecewise continuous curve $\gamma:[0,1]\to[0,1)$ satisfying

[TABLE]

for all $t$ , we say that $(\mathbf{T},\gamma)$ is an intermittent QDS.

Given an intermittent QDS $(\mathbf{T},\gamma)$ , we define the functions $S_{n}:[0,1]\times[0,1]\to{\mathbb{R}}$ by

[TABLE]

where

[TABLE]

$f_{n,0}=f$ , and $f:[0,1]:\to{\mathbb{R}}^{d}$ is a bounded function. We fix an initial distribution $\mu$ of $x\in[0,1]$ and for each $t\in[0,1]$ view the $S_{n}(t)=S_{n}(\cdot,t)$ as random vectors. The problem is now to approximate the law of the fluctuations

[TABLE]

by ${\mathcal{N}}(0,I_{d\times d})$ , where $\bar{S}_{n}(x,t)=S_{n}(x,t)-\mu(S_{n}(x,t))$ and $b=b(n,t)=\text{Cov}_{\mu}(\bar{S}_{n}(\cdot,t))^{1/2}$ .

Theorem 4.4.

Let $f:[0,1]\to{\mathbb{R}}^{d}$ be a Lipschitz continuous function and $\mu$ be such that its density lies in ${\mathcal{C}}_{*}(\beta_{*})$ . Suppose that the limiting curve $\gamma$ is Hölder-continuous, that for some $\eta\in(0,1]$ we have

[TABLE]

and that there exists $t_{0}\in(0,1]$ such that $f$ is not a co-boundary for $T_{\gamma_{t_{0}}}$ in any direction222i.e. there does not exist a unit vector $v\in{\mathbb{R}}^{d}$ , a constant $c\in{\mathbb{R}}^{d}$ , and a function $\psi\in L^{2}(\hat{\mu}_{\gamma_{t_{0}}})$ such that $v^{T}f=c+\psi-\psi\circ T_{\gamma_{t_{0}}}$ ..

(1)

If $\gamma([0,1])\subset[0,\beta_{*}]$ and $\beta_{*}<1/3$ , then there exists $C_{*}=C_{*}(t_{0},d,f,\gamma)$ such that for all $t\geq t_{0}$ and $n\geq 2$ ,

[TABLE]

(2)

If $\gamma([0,1])\subset[0,\beta_{*}]$ and $1/3\leq\beta_{*}<2/5$ , then for any $\delta>0$ there exists $C_{*}=C_{*}(t_{0},d,f,\delta,\gamma)$ such that for all $t\geq t_{0}$ and $n\geq 1$ ,

[TABLE]

Proof.

Set $\xi_{n}(x,t)=n^{-\frac{1}{2}}bW_{n}(x,t)$ . By Lemma 4.4 in [29], uniformly in $t\in[0,1]$ ,

[TABLE]

where

[TABLE]

and $\hat{f}_{t}=f-\hat{\mu}_{\gamma_{t}}(f)$ . By theorem 2.11 in the same article the limit covariance $\Sigma_{t}(f):=\int_{0}^{t}\hat{\Sigma}_{s}(f)\,ds$ is positive definite for all $t\geq t_{0}$ (this is where the co-boundary condition on $f$ is needed). In particular, $\lambda_{\min}(\Sigma_{t}(f))>0$ , where $\lambda_{\min}(A)$ denotes the least eigenvalue of the matrix $A\in{\mathbb{R}}^{d\times d}$ . It follows by the same argument as in p. 20 of [29] that there exists $n_{0}$ and $C>0$ such that $\lambda_{\min}(\text{Cov}_{\mu}(\xi_{n}(t)))\geq C$ holds for all $t\geq t_{0}$ and all $n\geq n_{0}$ . In other words,

[TABLE]

Next we show the wanted upper bound on $d_{\mathscr{W}}(W_{n}(t),Z)$ by controlling separately the following three terms:

[TABLE]

where $b(s)=\text{Cov}_{\mu}(\bar{S}_{n}(s))^{1/2}$ .

Note that

[TABLE]

It follows immediately by (23) and Theorem 4.1 that for all $n\geq n_{0}$ and $t\geq t_{0}$ ,

[TABLE]

where $\delta>0$ can be made arbitrarily small.

In the remainder of this proof we assume that $\beta_{*}<1/2$ and $\gamma([0,1])\subset[0,\beta_{*}]$ . Whenever $t\geq t_{0}$ and $n\geq n_{0}$ ,

[TABLE]

Since the density of $\mu$ belongs to ${\mathcal{C}}_{*}(\beta_{*})$ , it follows by Lemma 3.3 in [37] that

[TABLE]

where $C=C(d,\|f\|_{\text{Lip}},\gamma,\beta_{*})>0$ is a constant independent of $t$ . Moreover (see Lemma 5.4),

[TABLE]

and

[TABLE]

That is,

[TABLE]

For brevity denote $\Sigma_{n,s}=\text{Cov}_{\mu}(\bar{S}_{n}(s))$ . Then we have for $t\geq t_{0}$ and $n\geq n_{0}$ the upper bound (see [49] for the first inequality)

[TABLE]

To bound the remaining spectral norm we fix $\alpha,\beta\in\{1,\ldots,d\}$ , denote $\varphi=f_{\alpha}$ and $\psi=f_{\beta}$ . For a a real-valued function $g:[0,1]\to{\mathbb{R}}$ and integers $0\leq k\leq n$ we denote $\bar{g}_{n,k}=g\circ T_{n,k}-\mu(g\circ T_{n,k})$ . Whenever $n\geq 2/t_{0}$ , we can use Theorem 1.1 in [37] to find $\kappa>1$ such that

[TABLE]

Hence, the upper bound $\|\Sigma_{n,\lceil nt\rceil/n}-\Sigma_{n,t}\|_{s}\leq dC\|f\|_{\text{Lip}}^{2}$ follows by Lemma 5.4. We have shown that $\eqref{eq:term_2}\leq Cn^{-1/2}$ whenever $t\geq t_{0}$ and $n\geq n_{0}$ .

Finally, by (23) and Lemma 5.4,

[TABLE]

whenever $t\geq t_{0}$ and $n\geq n_{0}$ . Now to finish the proof for Theorem 4.4 it suffices to combine the foregoing upper bounds on $\eqref{eq:term_1}$ , (25), and (26).

∎

4.3. Rate in the quenched CLT

We consider a sequence $(T_{\omega_{i}})_{i\geq 1}$ of intermittent maps with parameters $(\omega_{i})_{i\geq 1}$ drawn randomly from the probability space $(\Omega,{\mathcal{F}},{\mathbb{P}})=([0,\beta_{*}]^{{\mathbb{Z}}_{+}},{\mathcal{E}}^{{\mathbb{Z}}_{+}},{\mathbb{P}})$ , where ${\mathcal{E}}$ is the Borel $\sigma$ -algebra of $[0,\beta_{*}]$ and ${\mathbb{Z}}_{+}=\{1,2,\dots\}$ . Let $\tau:\Omega\to\Omega$ denote the shift $(\tau(\omega))_{i}=\omega_{i+1}$ .

Conditions (RDS):

(i)

The shift $\tau:\Omega\to\Omega:(\tau(\omega))_{i}=\omega_{i+1}$ preserves ${\mathbb{P}}$ .

(ii)

There is $C>0$ and $\gamma>0$ such that, for all $n\geq 1$ ,

[TABLE]

where ${\mathcal{F}}_{1}^{i}$ is the sigma-algebra generated by the projections $\pi_{1},...,\pi_{i}$ , $\pi_{k}(\omega)=\omega_{k}$ , and ${\mathcal{F}}_{i+n}^{\infty}$ is generated by $\pi_{i+n},\pi_{i+n+1},\dots$ .

We set

[TABLE]

where $f:[0,1]\to{\mathbb{R}}^{d}$ is a bounded observable with $d\geq 1$ , and $\varphi(n,\omega)=T_{\omega_{n}}\circ\cdots\circ T_{\omega_{1}}$ . That is, we take

[TABLE]

as the normalizing matrix.

Theorem 4.5.

Suppose that $\beta_{*}<1/3$ , that $f:[0,1]\to{\mathbb{R}}^{d}$ is Lipschitz continuous, and that $\mu$ is a measure whose density belongs to ${\mathcal{C}}_{*}(\beta_{*})$ . Assume conditions (RDS). Then,

[TABLE]

is well-defined and positive semi-definite. Moreover, $\Sigma$ is positive definite if and only if

[TABLE]

holds for all $v\neq 0$ . Fix an arbitrarily small $\delta>0$ . If $\Sigma$ is positive definite, then there is $\Omega^{*}\subset\Omega$ with ${\mathbb{P}}(\Omega^{*})=1$ such that for any three times differentiable function $h:\,{\mathbb{R}}^{d}\to{\mathbb{R}}$ with $\max_{1\leq k\leq 3}\|D^{k}h\|_{\infty}<\infty$ , any $\omega\in\Omega^{*}$ , and any $N\geq 2$ ,

[TABLE]

where $C_{*}=C_{*}((T_{\omega})_{\omega\in\Omega},d,f)>0$ and

[TABLE]

Remark 4.6.

Nicol–Török–Vaienti [43], Su [55], and Nicol–Pereira–Török [42] proved CLTs without rates of convergence for random dynamical systems of intermittent maps with parameters $\omega_{i}\leq\beta_{*}<1/2$ . Theorem 4.5 gives a better rate of convergence than the following upper bound established in [29] for univariate $f:[0,1]\to{\mathbb{R}}$ :

[TABLE]

Here $Z\sim{\mathcal{N}}(0,1)$ and $\sigma^{2}=\Sigma$ . The proof below can be modified to obtain the upper bound $d_{\mathscr{W}}(W,\sigma Z)\leq C_{*}\theta(N)$ for univariate $f:[0,1]\to{\mathbb{R}}$ .

Proof.

Given any vector $v\in{\mathbb{R}}^{d}$ denote

[TABLE]

where $\Sigma_{N}=\text{Cov}_{\mu}(W\otimes W)$ . In other words, $\ell_{n}(v)$ is the variance of

[TABLE]

Let $v\in{\mathbb{R}}^{d}$ . By Theorem 2.6 in [29], $\lim_{n\to\infty}\ell_{n}(v)=v^{T}\Sigma v$ exists and $v^{T}\Sigma v>0$ if and only if

[TABLE]

Hence (27) is equivalent to the positive definiteness of $\Sigma$ . The proof of Theorem 2.6 in [29] also shows that, for almost every $\omega\in\Omega$ ,

[TABLE]

Hence, by Lemma 4.4 in [31], for almost every $\omega\in\Omega$ ,

[TABLE]

From now on we assume that $\Sigma$ is positive definite. We split $|\mu[h(W)]-\Phi_{\Sigma}(h)|$ into two terms:

[TABLE]

and

[TABLE]

It follows by (29) that there is $N_{0}$ such that $\Sigma_{N}$ is positive definite for $N\geq N_{0}$ and a.e. $\omega\in\Omega$ . Then, for all $\omega\in\Omega$ and $N\geq N_{0}$ , the upper bound

[TABLE]

holds for some $C=C(\beta_{*},d,\|f\|_{\text{Lip}})>0$ . The proof for (32) is almost verbatim the same as the proof for Theorem 4.1: Theorem 2.1 is applied with $b=\sqrt{N}I_{d\times d}$ after verifying conditions (A1)-(A3) using Theorem 1.1 in [37]. We will not repeat the argument here.

Finally, it is easy to show that, for some absolute constant $C>0$ ,

[TABLE]

Hence, for $N\geq N_{0}$ and a.e. $\omega\in\Omega$ (see [49] for the first inequality),

[TABLE]

The obtained upper bound combined with (29) finishes the proof for Theorem 4.5.

∎

5. Proofs for main results

5.1. On the regularity of solutions to Stein equation

Let the matrix $\Sigma\in{\mathbb{R}}^{d\times d}$ be symmetric and positive definite. Denote respectively by $\phi_{\Sigma}$ and $\Phi_{\Sigma}$ the density and expected value of the $d$ -dimensional normal distribution with mean [math] and covariance matrix $\Sigma$ . Given a test function $h:{\mathbb{R}}^{d}\to{\mathbb{R}}$ , define

[TABLE]

Then, we have the following result for smooth test functions $h$ ; see [5, 21, 18, 17]:

Lemma 5.1.

Let $h:\,{\mathbb{R}}^{d}\to{\mathbb{R}}$ be three times differentiable with $\|D^{k}h\|_{\infty}<\infty$ for $1\leq k\leq 3$ . Then, $A\in C^{3}({\mathbb{R}}^{d},{\mathbb{R}})$ , and $A$ solves the Stein equation (2). Moreover, the partial derivatives of $A$ satisfy the bounds

[TABLE]

whenever $t_{1}+\cdots+t_{d}=k$ , $1\leq k\leq 3$ .

Note that the bounds on the partial derivatives of $A$ are independent of the covariance matrix $\Sigma$ .

Recently Gallouët–Mijoule–Swan [16] obtained notable improvements on the regularity of solutions to Stein’s equation in the case $\Sigma=I_{d\times d}$ , for test functions $h$ that are assumed to be Hölder continuous:

Lemma 5.2 (See Proposition 2.2 in [16] ).

Set $\Sigma=I_{d\times d}$ and let $h:{\mathbb{R}}^{d}\to{\mathbb{R}}$ be $\eta$ -Hölder continuous with some $\eta\in(0,1]$ . Then the function $A:{\mathbb{R}}^{d}\to{\mathbb{R}}$ defined by (33) solves the Stein equation (2). Moreover, $A\in C^{2}({\mathbb{R}}^{d},{\mathbb{R}})$ and its second derivative satisfies the following bound:

[TABLE]

where

[TABLE]

and

[TABLE]

We will apply the result with $\eta=1$ . In this case the result is known to be optimal in terms of regularity of $D^{2}A$ . More precisely it was shown in [16] that, when $d=2$ and $h(x,y)=\max\{0,\min\{x,y\}\}$ (an example considered first by Raič in [47]),

[TABLE]

5.2. Sunklodas’ decomposition.

Set $Y^{i}=b^{-1}\bar{f}^{i}$ so that

[TABLE]

Next, we define punctured modifications of the sum $W$ , namely

[TABLE]

where

[TABLE]

Moreover, set

[TABLE]

Note that

[TABLE]

as well as

[TABLE]

for any $n$ and $m$ .

The proofs for the main results are based on the following decomposition, which is a multivariate version of Proposition 4 in [56] due to Sunklodas.

Proposition 5.3.

Suppose $A\in C^{2}({\mathbb{R}}^{d},{\mathbb{R}})$ . Denote

[TABLE]

and

[TABLE]

Then

[TABLE]

where

[TABLE]

Proof.

For any $n$ , by (35),

[TABLE]

By (36),

[TABLE]

Since $\mu[(Y^{n})^{T}\nabla A(0)]=0$ , it follows by the above identities that

[TABLE]

Note that

[TABLE]

so what remains of $\mu[\textnormal{tr}\Sigma D^{2}A(W)-W^{T}\nabla A(W)]$ after subtracting $E_{1}$ and $E_{2}$ is

[TABLE]

where

[TABLE]

Next note that

[TABLE]

Since $\mu[(Y^{n})^{T}\overline{D^{2}A(0)}Y^{n,m}]=0$ , this yields

[TABLE]

Finally, since

[TABLE]

we have

[TABLE]

This completes the proof for Proposition 5.3. ∎

5.3. Proof for Theorem 2.1

We gather in the following lemma some useful basic inequalities involving the spectral norm.

Lemma 5.4.

For all $A,B\in{\mathbb{R}}^{d\times d}$ , $x\in{\mathbb{R}}^{d}$ , and $\alpha,\beta\in\{1,\ldots,d\}$ :

(i)

$\|Ax\|\leq\|A\|_{s}\|x\|$ ;

(ii)

$\|AB\|_{s}\leq\|A\|_{s}\|B\|_{s}$ ;

(iii)

$|A_{\alpha\beta}|\leq\|A\|_{s}\leq\left(\max_{1\leq j\leq d}\sum_{i=1}^{d}|A_{ij}|\right)^{\frac{1}{2}}\left(\max_{1\leq i\leq d}\sum_{j=1}^{d}|A_{ij}|\right)^{\frac{1}{2}}$ ;

(iv)

$|\textnormal{tr}A|\leq d\|A\|_{s}$ ;

(v)

$\|A\|_{s}=\sqrt{\lambda_{\max}(A^{T}A)}\leq\sqrt{\textnormal{tr}\,A^{T}A}$ , where $\lambda_{\max}(A^{T}A)$ denotes the largest eigenvalue of the positive-semidefinite matrix $A^{T}A$ .

Lemma 5.5.

Let $h:\,{\mathbb{R}}^{d}\to{\mathbb{R}}$ be three times differentiable with $\|D^{k}h\|_{\infty}<\infty$ for $1\leq k\leq 3$ and let $A$ be the function (33) that solves Stein’s equation. Define $\delta^{n,k}(u)$ as in Proposition 5.3. Then, conditions (A2) and (A3) imply that, for all $0\leq n,m\leq N-1$ , the following two conditions hold:

(A2’)

Whenever $u\in[0,1]$ and $m\leq k\leq N-1$ ,

[TABLE]

(A3’)

Whenever $2m\leq k\leq N-1$ ,

[TABLE]

Proof.

We denote

[TABLE]

where $(s,t,z)\in[0,1]^{2}\times{\mathbb{R}}^{d}$ . Then,

[TABLE]

which together with Lemma 5.4 implies

[TABLE]

Hence,

[TABLE]

Similarly we see that, for all $1\leq i\leq 2d$ ,

[TABLE]

For (A1’) Suppose that $m\leq k\leq N-1$ . Recall from Lemma 5.1 that

[TABLE]

solves the Stein equation (2). Since $h$ is three times differentiable with $\|D^{k}h\|_{\infty}<\infty$ for $1\leq k\leq 3$ , we can use dominated convergence to compute

[TABLE]

Recall that, for a function $F:{\mathbb{R}}^{d}\times B_{d}(0,4M+1)\to{\mathbb{R}}^{d\times d}$ , we denote

[TABLE]

and

[TABLE]

By Fubini’s theorem,

[TABLE]

so that an application of condition (A2) combined with (38) and (39) yields

[TABLE]

which proves condition (A2’). The proof for condition (A3’) is essentially the same which is why we omit it. ∎

We now proceed to show Theorem 2.1. Combining Lemma 5.1 with Proposition 5.3 yields

[TABLE]

where $A$ is given by (33) and $E_{i}$ are as in Proposition 5.3. We bound each term $E_{i}$ separately, using conditions (A1), (A2’) and (A3’).

By condition (A2’),

[TABLE]

Moreover,

[TABLE]

where (37) was used in the third inequality.

For $E_{3}$ first note that

[TABLE]

so that Lemma 5.4 and condition (A1) can be used to obtain

[TABLE]

Combinining (40) with an application of condition (A2’) yields

[TABLE]

Condition (A3’) is used to bound $E_{4}$ and $E_{5}$ :

[TABLE]

and

[TABLE]

Again by (40),

[TABLE]

Finally,

[TABLE]

Gathering the foregoing upper bounds we obtain

[TABLE]

The proof for Theorem 2.1 is complete.

5.4. Proof for Theorem 2.3

Since the proof for Theorem 2.3 is very similar to the proof for Theorem 2.1, we omit most of the details and only give an outline, emphasizing differences between the two proofs.

Now $b^{2}=\text{Var}_{\mu}(\sum_{i<N}\bar{f}^{i})>0$ so that $\text{Var}_{\mu}(W)=\mu(W^{2})=1$ . Then the univariate Stein equation is defined by

[TABLE]

where $w\in{\mathbb{R}}$ . Note that the order of (41) is one smaller than the order of the multivariate Stein equation (2). We have the following result regarding the regularity of $A$ :

Lemma 5.6 (See [7]).

Whenever $h:{\mathbb{R}}\to{\mathbb{R}}$ is Lipschitz continuous with $\textnormal{Lip}(h)\leq 1$ the solution $A:{\mathbb{R}}\to{\mathbb{R}}$ to (41) belongs to the class $\mathscr{F}_{1}$ consisting of all differentiable functions with an absolutely continuous derivative, satisfying the bounds

[TABLE]

The lemma implies that

[TABLE]

where $Z\sim{\mathcal{N}}(0,1)$ . Next $\mu[A^{\prime}(W)-WA(W)]$ is decomposed precisely as in Proposition 4 of [56]. The decomposition is the same as that given in Proposition 5.3 except that $\delta^{n,m}(u)$ there is replaced with

[TABLE]

Then

[TABLE]

where

[TABLE]

By Lemma 5.6

[TABLE]

and

[TABLE]

Hence, conditions (B2) and (B3) can be applied with $G_{u}\upharpoonright({\mathbb{R}}\times B_{1}(0,4M+1))$ as in the proof for Theorem 2.1. Using also condition (B1) we obtain bounds to each of the terms $E_{i}$ appearing in the univariate version of Proposition 5.3, which then lead to the upper bound (5).

5.5. Proof for Theorem 2.6

From now on we assume that $\operatorname{Cov}_{\mu}\left(\sum_{i=0}^{N-1}\bar{f}^{i}\right)$ is positive definite and take

[TABLE]

in which case $\Sigma=\mu(W\otimes W)=I_{d\times d}$ . By Lemma 5.4,

[TABLE]

where we recall that $\lambda_{\min}$ is the least eigenvalue of $\operatorname{Cov}_{\mu}(\sum_{i=0}^{N-1}\bar{f}^{i})$ .

By Lemma 5.2,

[TABLE]

where $Z\sim{\mathcal{N}}(0,I_{d\times d})$ and ${\mathcal{A}}$ denotes the class of all $C^{2}$ functions satisfying (34). The proof then proceeds as follows. First we decompose $\mu[\operatorname{tr}D^{2}A(W)-W^{T}\nabla A(W)]=\sum_{i=1}^{7}E_{i}$ using Proposition 5.3, which reduces the proof to bounding each term $E_{i}$ for functions $A\in{\mathcal{A}}$ . For example, to obtain an upper bound on $E_{1}$ we have to control the integral

[TABLE]

where we recall that $\delta^{n,m}(u)=D^{2}A(W^{n,m}+u\,Y^{n,m})-D^{2}A(W^{n,m})$ . To this end we will describe a class ${\mathcal{G}}$ of regular functions $G:{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d\times d}$ such that

[TABLE]

The integral on the right is bounded by condition (C2), provided that $G$ is a $C^{1}$ -function. This might not be the case, since functions in ${\mathcal{G}}$ will have the same regularity as the second derivatives of functions in ${\mathcal{A}}$ , which according to Lemma 5.2 is Lipschitz up to a logarithmic factor. But we can approximate such functions by $C^{\infty}$ -functions, which in combination with condition (C2) then leads to an upper bound on (42) and consequently on $E_{1}$ . The other terms $E_{i}$ will be treated similarly. We now proceed to detail the foregoing argument.

We denote by ${\mathcal{G}}$ the collection of all functions $G:{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d\times d}$ that satisfy the following upper bounds:

[TABLE]

where $K=2C_{\#}+\sqrt{d}4M+2$ and $C_{\#}$ is the constant from Lemma 5.2 with $\eta=1$ .

Lemma 5.7.

Assume $\lambda_{\min}>1$ . Then, given any $A\in{\mathcal{A}}$ and $0\leq n,m\leq N-1$ , there is a function $G_{u}:{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}^{d\times d}$ satisfying

[TABLE]

where $\delta^{n,m}(u)$ is defined as in Proposition 5.3, such that

[TABLE]

Proof.

It is easy to see that (43) holds with $G_{u}(x,y)$ defined as

[TABLE]

We show that $G\in{\mathcal{G}}$ and leave the similar verification of $G_{1}\in{\mathcal{G}}$ to the reader.

Observe that, by Lemma 5.4 and (34),

[TABLE]

holds for all $x\in{\mathbb{R}}^{d}$ , $y\in{\mathbb{R}}^{d}\setminus\{0\}$ , and $u\in(0,1]$ . Then assume (as we may) that $y\neq 0$ . We use (44) and $\|b^{-1}\|_{s}=\lambda_{\min}^{-1/2}<1$ to obtain

[TABLE]

Since $1\leq\|b\|_{s}\|b^{-1}\|_{s}$ ,

[TABLE]

where we used Lemma 5.4. Hence,

[TABLE]

Next let $a,a^{\prime},y\in{\mathbb{R}}^{d}$ . Then,

[TABLE]

where (44) was used in the second inequality, and (45) in the third inequality.

Finally, for all $a,a^{\prime},x\in{\mathbb{R}}^{d}$ ,

[TABLE]

where (45) was used in the second last inequality. This completes the proof for $G\in{\mathcal{G}}$ . ∎

The following lemma is established by a standard approximation argument. See Appendix A for the proof.

Lemma 5.8.

Conditions (C2) and (C3) imply that, for all $0\leq n,m\leq N-1$ , the following two conditions hold:

(C2’)

Whenever $m\leq k\leq N-1$ and $G\in{\mathcal{G}}$ ,

[TABLE]

where

[TABLE]

(C3’)

Whenever $2m\leq k\leq N-1$ and $G\in{\mathcal{G}}$ ,

[TABLE]

where

[TABLE]

We proceed to bound the terms $E_{i}$ in Proposition 5.3 using conditions (C1), (C2’) and (C3’). Let $G_{u}$ be a function as in Lemma 5.7 and set $G=\int_{0}^{1}G_{u}\,du$ . Then for $E_{1}$ we have by condition (C2’) the upper bound

[TABLE]

Since $G\in{\mathcal{G}}$ ,

[TABLE]

For $E_{3}$ we note that

[TABLE]

Hence, by Lemma 5.4 and condition (C1),

[TABLE]

Combining (5.5) with condition (C2’) implies the upper bound

[TABLE]

Next condition (C3’) is used to bound $E_{4}$ and $E_{5}$ :

[TABLE]

and

[TABLE]

Again by (5.5),

[TABLE]

Finally,

[TABLE]

Recall that, by Lemma 5.2,

[TABLE]

Hence, Proposition 5.3 together with the above bounds implies

[TABLE]

The proof for Theorem 2.6 is complete.

Appendix A Proof for Lemma 5.8

Let us define the mollifier $\eta:{\mathbb{R}}^{d}\to{\mathbb{R}}$ by $\eta(x)=c\varphi(1-\|x\|^{2})$ where

[TABLE]

and $c>0$ is such that $\int_{{\mathbb{R}}^{d}}\eta(x)\,dx=1$ . Then

[TABLE]

Let $G\in{\mathcal{G}}$ . We approximate the components of $G$ by convolutions $G^{\varepsilon}_{\alpha,\beta}:{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}\to{\mathbb{R}}$ ,

[TABLE]

where $x=(x_{1},x_{2})\in{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}$ , $j_{\varepsilon}(x)=\varepsilon^{-2d}j(x/\varepsilon)$ , and $j(x)=\eta(x_{1})\eta(x_{2})$ .

For all $\alpha,\beta\in\{1,\ldots,d\}$ , $x=(x_{1},x_{2})\in{\mathbb{R}}^{d}\times{\mathbb{R}}^{d}$ , and $\varepsilon\in(0,1)$ :

[TABLE]

Lemma 5.4 was used in the second inequality and $G\in{\mathcal{G}}$ in the third inequality. It follows by Lemma 5.4 that

[TABLE]

Since $G\in{\mathcal{G}}$ ,

[TABLE]

so that Lemma 5.4 implies

[TABLE]

Since

[TABLE]

we have

[TABLE]

An easy computation shows that $|\partial_{i}j(x)|\leq 12c^{2}$ . Using this, $G\in{\mathcal{G}}$ , and (47) we obtain for all $\varepsilon\in(0,1)$ the upper bound

[TABLE]

Hence, by Lemma 5.4,

[TABLE]

where $\partial_{i}G^{\varepsilon}(x)=[\partial_{i}G^{\varepsilon}_{\alpha,\beta}(x)]_{\alpha,\beta}$ .

We combine (48)-(50) with condition (C2) to obtain

[TABLE]

Choosing $\varepsilon=\tfrac{1}{2}\tfrac{\rho(m)}{\rho(0)}<1$ implies condition (C2’). The proof for condition (C3’) is omitted as it is almost verbatim the same.

Bibliography57

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Romain Aimino, Huyi Hu, Matthew Nicol, Andrei Török, and Sandro Vaienti. Polynomial loss of memory for maps of the interval with a neutral fixed point. Discrete Contin. Dyn. Syst. , 35(3):793–806, 2015. URL: http://dx.doi.org/10.3934/dcds.2015.35.793 . · doi ↗
2[2] Romain Aimino and Jérôme Rousseau. Concentration inequalities for sequential dynamical systems of the unit interval. Ergodic Theory Dynam. Systems , 36(8):2384–2407, 2016. URL: http://dx.doi.org/10.1017/etds.2015.19 . · doi ↗
3[3] V. I. Bakhtin. Random processes generated by a hyperbolic sequence of mappings. I. Izv. Ross. Akad. Nauk Ser. Mat. , 58(2):40–72, 1994. URL: https://doi.org/10.1070/IM 1995 v 044n 02ABEH 001596 . · doi ↗
4[4] V. I. Bakhtin. Random processes generated by a hyperbolic sequence of mappings. II. Izv. Ross. Akad. Nauk Ser. Mat. , 58(3):184–195, 1994. doi:10.1070/IM 1995 v 044n 03ABEH 001616 . · doi ↗
5[5] Andrew Barbour. Stein’s method for diffusion approximations. Probab. Theory Related Fields , 84(3):297–322, 1990. doi:10.1007/BF 01197887 . · doi ↗
6[6] A. Castro, F.B. Rodrigues, and P. Varandas. Stability and limit theorems for sequences of uniformly hyperbolic dynamics. 2017. Preprint. ar Xiv:1709.01652 .
7[7] Louis H. Y. Chen, Larry Goldstein, and Qi-Man Shao. Normal approximation by Stein’s method . Probability and its Applications (New York). Springer, Heidelberg, 2011. doi:10.1007/978-3-642-15007-4 . · doi ↗
8[8] Jean-Pierre Conze and Albert Raugi. Limit theorems for sequential expanding dynamical systems on [ 0 , 1 ] 0 1 [0,1] . In Ergodic theory and related fields , volume 430 of Contemp. Math. , pages 89–121. Amer. Math. Soc., Providence, RI, 2007. doi:10.1090/conm/430/08253 . · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Sunklodas’ approach to normal approximation for time-dependent dynamical systems

Abstract.

Key words and phrases:

Acknowledgements

1. Introduction

Notation.

Structure of the paper

2. Main results

2.1. General normalization

Theorem 2.1**.**

Remark 2.2**.**

Theorem 2.3**.**

Lemma 2.4**.**

Proof.

Remark 2.5**.**

2.2. Self-normalization

Theorem 2.6**.**

Remark 2.7**.**

2.3. Pène’s CLT for stationary dynamics

3. Application I: Random 1D1D1D piecewise expanding maps

Remark 3.1**.**

Lemma 3.2** (See Proposition 1 in [12]).**

3.1. Statement of result

Theorem 3.3**.**

Remark 3.4**.**

3.2. A functional correlation bound

Proposition 3.5**.**

Remark 3.6**.**

Lemma 3.7**.**

Proof.

Lemma 3.8** (See Lemma 2 in [12]).**

Proof for Proposition 3.5.

Proof for Claim.

3.3. Proof for Theorem 3.3

Remark 3.9**.**

4. Application II: intermittent maps

4.1. Sequential compositions

Theorem 4.1**.**

Remark 4.2**.**

Proof for Theorem 4.1 .

4.2. Quasistatic dynamics

Definition 4.3** (Intermittent QDS).**

Theorem 4.4**.**

Proof.

4.3. Rate in the quenched CLT

Theorem 4.5**.**

Remark 4.6**.**

Proof.

5. Proofs for main results

5.1. On the regularity of solutions to Stein equation

Lemma 5.1**.**

Lemma 5.2** (See Proposition 2.2 in [16] ).**

5.2. Sunklodas’ decomposition.

Proposition 5.3**.**

Proof.

5.3. Proof for Theorem 2.1

Lemma 5.4**.**

Lemma 5.5**.**

Proof.

5.4. Proof for Theorem 2.3

Lemma 5.6** (See [7]).**

5.5. Proof for Theorem 2.6

Lemma 5.7**.**

Proof.

Lemma 5.8**.**

Appendix A Proof for Lemma 5.8

Theorem 2.1.

Remark 2.2.

Theorem 2.3.

Lemma 2.4.

Remark 2.5.

Theorem 2.6.

Remark 2.7.

3. Application I: Random $1D$ piecewise expanding maps

Remark 3.1.

Lemma 3.2 (See Proposition 1 in [12]).

Theorem 3.3.

Remark 3.4.

Proposition 3.5.

Remark 3.6.

Lemma 3.7.

Lemma 3.8 (See Lemma 2 in [12]).

Remark 3.9.

Theorem 4.1.

Remark 4.2.

Definition 4.3 (Intermittent QDS).

Theorem 4.4.

Theorem 4.5.

Remark 4.6.

Lemma 5.1.

Lemma 5.2 (See Proposition 2.2 in [16] ).

Proposition 5.3.

Lemma 5.4.

Lemma 5.5.

Lemma 5.6 (See [7]).

Lemma 5.7.

Lemma 5.8.