Sunklodas' approach to normal approximation for time-dependent dynamical systems
Juho Lepp\"anen, Mikko Stenlund

TL;DR
This paper develops a normal approximation method for sums in time-dependent dynamical systems, providing explicit error bounds and applying Stein's method to systems like expanding maps and intermittent systems.
Contribution
It introduces a new normal approximation approach for time-dependent systems with explicit error rates and constants, extending Stein's method to this context.
Findings
Error in approximation decays at rates $O(N^{-1/2})$ or $O(N^{-1/2} \log N)$
Conditions depend on the normalizing sequence $b(N)$ and metric used
Applications include expanding maps and intermittent systems
Abstract
We consider time-dependent dynamical systems arising as sequential compositions of self-maps of a probability space. We establish conditions under which the Birkhoff sums for multivariate observations, given a centering and a general normalizing sequence of invertible square matrices, are approximated by a normal distribution with respect to a metric of regular test functions. Depending on the metric and the normalizing sequence , the conditions imply that the error in the approximation decays either at the rate or the rate , under the additional assumption that . The error comes with a multiplicative constant whose exact value can be computed directly from the conditions. The proof is based on an observation due to Sunklodas regarding Stein's method of normal approximation. We give applications to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Sunklodas’ approach to normal approximation for time-dependent dynamical systems
Juho Leppänen
LPSM, Laboratoire de Probabilités, Statistique et Modélisation, Sorbonne Université, 4 Place Jussieu, 75252 Paris, France
and
Mikko Stenlund
Department of Mathematics and Statistics, P.O. Box 68, Fin-00014 University of Helsinki, Finland.
[email protected] http://www.helsinki.fi/ stenlund/
Abstract.
We consider time-dependent dynamical systems arising as sequential compositions of self-maps of a probability space. We establish conditions under which the Birkhoff sums for multivariate observations, given a centering and a general normalizing sequence of invertible square matrices, are approximated by a normal distribution with respect to a metric of regular test functions. Depending on the metric and the normalizing sequence , the conditions imply that the error in the approximation decays either at the rate or the rate , under the additional assumption that . The error comes with a multiplicative constant whose exact value can be computed directly from the conditions. The proof is based on an observation due to Sunklodas regarding Stein’s method of normal approximation. We give applications to one-dimensional random piecewise expanding maps and to sequential, random, and quasistatic intermittent systems.
Key words and phrases:
Stein’s method, multivariate normal approximation, time-dependent dynamical system, intermittency
2010 Mathematics Subject Classification. 60F05; 37A05, 37A50, 37C60
Acknowledgements
JL was supported by DOMAST (University of Helsinki) and by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 787304). He would like to thank Viviane Baladi for helpful discussions. MS was supported by Emil Aaltosen Säätiö, and the Jane and Aatos Erkko Foundation
1. Introduction
In this note we revisit the topic of statistical limit laws by Stein’s method for dynamical systems, studied previously in [30, 28, 29, 35, 19, 10, 46, 20, 24, 26]. We consider discrete time-dependent dynamical systems described by sequential compositions , where each is a transformation of a probability space . The measure is not assumed to be invariant under any of the maps . Given a bounded obsevable and a sequence of invertible matrices, we are interested in approximating the law of the sums
[TABLE]
by a multivariate normal distribution. More precisely, we want to identify conditions that cover a wide range of chaotic time-dependent systems and imply a good upper bound on
[TABLE]
where is a class of regular test functions , and denotes the expectation of with respect to the multivariate normal distribution with covariance matrix .
Since its introduction in [50], Stein’s method has seen extensive development in the literature of probability theory. In the present context of dynamical systems, the simple basic idea of the method can be described as follows. If for each test function the solution to the differential equation (called a Stein equation)
[TABLE]
lies in another class of functions , then it follows that
[TABLE]
In this way the original problem of approximating the law of by is reduced to bounding the right hand side of (3), which interestingly only depends on the law of and the class . It was observed in [30, 28] that, when , Taylor expanding about the punctured sums
[TABLE]
with a suitably chosen leads to certain correlation-decay conditions for an upper bound on . Such an approach calls for bounds on partial derivatives of , which are known to follow from bounds on partial derivatives of . In [30, 28], was taken to be the class of three times differentiable functions with bounded derivatives in the case of a general , and the class of Lipschitz continuous functions in the case .
The approach described above was applied in [30] to stationary Sinai billiards and in [28] to time-dependent smooth uniformly expanding circle maps. Both systems are (the latter in a certain sense) exponentially mixing, which is essentially the reason why replacing with in the application of Stein’s method causes only a small error. Indeed, upper bounds of order on (1) for sufficiently regular observables could be obtained this way. While such a “fixed gap” approach works also for polynomially mixing systems, it yields a larger error depending on the rate of mixing. This can be seen from the results of [29], where time-dependent systems in the spirit of [1, 43] described by sequential compositions of polynomially mixing intermittent maps with parameters were considered. Under the condition that is positive definite, an upper bound of order was obtained for Lipschitz continuous observables. The result was used to establish central limit theorems for quasistatic and random compositions of intermittent maps.
The purpose of the present note is to describe an adaptation of Stein’s method that is more suitable than those of [30, 28] for normal approximation of polynomially mixing systems, and investigate some of its implications. The starting point is a decomposition of due to Sunklodas [56], which allows to identify correlation-decay conditions that imply a rate of decay for depending on the “growth of ”. In the case of a general such that , the conditions yield the rate for a class of smooth test functions , and in the special self-norming case the rate for Lipschitz continuous test functions. A key ingredient in the proof of the latter estimate is a recent result due to Gallouët–Mijoule–Swan [16] concerning the regularity of solutions to Stein equation. As applications we establish rates of convergence in the central limit theorem for the random piecewise expanding model studied by Dragičević et al. in [12] and for sequential, random, and quasistatic intermittent systems. The results for intermittent systems notably improve those of [29].
Statistical properties of time-dependent dynamical systems have been studied in several previous works including [54, 2, 15, 33, 57, 34, 53]. Central limit theorems were shown by Bakhtin [3, 4], Conze–Raugi [8], and more recently by Nándori–Szász–Varjú [41] and Nicol– Török–Vaienti [43]. Heinrich [27] showed a Berry-Esseen type upper bound for sequences of uniformly expanding interval maps admitting a Markov partition. Haydn–Nicol–Török–Vaienti [25] established almost sure invariance principles (ASIP) for piecewise-expanding and other related models, also in higher dimension. ASIPs were obtained also by Castro–Rodrigues–Varandas [6] for convergent sequences of Anosov diffeomorphisms and expanding maps on compact Riemannian manifolds. Recently Su [55] proved a vector valued ASIP for a general class of polynomially mixing time-dependent systems. Among its many implications is a self-norming CLT for the sequential intermittent system with , under a (polynomial) variance growth condition. Finally, Hafouta [23] showed several limit theorems, including a Berry-Esseen theorem and a local limit theorem, for sequential compositions of maps belonging to a certain class of distance expanding maps of a compact metric space.
Notation.
For a function , we write for the th derivative of , and also denote . We define
[TABLE]
The spectral norm of a matrix is denoted by
[TABLE]
where is the Euclidean norm of . We use to denote the open ball in with center and radius .
Given a measure space and a -integrable function we set . The components of are denoted by , where . The Lebesgue measure is denoted by .
For two vectors we set .
We denote by a generic positive constant whose value might change from one line to the next. We use to denote a positive constant that depends only on the parameters .
Structure of the paper
In Section 2 we present our main results concerning normal approximation of abstract discrete time-dependent dynamical systems. Sections 3 and 4 contain applications to one-dimensional dynamics. The model of Section 3 is a random dynamical system of piecewise smooth uniformly expanding maps, while in Section 4 we consider sequential, quasistatic, and random intermittent systems. Finally, in Section 5 we prove the main results.
2. Main results
Consider a sequence of measurable maps of a probability space . For each let be a bounded measurable function and define
[TABLE]
Given and an invertible matrix , we write
[TABLE]
Given also , we write
[TABLE]
The covariance matrix of is denoted by
[TABLE]
2.1. General normalization
First we consider a general invertible and give conditions that imply an upper bound on the distance between the law of and the normal distribution with respect to a smooth metric.
Suppose that for all . Then, given a smooth test function and we define the matrix-valued function by
[TABLE]
where . For a differentiable function we set
[TABLE]
and
[TABLE]
where .
Here is the first main result:
Theorem 2.1**.**
Fix and let be three times differentiable with for . Suppose , and that there exist a function with and constants , , such that the following conditions hold for all :
- (A1)
For all ,
[TABLE]
- (A2)
Whenever and ,
[TABLE]
- (A3)
Whenever and ,
[TABLE]
- (A4)
The matrix is positive definite.
Then
[TABLE]
where
[TABLE]
Here denotes the expectation of with respect to .
Remark 2.2**.**
Theorem 2.1, as well as Theorems 2.3 and 2.6 given below, continue to hold if are replaced with general random vectors.
We postpone proving Theorem 2.1 and other results in this section until Section 5. Due to the smooth metric, the constant in the upper bound (4) is independent of the covariance matrix . Note that under the additional assumptions and we obtain as , which is the optimal rate in this generality. Conditions (A1)-(A3) are designed for time-dependent systems with sufficiently good (polynomial) mixing properties. Condition (A1) requires the decay of non-stationary correlations at the rate . Condition (A2) requires that, for large , the random vectors
[TABLE]
are componentwise nearly uncorrelated. This is reasonable because the function on the right depends on with only. The function is differentiable and its norm appears as a factor in the upper bound. Condition (A3) is similar in spirit to condition (A2), for it requires
[TABLE]
to be nearly componentwise uncorrelated, which is again reasonable when .
Recall that the Wasserstein distance between two random vectors and is defined by
[TABLE]
where
[TABLE]
is the class of all -Lipschitz functions. When we obtain a result similar to Theorem 2.1 for the Wasserstein distance. The relaxed smoothness of comes with the expense that conditions (A2) and (A3) have to be verified for a whole class of regular functions.
For a function we denote
[TABLE]
Theorem 2.3**.**
Let and fix . Take . Suppose that , that , and that there exist constants , , and a function with such that the following conditions hold for all :
- (B1)
.
- (B2)
Whenever and is a bounded Lipschitz continuous function,
[TABLE]
- (B3)
Whenever and is a bounded Lipschitz continuous function,
[TABLE]
Then
[TABLE]
where
[TABLE]
and is a random variable with standard normal distribution.
The following easy observation allows for normalizing constants other than
[TABLE]
Lemma 2.4**.**
Suppose (5) of Theorem 2.3 holds. Then, for any ,
[TABLE]
Proof.
For any random variables and any , the Wasserstein metric satisfies
[TABLE]
∎
Remark 2.5**.**
There is a notable difference between the upper bounds (4) and (6): unlike (4), (6) always depends on in addition to the normalizing constant . This difference is due to the choice of metric.
2.2. Self-normalization
We now assume that is positive definite and set so that . In this case we establish an upper bound on the distance between the law of and a standard normal random vector with respect to the Wasserstein metric. Unlike Theorem 2.3, the result applies for a general . We denote by the least eigenvalue of .
Theorem 2.6**.**
Let . Suppose that where , that , and that there exist a non-increasing function with with and constants , , such that the following conditions hold for all :
- (C1)
For all ,
[TABLE]
- (C2)
Whenever and is a bounded -function with bounded gradient,
[TABLE]
- (C3)
Whenever and is a bounded -function with bounded gradient,
[TABLE]
Then
[TABLE]
where
[TABLE]
Remark 2.7**.**
If in addition and hold, then we obtain the rate , as .
2.3. Pène’s CLT for stationary dynamics
The theorems given above apply in the stationary case where preserves the measure for all . In this case the problem of normal approximation has been studied in several important articles including [22, 9, 14, 44, 13, 32], using different methods, conditions, and metrics. In the multidimensional case , Pène [45] formulated a correlation-decay condition for stationary processes, based on the inductive proof of Rio [48]. Let , where , is bounded and . In this context of measure preserving transformations, Pène’s condition can be stated as follows:
- (D)
There exist , , , and a sequence of real numbers with and , such that for any integers satisfying , for any integers with , for any , and for any bounded differentiable function with bounded gradient,
[TABLE]
Condition (D) is satisfied by chaotic dynamical systems such as Sinai billiards. It was shown in [45] that condition (D) implies the existence of the limit and, whenever is nonnull, the existence of a constant such that
[TABLE]
where is a Gaussian random variable with expectation [math] and covariance matrix .
Compared to Theorem 2.1, (7) gives an upper bound of the same order for stationary systems whose correlations decay at a rate which has a finite first moment, for test functions that are only assumed to be Lipschitz continuous. On the other hand, Theorem 2.1 is more general in that it applies for rather arbitrary matrix-valued normalizing sequences . Furthermore, the constant in (4) is more explicit than the one in (7) in terms of its dependence on , and the underlying dynamical system. The same can be said about the constant in Theorem 2.6, which gives an upper bound for the same metric as (7) but with a slightly weaker rate of convergence due to the logarithmic factor. Note that, similarly to conditions (B2)-(B3) and (C2)-(C3), condition (D) has to be verified for a whole class of regular functions .
3. Application I: Random piecewise expanding maps
In this section we apply Theorem 2.3 to estimate the rate of convergence in the quenched CLT for a class of piecewise expanding random dynamical systems. Namely we consider the setup studied by Dragičević et al. in [12]. Below we recall some definitions and results from [12] as they are necessary for understanding the application given in this section.
Set and for a function define its total variation by
[TABLE]
Moreover, define
[TABLE]
The Banach space consists of all functions with and is equipped with the norm .
Let us denote by the collection of all maps for which there exists a finite partition of into subintervals such that for every :
- (1)
extends to a map in a neighborhood of ;
- (2)
.
The map is monotonous on each element . From now on we take to be the minimal such partition and set .
Let be a probability space and let be an invertible -preserving transformation. We consider a map from into . Random compositions of maps are denoted by
[TABLE]
and
[TABLE]
where is the transfer operator associated to :
[TABLE]
Conditions (H):
- (i)
is invertible, -preserving, and ergodic.
- (ii)
The map is measurable for every measurable function such that .
- (iii)
; ; .
- (iv)
There is such that and .
- (v)
For every subinterval there is such that holds for almost every .
Remark 3.1**.**
It was shown in [12] that conditions (H) imply several nice properties for the transfer operators , including a Lasota-Yorke inequality and exponential decay in the BV-norm. The authors used such properties to establish an almost sure invariance principle.
Lemma 3.2** (See Proposition 1 in [12]).**
Assume conditions (H). Then there exists a unique measurable and non-negative function such that , and for almost every . Moreover, .
3.1. Statement of result
Let be a bounded measurable function and set
[TABLE]
where and is the function from Lemma 3.2. Set
[TABLE]
where is the square root of . We denote by the skew product , which preserves the measure on defined by
[TABLE]
Theorem 3.3**.**
Consider a family of piecewise expanding maps such that conditions (H) hold. Fix and suppose is Lipschitz continuous such that can not be written as for any . Then there is independent of such that
[TABLE]
holds for almost every . Here is a random variable with standard normal distribution.
Remark 3.4**.**
The proof of Theorem 3.3 is based on Theorem 2.3. Theorem 2.1 or 2.6 could be used instead to obtain similar central limit theorems for multivariate observables .
3.2. A functional correlation bound
Conditions (B2) and (B3) of Theorem 2.3 will be verified by applying the auxiliary result given below, which facilitates bounding integrals of the form , where is not necessarily a product of one-dimensional observables. Such functional correlation bounds were established for stationary Sinai billiards in [39] and for time-dependent intermittent maps in [36].
For a function , , and we denote
[TABLE]
where is obtained from by replacing the th coordinate with . We say that is -Hölder continuous in the coordinate if .
Proposition 3.5**.**
Let . Consider integers blocked according to a set of indices , where we assume that hold for all . Suppose is a family of maps such that conditions (H) hold, and that is a function with and
[TABLE]
Denote by the function
[TABLE]
Then, for any probability measures whose densities belong to , and for almost every ,
[TABLE]
where , and .
Remark 3.6**.**
The upper bound (9) is independent of .
The proof for Proposition 3.5 is based on two auxiliary results. The first result is an immediate consequence of Corollary 8 in [2] due to Aimino and Rousseau, who considered sequential (non-random) compositions of piecewise-expanding maps. The second result is Lemma 2 in the paper [12] by Dragičević et al.
Lemma 3.7**.**
Suppose conditions (H) hold. There is such that for almost every ,
[TABLE]
where denotes the total variation of over the subinterval .
Proof.
As is explained on p. 2252 of [12], condition (iv) implies that there exists and such that, for almost every ,
[TABLE]
holds for all and . It suffices to fix with such that (11) holds for all . Then the proof of Corollary 8 in [2] shows that (10) holds for all . ∎
Lemma 3.8** (See Lemma 2 in [12]).**
Assume conditions (H). There is and such that, for almost every ,
[TABLE]
holds for all and with .
Proof for Proposition 3.5.
The proof proceeds by induction on . First let and denote . Then the function in Proposition 3.5 becomes
[TABLE]
where . Set 111We denote by the greatest non-negative integer with .. Then,
[TABLE]
Claim. If , then almost surely
[TABLE]
where , , and . We recall that by definition .
Proof for Claim.
Since is -Hölder continuous for a.e. in the first coordinates,
[TABLE]
holds for a.e. . Consequently,
[TABLE]
For each , maps diffeomorphically onto , which implies the upper bound . That is, for a.e. ,
[TABLE]
This proves the claim. ∎
We fix a point for each . Then (12) implies for a.e. the upper bound
[TABLE]
Let denote the density of , and let denote the density of . Moreover, let be the function that satisfies . Fix . Then, for a.e. ,
[TABLE]
Let . Since , either and
[TABLE]
or . It follows easily from this and the strict monotonicity of that
[TABLE]
where
[TABLE]
We conclude that
[TABLE]
On the other hand there is such that, for any , holds for almost every . This follows from (11) together with the fact that for almost every ; see p. 2257 of [12]. In particular,
[TABLE]
Next we combine Lemma 3.8, (14) and (15) to obtain
[TABLE]
for a.e. , where . Then, by Lemma 3.7,
[TABLE]
for a.e. . Taking completes the proof for the case .
Suppose that we have shown (9) for , and fix integers as in the proposition. Recall that denotes the function
[TABLE]
From the case we know that, for a.e. ,
[TABLE]
where is the density of .
Next for each , we apply the induction hypothesis to the function
[TABLE]
This implies for a.e. the upper bound
[TABLE]
for all . Now, to complete the proof for Proposition 3.5, it suffices to combine (16) and (17).
∎
3.3. Proof for Theorem 3.3
It was shown in [12] that there exists a non-random such that
[TABLE]
for almost every . Moreover, if and only if there exists such that . Hence, under our assumption there exists and such that, for a.e. ,
[TABLE]
holds for all .
Next we show that, with as the initial measure, conditions (B1)-(B3) hold with for a.e. , where is the same as in Proposition 3.5. To this end recall that, by Lemma 3.2, the density of lies in for a.e. .
(B1): For brevity, we introduce the notation . Taking , , , and in Proposition 3.5 yields the upper bound
[TABLE]
for a.e. .
(B2): Let and let be a bounded Lipschitz continuous function. We define by the formula
[TABLE]
where and the summations are over . Then
[TABLE]
which is the integral we need to control. It is easy to verify that is Lipschitz continuous with
[TABLE]
and
[TABLE]
where is an indexing for the arguments of . Observe that, since ,
[TABLE]
where and . It follows by Proposition 3.5 applied with and that, for a.e. ,
[TABLE]
(B3): This is obtained in the same way as condition (B2). Namely, whenever , applying Proposition 3.5 with and the function
[TABLE]
where
[TABLE]
implies for a.e. the upper bound
[TABLE]
Since , Theorem 3.3 now follows by Theorem 2.3.
Remark 3.9**.**
Another example of a random dynamical system that satisfies the conditions of Theorem 2.3 is the Sinai Billiard of [51], in which a scatterer configuration on the torus is randomly updated between consecutive collisions. The key technical lemmas necessary for obtaining an analog of Proposition 3.5 were proven in [51, 53], including a statistical memory loss starting from an initial measure supported on a single homogeneous local unstable manifold (Lemma 12 of [51]), and a tail estimate on the prevalence of short local unstable manifolds (Lemma 13 of [51]). The application would imply a rate of convergence in the annealed CLT but we will not treat it here.
4. Application II: intermittent maps
Following [40] we define for each the map by
[TABLE]
Associated to each map is its transfer operator defined by
[TABLE]
We denote by the invariant absolutely continuous probability measure associated to . It follows from [40] that the density belongs to the convex cone of functions
[TABLE]
[TABLE]
and that
[TABLE]
4.1. Sequential compositions
First we consider sequential compositions
[TABLE]
of intermittent maps with parameters . The notation below is adapted from Section 2.2: is a Borel probability measure on ; is a bounded observable for all ;
[TABLE]
For a Lipschitz continuous function we set , where
[TABLE]
and
[TABLE]
Theorem 4.1**.**
Let and let be a measure whose density lies in the cone . Suppose that are Lipschitz continuous with and that . Denote by a standard normal random vector.
- (1)
If , then there is such that
[TABLE]
In particular, if , then as .
- (2)
If , then for any there is such that
[TABLE]
In particular, if , then as .
Remark 4.2**.**
A couple of remarks are in order:
- (i)
The proof is based on Theorem 2.6. In the special case let us denote and . Assuming , the sharper upper bound
[TABLE]
is obtained by applying Theorem 2.3 instead of Theorem 2.6, provided that . Consequently, by Lemma 2.4, for any ,
[TABLE]
Without any assumption on we still obtain the weaker bound
[TABLE]
This follows easily by combining (20) with the fact that, for any random variables and with finite variances and , respectively, the Wasserstein metric satisfies (see e.g. **[28]** for the last statement).
- (ii)
In the stationary case of a single intermittent map preserving the measure , a Berry-Esseen theorem for univariate Hölder continuous observables was shown by Gouëzel **[22]**. Gouëzel’s result establishes the rate with respect to the Kolmogorov metric for parameters . For parameters , Gouëzel obtains a rate depending on the behavior of around the fixed point . For multivariate Lipschitz continuous observables, the rate in the CLT with respect to the Wasserstein metric was shown for parameters in **[36]** by an application of Pène’s theorem **[45]**. The upper bound (19) can be viewed as an extension of this result for parameters . Pène’s condition (see Section 2.3) does not hold for parameters because correlations do not decay at a rate which has a finite first moment.
Proof for Theorem 4.1 .
Set for and . We show that conditions (C1)-(C3) of Theorem 2.6 hold with using Theorem 1.1 in [36].
(C1): Let and . Applying Theorem 1.1 in [36] with , , , and yields the upper bound
[TABLE]
where .
(C2): Let , , and be a bounded -function with bounded gradient. We define
[TABLE]
by the formula
[TABLE]
where and the summations are over . Then,
[TABLE]
which is the integral we need to control. It is easy to verify that
[TABLE]
and
[TABLE]
Here
[TABLE]
is defined by (8), and is an indexing for the arguments of . Theorem 1.1 in [36] together with (21) and (22) implies the upper bound
[TABLE]
(C3): This is shown in the same way as condition (C2). Namely Theorem 1.1 in [36] is applied with the function
[TABLE]
where
[TABLE]
We leave the details to the reader.
If , it follows by the foregoing that conditions (C1)-(C3) hold also with for some . In particular , so that item (1) of Theorem 4.1 follows by Theorem 2.6. If instead we obtain conditions (C1)-(C3) with for any . Then holds for arbitrarily small and item (2) of Theorem 4.1 follows again by Theorem 2.6. ∎
In the remainder of this section we look at situations where we have control on the limiting behavior of .
4.2. Quasistatic dynamics
We apply Theorem 4.1 to a model described by time-dependent (non-random) compositions of slowly transforming intermittent maps. More precisely we consider the following subclass of quasistatic dynamical systems (QDS); for background and earlier results on quasistatic systems we refer the reader to [37, 52, 38, 11, 28, 29].
Definition 4.3** (Intermittent QDS).**
Let be a triangular array of intermittent maps with parameters . If there is a piecewise continuous curve satisfying
[TABLE]
for all , we say that is an intermittent QDS.
Given an intermittent QDS , we define the functions by
[TABLE]
where
[TABLE]
, and is a bounded function. We fix an initial distribution of and for each view the as random vectors. The problem is now to approximate the law of the fluctuations
[TABLE]
by , where and .
Theorem 4.4**.**
Let be a Lipschitz continuous function and be such that its density lies in . Suppose that the limiting curve is Hölder-continuous, that for some we have
[TABLE]
and that there exists such that is not a co-boundary for in any direction222i.e. there does not exist a unit vector , a constant , and a function such that ..
- (1)
If and , then there exists such that for all and ,
[TABLE]
- (2)
If and , then for any there exists such that for all and ,
[TABLE]
Proof.
Set . By Lemma 4.4 in [29], uniformly in ,
[TABLE]
where
[TABLE]
and . By theorem 2.11 in the same article the limit covariance is positive definite for all (this is where the co-boundary condition on is needed). In particular, , where denotes the least eigenvalue of the matrix . It follows by the same argument as in p. 20 of [29] that there exists and such that holds for all and all . In other words,
[TABLE]
Next we show the wanted upper bound on by controlling separately the following three terms:
[TABLE]
[TABLE]
[TABLE]
where .
Note that
[TABLE]
It follows immediately by (23) and Theorem 4.1 that for all and ,
[TABLE]
where can be made arbitrarily small.
In the remainder of this proof we assume that and . Whenever and ,
[TABLE]
Since the density of belongs to , it follows by Lemma 3.3 in [37] that
[TABLE]
where is a constant independent of . Moreover (see Lemma 5.4),
[TABLE]
and
[TABLE]
That is,
[TABLE]
For brevity denote . Then we have for and the upper bound (see [49] for the first inequality)
[TABLE]
To bound the remaining spectral norm we fix , denote and . For a a real-valued function and integers we denote . Whenever , we can use Theorem 1.1 in [37] to find such that
[TABLE]
Hence, the upper bound follows by Lemma 5.4. We have shown that whenever and .
Finally, by (23) and Lemma 5.4,
[TABLE]
whenever and . Now to finish the proof for Theorem 4.4 it suffices to combine the foregoing upper bounds on , (25), and (26).
∎
4.3. Rate in the quenched CLT
We consider a sequence of intermittent maps with parameters drawn randomly from the probability space , where is the Borel -algebra of and . Let denote the shift .
Conditions (RDS):
- (i)
The shift preserves .
- (ii)
There is and such that, for all ,
[TABLE]
where is the sigma-algebra generated by the projections , , and is generated by .
We set
[TABLE]
where is a bounded observable with , and . That is, we take
[TABLE]
as the normalizing matrix.
Theorem 4.5**.**
Suppose that , that is Lipschitz continuous, and that is a measure whose density belongs to . Assume conditions (RDS). Then,
[TABLE]
is well-defined and positive semi-definite. Moreover, is positive definite if and only if
[TABLE]
holds for all . Fix an arbitrarily small . If is positive definite, then there is with such that for any three times differentiable function with , any , and any ,
[TABLE]
where and
[TABLE]
Remark 4.6**.**
Nicol–Török–Vaienti [43], Su [55], and Nicol–Pereira–Török [42] proved CLTs without rates of convergence for random dynamical systems of intermittent maps with parameters . Theorem 4.5 gives a better rate of convergence than the following upper bound established in [29] for univariate :
[TABLE]
Here and . The proof below can be modified to obtain the upper bound for univariate .
Proof.
Given any vector denote
[TABLE]
where . In other words, is the variance of
[TABLE]
Let . By Theorem 2.6 in [29], exists and if and only if
[TABLE]
Hence (27) is equivalent to the positive definiteness of . The proof of Theorem 2.6 in [29] also shows that, for almost every ,
[TABLE]
Hence, by Lemma 4.4 in [31], for almost every ,
[TABLE]
From now on we assume that is positive definite. We split into two terms:
[TABLE]
and
[TABLE]
It follows by (29) that there is such that is positive definite for and a.e. . Then, for all and , the upper bound
[TABLE]
holds for some . The proof for (32) is almost verbatim the same as the proof for Theorem 4.1: Theorem 2.1 is applied with after verifying conditions (A1)-(A3) using Theorem 1.1 in [37]. We will not repeat the argument here.
Finally, it is easy to show that, for some absolute constant ,
[TABLE]
Hence, for and a.e. (see [49] for the first inequality),
[TABLE]
The obtained upper bound combined with (29) finishes the proof for Theorem 4.5.
∎
5. Proofs for main results
5.1. On the regularity of solutions to Stein equation
Let the matrix be symmetric and positive definite. Denote respectively by and the density and expected value of the -dimensional normal distribution with mean [math] and covariance matrix . Given a test function , define
[TABLE]
Then, we have the following result for smooth test functions ; see [5, 21, 18, 17]:
Lemma 5.1**.**
Let be three times differentiable with for . Then, , and solves the Stein equation (2). Moreover, the partial derivatives of satisfy the bounds
[TABLE]
whenever , .
Note that the bounds on the partial derivatives of are independent of the covariance matrix .
Recently Gallouët–Mijoule–Swan [16] obtained notable improvements on the regularity of solutions to Stein’s equation in the case , for test functions that are assumed to be Hölder continuous:
Lemma 5.2** (See Proposition 2.2 in [16] ).**
Set and let be -Hölder continuous with some . Then the function defined by (33) solves the Stein equation (2). Moreover, and its second derivative satisfies the following bound:
[TABLE]
where
[TABLE]
and
[TABLE]
We will apply the result with . In this case the result is known to be optimal in terms of regularity of . More precisely it was shown in [16] that, when and (an example considered first by Raič in [47]),
[TABLE]
5.2. Sunklodas’ decomposition.
Set so that
[TABLE]
Next, we define punctured modifications of the sum , namely
[TABLE]
where
[TABLE]
Moreover, set
[TABLE]
Note that
[TABLE]
as well as
[TABLE]
for any and .
The proofs for the main results are based on the following decomposition, which is a multivariate version of Proposition 4 in [56] due to Sunklodas.
Proposition 5.3**.**
Suppose . Denote
[TABLE]
and
[TABLE]
Then
[TABLE]
where
[TABLE]
Proof.
For any , by (35),
[TABLE]
By (36),
[TABLE]
Since , it follows by the above identities that
[TABLE]
Note that
[TABLE]
so what remains of after subtracting and is
[TABLE]
where
[TABLE]
Next note that
[TABLE]
Since , this yields
[TABLE]
Finally, since
[TABLE]
we have
[TABLE]
This completes the proof for Proposition 5.3. ∎
5.3. Proof for Theorem 2.1
We gather in the following lemma some useful basic inequalities involving the spectral norm.
Lemma 5.4**.**
For all , , and :
- (i)
;
- (ii)
;
- (iii)
;
- (iv)
;
- (v)
, where denotes the largest eigenvalue of the positive-semidefinite matrix .
Lemma 5.5**.**
Let be three times differentiable with for and let be the function (33) that solves Stein’s equation. Define as in Proposition 5.3. Then, conditions (A2) and (A3) imply that, for all , the following two conditions hold:
- (A2’)
Whenever and ,
[TABLE]
- (A3’)
Whenever ,
[TABLE]
Proof.
We denote
[TABLE]
where . Then,
[TABLE]
which together with Lemma 5.4 implies
[TABLE]
Hence,
[TABLE]
Similarly we see that, for all ,
[TABLE]
For (A1’) Suppose that . Recall from Lemma 5.1 that
[TABLE]
solves the Stein equation (2). Since is three times differentiable with for , we can use dominated convergence to compute
[TABLE]
Recall that, for a function , we denote
[TABLE]
and
[TABLE]
By Fubini’s theorem,
[TABLE]
so that an application of condition (A2) combined with (38) and (39) yields
[TABLE]
which proves condition (A2’). The proof for condition (A3’) is essentially the same which is why we omit it. ∎
We now proceed to show Theorem 2.1. Combining Lemma 5.1 with Proposition 5.3 yields
[TABLE]
where is given by (33) and are as in Proposition 5.3. We bound each term separately, using conditions (A1), (A2’) and (A3’).
By condition (A2’),
[TABLE]
Moreover,
[TABLE]
where (37) was used in the third inequality.
For first note that
[TABLE]
so that Lemma 5.4 and condition (A1) can be used to obtain
[TABLE]
Combinining (40) with an application of condition (A2’) yields
[TABLE]
Condition (A3’) is used to bound and :
[TABLE]
and
[TABLE]
Again by (40),
[TABLE]
Finally,
[TABLE]
Gathering the foregoing upper bounds we obtain
[TABLE]
The proof for Theorem 2.1 is complete.
5.4. Proof for Theorem 2.3
Since the proof for Theorem 2.3 is very similar to the proof for Theorem 2.1, we omit most of the details and only give an outline, emphasizing differences between the two proofs.
Now so that . Then the univariate Stein equation is defined by
[TABLE]
where . Note that the order of (41) is one smaller than the order of the multivariate Stein equation (2). We have the following result regarding the regularity of :
Lemma 5.6** (See [7]).**
Whenever is Lipschitz continuous with the solution to (41) belongs to the class consisting of all differentiable functions with an absolutely continuous derivative, satisfying the bounds
[TABLE]
The lemma implies that
[TABLE]
where . Next is decomposed precisely as in Proposition 4 of [56]. The decomposition is the same as that given in Proposition 5.3 except that there is replaced with
[TABLE]
Then
[TABLE]
where
[TABLE]
By Lemma 5.6
[TABLE]
and
[TABLE]
Hence, conditions (B2) and (B3) can be applied with as in the proof for Theorem 2.1. Using also condition (B1) we obtain bounds to each of the terms appearing in the univariate version of Proposition 5.3, which then lead to the upper bound (5).
5.5. Proof for Theorem 2.6
From now on we assume that is positive definite and take
[TABLE]
in which case . By Lemma 5.4,
[TABLE]
where we recall that is the least eigenvalue of .
By Lemma 5.2,
[TABLE]
where and denotes the class of all functions satisfying (34). The proof then proceeds as follows. First we decompose using Proposition 5.3, which reduces the proof to bounding each term for functions . For example, to obtain an upper bound on we have to control the integral
[TABLE]
where we recall that . To this end we will describe a class of regular functions such that
[TABLE]
The integral on the right is bounded by condition (C2), provided that is a -function. This might not be the case, since functions in will have the same regularity as the second derivatives of functions in , which according to Lemma 5.2 is Lipschitz up to a logarithmic factor. But we can approximate such functions by -functions, which in combination with condition (C2) then leads to an upper bound on (42) and consequently on . The other terms will be treated similarly. We now proceed to detail the foregoing argument.
We denote by the collection of all functions that satisfy the following upper bounds:
[TABLE]
[TABLE]
[TABLE]
where and is the constant from Lemma 5.2 with .
Lemma 5.7**.**
Assume . Then, given any and , there is a function satisfying
[TABLE]
where is defined as in Proposition 5.3, such that
[TABLE]
Proof.
It is easy to see that (43) holds with defined as
[TABLE]
We show that and leave the similar verification of to the reader.
Observe that, by Lemma 5.4 and (34),
[TABLE]
holds for all , , and . Then assume (as we may) that . We use (44) and to obtain
[TABLE]
Since ,
[TABLE]
where we used Lemma 5.4. Hence,
[TABLE]
Next let . Then,
[TABLE]
where (44) was used in the second inequality, and (45) in the third inequality.
Finally, for all ,
[TABLE]
where (45) was used in the second last inequality. This completes the proof for . ∎
The following lemma is established by a standard approximation argument. See Appendix A for the proof.
Lemma 5.8**.**
Conditions (C2) and (C3) imply that, for all , the following two conditions hold:
- (C2’)
Whenever and ,
[TABLE]
where
[TABLE]
- (C3’)
Whenever and ,
[TABLE]
where
[TABLE]
We proceed to bound the terms in Proposition 5.3 using conditions (C1), (C2’) and (C3’). Let be a function as in Lemma 5.7 and set . Then for we have by condition (C2’) the upper bound
[TABLE]
Since ,
[TABLE]
For we note that
[TABLE]
Hence, by Lemma 5.4 and condition (C1),
[TABLE]
Combining (5.5) with condition (C2’) implies the upper bound
[TABLE]
Next condition (C3’) is used to bound and :
[TABLE]
and
[TABLE]
Again by (5.5),
[TABLE]
Finally,
[TABLE]
Recall that, by Lemma 5.2,
[TABLE]
Hence, Proposition 5.3 together with the above bounds implies
[TABLE]
The proof for Theorem 2.6 is complete.
Appendix A Proof for Lemma 5.8
Let us define the mollifier by where
[TABLE]
and is such that . Then
[TABLE]
Let . We approximate the components of by convolutions ,
[TABLE]
where , , and .
For all , , and :
[TABLE]
Lemma 5.4 was used in the second inequality and in the third inequality. It follows by Lemma 5.4 that
[TABLE]
Since ,
[TABLE]
so that Lemma 5.4 implies
[TABLE]
Since
[TABLE]
we have
[TABLE]
An easy computation shows that . Using this, , and (47) we obtain for all the upper bound
[TABLE]
Hence, by Lemma 5.4,
[TABLE]
where .
We combine (48)-(50) with condition (C2) to obtain
[TABLE]
Choosing implies condition (C2’). The proof for condition (C3’) is omitted as it is almost verbatim the same.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Romain Aimino, Huyi Hu, Matthew Nicol, Andrei Török, and Sandro Vaienti. Polynomial loss of memory for maps of the interval with a neutral fixed point. Discrete Contin. Dyn. Syst. , 35(3):793–806, 2015. URL: http://dx.doi.org/10.3934/dcds.2015.35.793 . · doi ↗
- 2[2] Romain Aimino and Jérôme Rousseau. Concentration inequalities for sequential dynamical systems of the unit interval. Ergodic Theory Dynam. Systems , 36(8):2384–2407, 2016. URL: http://dx.doi.org/10.1017/etds.2015.19 . · doi ↗
- 3[3] V. I. Bakhtin. Random processes generated by a hyperbolic sequence of mappings. I. Izv. Ross. Akad. Nauk Ser. Mat. , 58(2):40–72, 1994. URL: https://doi.org/10.1070/IM 1995 v 044n 02ABEH 001596 . · doi ↗
- 4[4] V. I. Bakhtin. Random processes generated by a hyperbolic sequence of mappings. II. Izv. Ross. Akad. Nauk Ser. Mat. , 58(3):184–195, 1994. doi:10.1070/IM 1995 v 044n 03ABEH 001616 . · doi ↗
- 5[5] Andrew Barbour. Stein’s method for diffusion approximations. Probab. Theory Related Fields , 84(3):297–322, 1990. doi:10.1007/BF 01197887 . · doi ↗
- 6[6] A. Castro, F.B. Rodrigues, and P. Varandas. Stability and limit theorems for sequences of uniformly hyperbolic dynamics. 2017. Preprint. ar Xiv:1709.01652 .
- 7[7] Louis H. Y. Chen, Larry Goldstein, and Qi-Man Shao. Normal approximation by Stein’s method . Probability and its Applications (New York). Springer, Heidelberg, 2011. doi:10.1007/978-3-642-15007-4 . · doi ↗
- 8[8] Jean-Pierre Conze and Albert Raugi. Limit theorems for sequential expanding dynamical systems on [ 0 , 1 ] 0 1 [0,1] . In Ergodic theory and related fields , volume 430 of Contemp. Math. , pages 89–121. Amer. Math. Soc., Providence, RI, 2007. doi:10.1090/conm/430/08253 . · doi ↗
