A note on the finite-dimensional distributions of dispersing billiard processes
Juho Lepp\"anen, Mikko Stenlund

TL;DR
This paper establishes a correlation bound for dispersing billiard processes with random initial conditions, enabling the derivation of limit theorems and aiding the study of more general functionals.
Contribution
It introduces a functional correlation bound for finite-dimensional distributions of dispersing billiards, facilitating analysis of their limit behaviors.
Findings
Correlation bound implies several limit theorems
Tool for studying general functionals of billiard processes
Enhances understanding of dispersing billiard dynamics
Abstract
In this short note we consider the finite-dimensional distributions of sets of states generated by dispersing billiards with a random initial condition. We establish a functional correlation bound on the distance between the finite-dimensional distributions and corresponding product distributions. We demonstrate the usefulness of the bound by showing that it implies several limit theorems. The purpose of this note is to provide a tool facilitating the study of more general functionals of the billiard process.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A note on the finite-dimensional distributions of dispersing billiard processes
Juho Leppänen
Department of Mathematics and Statistics, P.O. Box 68, Fin-00014 University of Helsinki, Finland.
and
Mikko Stenlund
Department of Mathematics and Statistics, P.O. Box 35, Fin-40014 University of Jyväskylä, Finland.
Abstract.
In this short note we consider the finite-dimensional distributions of sets of states generated by dispersing billiards with a random initial condition. We establish a functional correlation bound on the distance between the finite-dimensional distributions and corresponding product distributions. We demonstrate the usefulness of the bound by showing that it implies several limit theorems. The purpose of this note is to provide a tool facilitating the study of more general functionals of the billiard process.
Key words and phrases:
Dispersing billiards, finite-dimensional distributions, weak convergence, decorrelation
Acknowledgements
This work was supported by the Jane and Aatos Erkko Foundation, and by Emil Aaltosen Säätiö.
1. Introduction
In this note we revisit the two-dimensional dispersing Sinai billiard with finite horizon. To specify the model, we consider the torus with a finite collection of scatterers, i.e., closed convex sets , having boundaries with strictly positive curvatures. A particle moves linearly in the domain , with unit speed, up to elastic collisions with the boundaries of the scatterers. The scatterers are disjoint and positioned so that the free path length of the particle is bounded. Note that the precise dynamics of the system is fully determined by the geometry of the domain .
A standard discrete-time representation of the dynamics is obtained by keeping track of the collisions only, which leads to the so-called collision map as follows: Topologically, is the disjoint union of cylinders , homeomorphic to . A general point consists of a pair , where represents the position of the particle on the boundary during a collision, and represents its direction immediately after the collision relative to the normal of the boundary. Then is defined as the corresponding pair after the next collision. Since the continuous-time system is Hamiltonian, preserving phase-space volume, the collision map preserves a corresponding Borel probability measure, namely , on . Reversing the velocity of the particle, one moreover verifies that is invertible.
Given an initial state , the billiard dynamics generates the sequence of states . If is chosen at random, according to the invariant measure , the sequence is a stationary random process, which we call the billiard process. We can equally well define the two-sided billiard process , and everything below extends readily to that setup, but let us proceed with the one-sided case. Of course, knowledge of the value of for some fully determines the value of for all . Yet, the same is in general not true of, say, and where are “observables”. For instance, if and are Hölder continuous, then an exponential covariance bound
[TABLE]
holds. Above, and depend on the Hölder classes of and on system constants111In this paper system constants are quantities which only depend on the geometry of the domain .. Obviously, more general bounds exist, but (1) is sufficient for the ongoing illustrative discussion. The covariance bound (1), and other similar results, are consequences of the chaotic nature of the billiard dynamics. Colloquially, we may regard the observations as weakly dependent random variables, which in one form or another is at the heart of proving probabilistic limit results for functionals of the billiard process, say concerning the asymptotic behavior of the Birkhoff sums
[TABLE]
in the limit .
To proceed, we recall that a finite-dimensional distribution of the billiard process is the joint distribution of a subsequence corresponding to a finite index set . From here on, we will without loss of generality always assume that the indices in such an index set are in increasing order, . The probability measure on is characterized by the identity
[TABLE]
for bounded measurable functions . Of course, stationarity of the billiard process means that for all translates of , which is clearly true by the invariance of .
For example, in terms of finite-dimensional distributions, (1) reads
[TABLE]
where . In this weak sense, we may informally write
[TABLE]
when is large.
Convention. In the rest of the note we will consider unions
[TABLE]
of increasing nonempty index sets , where . We will always assume they are disjoint and ordered, in the sense that the gap between and satisfies
[TABLE]
for all . We shall henceforth write
[TABLE]
to express these conventions succinctly.
Being still informal, higher order correlation bounds indicate that, when each is large,
[TABLE]
in some weak sense. The purpose of this brief note is to make this interpretation precise. As an aside, it provides a unified perspective on several limit theorems that we treat as examples: We will obtain an estimate on the difference between and in an appropriately general sense of practical use, which is then shown to imply all the limit theorems. Let us immediately be clear that the latter limit theorems, per se, have been proved elsewhere, in the references cited (although we do obtain some minor improvements). Thus a side goal here is to shed additional light on why those theorems are true, in a mathematically rigorous way. The main result is the aforementioned estimate, which we call the “functional correlation bound”. We expect it to be of much broader use, as it is directly applicable to studying other kinds of functionals of the billiard process than the examples included here. In short, we view the functional correlation bound as a tool which helps put the vague statement in (2) onto a solid footing, in reasonable generality, so that it can be used effectively in technical proofs in the theory of dispersing billiards.
Structure of this note. In Section 2 we state two theorems on functional correlation decay. In Section 3 we give examples of using them for deducing limit theorems, and in Section 4 we prove them.
2. Results
Before stating the results, we recall a few standard facts from the theory of dispersing billiards. The reader is referred to the book [3] for more details.
In the disjoint union , the cylinders are further divided into horizontal strips , , called homogeneity strips. (Here the numbers are such that , which facilitates controlling distortions of the map within each strip.) We consider the totality of the homogeneity strips the connected components of . For a pair of points , we say that their trajectories separate when and are in different components for the first time ; this is called the future separation time, which we denote by . We define if the trajectories never separate. The past separation time is the analogous notion for the inverse map .
A local stable manifold of a point is a maximal curve such that is completely contained in a component of , for all . That is, given , there exist such that . It can be shown that almost every point has a nontrivial local stable manifold, and that the length of decreases exponentially as . Given two points , we either have (meaning ) or . Note that, in the first case, for all . The family of all local stable manifolds is uncountable, and forms a measurable partition of . Local unstable manifolds have identical properties in terms of the inverse map . In particular, they form a measurable partition of . Moreover, if , then for all .
We also recall the notion of dynamical Hölder continuity. The following definition is from [15]. It is a small variation of the one in [2], but in the current form it enjoys the property of being dynamically closed, which is used in the proofs; see Lemma 4.5.
Definition 2.1**.**
A function is dynamically Hölder continuous on local unstable manifolds with rate and constant if
[TABLE]
holds whenever and belong to the same local unstable manifold. In this case we write . Likewise, is dynamically Hölder continuous on local stable manifolds if
[TABLE]
holds whenever and belong to the same local stable manifold. In this case we write .
For instance, if is Hölder continuous with exponent and constant , then , where , and is determined by and system constants.
Finally, we introduce the class of admissible test functions :
Definition 2.2**.**
Given increasingly ordered index sets , , we say that a bounded function is -admissible, if it is separately dynamically Hölder continuous in the sense that
[TABLE]
Here is our first functional correlation bound, concerning the case :
Theorem 2.3**.**
There exist system constants and such that the following holds. Let , and let be -admissible:
[TABLE]
Then
[TABLE]
*Here is the gap between and . *
The second functional correlation bound extends the first one to . While it is entirely possible to formulate the result for general admissible test functions , the resulting bound has a cumbersome expression. For aesthetic reasons alone, we restrict to functions admissible with the same parameters, leaving generalizations to the reader. By “the same parameters” we mean that and in Definition 2.2.
Theorem 2.4**.**
Let , , and let be -admissible, with the same parameters and . Then
[TABLE]
Here is the gap between and ,
[TABLE]
and
[TABLE]
The system constants and are the same as in Theorem 2.3.
A result in the spirit of Theorem 2.4 was recently proved by Leppänen [7], for a class of one-dimensional, non-uniformly expanding dynamical systems.
In fact, the inductive proof of Theorem 2.4 shows that the special case and the general case are equivalent. This hinges on the dynamical closedness of the function classes and ; see Lemma 4.5.
At first, the theorems may seem like inconsequential extensions of correlation bounds such as the one displayed in (1). But they do allow for estimating integrals of functionals of the billiards process, , beyond the scope of simple correlation bounds. Just to give a simple example, consider a situation of the following kind:
Example 2.5**.**
Let be Lipschitz continuous with constant in each variable, and let the index sets be as above. Define the sums
[TABLE]
where the functions are bounded, and dynamically Hölder continuous with the same parameters and . Let us consider the intergral
[TABLE]
We would like to argue that, when each is large, the sums in the argument of are weakly dependent, so must be close to
[TABLE]
where the sums are literally independent due to the product measure. Theorem 2.4 helps make such an argument rigorous: Let be the function
[TABLE]
Then is -admissible: for all , and
[TABLE]
In both cases, we immediately arrive at the quantitative estimate
[TABLE]
For instance, Example 2.5 applies to “interlaced” covariances of the form
[TABLE]
where the argument of the Lipschitz function (respectively, ) involves with and odd (respectively, even). Here is even for convenience. This is so, because both terms in the difference can be compared with the integral with respect to the product measure .
In the special case of singleton index sets, , Example 2.5 yields a bound on
[TABLE]
Such bounds are relevant, e.g., for multiple recurrence problems.
3. More examples
In this section we give applications of Theorem 2.4 to limit results. We reiterate that the sole purpose of this section is to illustrate the usefulness of the theorem: it allows to check the conditions of various limit theorems with great ease. The verified conditions actually amount to very simple special cases of Theorem 2.4. We thus believe the result to be a tool of much broader use in analyzing dispersing billiard dynamics.
Below, the various constants in the results concerning billiards will be the same as in Theorem 2.4.
3.1. Multiple correlation bounds
Theorem 3.1**.**
Let and , and define
[TABLE]
Suppose and . Then
[TABLE]
for all , where
[TABLE]
It was shown in [2] that such a multiple correlation bound suffices for the central limit theorem to hold: If is bounded and , then
[TABLE]
converges weakly, as , to the normal distribution with zero mean and variance
[TABLE]
See also [10] for a closely related result. To be technically accurate, [2] dealt with a smaller class of observables, as did [10]. In [15] it was shown that, for the present classes , the multiple correlation bound is equivalent to the pair correlation bound corresponding to the special case ; and consequently that the pair correlation bound alone is enough for the central limit theorem.
Proof of Theorem 3.1.
Define and . Then
[TABLE]
where is the function
[TABLE]
Set Since and , we have
[TABLE]
Hence, is -admissible with the same parameters and . By Theorem 2.4,
[TABLE]
which implies the desired bound. ∎
3.2. Multivariate normal approximation by Stein’s method
In this section and the next, we show that Theorem 2.4 implies not only normal convergence but also estimates on the speed of convergence. In particular, we treat the case of multivariate normal distributions arising from vector-valued observables.
Let be a general transformation preserving a probability measure . We introduce the following notations: Given an observable , we write
[TABLE]
for all , denoting the coordinate functions of by , . We set
[TABLE]
for all . For , we introduce the time window
[TABLE]
around , and define
[TABLE]
for . Thus, is a modification of where the times are omitted in the sum. Finally, stands for the expectation of a function with respect to the centered multivariate normal distribution with covariance matrix . We write , , etc., for the norms of tensor fields.
The following theorem was proved in [5], where an adaptation of Stein’s method [14] to the study of dynamical systems was developed:
Theorem 3.2**.**
Let be a bounded measurable function with . Suppose is three times differentiable with for . Fix integer . Suppose there exists such that the following conditions are satisfied:
- (A1)
There exist constants and such that
[TABLE]
hold whenever ; ; and .
- (A2)
There exists a constant such that
[TABLE]
holds for all , , , and .
- (A3)
* is not a coboundary in any direction.222This is a standard condition, requiring that, given , the scalar function cannot be written in the form for any function .*
Then
[TABLE]
is a well-defined, symmetric, positive-definite, matrix; and
[TABLE]
where
[TABLE]
Returning to billiards, we now prove the following:
Theorem 3.3**.**
Assume is bounded, , and there exist constants and such that for all . Then, for all , condition (A1) is satisfied with
[TABLE]
and condition (A2) is satisfied with
[TABLE]
A similar result (with different constants) was recently proved in [5], but there a direct scheme for checking (A2) was implemented. Here we illustrate that (A1) and (A2) — as well as the bound in (4) — are immediate consequences of Theorem 2.4.
Proof of Theorem 3.3.
That (A1) holds with the given expressions of and is immediate; see Theorem 3.1. Condition (A2) follows by applying Theorem 2.4 to the function
[TABLE]
and two index sets , where either and (case ), or and (case ). The function belongs to
[TABLE]
and for other indices , the function belongs to
[TABLE]
Hence, we see that is -admissible with the same parameters
[TABLE]
By Theorem 2.4, (A2) is satisfied with the value of given. ∎
3.3. Multivariate normal approximation by Pène’s method
In [11], Pène introduced a method of multivariate normal approximation based on the work of Rio [13]; see also [9, 10] for earlier, related, results by the same author. The theorem below is a special case of Pène’s theorem applied to a map preserving a probability measure . We write
[TABLE]
Otherwise the notation is the same as in the previous section.
Theorem 3.4**.**
Let be a bounded measurable function with . Suppose that there exist , , and a sequence of non-negative real numbers such that the following conditions hold:
- (B1)
* and . *
- (B2)
For any integers satisfying ; for any integers with ; for any ; and for any bounded differentiable function with bounded gradient,
[TABLE]
Then the limit
[TABLE]
exists. If , then the sequence is bounded in . Otherwise there exists such that for any Lipschitz continuous function ,
[TABLE]
for all .
We proceed to the case of billiards:
Theorem 3.5**.**
Assume is bounded, , and there exist constants and such that for all . Then conditions (B1) and (B2) are satisfied with and
[TABLE]
The result is due to Pène [11], assuming piecewise Hölder continuous observables. The above version covers also dynamically Hölder continuous observables. But again, our intention here is to underline that the conditions of Pène’s theorem are immediate consequences of Theorem 2.4.
Proof of Theorem 3.5.
Obviously satisfies (B1). To establish condition (B2), define
[TABLE]
Then , and is -admissible, where and : For the indices , since are integers with , a simple computation shows that the function belongs to
[TABLE]
Moreover, for , the function is in
[TABLE]
Consequently, is -admissible with the same parameters
[TABLE]
Theorem 2.4 applied to and now yields the estimate
[TABLE]
Hence, (B2) holds with the value of given. ∎
3.4. Vector-valued almost sure invariance principle by Gouëzel’s method
In this section we present an application of Theorem 2.4 to multivariate almost sure limits.
The following theorem is due to Gouëzel [4].
Theorem 3.6**.**
Let be a bounded measurable function with . Given integers , , , , and vectors , set
[TABLE]
for brevity. Now, suppose there exist constants , , and such that
[TABLE]
holds for all choices of the numbers , , , , and all vectors satisfying . Then
- (1)
Equation (3) yields a well-defined, symmetric, positive-semidefinite, matrix . 2. (2)
*The matrix satisfies (5). * 3. (3)
* converges in distribution to .* 4. (4)
Given any , there exists a probability space together with two -valued processes and such that
- (a)
* and have the same distribution.* 2. (b)
The random vectors are independent. 3. (c)
Almost surely, .
Such a theorem has a multitude of interesting consequences, including the central limit theorem (CLT), weak invariance principle, almost sure CLT, law of the iterated logaritm (LIL), Strassen’s functional LIL, an upper and lower class refinement of the LIL, and an upper and lower class refinement of Chung’s LIL. We refer the reader to [17, 1, 12, 6, 8] for more details concerning the implications.
We proceed to check condition (6) in the case of billiards. This was done by direct means in [16]. To our knowledge, the resulting vector-valued almost sure invariance principle comes with the smallest error and covers the broadest class of observables to date. Here we show condition (6) to be an immediate consequence of Theorem 2.4.
Theorem 3.7**.**
Assume is bounded, , and there exist constants and such that for all . Given ,
[TABLE]
holds for all choices of the numbers , , , , and all vectors satisfying .
Proof.
Let and . Define the function
[TABLE]
Then is -admissible with the same parameters and . Indeed, for all indices ,
[TABLE]
To see this, recall that for all . Thus, if say and ,
[TABLE]
The other indices and local stable manifolds are treated similarly. Theorem 2.4 now yields
[TABLE]
Since , the proof is complete. ∎
4. Proofs of Theorems 2.3 and 2.4
We begin by recalling three facts from the theory of billiards, which are necessary for the proofs of the theorems. We refer the reader to the standard textbook [3] for more details.
Lemma 4.1**.**
The space is a standard probability space, and the family of local unstable manifolds is a measurable partition of it. Here is an uncountable index set. Thus, the measure admits a disintegration
[TABLE]
where the is a system of conditional probability measures of on , with almost surely, and is a factor probability measure on .
Lemma 4.2**.**
There exist system constants , and such that the following holds. Suppose . Then,
[TABLE]
for all and . Here stands for the length of the local unstable manifold .
Lemma 4.3**.**
There exists a system constant such that
[TABLE]
Moreover,
[TABLE]
for all .
Next, let us recall a simple lemma.
Lemma 4.4**.**
Let be a function, with arbitrary. Then the identity
[TABLE]
holds for all . Here we have denoted and , with the agreement that .
Proof.
The claim is tautological for . For , the induction step
[TABLE]
proves the lemma. ∎
The next lemma is a reflection of the fact that and are dynamically closed, as mentioned earlier; see also [15].
Lemma 4.5**.**
Let be dynamically Hölder continuous in the sense that
[TABLE]
Let and . Then
[TABLE]
for all and
[TABLE]
for all .
Proof.
Denote . Let and belong to the same local unstable manifold, . Recalling , we have , and . Thus, identity (7) yields
[TABLE]
proving the first claim. A corresponding result holds for the inverse map , which is equivalent to the second claim. ∎
We are now ready to prove the first theorem, concerning .
Proof of Theorem 2.3.
[TABLE]
Since the billiard process is stationary, the finite-dimensional distributions are the same for the translated index sets
[TABLE]
where
[TABLE]
and is a number to be determined later. For the moment it suffices to assume that , meaning .
For brevity, define
[TABLE]
Of course, we then have
[TABLE]
and
[TABLE]
Since , Lemma 4.5 implies that
[TABLE]
so
[TABLE]
Inserting this estimate into the identity above, we obtain
[TABLE]
after an application of Fubini’s theorem.
Let us denote
[TABLE]
where is the constant appearing in Lemma 4.2. Note that, by Lemma 4.3,
[TABLE]
Since , Lemma 4.5 implies that
[TABLE]
Therefore, by Lemma 4.2,
[TABLE]
if . On the other hand, if , we have the trivial bound
[TABLE]
This yields
[TABLE]
Integrating the expression inside the absolute value with respect to , we obtain
[TABLE]
Hence,
[TABLE]
Recalling , Fubini’s theorem gives
[TABLE]
where we note that
[TABLE]
with . Stationarity guarantees that
[TABLE]
Inserting into the upper bound above, we see that
[TABLE]
whenever . Finally, let be the smallest integer . Then and . This yields the final estimate
[TABLE]
Defining the system constant , we arrive at the claimed bound. ∎
We proceed to the proof of the second theorem, concerning .
Proof of Theorem 2.4.
The proof is based on induction with respect to .
Case : The assumption is that is -admissible with the same parameters . Therefore, Theorem 2.3 yields
[TABLE]
Defining the system constants and , we obtain
[TABLE]
as claimed.
Case : We are now assuming that is -admissible with the same parameters . In particular, is then -admissible. Hence, the preceding case implies
[TABLE]
Suppose that
[TABLE]
for all -admissible functions with the same parameters . It now suffices to just observe that is -admissible in its first arguments. More precisely, given , the function
[TABLE]
is -admissible, bounded by . Hence,
[TABLE]
This finishes the proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Patrick Billingsley. Convergence of probability measures . Wiley Series in Probability and Statistics: Probability and Statistics. John Wiley & Sons Inc., New York, second edition, 1999. A Wiley-Interscience Publication. doi:10.1002/9780470316962 . · doi ↗
- 2[2] Nikolai Chernov. Advanced statistical properties of dispersing billiards. J. Stat. Phys. , 122(6):1061–1094, 2006. doi:10.1007/s 10955-006-9036-8 . · doi ↗
- 3[3] Nikolai Chernov and Roberto Markarian. Chaotic billiards , volume 127 of Mathematical Surveys and Monographs . American Mathematical Society, Providence, RI, 2006.
- 4[4] Sébastien Gouëzel. Almost sure invariance principle for dynamical systems by spectral methods. Ann. Probab. , 38(4):1639–1671, 2010. doi:10.1214/10-AOP 525 . · doi ↗
- 5[5] Olli Hella, Juho Leppänen, and Mikko Stenlund. Stein’s method for dynamical systems. 2016. Preprint. Available from: https://arxiv.org/abs/1701.02966 .
- 6[6] Michael T. Lacey and Walter Philipp. A note on the almost sure central limit theorem. Statist. Probab. Lett. , 9(3):201–205, 1990. doi:10.1016/0167-7152(90)90056-D . · doi ↗
- 7[7] Juho Leppänen. Functional correlation decay and multivariate normal approximation for non-uniformly expanding maps. 2017. Preprint. Available from: https://arxiv.org/abs/1702.00699 .
- 8[8] Ian Melbourne and Matthew Nicol. A vector-valued almost sure invariance principle for hyperbolic dynamical systems. Ann. Probab. , 37(2):478–505, 2009. doi:10.1214/08-AOP 410 . · doi ↗
