Approximating integrals with respect to stationary probability measures of iterated function systems
Italo Cipriano, Natalia Jurga

TL;DR
This paper introduces an efficient algorithm for approximating integrals with respect to stationary measures of iterated function systems, enabling better estimation of various mathematical properties like Hausdorff moments and Lyapunov exponents.
Contribution
The paper presents a novel algorithm for fast approximation of integrals related to stationary measures of iterated function systems under specific conditions.
Findings
Effective approximation of Hausdorff moments
Accurate estimation of Wasserstein distances
Reliable computation of Lyapunov exponents
Abstract
We study fast approximation of integrals with respect to stationary probability measures associated to iterated functions systems on the unit interval. We provide an algorithm for approximating the integrals under certain conditions on the iterated function system and on the function that is being integrated. We apply this technique to estimate Hausdorff moments, Wasserstein distances and Lyapunov exponents of stationary probability measures.
| Order | Approx. moment value () | Actual moment value | Error |
|---|---|---|---|
| 0 | 1 | 1 | 0 |
| 1 | 2/3 | 0 | |
| 2 | |||
| 3 | |||
| 4 | |||
| 5 | |||
| 6 | |||
| 7 | |||
| 8 | |||
| 9 | |||
| 10 |
| Order | Approximate moment value () |
|---|---|
| 0 | |
| 1 | |
| 2 | |
| 3 | |
| 4 | |
| 5 | |
| 6 | |
| 7 | |
| 8 | |
| 9 | |
| 10 |
| Iteration | Approx. Wasserstein dsitance | Error |
|---|---|---|
| 8 | ||
| 9 | ||
| 10 | ||
| 11 | ||
| 12 | ||
| 13 | ||
| 14 | ||
| 15 | ||
| 16 |
| Iter. | Approx. Wasserstein distance |
|---|---|
| 8 | |
| 9 | |
| 10 | |
| 11 | |
| 12 | |
| 13 | |
| 14 | |
| 15 |
| Iter. | Approx. Lyapunov exponent |
|---|---|
| 8 | |
| 9 | |
| 10 | |
| 11 | |
| 12 | |
| 13 | |
| 14 | |
| 15 | |
| 16 | |
| 17 | |
| 18 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Approximating integrals with respect to stationary probability measures of iterated function systems
Italo Cipriano
Facultad de Matemáticas, Pontificia Universidad Católica de Chile (PUC), Avenida Vicuña Mackenna 4860, Santiago, Chile
[email protected] http://www.icipriano.org/ and
Natalia Jurga
Department of Mathematics, University of Surrey, Guildford, GU27XH, UK
(Date: 17th March 2024)
Abstract.
We study fast approximation of integrals with respect to stationary probability measures associated to iterated functions systems on the unit interval. We provide an algorithm for approximating the integrals under certain conditions on the iterated function system and on the function that is being integrated. We apply this technique to estimate Hausdorff moments, Wasserstein distances and Lyapunov exponents of stationary probability measures.
1. Introduction
Let for some and let be an iterated function system consisting of Lipschitz contractions . It is well-known that there exists a unique, non-empty and compact set such that
[TABLE]
This set is called the attractor of Given a probability vector (meaning that each and ), there exists a unique probability measure such that
[TABLE]
for every continuous function This probability measure is called the stationary probability measure associated to and Its support is the attractor of
Iterated function systems were first studied by Hutchinson [17] who proved the existence and uniqueness of the attractor and stationary probability measure. Since the late eighties, the problem of algorithmically estimating stationary probability measures has been a topic of active research, see [24, 2, 12, 20, 13].
In these notes we focus on the related problem of algorithmically estimating the integrals of functions with respect to stationary measures. In many areas of analysis, dynamical systems and probability theory, important quantities can be expressed in terms of such integrals, see section 6. For example, in fractal geometry, the Hausdorff dimension of the attractor can be bounded or sometimes explicitly calculated in terms of an expression involving the Lyapunov exponent of particular stationary measures, see section 6.3. On the other hand, in dynamical systems, equilibrium measures often take the form of a stationary probability measure, and therefore it is a natural problem to study the approximation of integrals with respect to them. Apart from some special cases, for instance whenever the stationary measure is absolutely continuous with an explicit known density, it is impossible to analytically calculate these integrals, and so one must turn to approximating them. We use an operator theoretic approach similar to [21, Theorem 1], where Jenkinson and Pollicott constructed an algorithm yielding accelerated approximations of integrals of functions with respect to Lebesgue measure. Indeed, our main Theorem 2.3 can be seen as an analogue of their result. In particular we provide
- •
a sufficient condition on the iterated function system and
- •
a class of functions
such that the integral
[TABLE]
can be approximated efficiently, where denotes any stationary probability measure associated to . In particular, in Theorem 2.3 we provide an algorithm which produces approximations to such that the approximations approach the integral at a super-exponential rate. We demonstrate the performance of this algorithm by using it to estimate Hausdorff moments, Wasserstein distances and Lyapunov exponents.
2. Main results
We introduce the following contraction condition motivated by the definition introduced in [3, Definition 2.2].
Definition 2.1**.**
A family of maps , is called complex contracting if there exists a non-empty, bounded, connected and open set such that and
- (1)
each extends holomorphically to 2. (2)
* is continuous on the boundary of * 3. (3)
**
The significance of the above definition is that if is a complex contracting family of maps then there exists some complex domain such that , each extends holomorphically to and , see [3, Lemma 2.4]. In this case we say that is an admissable domain for . The existence of such a domain is what is actually required in order to construct our algorithm, however in many cases it is easier to check that a family is complex contracting than to check that such a domain exists.
Definition 2.2**.**
Given a complex contracting iterated function system on the unit interval, we define the class of functions
[TABLE]
where the union is taken over all admissable domains for
In particular if is complex contracting, then all sufficiently small Euclidean -neighbourhoods of are admissable, see [3, Remark 2.5]. Therefore any function which is bounded and holomorphic on some Euclidean -neighbourhood of belongs to .
We introduce some notation. Let and for Define . For and we define
[TABLE]
Define by
[TABLE]
For each let denote the periodic point and let denote the fixed point of . Finally, we say that an iterated function system is non-overlapping if are pairwise disjoint.
The following is our main result.
Theorem 2.3**.**
Let be a complex contracting iterated function system on the unit interval and a probability vector. Given define
[TABLE]
and
[TABLE]
Also define
[TABLE]
and
[TABLE]
Then for
[TABLE]
and the stationary probability measure associated to and
[TABLE]
for some constants which are independent of
Note that while our approximations are non-effective, in some special cases the constants appearing in (18) can be made explicit. For example, when each takes the form of a linear fractional transformation, that is, for some constants (which in particular includes the important class of self-similar iterated function systems), the constants and can be bounded similarly to [19, 22]. Also note that since the time taken to process steps of the algorithm is exponential in , the error decreases super-polynomially fast in time.
Finally, we observe that by using stationarity of the measure , Theorem 2.3 can also be used to obtain approximations of integrals of some piecewise analytic functions. In particular, suppose that for some , are pairwise disjoint and let be a function (not necessarily continuous) such that each composition function belongs to for every By continuous continuation there is a continuous function with for every Thus by stationarity,
[TABLE]
so since , Theorem 2.3 can be applied to approximate each integral .
3. Preliminaries
3.1. Trace class operators, determinants and approximation numbers
Given a compact operator on a Hilbert space , its th approximation number is defined as
[TABLE]
A bounded linear operator on a complex separable Hilbert space is called trace-class if Given a trace-class operator , the trace is defined as
[TABLE]
where is any orthonormal basis and is the inner product for the Hilbert space .
Given a compact operator , we denote by the monotone decreasing sequence of non-zero eigenvalues of , listed with algebraic multiplicity. If is trace-class then it is compact and its sequence of eigenvalues is absolutely summable.
For a trace-class operator , the Fredholm determinant of can be defined as
[TABLE]
which is an entire function of [5, Theorem 3.3], so in particular there exist such that
[TABLE]
Note that by (3) the roots of are precisely the reciprocals of the eigenvalues of , and the degree of each zero is given by the multiplicity of the corresponding eigenvalue. Moreover, each coefficient can be expressed in terms of the traces of for :
[TABLE]
for a proof see for instance [19, Proposition 3.2].
On the other hand, by finding the coefficient of in (3) we see that
[TABLE]
therefore
[TABLE]
3.2. Bergman space
We will use results of Bandtlow and Jenkinson [3, 4] on operators acting on the Bergman space. Let be any connected, non-empty open subset of . We define the Bergman space by
[TABLE]
where denotes 2-dimensional Lebesgue measure, normalised so that the unit ball has unit mass. is a Hilbert space with inner product
[TABLE]
Consider the operator
[TABLE]
where are holomorphic functions on that satisfy and are holomorphic bounded functions on . Using the work of Bandtlow and Jenkinson [3, 4] we have the following result.
Proposition 3.1**.**
The operator preserves the Bergman space . Moreover
- (1)
* is a trace-class operator and there exist constants which are independent of and such that its eigenvalues satisfy*
[TABLE]
where , 2. (2)
the trace of is given by
[TABLE]
where is the unique fixed point of and
Proof.
The first part is a special case of [4, Theorems 5.9 and 5.13]. The second part follows by [3, Theorem 4.2]. ∎
3.3. Analytic perturbation theory
We say that a bounded linear operator on a Banach space has spectral gap if where is a rank one projection (so and ), is a bounded operator with spectral radius and . does not need to be compact in order to have a spectral gap, however if the operator is compact and has a simple leading eigenvalue111Throughout the paper we say that an eigenvalue is simple if it is algebraically simple, that is, the eigenvalue has a one-dimensional generalised eigenspace. and no other eigenvalues with the same absolute value, it has a spectral gap.
We can use the standard techniques of perturbation theory [18] to relate to the spectral properties of an appropriate operator. The following perturbation theorem is presented in a more general form in [16, Theorem 3.8].
Theorem 3.2** (Analytic perturbation theorem).**
Let be a family of bounded linear operators on a Banach space such that is holomorphic and has spectral gap. Then there exists an open neighbourhood of 0 for which has spectral gap for all . Moreover there exist which are holomorphic families on such that:
- (a)
, 2. (b)
** 3. (c)
* is a bounded rank one projection and has the form*
[TABLE]
for some small circle around which separates it from the rest of the spectrum of , 4. (d)
* for some which is independent of .*
4. Transfer operator
Let and satisfy the hypothesis of theorem 2.3. By assumption, there exists an admissable domain for such that has a bounded and holomorphic extension to . Abusing notation slightly, we also denote this extension by . Fix a probability vector , and . Define the operator as
[TABLE]
Then by Proposition 3.1, . Observe that the iterates of are given by
[TABLE]
The following proposition summarises some of the spectral properties of .
Proposition 4.1**.**
* is a trace-class operator with decreasing sequence of eigenvalues . Moreover*
- (1)
There exist constants which are independent of such that the eigenvalues admit the bound
[TABLE]
for any with , 2. (2)
* is a simple eigenvalue of of maximum modulus,* 3. (3)
* is analytic in in a neighbourhood of 0 and* 4. (4)
\int g\textup{d}\mu=\frac{\textup{d}}{\textup{d}s}\lambda_{1}(s)\biggr{|}_{s=0}.
Proof.
The fact that is trace-class and the uniform bounds on follow from Proposition 3.1.
Next we prove (2). Clearly 1 is an eigenvalue of with eigenfunction . To see that it is geometrically simple, suppose that is a fixed point of and . We will show that must be a constant function. First, observe that
[TABLE]
where the right hand side is finite because is a compact subset of . Therefore,
[TABLE]
for some . By the maximum-modulus principle, is constant on . Using the same argument, we can establish that 1 is the only eigenvalue of modulus 1.
Therefore it remains to show that is an algebraically simple eigenvalue. We need to show that is one dimensional (so only consists of the constant functions). For a contradiction suppose that there exists for which but . So in particular for some constant . In particular, since and therefore by replacing by we obtain that , that is, . By induction we see that
[TABLE]
On the other hand, define
[TABLE]
and define . Since is a nested sequence of closed subsets of , is a compact subset of . For any ,
[TABLE]
By (6), implying that
[TABLE]
which is clearly a contradiction since is bounded on .
To see (3), since is compact, is a simple eigenvalue of maximum modulus and is clearly an analytic family of operators in , we can apply the analytic perturbation theorem 3.2 to deduce that is analytic in a neighbourhood of 0.
Finally to prove (4) put
[TABLE]
as in (c) of Theorem 3.2. (a)-(c) of Theorem 3.2 imply that the image of is an eigenspace for the eigenvalue and that is an eigenfunction for the eigenvalue . Note that . Since is holomorphic it immediately follows that is also holomorphic. We write f_{0}=\frac{\textup{d}}{\textup{d}s}h_{s}\bigr{|}_{s=0}\in A^{2}(D).
Fix some . Observe that for each and ,
[TABLE]
Denote . Differentiating at we obtain
[TABLE]
where we used that and . Therefore since
[TABLE]
The fact that follows because are uniformly contracting and has bounded derivative on , therefore for any one can choose sufficiently large so that for , and all ,
[TABLE]
∎
5. Determinants and the algorithm
Since is a trace-class operator for each , we can define its determinant function . The following proposition summarises its properties.
Proposition 5.1**.**
For all , is an entire function of and can be written in the form
[TABLE]
for where for all and for is defined as
[TABLE]
Moreover, the trace of is given by
[TABLE]
Proof.
(9) follows from (4). To see (10) notice that by Proposition 3.1(2),
[TABLE]
Since for any , is a fixed point of it follows that and therefore (10) follows. ∎
Proposition 5.2**.**
Let as before. Then there exist constants such that for all , . Moreover,
[TABLE]
Proof.
For the first part, note that by (5) and Proposition 4.1(1),
[TABLE]
where the last line follows by repeated geometric summation. The upper bound on follows (for new constants and ). By the Cauchy integral formula,
[TABLE]
therefore
[TABLE]
from which we obtain the bound on .
Let be the neighbourhood of 0 on which is analytic. Observe that since the zeroes of the determinant are the reciprocals of the eigenvalues of ,
[TABLE]
Since uniformly on we can apply the Cauchy integral formula to deduce that the partial sums converge uniformly on compact subsets of as . Therefore we can differentiate (12) and take derivatives inside the summation to obtain
[TABLE]
Since is a simple zero of , it follows that and so by rearranging (13) we obtain
[TABLE]
which completes the proof. ∎
We define the th approximation
[TABLE]
Define
[TABLE]
and
[TABLE]
We are now ready to prove the main theorem.
Proof of Theorem 2.3. By (9), (14) and the definitions of it follows that is given by theorem 2.3. By the bounds on and given in Proposition 5.2 and (11) we obtain the desired bound on the error . ∎
6. Applications
In this section we demonstrate Theorem 2.3 by estimating moments and Lyapunov exponents of stationary probability measures and the first Wasserstein distance between stationary probability measures.
6.1. Moments of stationary probability measures
The -th moment of a positive Borel measure on is defined by
[TABLE]
Moments are particularly important in analysis, probability and statistics. In analysis, the classical moment problem [1] is connected with characterising the image of the map
[TABLE]
which is connected to the problem of the extension of positive functionals. In probability and statistics, the method of moments [6] is useful for proving limit theorems and estimating distributions of samples.
Since moments of stationary probability measures can be computed analytically in some special cases (see Lemma 6.2), we can demonstrate the efficiency of the algorithm by using it to approximate moments.
Corollary 6.1**.**
Let and satisfy the assumptions of Theorem 2.3. Then, for each fixed , Theorem 2.3 provides an algorithm with a super-exponential rate of convergence (18) that gives approximations to for
In the setting where is a non-overlapping iterated function system of similarities it is possible to obtain the following analytic formula for the moments.
Lemma 6.2**.**
Let with such that for every and a probability vector.
Then and for every
[TABLE]
Proof.
Directly from (17) and definition of ∎
We can use Theorem 2.3 to compute approximate values for the moments and compare these with the exact values given by Lemma 6.2.
Example 6.3**.**
Let and In table 1 we compare the approximate values for the moments with the exact values given by the formula.
On the other hand, when is made up of non-affine contractions, there is no analytic formula available. In the following example we approximate the moments of a stationary probability measure in this setting.
Example 6.4**.**
Let and We can compute the approximate values for the moments. The results are in Table 2, where all of the digits are stable.222We say that a digit is stable if, from empirical observations, it appears to have converged to a stable value
6.2. First Wasserstein distance between stationary probability measures
Fraser [11] initiated the study of the interaction between fractal geometry and optimal transportation by proposing the problem of computing the Wasserstein distances between stationary probability measures. The Wasserstein distance is a metrization of the weak- topology of the space of probability Borel measures on a Polish space. For the Wasserstein distance of order between two probability measures and is defined by
[TABLE]
In general, under no assumption on the iterated functions system and associated stationary probability measure, there is no hope of finding an analytic formula. Under some very restrictive assumptions it has been proved in [11, 8] that can be obtained explicitly. Using results announced in [7] and Theorem 2.3 we can provide an estimate of when and are stationary probability measures. The following result was obtained in [7].
Theorem 6.5**.**
Let be a non-overlapping iterated function system of differentiable Lipschitz contractions on the unit interval. Additionally, suppose that for each and that is a pair of probability vectors with the property that
[TABLE]
does not change sign for . Then
[TABLE]
Using the above result and Theorem 2.3, we obtain the following.
Corollary 6.6**.**
Let and satisfy the hypothesis of Theorem 6.5 and Theorem 2.3. Then Theorem 2.3 provides an algorithm that gives approximations to and which converges at a super-exponential rate (18).
Let satisfy the assumptions of Corollary 6.6 and additionally suppose that with for each . In [7] it was proved that
[TABLE]
We can compare the approximate values for the Wasserstein distances given by Theorem 6.6 with the exact value given by the equation (15).
Example 6.7**.**
Let and We compare the approximate value of the Wasserstein distance for with the exact value The results are in table 3.
On the other hand, when the contractions in are non-affine, an analytic formula is not available.
Example 6.8**.**
Let and We can compute the approximate values for the Wasserstein distance between the stationary probability measures associated to and The results are in Table 4.
6.3. Lyapunov exponents
Let be an iterated function system and be the stationary probability measure associated to The Lyapunov exponent of with respect to can be defined by
[TABLE]
The Lyapunov exponent of describes the typical rate at which distances are contracted under the action of the maps in the iterated function system, from the point of view of the measure . Therefore, Lyapunov exponents play an important role in fractal geometry; for example the Hausdorff dimension of is given in terms of its Lyapunov exponent, which in turn can provide bounds or even precise values for the Hausdorff dimension of the attractor , see [9]. When is non-overlapping, the Lyapunov exponent of has an equivalent interpretation as the -typical rate of expansion of the dynamical system whose branches are given by the inverse images of the maps in , and therefore Lyapunov exponents also play an important role in ergodic theory, particularly in thermodynamic formalism. We can use Theorem 2.3 to approximate the Lyapunov exponent of with respect to .
Corollary 6.9**.**
Let a probability vector and let be a complex contracting iterated function system on the unit interval such that:
[TABLE]
Then Theorem 2.3 provides an algorithm that gives approximations to the Lyapunov exponent of with respect to which converges at the super-exponential rate (18).
Proof.
Follows by applying Theorem 2.3 to the function which belongs to by (16) and the fact that is complex contracting. ∎
Example 6.10**.**
Let and We compute the approximate values to the Lyapunov exponent of with respect to . The results are in Table 5.
7. Further remarks
7.1. Implementation
The algorithm was implemented in Python using the free library mpmath 333Python library for real and complex floating-point arithmetic with arbitrary precision.. To estimate the fixed points with arbitrary precision the command
mpmath.findroot
was used and to estimate with arbitrary precision the command
mpmath.diff
was used.
7.2. Other applications
Our methods can be used to estimate other functions in probability theory, such as pointwise estimates of the Fourier-Stieltjes transform of stationary probability measures and coefficients of the Fourier series of singular measures arising as stationary probability measures.
7.3. Extension to non constant weight functions
Let be a complex contracting iterated function as before. We can extend our results to certain probability measures associated to non-constant weight functions.
In particular, suppose there exists an admissable domain for and bounded holomorphic functions , with the property that
- (1)
for all , 2. (2)
for all .
Then we can obtain an analogue of Theorem 2.3 for the unique probability measure supported on the attractor of with the property that
[TABLE]
for every continuous function . This probability measure is called the stationary probability measure associated to and , see [10]. It generalises the stationary probability measure considered for constant weight functions.
By following the proof of Theorem 2.3 with the operator
[TABLE]
one can obtain the following analogue of Theorem 2.3.
Theorem 7.1**.**
Let , and be as above. Given define
[TABLE]
and
[TABLE]
where
[TABLE]
Define , and as in Theorem 2.3. We have
[TABLE]
for some constants which are independent of
Acknowledgements.
IC would like to express his gratitude to Felipe Serrano and Franziska Schlösser for suggestions around the effective implementation of the algorithms. IC was partially supported by CONICYT PIA ACT172001. NJ was financially supported by the Leverhulme Trust (Research Project Grant number RPG-2016-194). Both authors would like to thank Ian Morris for helpful discussions and suggestions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Akhiezer, N. I. (1965). The classical moment problem and some related questions in analysis. Translated by N. Kemmer. Hafner Publishing Co., New York.
- 2[2] Blank, M. (2001). Perron-Frobenius spectrum for random maps and its approximation, Mosc. Math. J. 1 31544 74, 4481-4495.
- 3[3] Bandtlow, O. F. and Jenkinson, O. (2008). On the Ruelle eigenvalue sequence. Ergodic Theory and Dynamical Systems, 28(6), 1701-1711.
- 4[4] Bandtlow, Oscar F., and Oliver Jenkinson. ”Explicit eigenvalue estimates for transfer operators acting on spaces of holomorphic functions.” Advances in Mathematics 218.3 (2008): 902-925.
- 5[5] Simon, B. (2010). Trace ideals and their applications (No. 120). American Mathematical Soc.
- 6[6] Diaconis, P.(1987). Application of the method of moments in probability and statistics. In Moments in mathematics (San Antonio, Tex., 1987), volume 37 of Proc. Sympos. Appl. Math., pages 125-142. Amer. Math. Soc., Providence, RI.
- 7[7] Cipriano, I. (2018). The Wasserstein distance between stationary measures, ar Xiv:1611.00092 v 3.
- 8[8] Cipriano, I. and Pollicott, M. (2018). Stationary measures associated to analytic iterated function schemes, Math. Nachr. , 291:7, 1049-1054.
