A simultaneous version of Host's equidistribution Theorem
Amir Algom

TL;DR
This paper extends Host's equidistribution theorem to a simultaneous setting, proving that under certain independence conditions, the joint orbit of a point under two multiplicative maps equidistributes for the product measure.
Contribution
It introduces a simultaneous version of Host's theorem, showing joint equidistribution for pairs under independent multiplicative maps, generalizing previous results.
Findings
Joint orbit of (x,x) equidistributes for Lebesgue times μ
Results hold when m and p are independent and m>p
Extends to pairs (m,n) with n independent of p
Abstract
Let be a probability measure on that is ergodic under the map, with positive entropy. In 1995, Host showed that if then almost every point is normal in base . In 2001, Lindenstrauss showed that the conclusion holds under the weaker assumption that does not divide any power of . In 2015, Hochman and Shmerkin showed that this holds in the "correct" generality, i.e. if and are independent. We prove a simultaneous version of this result: for typical , if are independent, we show that the orbit of under equidistributes for the product of the Lebesgue measure with . We also show that if and is independent of as well, then the orbit of under equidistributes for the Lebesgue measure.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A simultaneous version of Host’s equidistribution Theorem
Amir Algom
Abstract
Let††Supported by ERC grant 306494 and ISF grant 1702/17. be a probability measure on that is ergodic under the map, with positive entropy. In 1995, Host [14] showed that if ††2010 Mathematics Subject Classification 11K16, 11A63, 28A80, 28D05. then almost every point is normal in base . In 2001, Lindenstrauss [19] showed that the conclusion holds under the weaker assumption that does not divide any power of . In 2015, Hochman and Shmerkin [13] showed that this holds in the ”correct” generality, i.e. if and are independent. We prove a simultaneous version of this result: for typical , if are independent, we show that the orbit of under equidistributes for the product of the Lebesgue measure with . We also show that if and is independent of as well, then the orbit of under equidistributes for the Lebesgue measure.
1 Introduction
1.1 Background and main results
Let be an integer greater or equal to . Let be the -fold map of the unit interval,
[TABLE]
Let be an integer independent of , that is, . Henceforth, we will write to indicate that and are independent. In 1967 Furstenberg [7] famously proved that if a closed subset of is jointly invariant under and , then it is either finite or the entire space . A well known Conjecture of Furstenberg about a measure theoretic analouge of this result, is that the only continuous probability measure jointly invariant under and , and ergodic under the action generated by these maps, is the Lebesgue measure. The best results towards this Conjecture, due to Rudolph [26] for such that and later to Johnson [17] for , is that it holds if in addition the measure has positive entropy with respect to the action generated by (see also the earlier results of Lyons [21]).
In 1995 Host proved the following pointwise strengthening of Rudolph’s Theorem. Recall that a number is said to be normal in base if the sequence equidistributes for the Lebesgue measure on . Equivalently, the sequence of digits in the base expansion of has the same limiting statistics as an IID sequence of digits with uniform marginals.
Theorem**.**
(Host, [14]) Let be integers such that . Let be invariant ergodic measure with positive entropy. Then almost every is normal in base .
Host’s theorem can be shown to imply Rudolph’s Theorem, but is more constructive in the sense that it proves that a large collection of measures satisfy a certain regularity property. Host’s Theorem is also closely related to classical results of Cassels [3] and Schmidt [27] from around 1960, that proved a similar result for certain Cantor-Lebesgue type measures. This was later generalized by Feldman and Smorodinsky [6] to all non-degenerate Cantor-Lebesgue measures (in fact, weakly Bernoulli) with respect to any base (though applying to a less general class of measures, the latter results nonetheless hold for any independent integers ). We remark that the works of Meiri [23] and of Hochman and Shmerkin [13] contain excellent expositions on Host’s Theorem, and on the results of Cassels and Schmidt and some of the research that followed, respectively.
The assumption made on the integers in Host’s Theorem, however, is stronger than it ”should” be. Namely, it is stronger than assuming that . In 2001, Lindenstrauss [19] showed that the conclusion of Host’s Theorem holds under the weaker assumption that does not divide any power of . Finally, in 2015, Hochman and Shmerkin [13] proved that Host’s Theorem holds in the ”correct” generality, i.e. when .
Now, let be a measure as in Host’s Theorem, with . Then, on the one hand, by the results of Hochman and Shmerkin, for almost every , its orbit under equidistributes for the Lebesgue measure. On the other hand, for almost every , its orbit under equidistributes for (this is just the ergodic Theorem). The main result of this paper is that this holds simultaneously.
Theorem 1.1**.**
Let be a invariant ergodic measure with . Let be integers such that .
If then
[TABLE]
where the convergence is in the weak- topology, and is the Lebesgue measure on .* 2. 2.
If then
[TABLE]
Several remarks are in order. First, the assumption that has positive dimension and the assumption that it has positive entropy are equivalent, so there is no discrepancy between the assumptions in Host’s Theorem and those in Theorem 1.1 (see Section 2.1 for a discussion on the dimension theory of measures). Secondly, in the second part of the Theorem we do not need that and are independent, only that . In addition, we can prove a version of Theorem 1.1 where the initial point is replaced with for that are non singular affine maps of that satisfy some extra mild conditions. This is explained in Section 6.
Theorem 1.1 can also be considered as part of the following general framework. Let , and let be an invariant probability measure. The idea is to study the orbits for typical . In our situation,
[TABLE]
and the measure , where is the map .
Problems around this framework were studied by several authors. Notable related examples are the works of Meiri and Peres [24], and the subsequnet work of Host [15]. Meiri and Peres prove a Theorem similar to ours, with the following differences:
- •
They work with two general diagonal endomorphisms and , but they require that the corresponding diagonal entries and be larger than and co-prime.
- •
They allow for more general measures then the one dimensional measures that we work with (in Theorem 1.1 we work with measures on the diagonal of ).
Host in turn has some requirements on and the measure that are more general than ours, but also requires that and be co-prime, and that for every the characteristic polynomial of is irreducible over (clearly this is not the case here). The results of both Host, and Meiri and Peres, extend to any dimensional torus.
Our proof of Theorem 1.1 is inspired by the work of Hochman and Shmerkin [13]. In particular, the scenery of the measure at typical points plays a pivotal role in our work. We devote the next Section to defining this scenery and some related notions, and to formulating the main technical tool used to prove Theorem 1.1.
1.2 On sceneries of measures and the proof of Theorem 1.1
We first recall some notions that were defined in ([13], Section 1.2). However, we remark that these notions and ideas have a long history, going back varisouly to Furstenberg ([8], [9]), Zähle [31], Bedford and Fisher [1], Mörters and Preiss [25], and Gavish [10]. See ([13], Section 1.2) and [11] for some further discussions and comparisons.
For a compact metric space let denote the space of probability measures on . Let
[TABLE]
For and we define the scaled measure by
[TABLE]
For we similarly define the translated measure by
[TABLE]
The scaling flow is the Borel flow acting on . The scenery of at is the orbit of under , that is, the one parameter family of measures for . Thus, the scenery of the measure at some point is what one sees as one ”zooms” into the measure.
Notice that . As is standard in this context, we shall refer to elements of as distributions, and to elements of as measures. A measure generates a distribution at if the scenery at equidistributes for in , i.e. if
[TABLE]
and generates if it generates at almost every .
If generates , then is supported on and is -invariant ([11], Theorem 1.7). We say that is trivial if it is the distribution supported on - a fixed point of . To an -invariant distribution we associate its pure point spectrum . This set consists of all the for which there exists a non-zero measurable function such that for every , on a set of full measure. The existence of such an eigenfunction indicates that some non-trivial feature of the measures of repeats periodically under magnification by .
Finally, we say that a measure is pointwise generic under for a measure if almost every equidistributes for under , that is,
[TABLE]
We are now ready to state our second main result, which is the technical tool that shall be employed to prove Theorem 1.1.
Theorem 1.2**.**
Let and let be integers, such that:
The measure generates a non-trivial -ergodic distribution . 2. 2.
The pure point spectrum does not contain a non-zero integer multiple of . 3. 3.
The measure is pointwise generic under for an ergodic and continuous measure .
Then
[TABLE]
Notice that under assumption (3) of Theorem 1.2, the measure is invariant, so its ergodicity is with respect to this map. Theorem 1.2 together with the machinery developted by Hochman and Shmerkin in ([13], Section 8) imply Theorem 1.1: this is explained in Section 5.
We end this introduction with a brief overview of the proof of Theorem 1.2. First, we note that if we only assume (1) and (2) in Theorem 1.2, then
[TABLE]
according to the main result of Hochman and Shmerkin [13]. This is proved by roughly following three steps: first, using the spectral condition, they show that any accumulation point of measures as in (2) can be represented as an integral over measures that are closely related to those drawn according to . They proceed to use this representation to show that
[TABLE]
They conclude by showing that the only invariant measure satisfying the latter property is the Lebesgue measure.
Our strategy is to first show that a invariant measure , that projects to and to in the first and second coordinate respectively, must be if it satisfies the following condition:
[TABLE]
for almost every , where is the conditional measure of on the fiber , and . This is Claim 3.4 in Section 3. Afterwards, we show that this property holds for all the accumulation points of the measures from (1). This is done via a corresponding integral representation, see Claim 4.1 in Section 4.
Notation We shall use the letter to indicate both the Lebesgue measure on and the Lebesgue measure on . Which is meant will be clear from context. Also, whenever we have a finite product space, we denote by the projection to the -th coordinate.
Acknowledgements I would like to thank Mike Hochman for suggesting the problem studied in this paper to me, and for his various insightful comments on earlier versions of this manuscript. I would also like to thank Shai Evra for some helpful conversations related to this paper.
2 Preliminaries
2.1 Dimension theory of measures, and their Fourier coefficients
For a Borel set in some metric space , we denote by its Hausdorff dimension, and by its packing dimension (see Falconer’s book [5] for an exposition on these concepts). Now, let . The (lower) Hausdorff dimension of the measure is defined as
[TABLE]
and the upper Hausdorff dimension of the measure is defined as
[TABLE]
The (upper) packing dimension of the measure is defined as
[TABLE]
An alternative characterization of the dimension of that we shall often use is given in terms of their local dimensions: For every we define the local (pointwise) dimension of at as
[TABLE]
where denotes the closed ball or radius about . The Hausdorff dimension of is equal to
[TABLE]
and the upper Hausdorff dimension of is equal to
[TABLE]
see e.g. [4]. If exists as a limit at almost every point, and is constant almost surely, we shall say that the measure is exact dimensional. In this case, most metric definitions of the dimension of coincide (e.g. lower and upper Hausdorff dimension and packing dimension).
Let us now collect some known facts regarding dimension theory of measures:
Proposition 2.1**.**
Let and suppose that there is a distribution such that
[TABLE]
Then
[TABLE]
If where and is any probability vector, then
[TABLE] 2. 2.
Let be a Lipschitz map between complete metric spaces. Then for any ,
[TABLE]
with an equality if is locally bi-Lipschitz. 3. 3.
Let be exact dimensional, and be a measure supported on finitely many atoms. Then , and moreover, is exact dimensional.
The next Lemma is essentially Lemma 3.5 in [13], with a minor modification which follows e.g. from Lemma 6.13 in [11].
Lemma 2.2**.**
([13], Lemma 3.5) Let .
Suppose that for almost every , (where is the conditional measure on the fiber ). Then . 2. 2.
For an upper bound, we have .
We end this section with a brief discussion of the Fourier coefficients of measures on . These are defined as follows. First, given we define for any the corresponding Fourier coefficient by
[TABLE]
The following relations are easily verified for two measures :
[TABLE]
[TABLE]
The following Lemma is standard:
Lemma 2.3**.**
([22], Section 3.10 ) Let . If for all then .
Finally, let and let be the Cantor-Lebesgue measure corresponding to the non-degenerate probability vector . That is, is the distribution of the Random sum , where are IID random variables with . It is a well known fact that for every ,
[TABLE]
2.2 Dimension theory of invariant measures
2.2.1 Some notions from ergodic theory
In this paper, a dynamical system is a quadruple , where is a compact metric space, is the Borel sigma algebra, and is a measure preserving map, i.e. is Borel measurable and . Since we always work with the Borel sigma-algebra, we shall usually just write . For example one may consider , the Borel map for some , and some Cantor-Lebesgue measure with respect to base .
A dynamical system is ergodic if and only if the only invariant sets are trivial. That is, if satisfies then or . A dynamical system is called weakly mixing if for any ergodic dynamical system , the product system is also ergodic. In particular, weakly mixing systems are ergodic. Moreover, If both and are weakly mixing, then their product system is also weakly mixing. A class of examples of weakly mixing systems is given where is a Cantor-Lebesgue measure with respect to base .
We will have occasion to use the ergodic decomposition Theorem: Let be a dynamical system. Then there is a map , denoted by , such that:
The map is measurable with respect to the sub-sigma algebra of invariant sets. 2. 2.
3. 3.
For almost every , is invariant and ergodic. The measure is called the ergodic component of .
Finally, we shall say that a point is generic with respect to if
[TABLE]
in the weak-* topology. By the ergodic Theorem, if is ergodic then a.e. is generic for .
2.2.2 Dimension theory of invariant measures
Recall that in general, if is a invariant measure, we may define its entropy with respect to , a quantity that we shall denote by . As there is an abundance of excellent texts on entropy theory (e.g. [30]), we omit a discussion on entropy here. We now restrict our attention to dynamical systems of the form or , where we always assume that . In the one dimensional case, the dimension of may be computed via the entropies of its ergodic components:
Theorem 2.4**.**
([20], Theorem 9.1) Let be a invariant and ergodic measure. Then is exact dimensional and
[TABLE]
In general, if is a invariant measure with ergodic decomposition , then
[TABLE]
and
[TABLE]
The situation for dynamical systems of the form is more complicated. This may be attributed to the fact that the map is not conformal. There is, however, a way to compute the dimension of in this situation via entropy theory, using a suitable version of the Ledrappier-Young formula . This was first done by Kenyon and Peres in [18] for ergodic measures. The general case may be treated using similar methods, as observed by Meiri and Peres in ([24], Lemma 3.1).
Theorem 2.5**.**
[24]** Let be a invariant measure. Then for almost every the local dimension exists as a limit and
[TABLE]
where and denote the corresponding ergodic components of , and of , respectively.
Finally, we will require the following result of Meiri, Lindenstrauss and Peres from [20]:
Theorem 2.6**.**
[20]** Let be a invariant weakly mixing measure, such that . Let denote the convolution of with itself -times. Then
[TABLE]
We remark that we have only cited a special case of this result. Indeed, Meiri, Lindenstrauss and Peres deal with the growth of the entropy of more general convolutions of ergodic measures. We refer the reader to [20] for the full statement.
2.3 Relating the distribution of orbits to the measure
Let be a compact metric space, a Borel measurable map, and let . Following Hochman and Shmerkin [13], we shall say that is pointwise generic for under if almost every equidistributes for under , that is,
[TABLE]
This notion is closely related to the main results of this paper. Indeed, let , for and , and be the pushforward of a invariant ergodic positive dimensional measure to the diagonal of . Then Theorem 1.1 part (1) for example may be stated as ” is pointwise generic for under ”.
In [13], the authors obtain a criteria for this to occur, one that shall play a central role in this paper as well. We now recall its formulation. Let be a finite partition of , and for every let . Let denote the coarsest common refinement of . Now, if the smallest sigma algebra that contains for all is the Borel sigma algebra, we say that is a generator for . We say that is a topological generator if as . A topological generator is clearly a generator.
Let us give two examples of topological generators that shall be used in this paper: for every let be the -adic partition of (and of ), that is,
[TABLE]
Then, under the map , we see that
[TABLE]
It is thus easy to see that is a generator for . Similarly, if then the partition of is a generator under .
Finally, in general, for every and , let denote the unique element of that contains . Given and such that , let
[TABLE]
which is well defined almost surely.
Theorem 2.7**.**
([13], Theorem 2.1) Let be a Borel measurable map of a compact metric space, and a generating partition. Then for almost every , if equidistributes for along some , and if for all , then
[TABLE]
A crucial ingredient in our application of Theorem 2.7 is the following Claim. Let , and define for every
[TABLE]
Also, recall that the density of a sequence (if it exists) is the limit of the sequence as . If the limit does not exist, the corresponding is called the upper density of .
Claim 2.8**.**
Suppose that is a measure that is pointwise generic under for a continuous measure . Then for almost every , if and represents the times when , then the density of is zero.
Proof.
Choose , and if let be the sequence as in the statement of the Claim. Let . We will show that the upper density of is at most . First, since is a continuous measure, there exists some such that , where is the ball about [math] in . By our assumption that is pointwise generic under for , and since is a continuous measure,
[TABLE]
has density .
Now, let us decompose our sequence
[TABLE]
Then the upper density of is at most . We now show that the density of the sequence is [math]. In fact, we will show that this is a finite sequence.
Indeed, let . We claim that . Assume towards a contradiction that there exists some such that for some . Then there is a unique -adic number (an endpoint of an cell) such that . Write for some integer . Then we have
[TABLE]
which implies that , by the choice of . Thus, , contradicting the choice of the sequence . Thus, , which is sufficient for us. ∎
We will also require the following Lemma.
Lemma 2.9**.**
Let be such that it equidistributes for a continuous measure under . Let be some interval. Let be the sequence of times when but . Then the density of the sequence is [math].
Proof.
Let . Since is continuous, there exists some such that
[TABLE]
Let
[TABLE]
Then by our assumption on , the density of is at most . However, the sequence , apart from maybe finitely many indices. It follows that the upper density of is at most the density of , and therefore is at most . This proves the Lemma. ∎
2.4 Ergodic fractal distributions
Recall the definitions introduced in Section 1.2. In this Section we discuss some other related results of [13] that we shall require. First, we cite a result about the implication of not having some element in the pure point spectrum of a distribution generated by a measure.
Proposition 2.10**.**
([13], Section 4) Suppose that generates an -ergodic distribution and that no non-zero integer multiple of is in . Then is -generated by at almost every , i.e. the sequence equidistributes for .
The next result says that distributions that are generated by a given measure have some additional invariance properties:
Theorem 2.11**.**
([13], Theorem 4.7) Suppose that generates an -invariant distribution . Then is supported on and satisfies the -quasi-Palm property: for every Borel set , if and only if for every , almost every measure satisfies that for almost every such that .
We shall refer henceforth to -ergodic distributions supported on that satisfy the conclusion of Theorem 2.11 as EFD’s (Ergodic Fractal Distributions), a term coined by Hochman in [11]. The next Proposition says that typical measures with respect to a non-trivial EFD have positive dimension (recall the definition of non-triviality in this situation from Section 1.2):
Proposition 2.12**.**
([13], Proposition 4.12) Let be an EFD. Then there exists some such that almost every has . If is non-trivial then .
We will also need to know that -typical measures are not ”one sided at small scales”
Proposition 2.13**.**
([13], Proposition 4.13) Let be an EFD. For every , for almost every , we have , where ranges over closed intervals of length containing [math].
The next Proposition follows from the -invariance of EFD’s, and from a Theorem of Hunt and Kaloshin [16]:
Proposition 2.14**.**
([13], Lemma 5.8) Let be a non trivial EFD such that typical measures have dimension . Let be such that . Then for almost every .
Finally, the next Proposition shows that ergodic invariant measures of positive dimension generate non-trivial EFD’s:
Theorem 2.15**.**
[12]** Let be a invariant ergodic measure with . Then generates a non-trivial ergodic distribution (which is an EFD by Theorem 2.11).
Let . We remark that while non-degenerate Cantor-Lebesgue measures with respect to base do generate EFD’s such that for every non zero integer , this is not true in general. Thus, in order to deduce Theorem 1.1 from Theorem 1.2, we shall require some additional machinery developed by Hochman and Shmerkin in [13] for a similar purpose. This is discussed in Section 5.
3 Some properties of (times m, times n) invariant measures
Throughout this section we fix integers . We begin with an elementary Lemma from entropy theory. Recall that we denote the coordinate projections by .
Lemma 3.1**.**
Let be a invariant measure such that . If
[TABLE]
then .
Proof.
Let be the invariant sigma algebra that corresponds to the second coordinate of . Then, by the Abramov-Rokhlin Lemma (see [2] for the non-invertible case),
[TABLE]
Combining this with our condition, we see that
[TABLE]
Recall that the partition is a generating partition of (see Section 2.3). Then it follows from Fekete’s Lemma and the Kolmogorov-Sinai Theorem that
[TABLE]
As is also an upper bound for the sequence , we find that for every ,
[TABLE]
So, by the formula for conditional entropy as average of the conditional measures ,
[TABLE]
where the partition in the last term on the RHS should be understood as the corresponding partition on the fiber . We also have almost surely, since has atoms. Therefore,
[TABLE]
almost surely. Such an equality is possible only if is the uniform measure on . It follows that almost surely the measure is the uniform measure on for every . By the Kolmogorov consistency Theorem, almost surely. Since , this proves the result. ∎
Claim 3.2**.**
Let be a invariant measure such that is exact dimensional. If then .
Proof.
By equation (3), and by Theorem 2.5
[TABLE]
(recall that and denote the corresponding ergodic components of and of , respectively).
Now, by Theorem 2.4, and since has exact dimension
[TABLE]
So, for almost every we have
[TABLE]
Combining (11) with (10), we find that
[TABLE]
Therefore, by (12), the formula for entropy as an average over ergodic components, the Abramov-Rokhlin Lemma, and the formula for entropy as the average of conditional measures (as in Lemma 3.1), we have
[TABLE]
where be the invariant sigma algebra that corresponds to the second coordinate of . Thus, we have that almost surely,
[TABLE]
Now, (13) and the Abramov-Rokhlin Lemma imply that almost surely
[TABLE]
By (13) and the formula for entropy and convex combinations,
[TABLE]
Since almost surely, we must have almost surely. By this equality and (14) we see that for almost every ,
[TABLE]
[TABLE]
By Lemma 3.1, almost every ergodic component equals . Thus,
[TABLE]
∎
Next, we make a short digression to discuss the relation between the conditional measures of a convolution of measures, and the conditional measures of the individual measures convolved, in some special cases. In the following, the convolution of the two measures on the unit square is understood to take place in . For a measure , Let be the disintegration of with respect to the projection .
Claim 3.3**.**
Let be two measure such that , where the measure is a convex combination of finitely many atomic measures. Then for almost every , the conditional measure with respect to the projection is a finite convex combination of measures of the form , where is an atom of and is a conditional measure of with respect to the projection .
Proof.
If for some then the result is straightforward. For the general case, notice that if and then by the linearity of both convolution and of taking product measures
[TABLE]
In general, if is a convex combination of probability measures and is some sigma algebra, then the following holds almost surely for every :
[TABLE]
We remark that in the above equation, the Radon-Nikodym derivatives in fact stand for the Radon-Nikodym derivatives when the measures are restricted to the sigma-algebra , i.e. . However, we suppress this in our notation. So, for the Borel sigma algebra on the -axis, for every and for almost every
[TABLE]
It follows that almost surely,
[TABLE]
∎
The following Claim, which forms the main result of this section, is also the key for our argument.
Claim 3.4**.**
Let be a invariant measure such that:
We have , where is a continuous ergodic measure, and . 2. 2.
There exists some such that:
For every probability measure with , for almost every , we have .
Then .
Proof.
Suppose towards a contradiction that . Let us first identify with the corresponding measure on (i.e. we project to but we keep the notation ), which cannot be either. Then, by Lemma 2.3, there exists such that
[TABLE]
Now, as and we must have , since if e.g. then, using (6),
[TABLE]
a contradiction. Thus, we may assume both , and since we have by (6).
Now, let be such that . We construct two measures such that:
The measure is a uniform measure on a finite (periodic) orbit such that .
To find such a measure, we take the periodic orbit where and . Define a measure on this orbit. Then
[TABLE]
Now, if then , which can only happen if . However, and . Thus, it is impossible that . 2. 2.
The measure is invariant, and .
To find such a measure, let be the Cantor-Lebesgue measure with respect to base and the non-degenerate probability vector (see the end of Section 2.1). Then is a weakly mixing invariant measure (a Bernoulli measure). By (7),
[TABLE]
By looking at the corresponding power series expansion, we see that for every
[TABLE]
By Proposition 3.1 in Chapter 5 of [29], we conclude that if and only if one of its factors has a zero at . Since , this clearly does not happen, and we conclude that .
Also, notice that , where is the Shannon entropy of the probability vector . Finally, by Theorem 2.6, there exists some such that , where by we mean that we convolve with itself times. Recalling (5), we see that . Thus, we may take . Notice that is invariant, so it is also invariant.
Thus, by (5) and (6), the Fourier coefficients of the measure satisfy
[TABLE]
Therefore, as , we have by Lemma 2.3
[TABLE]
since
[TABLE]
On the other hand, let us now lift all our measures to corresponding measures on and . Since is already defined on the unit square, we take this representative for our lift. Since cannot be atomic we can take our lift as the corresponding measure on , and for the measure we can take essentially the same measure. By Claim 3.3, the conditional measures of with respect to the projection are almost surely finite convex combinations of measures of the form , where are the atoms of the measure , with weights for . So, for almost every ,
[TABLE]
where we have used condition (2) in the statement of the Claim, the lower bound on and Proposition 2.1. Since the opposite inequality is always true, we conclude that
[TABLE]
Since , and by (17), we see via Lemma 2.2 part (1) that
[TABLE]
On the other hand, by part (2) of Lemma 2.2, and since ,
[TABLE]
We conclude that .
Finally, we project to . Since this projection is a local diffeomorphism, it preserves dimension. Thus, the convolved measure , with the ambient group being , has dimension . Moreover, by Theorem 2.4, since is ergodic it is exact dimensional. Since is a discrete measure (supported on two atoms), the convolution remains exact dimensional (Proposition 2.1).
Therefore, we may apply Claim 3.2 for the measure , since this is a invariant measure (as the convolution of such measures), and the assumptions on the dimension of and on are met by the previous paragraph. Thus, we may conclude that . Via (16), this yields our desired contradiction. ∎
4 Proof of Theorem 1.2
Let be as in Theorem 1.2, and let be some accumulation point of the sequence of measures as in (1) (where we pick a typical according to ), along a subsequence . Our goal is to show that , and we shall do this by showing that meets the conditions of Claim 3.4.
By our assumptions and Theorem 1.1 in [13] it follows that and . Thus, satisfies condition (1) in Claim 3.4. Notice that this implies that gives zero mass to the points of discontinuouty of . So, is invariant. For the second condition of Claim 3.4, we require the following analogue of Theorem 5.1 in [13]. Recall that is the EFD generated by (see Section 2.4).
Claim 4.1**.**
(Conditional integral representation) For almost every there is a probability space and measurable functions
[TABLE]
such that:
[TABLE] 2. 2.
Let denote the distribution of random variable as above. Then .
Proof.
We dedicate the first part of the proof to finding a disintegration of according to the measure . To this end, consider the following sequence of distributions , defined by
[TABLE]
Let be some accumulation point of this sequence. Without the loss of generality, let us assume the limit already exists along the sequence . Then we may assume that and , since we are considering a typical point , making use of the fact that is pointwise generic under for , and of the spectral condition on via Proposition 2.10.
Next, we disintegrate the distribution via the projection :
[TABLE]
Applying the map to this disintegration, we see that
[TABLE]
Thus, the family of measures forms our desired disintegration.
Let us study this family of distributions a little further: It is well known (see e.g. [9] or [28]) that for almost every , we may write
[TABLE]
Therefore,
[TABLE]
Finally, we note that for almost every , for every ,
[TABLE]
Therefore, for almost every , for every ,
[TABLE]
We now turn our attention to the main assertions of the Claim. First, we embed (the measure from Theorem 1.2) on the diagonal of the unit square by pushing it forward via the map . We call this new measure . For let denote the partition of given by
[TABLE]
Given a point such that we define a probability measure
[TABLE]
where is a normalizing constant. By applying Claim 2.8, we see that there is a set of density (possibly depending on the we chose according to ), such that for every the measure is an affine image of the measure . Since we are only interested in the limiting behaviour of these measures, we may assume . Also, for all since and are both continuous measures. Thus, by Theorem 2.7
[TABLE]
Now, for almost every the conditional measure can be obtained as the weak-* limit , where for every Borel set and ,
[TABLE]
Fix . By (19) and since is an affine image of the measure for every , the projection of to the -axis (i.e. via ) equals111Recall that by equation (8) in [13] Section 5.2,
for corresponding parameters.
[TABLE]
[TABLE]
where is the unique affine map taking the -axis to the line with slope through the point . Notice that in the first equation above we only take note of the indices such that , and this is justified by Lemma 2.9.
We thus see, as in Theorem 5.1 in [13] and its preceding discussion, that there is a distribution
[TABLE]
such that we have an integral representation (that depends on both and )
[TABLE]
where is the map . Moreover, the distribution of on the measure component is given by
[TABLE]
and by equation (18) and its preceding discussion, this distribution is absolutely continuous with respect to .
Notice that for almost every , is the normalizing constant making a probabilty measure. Also, the map is continuous almost surely. Moreover, by moving to a subsequence, we may assume the weak -* limit exists, call it . For these assertions, we argue, as in ([13], Theorem 5.1), that the distribution of the normalizing constants is tight. Indeed, for measures drawn according to this follows from Proposition 2.13, and in our case the distribution of on the measure component is absolutely continuous with respect to . Finally,
[TABLE]
This completes the proof of part (1). For part (2), it remains to note that by our construction, for almost every the distribution of on the measure component is given by , as in the first part of the proof, by the discussion preceding (18). ∎
Proof of Theorem 1.2 We are now in position to show that satisfies all conditions in Claim 3.4. We have already established that condition (1) holds. As for condition (2), we choose (here we use the assumption that is non trivial, and Proposition 2.12). Let be such that . We now show that for almost every ץ
First, by Lemma 2.14 and Claim 4.1 part (2)
[TABLE]
Therefore, for almost every , for almost every , (since the integrand is always ). Thus, by Claim 4.1 part (1), for almost every ,
[TABLE]
Since is always true, we find that for almost every .
We conclude that satisfies the conditions of Claim 3.4. Therefore, , as desired.
5 Proof of Theorem 1.1
Let be a -invariant and ergodic measure with positive dimension. Then generates an EFD with by Theorem 2.15. Let . The pure point spectrum can contain non zero integer multiplies of only if either (in Theorem 1.1 we assume this is not the case), or if , see [12]. We shall prove Theorem 1.1 by using Theorem 1.2, and following the analysis of Hochman and Shmerkin from ([13], Section 8) in order to relax the spectral condition (i.e. deal with the latter case). We begin by treating the case .
Suppose first that does not contain a non-zero integer multiple of . By the ergodic Theorem, is pointwise generic for . Also, since generates an EFD such that for every , we may apply Theorem 1.2 and obtain
[TABLE]
.
Suppose now that there exists some such that , so is not ergodic by Proposition 4.1 in [13]. By the results discussed in ([13], Sections 8.2 and 8.3) there is a probability space and a measurable family of measures such that:
The measures form a disintegration of , that is, . 2. 2.
For almost every , generates . 3. 3.
For almost every , -generates an ergodic distribution at almost every point (see Proposition 2.10 for the definition of -generation).
Let denote the almost sure dimension of the measures drawn by . Then the following holds222Here, we use the fact that the commutative phase measure from Theorem 8.2 in [13] has dimension 1, as proven in Section 8.3.:
Lemma 5.1**.**
([13], Lemma 8.3) Let be such that . Then for almost every , almost every , and almost every .
Now, we may finish the proof in a similar fashion to the proof of Theorem 1.2. Namely, For almost every and for almost every , let be such that equidistribute for it sub-sequentially under . Then we may assume by the ergodic Theorem, and that -generates , where is typical with respect to Lemma 5.1. Then we have a conditional integral representation as in Claim 4.1, but now we can only disintegrate . Since we have Lemma 5.1 at our disposal (so that an analogue of (21) holds for instead of ), we still have that for every with , for almost every , as the calculation carried out during the last stage of the proof of Theorem 1.2 follows through in this case as well. It follows that . Finally, since this is true for almost every and for almost every , this is also true for almost every (recall that ).
The case when follows by a similar argument, only here for almost every , is pointwise -normal, since this is true for by Theorem 1.10 in [13].
6 Perturbing the initial point
In this Section we prove the following generalization of Theorem 1.1:
Theorem 6.1**.**
Let be a invariant ergodic measure with . Let be integers such that , and let be such that .
If then
[TABLE] 2. 2.
If then
[TABLE]
The proof is similar to the proof of Theorem 1.1. In particular, it relies on the following generalization of Theorem 1.2:
Theorem 6.2**.**
Let be a probability measure, , and be integers, such that:
The measure generates a non-trivial -ergodic distribution . 2. 2.
The pure point spectrum does not contain a non-zero integer multiple of . 3. 3.
The measure is pointwise generic under for an ergodic and continuous measure , and .
Then
[TABLE]
For this to work, we need the following version of Claim 2.8. Let . For every , define
[TABLE]
Claim 6.3**.**
Suppose that is a measure such that is pointwise generic under for a continuous measure . Then for almost every , if and represents the times when , then the density of is zero.
The proof is analogues to that of Claim 2.8.
Proof of Theorem 6.2 The proof follows essentially the same steps as the proof of Theorem 1.2. Let be some accumulation point of the orbit under of , where is drawn according to .
- •
By ([13], Theorem 1.1) we have . By our assumption on , .
- •
A complete analogue of Claim 4.1 holds in this case as well. First, we disintegrate according to , in a similar manner to the first part of the proof of Claim 4.1. Here, we make use of the fact that generates and generates , which follows by ([13], Lemma 4.16).
Secondly, we embed on a line in by pushing it forward via the map (recall that we are assuming that both and map to ). Calling this measure , and using the same notation as in Claim 4.1, we have
[TABLE]
by an application of Theorem 2.7. Also, by applying Claim 6.3, we see that there is a set of density (possibly depending on the we chose according to ), such that for every the measure is an affine image of the measure . Thus, we obtain an analogue of (20). From here, we complete the proof as in the proof of Claim 4.1.
- •
We finish the proof of the Theorem by showing that meets the conditions of Claim 3.4. The proof is essentially the same as in the case of Theorem 1.2.
Proof of Theorem 6.1 Since we have Theorem 6.2 at our disposal, the proof is now essentially the same as the proof of Theorem 1.1. We remark that an analogue of Lemma 5.1 remains true in this case as well, which may be deduced from the results of ([13], Section 8.4).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Tim Bedford and Albert M. Fisher. Ratio geometry, rigidity and the scenery process for hyperbolic Cantor sets. Ergodic Theory Dynam. Systems , 17(3):531–564, 1997.
- 2[2] Thomas Bogenschütz and Hans Crauel. The Abramov-Rokhlin formula. In Ergodic theory and related topics, III (Güstrow, 1990) , volume 1514 of Lecture Notes in Math. , pages 32–35. Springer, Berlin, 1992.
- 3[3] J. W. S. Cassels. On a problem of Steinhaus about normal numbers. Colloq. Math. , 7:95–101, 1959.
- 4[4] Kenneth Falconer. Techniques in fractal geometry . John Wiley & Sons, Ltd., Chichester, 1997.
- 5[5] Kenneth J Falconer. The geometry of fractal sets , volume 85. Cambridge university press, 1986.
- 6[6] J. Feldman and M. Smorodinsky. Normal numbers from independent processes. Ergodic Theory Dynam. Systems , 12(4):707–712, 1992.
- 7[7] Harry Furstenberg. Disjointness in ergodic theory, minimal sets, and a problem in diophantine approximation. Theory of Computing Systems , 1(1):1–49, 1967.
- 8[8] Harry Furstenberg. Intersections of Cantor sets and transversality of semigroups. In Problems in analysis (Sympos. Salomon Bochner, Princeton Univ., Princeton, N.J., 1969) , pages 41–59. Princeton Univ. Press, Princeton, N.J., 1970.
