Singular Vectors on Fractals and Projections of Self-similar Measures
Osama Khalil

TL;DR
This paper establishes an upper bound on the Hausdorff dimension of singular vectors on self-similar fractals, linking Diophantine approximation properties with fractal geometry, and applies the results to translation flows on flat surfaces.
Contribution
It provides an optimal upper bound on the Hausdorff dimension of singular vectors on self-similar fractals, extending previous results and addressing open questions in the field.
Findings
Upper bound matches known exact dimensions for trivial fractals.
For specific fractals like Cantor sets, the bound is 2/3 of the fractal's dimension.
The method applies to translation flows, showing non-uniquely ergodic directions have dimension at most 1/2 of the fractal.
Abstract
Singular vectors are those for which the quality of rational approximations provided by Dirichlet's Theorem can be improved by arbitrarily small multiplicative constants. We provide an upper bound on the Hausdorff dimension of singular vectors lying on self-similar fractals in satisfying the open set condition. The bound is in terms of quantities which are closely tied to Frostman exponents of projections of the Hausdorff measure supported on the fractal. Our bound is optimal in the sense that it agrees with the exact dimension of singular vectors obtained by Cheung and Chevallier when the fractal is trivial (i.e. has non-empty interior). As a corollary, we show that if the fractal is the product of copies of Cantor's middle thirds set or the attractor of a planar homogeneous irrational IFS, then the upper bound is the dimension of the fractal. This addresses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Singular Vectors on Fractals and Projections of Self-similar Measures
Osama Khalil
Department of Mathematics, Ohio State University, Columbus, OH
Abstract.
Singular vectors are those for which the quality of rational approximations provided by Dirichlet’s Theorem can be improved by arbitrarily small multiplicative constants. We provide an upper bound on the Hausdorff dimension of singular vectors lying on self-similar fractals in satisfying the open set condition. The bound is in terms of quantities which are closely tied to Frostman exponents of projections of the Hausdorff measure supported on the fractal. Our bound is optimal in the sense that it agrees with the exact dimension of singular vectors obtained by Cheung and Chevallier when the fractal is trivial (i.e. has non-empty interior). As a corollary, we show that if the fractal is the product of copies of Cantor’s middle thirds set or the attractor of a planar homogeneous irrational IFS, then the upper bound is the dimension of the fractal. This addresses the upper bound part of a question raised by Bugeaud, Cheung and Chevallier. We apply our method in the setting of translation flows on flat surfaces to show that the dimension of non-uniquely ergodic directions belonging to a fractal is at most the dimension of the fractal.
Key words and phrases:
singular vectors, fractals, Hausdorff dimension, translation surfaces
2010 Mathematics Subject Classification:
11J13, 11J83, 28A80, 37A17
1. Introduction
1.1. Statement of results
The goal of this article is to study the dimension of singular vectors on fractals. The notion of singular vectors is motivated by Dirichlet’s theorem. It states that for every vector and for every , the inequalities
[TABLE]
admit a non-zero solution when . A vector is then said to be singular if for every , there exists so that the inequalities in (1.1) admit a non-zero integral solution for all . We will denote by the set of singular vectors in . It is well-known [DS70a, KW08] that the Lebesgue measure of is [math].
In a remarkable article, the Hausdorff dimension of was determined by Cheung [Che11]. This result was extended by Cheung and Chevallier in [CC16] to higher dimensions. They showed that for ,
[TABLE]
where denotes Hausdorff dimension. More recently, a sharp upper bound on the dimension of singular -matrices was found in [KKLM17] and the exact lower bound was later determined in [DFSU19]. On the other hand, the study of Diophantine properties of fractals has attracted a lot of interest in recent years, begining with the work of Kleinbock, Lindenstrauss, and Weiss in [KLW04]. This study is motivated by Sprindžuk’s conjecture, resolved in [KM98], and concerns finding optimal conditions on measures and subsets of under which they inherit the Diophantine properties of the ambient space. Subsequently, Bugeaud, Cheung, and Chevallier raised the natural question of determining the Hausdorff dimension of singular vectors on fractals in .
Question 1.1** (Problem 6, [BCC18]).**
What is the dimension of the set of vectors in whose coordinates belong to Cantor’s middle thirds set?
In this article, we give an upper bound on the dimension of singular vectors belonging to a large class of fractals which arise as the limit sets of iterated function systems (IFS) of contractive similarities on . This class includes such familiar examples as products of the same Cantor set, Koch snowflakes, Sierpiński gaskets, etc. Our result is new even in the setting of Question 1.1. We refer the reader to Section 2 for detailed definitions. For prior work on Diophantine properties of fractals, see Section 1.4.
To state the result, we need some notation. For , we denote by the collection of all affine subspaces of of dimension . For and , we use to denote the open -neighborhood of in the Euclidean metric. Given a compactly supported Borel measure on and , we define by
[TABLE]
The quantities quantify the concentration of the support of near proper subspaces: the smaller is, the more concentrated its support is near affine subspaces of dimension . On the other hand, proper rational affine subspaces of are contained in . Our first main result shows that one can provide an upper bound on the dimension of singular vectors on fractals in terms of these quantities. It is a special case of Theorem 1.12 below.
Theorem A**.**
Suppose is an IFS consisting of contractive similarities on and satisfying the open set condition. Let be the limit set of and be the restriction of the -dimensional Hausdorff measure to , where . Then,
[TABLE]
Remark 1.2**.**
It is known that in the setting of Theorem A, is finite and non-zero, and (Proposition 2.2). When is irreducible, i.e. no proper affine subspace is invariant by all the maps in , then for each (Corollary A.10). We also show in Proposition A.7 that for self-similar measures, the in the definition of is in fact a limit.
Remark 1.3**.**
If is the Lebesgue measure on , then . Moreover, one can realize the unit cube as the attractor of an IFS as in Theorem A. Since singular vectors are invariant by integer translations, Theorem A shows that has dimension at most , which agrees with the exact dimension obtained by Cheung and Chevallier.
In the setting of Question 1.1, Theorem A yields the following.
Corollary 1.4**.**
Suppose , where is Cantor’s middle thirds set. Then,
[TABLE]
The quantities are closely tied to Frostman exponents of projections of , cf. [Shm16, Section 1.3] and Section 1.3 below. In a breakthrough article, these exponents were determined by Shmerkin for planar homogeneous irrational fractals in [Shm16, Theorem 8.2]. This result, combined with Theorem 1.9 below, allows us to evaluate the formula in Theorem A explicitly, yielding Corollary 1.5. Recall that a planar IFS is homogeneous if each is of the form where and is a rotation by angle , each of which is independent of . We say is irrational if .
Corollary 1.5**.**
Suppose is a homogeneous irrational IFS on satisfying the open set condition and let be its limit set. Then,
[TABLE]
For more general self-similar fractals, when the co-dimension of is , it is possible to get an explicit, yet crude, estimate on , yielding the following corollary.
Corollary 1.6**.**
Suppose , , and are as in Theorem A. Assume further that . Then, and, hence,
[TABLE]
Remark 1.7**.**
Corollary 1.6 along with (1.2) show that , under the hypotheses of Theorem A, whenever .
We emphasize that we do not expect the estimated upper bound in Corollary 1.6 to agree with the exact lower bound in general. However, we conjecture that the upper bound in Theorem A (and, in particular, Corollary 1.4) is sharp when the fractal contains a dense set of rational vectors, cf. Question 1.11.
It is worth noting that computing Frostman exponents of projections of self-affine (and in particular self-similar) measures is a rather delicate problem in general. For instance, when is the natural measure supported on the product of Cantor sets with multiplicatively independent dissection ratios, this problem constitutes the content of Furstenberg’s well-known intersection conjecture, recently resolved by Shmerkin in [Shm16], and independently by Wu in [Wu16].
1.2. Divergent orbits of the Teichmüller flow
Under Dani’s correspondence, it is known that corresponds to certain divergent orbits on the space of unimodular lattices in , see Theorem 1.12 below for details. In Section 7, we adapt our techniques to the closely related problem of divergent orbits of the Teichmüller geodesic flow.
In what follows, we fix a stratum of abelian differentials over a compact oriented surface (see Section 7.1 for definitions). Then, admits a natural action by and, in particular, by the following one parameter subgroups:
[TABLE]
The action induces the Teichmüller geodesic flow on . For , we say the orbit is divergent on average, if for every compact set , one has
[TABLE]
where denotes the indicator function of .
Theorem B**.**
Suppose is an IFS consisting of similarities on satisfying the open set condition and let be its limit set. Then, for every , the Hausdorff dimension of the set of such that the orbit diverges on average in is at most .
Masur showed in [Mas92] that the set of directions around any point for which the orbit is divergent have dimension at most . This was recently extended in [AAE*+*17] to show that this upper bound in fact holds for divergent on average directions. Theorem B generalizes both these results. More recently, the lower bound of on the dimension of divergent on average directions was established in [AM18].
The motivation for studying divergent Teichmüller geodesics comes from the study of the ergodic properties of billiard flows and interval exchange transformations (IETs). Masur’s criterion states that if the vertical straight line flow on is non-uniquely ergodic (NUE), then diverges in [Mas92]. In this vein, Theorem B has the following corollary.
Corollary 1.8**.**
Let be as in Theorem B. Then, for every , the set of directions such that the vertical flow on is not uniquely ergodic has dimension at most .
The size of the set of NUE directions has been extensively studied. A celebrated theorem of Kerckhoff, Masur, and Smilie shows that the Lebesgue measure of the set of NUE directions is [math] [KMS86]. This result was generalized by Veech in [Vee99] to a broader class of measures which includes Lebesgue and the natural measure on a Cantor set. Additionally, it is known that first return maps of straight line flows give rise to IETs. This observation can be packaged in the form of a locally defined map from a stratum of abelian differentials to the space of parameters of IETs with a given permutation (cf. [Mas82] for details). Minsky and Weiss characterized the straight lines in the space of IETs that arise as the image of orbits of the horocycle flow on strata [MW14] under such maps and showed that non-uniquely ergodic IETs belonging to these special lines have [math] mass with respect to a broad class of measures
1.3. Relation to Frostman exponents of projections
In the Appendix, we study the relationship between the quantities and the Frostman exponents of the projections of and prove Theorem 1.9 below. This result allows us to apply a result of Shmerkin to deduce Corollary 1.5 from Theorem A. To motivate the theorem, we first give an equivalent definition of the quantities . Let be the Grassmanian of vector subspaces of of dimension . We identify subspaces with the associated canonical orthogonal projection from . For , , and , let be the ball of radius around . If is a measure space with measure and is a measurable map, we denote by the push-forward measure. Given a compactly supported Borel measure on and , can be alternatively defined by
[TABLE]
Given a finite measure on , the Frostman exponent of , denoted , is defined by
[TABLE]
By definition, we have . For self-similar measures, we show the following.
Theorem 1.9**.**
Suppose is an irrational homogeneous IFS on satisfying the open set condition with limit set . Let be the restriction of the -dimensional Hausdorff measure to , where . Then, for Lebesgue almost every ,
[TABLE]
It is reasonable to expect the first equality in Theorem 1.9 to hold for self-similar measures in greater generality. We hope to address this question in future work.
The proof proceeds by realizing the quantities in question as limits of a certain sub-additive cocycle over an irrational rotation and utilizing an extension of the classical sub-additive ergodic theorem due to Furman [Fur97]. This technique has been used for similar problems in [PS09, NPS12, GSSY16].
Remark 1.10**.**
- (1)
Under the hypotheses of Theorem 1.9, it is shown in [Shm16, Theorem 8.2] that for every . The point of Theorem 1.9 is establishing equality between and these Frostman exponents. This statement is perhaps not surprising to experts, though we could not locate a reference in the literature. 2. (2)
Suppose where is Cantor’s middle thirds set realized as the attractor of the natural IFS, denoted by . A remarkable result of Shmerkin in [Shm16, Theorem 6.2, Corollary 6.4] shows that whenever . On the other hand, the projections of on the coordinate axes have Frostman exponent equal to . This shows that the minimality assumption (irrationality of ) cannot be dropped for the second equality in the conclusion of Theorem 1.9.
Theorem 1.9 suggests an affirmative answer to the following question.
Question 1.11**.**
Suppose , and are as in Theorem A. Assume that the group generated by the rotation parts of the maps in the IFS is dense in . Is it true that for every ? If we further assume that the maps in are all defined by rational parameters, is it true that ?
1.4. Prior work
Kleinbock and Weiss showed in [KW05b] that irreducible self-similar measures satisfying the open set condition (OSC) give [math] mass to . Indeed, they establish this result for the much wider class of friendly measures introduced in [KLW04]. When is fixed, the set of vectors which admit non-trivial solutions to the inequalities in (1.1) for all large are referred to as Dirichlet -improvable and are denoted . In [SW16], generalizing Benoist and Quint’s fundamental measure rigidity results for random walks on homogeneous spaces, Simmons and Weiss showed that irreducible self-similar measures with the OSC give [math] mass to for every . Special cases of this result were obtained previously by Einsiedler, Fishman, and Shapira for measures admitting invariance by expanding maps [EFS11]. This latter result relied on entropy methods and measure rigidity results for higher rank diagonlizable actions.
The above results indicate scarcity of singular vectors (and indeed of vectors) in the support of self-similar measures. On the other hand, Kleinbock and Weiss showed that badly approximable vectors have full dimension in the support of absolutely friendly measures (which include the measures in Theorem A) [KW05a], cf. [Fis09]. An observation of Davenport and Schmidt [DS70b, Theorem 2] shows that badly approximable vectors belong to . It follows that the set has full dimension in the support of these measures. Finally, the reader may wish to consult [DFSU18] for more recent developments in the study of extremality of fractal measures.
On the fractal geometric side, dimensions of projections of self-similar sets and measures have been extensively studied, see [Shm15] for a survey. It is shown in [Shm16, Section 6] that the Frostman exponent of the projection in an irrational direction of the restriction of the Hausdorff measure to a Sierpiński carpet of dimension or to the -dimensional Sierpiński gasket in the plane is equal to . These results build on prior work of Hochman in [Hoc14] and the observation that the projections of such sets are themselves self-similar. Less is known about projections of these sets in rational directions. However, several regularity results on the dimension of slices of the aforementioned sets with lines of rational slopes have been established in [BFS12, BR14] and references therein. When is the limit set of an IFS satisfying a stronger separation condition than OSC and such that the rotation parts of the maps in generate a dense subgroup of , it is shown in [HS12] that the image of under a projection onto an -dimensional subspace has Hausdorff dimension . Moreover, under the same hypotheses, it is shown in loc.cit. that projections of the associated self-similar measures onto subspaces of dimension are exact-dimensional, with dimension equal to . The reader is referred to [PS09, NPS12, HS12] and references therein for more results in that direction.
1.5. Overview of the proof and reduction to dynamics
We will deduce Theorem A from a stronger dynamical statment, Theorem 1.12 below. Let , , and . For and , define the following elements of .
[TABLE]
where denotes the identity matrix. It was shown by Dani in [Dan85] that is singular if and only if the orbit diverges in . Theorem A follows from the following result.
Theorem 1.12**.**
Suppose is an irreducible IFS on satisfying the open set condition and let be its limit set. Let and be the restriction of the -dimensional Hausdorff measure to . Then, for every , the Hausdorff dimension of the set of vectors such that the forward orbit diverges on average in is at most
[TABLE]
For convenience of the reader, we outline the proof of Theorem 1.12. The proof has two main steps: a linear argument and a probablistic scheme. The linear argument is concerned with estimating the average rate of expansion of vectors in the exterior powers of the standard representation of with respect to any measure satisfying for every . Roughly, we show that , for every , where is any non-zero vector in , equipped with the standard representation of . This is the content of Section 5.
The proof of this step is based on a simple but crucial observation regarding transversality of the expanding coordinates of . This is carried out in Propositions 4.1 and 4.2. Roughly speaking, we show that if the projections of onto each expanding coordinate of are simultaneously small, this implies that belongs to a neighborhood of an affine subspace of low dimension.
The other ingredient is to use the method of integral inequalities, first introduced in [EMM98], to translate the results on linear expansion to recurrence results on the space . This is carried out in Section 6. We construct a Margulis function (see Def. 3.1) which measures the depth of orbits into the cusp. We show that the average value of , for a fixed , and with respect to any non-planar measure , is a fraction of , whenever is sufficiently deep in the cusp. This concludes the linear argument.
The probablistic scheme takes as input the average height contraction established in the previous step and converts it into an upper dimension estimate on singular vectors in the support of the measure, Theorem 3.4. This step is the technical heart of this article and is carried out in Section 3. Before explaining the strategy, it is worth noting at this stage that we have not yet used the assumption that the measures under consideration are self-similar. However, this assumption is indispensible for the probablistic scheme. For example, using Frostman’s Lemma, and the fact that , one can find a measure , whose support is contained , and satisfying . This in particular implies that for all (cf. proof of Lemma 2.6). The results of the previous step (the linear argument) apply to such a measure, however clearly singular vectors have full dimension in the support of .
The main step in proving Theorem 3.4 is Proposition 3.5. The key property of self-similar measures we use is a renormalization mechanism to convert global information (average contraction over the entire support) obtained via the linear argument at a small fixed time scale to local information (average contraction over small pieces of the support) at large time scales. We make use of the existence of a faithful representation , where is the group of similarities of , , and is the normalizer of in . Under this inclusion, the scaling subgroup of corresponds to .
Analogous steps have been studied before in [KKLM17] in the case of Lebesgue measure on and in [Kha18] for measures supported on curves. The strategy in those cases is to show that the probability that an orbit segment spends a large proportion of its time in the cusp decays exponentially with a precise rate. One then uses continuity of the flow to show that if is one such point, then a whole neighborhood of of radius has roughly the same behavior. This converts measure estimates into a count on covers and an estimate on the box dimension.
Unfortunately, this strategy seems to fail when the contraction ratios of the maps in the IFS are not all the same and this introduces considerable difficulties in our case. This is due to the fact that the natural neighborhoods used in constructing the covers are pieces of the fractal whose diameters have distinct exponential decay rates to [math]. We introduce a method which does not rely on counting covers and rather works by estimating the Hausdorff dimension directly. The key starting inequality of our estimates roughly takes the form:
[TABLE]
where the sum is over self-similar pieces of the limit set which meet the set of vectors whose orbit segment of length spends a large proportion of its time in the cusp. Here, we use the fact that is a Hausdorff measure and, in particular, that . This allows us to interpret this sum as an integral of a cocycle measuring the diameter of the self-similar pieces. Moreover, and crucially to our method, we interpret the factor as being a non-constant contraction ratio of the averaging operator given by integrating against as in the right side of (1.9). This is to be contrasted with the more standard integral inequalities of Margulis functions of the form , with a uniform contraction factor . We also estimate the indicator of by a product of the height function and the indicator of . Accordingly, our linear argument is modified to take into account the additional cocycle factor inside the integral. An inductive procedure is then carried out to bound the -Hausdorff measure of the set .
We conclude this introduction with a natural problem arising from our investigations.
Question 1.13**.**
What is the Hausdorff dimension of singular vectors belonging to the limit set of a Zariski-dense, convex cocompact, discrete group of Möbius transformations of ?
We refer the reader to [BGSV18] where a related notion of intrinsic singular vectors on the limit set of geometrically finite manifolds is introduced and studied. We remark that the Diophantine problems studied in [BGSV18] correspond with the recurrence behavior of the geodesic flow on hyperbolic manifolds, while Question 1.13 is related to the recurrence of diagonal flows on via Dani’s correspondence in the spirit of Theorem 1.12.
Acknowledgements**.**
I would like to thank the anonymous referees for a careful reading and for numerous valuable comments that improved the presentation and corrected several inaccuracies in the initial version of the article. I would like to thank Jon Chaika, Yitwah Cheung and Barak Weiss for their interest in the project and for comments on an earlier version of the manuscript. I would also like to thank Pablo Shmerkin for several valuable suggestions and for providing the proof of Lemma A.1.
2. Preliminaries
We recall some properties of self-similar measures to be used in later sections.
2.1. Hausdorff dimension
We recall the definition of the Hausdorff dimension. Let be a subset of a metric space . For any , we define
[TABLE]
The -dimensional Hausdorff measure of is defined to be
[TABLE]
The Hausdorff dimension of is defined by
[TABLE]
2.2. IFS Notation
Fix a finite set . An iterated function system (IFS for short) is a finite collection of contractive similarities of , i.e., for each , has the form
[TABLE]
where , , and . The similarity dimension of is defined to be the unique solution of the equation . It is shown in [Hut81] that there exists a unique compact set which is invariant by in the following sense.
[TABLE]
We refer to the set as the limit set of . Hutchinson introduced a notion of separation, the open set condition, that will allow us to treat the union in (2.2) as if it were disjoint, cf. Proposition 2.3 below. Following [Hut81], we say satisfies the open set condition (OSC for short) if there exists a non-empty open set such that the following holds:
[TABLE]
Given , we let
[TABLE]
The maps take the form , where
[TABLE]
It will be convenient for us to consider the numbers for finite prefixes of infinite words . We do this by means of a multiplicative cocycle. To this end, let be the shift map, i.e., . Consider the function defined as follows for
[TABLE]
Then, satisfies the cocycle relation: .
We wish to regard as a function on . Since members of may not be disjoint, there is ambiguity on the overlaps. For this purpose, we introduce a modified partition consisting of disjoint sets as follows. Using any fixed order on the elements of , we can endow with a lexicographic order and define for each :
[TABLE]
Let and let be the unique word such that . Then, we define
[TABLE]
The following lemma shows that form a refining sequence of partitions. This fact is used in the inductive procedure of constructing covers for singular vectors by elements of .
Lemma 2.1**.**
For every and every , we have , where the union is taken over words such that is the prefix of of length .
Proof.
Suppose is such that is a prefix of . Suppose is such that and suppose . By (2.2), we have , where denotes . Let be such that . Then, since and is a prefix of , it follows that by definition of the lexicographic order. This implies that . This shows that .
To show the reverse containment, fix . Let be the maximal word in in the lexicographic order having as a prefix and satisfying 1) , and 2) for every such that . Suppose for contradiction that . Then, there exists some such that and . Let be the prefix of . By maximality of , . However, by definition of the lexicographic order, since , it must be that . In particular, since , it follows that , contrary to our assumption and concluding the proof. ∎
2.3. Self-similar measures
Fix a probability vector with full support on . That is , for every , and . The Markov-Feller Operator is defined by
[TABLE]
for all Borel measures on . It is shown in [Hut81] that there exists a unique probability measure supported on and satisfying
[TABLE]
We refer to measures satisfying (2.9) as self-similar measures for . Simple induction applied to (2.9) shows that
[TABLE]
where . Under the open set condition, Hutchinson showed that the self-similar measure for the natural probability vector is in fact the -dimensional Hausdorff measure .
Proposition 2.2** (Theorem 5.3(1), [Hut81]).**
Suppose is an IFS satisfying the open set condition with similarity dimension and let denote its limit set. Then, . In particular, . Let denote the normalized restriction of to . Then, is the self-similar measure for the probability vector . Moreover, there exist constants , such that for every and every ,
[TABLE]
2.4. Consequences of null overlaps
In general, the overlap between members of causes serious problems in the analysis. However, the following result, obtained in [Hut81], shows that the OSC insures that these overlaps are negligible from the point of view of self-similar measures.
Proposition 2.3** (Proposition 5.1(4), Theorem 5.3(1)[Hut81]).**
Suppose satisfies the open set condition and let be its similarity dimension. Let and be the self-similar probability measure for . Then, for every and all , .
We now state two consequences of Proposition 2.3 which we use in our proof. For a Borel set and a Borel measure , we denote by the restriction of to . That is for every Borel set , . The following lemma will be useful in estimating Hausdorff dimension.
Lemma 2.4**.**
Suppose satisfies the open set condition. Then, for every and , and have the same diameter.
Proof.
It suffices to show that is dense in . For and , denote by its length prefix. Let denote the coding map defined by
[TABLE]
The space is endowed with the product topology induced from the discrete topology on . In this topology, the map is onto and continuous [Hut81, Theorem 3.1.3(vii)]. Moreover, if and is the unique self-similar measure satisfying (2.9), then
[TABLE]
Let and . It suffices to show that is dense in by continuity of . Note that is the cylinder set consisting of all sequences whose prefix of length is . Moreover, by Proposition 2.3, . Since has full support in , is dense in . ∎
We record another useful consequence of the null overlaps.
Lemma 2.5**.**
Suppose satisfies the open set condition, and . Then, for every and all ,
[TABLE]
Proof.
This follows from Proposition 2.3 and equation (2.10). ∎
2.5. Elementary facts on projections
We say is irreducible if no finite collection of proper affine subspaces of is invariant by each . When is the similarity dimension of , Proposition 2.2 implies that
[TABLE]
where is the restriction of the -Hausdorff measure to . Moreover, it is easy to see that
[TABLE]
The following lemma provides a simple lower esimtate for .
Lemma 2.6**.**
Suppose is an irreducible IFS on satisfying the open set condition and let be its limit set. Suppose . Then, for each , , where is the restriction of to .
Proof.
This result is well-known, we provide a proof for completeness. Given an affine subspace of dimension and , the boundedness of implies that we can cover the set with balls of radius . By Proposition 2.2, each such ball has measure at most . In particular, . As and were arbitrary, this completes the proof. ∎
3. The Contraction Hypothesis and Divergent Trajectories
In this section, we prove an abstract recurrence result for orbits of the diagonal flow in (1.8) starting from fractals in actions of on metric spaces. Theorem 3.4 is the main result of this section establishing a bound on the dimension of divergent orbits. In later sections, we verify the hypotheses of this theorem in the settings of the results stated in the introduction.
3.1. The Contraction Hypothesis for Actions of SL(n,R)
We fix the following more convenient parametrization of the diagonal subgroup of , defined in (1.8), which we denote by for ,
[TABLE]
where denotes the identity matrix. Note that in this parametrisation, one has for every ,
[TABLE]
Recall the definition of self-similar measures in (2.9). The following is the key recurrence property for the action and the measure which underlies the results stated in the introduction.
Definition 3.1** (The Contraction Hypothesis).**
Suppose is a metric space equipped with an action of and let be a self-similar probabilty measure for an IFS on . Given a collection of functions and real numbers , we say that satisfies the -contraction hypothesis on if the following properties hold:
- (1)
The set is independent of and is -invariant. 2. (2)
For every , is -invariant and uniformly log Lipschitz with respect to the action. That is for every bounded neighborhood of identity in , there exists a constant such that for every , and ,
[TABLE] 3. (3)
There exists a constant such that the following holds: for every and , there exists such that for all with ,
[TABLE]
where is the cocycle defined in (2.8) and is the limit set of the IFS .
The functions will be referred to as height functions.
The notion of height functions was introduced in homogeneous dynamics in [EMM98] and was used in [KKLM17] to find a sharp upper bound on the dimension of singular systems of linear forms. The “Contraction Hypothesis" terminology is due to [BQ11].
Remark 3.2**.**
- (1)
Inequality (3.3) should be thought of as a Margulis inequality with a non-uniform contraction ratio. To best illustrate this analogy, consider the case where for some fixed constant . Setting in (3.3) yields
[TABLE]
for all and and all , where is the constant in Def. 3.1(3). In particular, if is large enough so that , the inequality in (3.4) recovers the classical form of Margulis inequalities of the form (cf. [EMM98, EM04, BQ11, EM01, EMM15, KKLM17, Kha18])
[TABLE]
where , , , and . 2. (2)
The collection of height functions we use in our applications will be equivalent in the following sense: for all , there exists a constant , such that
[TABLE]
Allowing this flexibility in Def. 3.1 is necessary however for verifying the contraction inequality (3.3) in the settings of flows on homogeneous spaces and strata of abelian differentials. We refer the reader to the proofs of Theorem 6.3 and Corollary 7.3 where this dependence of the height functions on is exploited. For the purposes of the discussion in this section, this flexibility does not play a role in the proofs and the reader may wish to regard the collection as consisting of a single height function. 3. (3)
In our applications, the constant , in Def. 3.1(3), can be chosen to be independent of .
We note that allowing height functions to assume the value has proven useful in several important applications [BQ11, EMM15].
Definition 3.3**.**
In the presence of a collection of height functions on a metric space with a -action, we say an orbit is -divergent on average, if for every and every ,
[TABLE]
where is the indicator function of . We often drop from the notation: -divergent on average when is understood from context.
The following is the main result of this section.
Theorem 3.4**.**
Let be a metric space equipped with an action by . Suppose is an IFS on satisfying the open set condition and denote by its limit set. Let and let be the restriction of the -dimensional Hausdorff measure to . Assume that satisfies the -contraction hypothesis on for a collection of height functions and real numbers . Then, for all ,
[TABLE]
The main applications of our results are to the action on the space of unimodular lattices in and to actions on moduli spaces of abelian differentials. The remainder of this section is dedicated to the proof of Theorem 3.4.
3.2. Notation
Throughout the remainder of this section, we let and be as in Theorem 3.4. We fix a finite set so that and denote by the limit set of . We use the notation of Definition 3.1 pertaining to the height functions . In particular, for , we have
[TABLE]
Moreover, we retain the iterated function systems notation of Section 2. In particular, given a word , we use the notation:
[TABLE]
where is given by (2.4). For an integrable function on , we use and interchangeably.
3.3. Integral inequalities and covering estimates
The goal of this section is to use the contraction hypothesis to control the Hausdorff dimension of divergent orbits.
Using the log Lipschitz property of Definition 3.1, one can easily verify that the orbit diverges on average if and only if
[TABLE]
for all and , where is the indicator of . This observation is very useful in handling the case where the contraction ratios of maps in are not all the same. This case poses significant difficulties in the proof. We are thus naturally led to studying the following sets: for , , , and , define by
[TABLE]
The following proposition is one of the main technical results of this article.
Proposition 3.5**.**
There exists a constant , depending only on the support of and on the constant in Def. 3.1(3), such that the following holds. For every , and , there exists , so that for all the following holds. For all , , one has that
[TABLE]
where the sum is taken over words satisfying . Moreover, can be chosen uniformly as varies in a fixed sub-level set for any .
Notational Convention
For the remainder of this section, we use to denote to simplify notation.
We need technical preparation before the proof which occupies the next subsections. Define the following constants:
[TABLE]
Using of Definition 3.1, we can find such that
[TABLE]
for all , and all in a ball around of radius . We also fix a constant so that
[TABLE]
for all , and . For , and natural numbers , we define the following sets:
[TABLE]
Recall the definition of the sets and the partitions in (2.7). We frequently use the fact that for every and that
[TABLE]
These facts follow from Lemmas 2.4 and 2.5.
3.4. Averages of multiplicative cocycles
We record the following cocycle relation.
Lemma 3.6**.**
Let . Then, for every and ,
[TABLE]
Proof.
Let be such that , where is the concatenated word. In particular, by definition
[TABLE]
Then, we have , where are the contraction ratios of and respectively. Moreover, Lemma 2.1 implies that . Hence, . Finally, since , we see that and, in particular, . ∎
The next lemma is a special case of a general principle: averages of “locally constant" submultiplicative cocycles form a submultiplicative sequence.
Lemma 3.7**.**
For all and all ,
[TABLE]
Proof.
Let . Then, for all , by Lemma 3.6, we obtain
[TABLE]
By Lemma 2.5, applied with , it follows that
[TABLE]
Hence, we get that
[TABLE]
The lemma follows by induction. ∎
The following lemma allows us to control complete sums over covers.
Lemma 3.8**.**
Let . For every and ,
[TABLE]
where, for , denotes the concatenation of and . In particular, .
Proof.
By Lemma 2.4 and the cocycle property of , we have
[TABLE]
Moreover, we have that is constant almost everywhere on and equal to for every . Thus, using the fact that , we obtain
[TABLE]
By Lemma 2.8, for almost every
[TABLE]
It follows that
[TABLE]
By Lemma 2.5, for every integrable function , we have . This implies that
[TABLE]
∎
3.5. Consequences of the log-Lipschitz property
The next lemmas provide us with simple consequences of the log-Lipschitz property of the function in Definition 3.1.
Lemma 3.9**.**
Suppose is such that . Then,
[TABLE]
Proof.
Suppose . Let and let be such that is greater than . Let be any other vector. Note that is constant on elements of . This implies
[TABLE]
Let be such that and . The invariance of the Euclidean norm by implies
[TABLE]
Thus, since , the choice of the constant in (3.8) implies
[TABLE]
This being true for all concludes the proof. ∎
Lemma 3.10**.**
Suppose for some , and some . Then,
[TABLE]
where .
Proof.
The proof is completely anaolgous to that of Lemma 3.9. ∎
Lemma 3.11**.**
Let . Then, for all ,
[TABLE]
where is given by (3.8) and .
Proof.
The proof follows from the fact that everywhere on and the following estimate:
[TABLE]
for all , where is given by (3.7). ∎
3.6. Consequences of the contraction property
For every and , we define the following elemenets of :
[TABLE]
where is the rotation part of the similarity and is given by (2.5). Note that and that each commutes with . The following lemma is the first main step in the proof of Proposition 3.5.
Lemma 3.12**.**
Let be the constant in (3.8). Let and let be the constant provided by of Definition 3.1 with . Suppose that for some , , , and . Then,
[TABLE]
where is as in of Definition 3.1.
Proof.
By Lemma 3.6, it follows that
[TABLE]
where on the second line we used the fact that everywhere on .
Note that since , we have , where denotes the transpose of . Moreover, , where . Thus, the following identity holds.
[TABLE]
Observe that commutes with and recall that the function is -invariant. This implies
[TABLE]
By Lemma 2.5, for every ,
[TABLE]
Combining this fact with (3.13), we obtain, for ,
[TABLE]
Since commutes with and is invariant by , it follows that . In particular, by Lemma 3.10, we have . Thus, the contraction property of in of Definition 3.1 implies
[TABLE]
Finally, we apply Lemma 3.11 to get
[TABLE]
Combining (3.6), (3.14), and (3.15), along with the fact that for all yields the desired estimate and concludes the proof. ∎
The following lemma uses Lemma 3.12 as a base step in an inductive procedure to establish an exponentially decaying estimate for similar averages over points with long cusp excursions.
Lemma 3.13**.**
Let and be the constants in (3.8) and (3.9) respectively. Let and let be the constant provided by of Definition 3.1 with . For all , , and ,
[TABLE]
where is given by:
[TABLE]
and is as in of Definition 3.1.
Proof.
If , then and the statement follows trivially. Thus, we may assume that and that . Let . Lemma 2.1 implies that , where the union is taken over words so that . In particular, we get
[TABLE]
where is given by (3.16). Next, we note that the following inclusion holds by Lemma 2.1.
[TABLE]
Moreover, Proposition 2.3 shows that the indicator functions of the above unions are equal almost everywhere to sums of the indicator functions of the members of the union. Hence, combining (3.6), and (3.18) yields the following estimate
[TABLE]
By an iterated application of (3.6) and (3.18), we obtain
[TABLE]
To apply Lemma 3.12 at this stage, we need to ensure that for each such that , we have that for some . Recall that we are assuming that and . Let for some . Then, we have that
[TABLE]
Moreover, by Lemma 3.6, we can write , for some . Thus, by the choice of in (3.9), this implies that . Hence, we obtain
[TABLE]
Finally, by Lemma 3.9, it follows that
[TABLE]
∎
The next ingredient is to provide an upper estimate of the sum appearing in Proposition 3.5 using integral estimates of the height function . The next lemma is a first step in that direction.
Lemma 3.14**.**
For all , , , and any ,
[TABLE]
where is given by (3.7) and is as in (3.8).
Proof.
For any finite word , we have
[TABLE]
Moreover, we have that . Indeed, this follows from the self-similarity of in (2.9) and Proposition 2.3 showing that the distinct sets have null overlap. To simplify notation, let , and define
[TABLE]
The other ingredient is to note that for every . Hence, we get
[TABLE]
In view of Lemma 3.9, we have that
[TABLE]
Finally, we observe that the following inequality
[TABLE]
holds for all by definition, where for a set , denotes its indicator function. ∎
3.7. Proof of Proposition 3.5
Fix some and define
[TABLE]
Our hypothesis implies that satisfies the -contraction hypothesis (note that is self-similar with respect to ). In particular, Lemmas 3.6 - 3.14 hold with and in place of and respectively. In our proof below, the only dependence of the constants on is in the constant defined in (3.9) (since the cocycle for is given by for ), and given by of Definition 3.1. This dependence will appear only in the choice of the constant in (3.19) below. Since our conclusion states that depends on and by replacing and by and respectively, we may hence assume that . For simplicity, we use the following notation:
[TABLE]
Let and be given. Let be the constant provided by of Definition 3.1 with . We define as follows
[TABLE]
where and are the constants in (3.8) and (3.9) respectively. Suppose , and are given. To simplify notation, let
[TABLE]
where is the constant in of Definition 3.1.
Consider a subset containing at least elements. Define the following set of trajectories whose behavior is determined by :
[TABLE]
We decompose the set and its complement into maximal “connected" intervals as follows
[TABLE]
for some integers . Note that . We claim that
[TABLE]
Since the set is a union of at most subsets of the form , the claim of the proposition follows by taking .
Order the intervals and in the way they appear in the sequence . Write for the interval in this sequence for . For a subset , we use to denote its cardinality. For purposes of induction, we write for a set with one element and whenever .
Case 1:
so that . Let be such that . Then, we note that
[TABLE]
where the sets were defined in (3). Hence, we may apply Lemma 3.14 with and to get
[TABLE]
where . We can then apply Lemma 3.13 to get
[TABLE]
where is defined in (3.20).
Recall that was chosen so that . Moreover, the choice of implies that . In particular, since is a maximal sub-interval of , we see that for some . The choice of the constant in (3.8) then implies that for all .
Moreover, is constant everywhere on . It follows that
[TABLE]
In the last equality, we used the fact that and . This follows from Lemmas 2.4 and 2.5 respectively. We thus arrive at the following estimate
[TABLE]
In view of the inclusion , it follows that
[TABLE]
Case 2:
so that . In this case, we apply Lemma 3.8 to obtain
[TABLE]
Then, using Lemma 3.7, it follows that
[TABLE]
Finally, using that , we obtain
[TABLE]
Equipped with the estimates in (3.22) and (3.23), we iteratively bound the sum in (3.21) by similar sums over covers of sets of the form with and yielding the following upper bound by downward induction on :
[TABLE]
Recall that and . Moreover, since is positive and strictly less than , we have that , . Hence, since , it follows that
[TABLE]
This implies the main claim (3.21).
3.8. Proof of Theorem 3.4
Having established Proposition 3.5, the proof of Theorem 3.4 follows from the definition of Hausdorff dimension. Recall the definition of the Hausdorff (outer) measures given in Section 2.1.
Let and let denote the set of vectors for which the trajectory diverges on average. As we noted in (3.5), in view of the log Lipschitz property of Definition 3.1, the set is related to the sets , defined in (3.6), via the following inclusion:
[TABLE]
for all and . We wish to apply Proposition 3.5. Fix some and let be as in the conclusion of the proposition. Recall the definition of and in (3.20). Let be sufficiently close to such that
[TABLE]
Note that as . Choose large enough so that
[TABLE]
where we used Lemma 3.7 in the first equality. Let be as in Proposition 3.5 and suppose . For each , define
[TABLE]
Suppose is given. Then, for every , Proposition 3.5 and the first equality in (3.26) show that
[TABLE]
By taking to infinity, it follows that
[TABLE]
Recall that for any countable collection of Borel sets . Since is contained in the union of countably many sets of the form by (3.25), it follows that . This being true for , this implies that as desired.
4. Transversality of Expanding Coordinates
In this section, we establish the first step towards verifying the contraction hypothesis on the space of lattices . The goal is to prove a key observation which allows us to obtain optimal average contraction rates with respect to any measure for which for every , where is defined in (1.3). The main results are Proposition 4.1 and 4.2.
4.1. The exterior power representation
We begin by giving a description of the coordinates of the fundamental representation of on the following vector space
[TABLE]
An element acts on via the linear map . Consider the basis of , where denotes the standard basis element. For each index set , we let
[TABLE]
The collection of monomials gives a basis of for each . We denote by the standard Euclidean inner product on , making the monomials an orthonormal basis of . Note that this basis consists of joint eigenvectors of the linear maps , where is any diagonal matrix.
Let and write
[TABLE]
where the sum is over index sets of cardinality . Let . First, we note that fixes and maps to for . This implies the following.
[TABLE]
where the sign depends on . In particular, we get that
[TABLE]
4.2. Transversality of the expanding coordinates
Let and write for each index set . For each index set containing [math], let be the affine subspace defined by
[TABLE]
Note that it is possible that or . Denote by the normal vector of given by
[TABLE]
where denotes the standard basis of and the choice of the signs is the same as in (4.3). Given an index set () of size , define by
[TABLE]
We note if , then if and only if appears as a coordinate of .
The following elementary proposition is a key observation for our proof.
Proposition 4.1** (Transversality).**
Suppose and let be an index set. Let . Then,
[TABLE]
Proof.
Note that by definition and in particular . Consider the map defined by
[TABLE]
for every . One easily verifies that is a bijection. In particular, . We claim that for each , the following holds:
[TABLE]
Indeed, when , then by definition of in (4.4). Otherwise, if satisfies , then we observe that . Indeed, for some , not equal to . In this case, the definition of in (4.4) shows that the coefficient of is [math]. This completes the proof of (4.6).
Equation (4.6) implies that the -matrix is diagonal, up to a permutation of the rows, with on the diagonal. It then follows from the definition of the inner product on that
[TABLE]
This completes the proof. ∎
For an affine subspace and , recall that we denote by the open neighborhood of . More precisely,
[TABLE]
where is the Euclidean distance.
Proposition 4.2** (From Transversality to Integrability).**
Suppose are affine hyperplanes in with . Suppose that is a unit normal vector of for each . Assume further that
[TABLE]
for some . Then, there exists , depending only on and d, so that for all ,
[TABLE]
Proof.
Let . First, we claim that . For each , let be such that . Let denote the matrix whose th row is and write for the column vector whose th entry is . Then, is the set of solutions of the system . The assumption that implies that has full rank. In particular, this implies that is non-empty.
By applying a translation, we may assume that . Let denote the canonical projection parallel to . Let for each . We note that it suffices to show
[TABLE]
for every . Note that is a unit vector orthogonal to for each . We continue to denote by the image of under .
Suppose . Since each passes through the origin and each is a unit normal to , the Euclidean distance of to is given by . In particular, . Note that . Thus, our task is to show that for an appropriate uniform constant .
Denote by the square matrix whose rows are . As are unit vectors, there is a constant , depending only on , such that . Moreover, we have that and in particular that is invertible. Recall that if , then . Thus, the following norm estimates follow:
[TABLE]
Moreover, we have that
[TABLE]
Together those two inequalities imply that , where .
∎
5. Decay Exponents, Transversality, and Expansion
This section is dedicated to proving estimates on the average rate of expansion of vectors in linear representations with respect to general measures, Proposition 5.1. The main point of the result is the precise integrability exponent for the functions . A key ingredient in the proof is the transversality result obtained in the previous section. Recall the definition of in 1.3 and the parametrization of in (3.1).
Proposition 5.1**.**
Let for some . Suppose is a compactly supported Borel probability measure on and suppose . Let be given. Then, for all , there exists a constant so that the following holds: for every , , and every measurable function :
[TABLE]
where
[TABLE]
Remark 5.2**.**
It is worth noting that the constant in Proposition 5.1 has a delicate dependence on the measure . In particular, it depends on the rate of convergence of the in the definition of . Moreover, in our proof, as . In particular, it is not clear whether Proposition 5.1 holds with and with a finite constant .
Before the proof, we state two elementary lemmas which will be useful for us. The first lemma is immediate from the definition.
Lemma 5.3**.**
Suppose is a Borel measure on such that exists for some . Then, for every , there exists so that for all
[TABLE]
The next lemma is a simple application of Fubini’s Theorem.
Lemma 5.4**.**
Let be a Borel measure and a non-negative Borel function on a separable metric space . Then,
[TABLE]
Proof of Proposition 5.1.
Let and without loss of generality assume . Denote by the following collection of vectors
[TABLE]
We use to denote the linear span of and we let . Then, is the expanding subspace corresponding to . Denote by be the canonical projection. Denote by the image of under and let . Define as follows:
[TABLE]
Then, since is compactly supported. Observe that for , we have that
[TABLE]
Fix some and let be given. We wish to apply Hölder’s inequality. To this end, let
[TABLE]
Note that and are Hölder conjugates. Then, Hölder’s inequality and (5.2) imply
[TABLE]
Hence, it remains to show that the integral of is uniformly bounded. We split the analysis into two cases based on the size of . Recall the expression of in standard coordinates given in (4.3).
Case 1:
. Then, since , there is an index set containing [math] so that . It follows that for each
[TABLE]
This implies that and concludes the proof in this case.
Case 2:
. Then, there exists some index set , not containing [math] so that . Define by
[TABLE]
Let and define by
[TABLE]
where the choice of signs is as in (4.3). Note that appears as a coordinate of . In particular, . Moreover, Proposition 4.1 then shows that .
Consider the hyperplane . Then, a simple calculation shows
[TABLE]
where denotes Euclidean distance. For each , we define the set as follows:
[TABLE]
Suppose that . Then, for each index set containing [math], the following holds.
[TABLE]
It follows that . For simplicity, denote by . Hence, Proposition 4.2, applied with , implies
[TABLE]
for some constant depending only on and . Applying Lemma 5.4, we obtain
[TABLE]
The next ingredient is to apply Lemma 5.3. We observe that is strictly smaller than , since and . Let . Then, . Applying Lemma 5.3 with this , we get that there exists so that for all ,
[TABLE]
Here, we use the fact that is an affine subspace of dimension since it is the intersection of transverse affine hyperplanes by Proposition 4.1. Let be sufficiently large such that . Note that the choice of here depends only on and . Since is a probability measure, it follows that
[TABLE]
Moreover, by (5.4), we obtain
[TABLE]
Finally, note that our choice of implies that . In particular, we have that . Combining (5), (5.5), (5.6), and (5.7) concludes the proof.
∎
6. Height Functions and Integral Inequalities
In this section, we construct a proper function on the space of unimodular lattices. Using the results in the previous section, we verify that this function satisfies the properties listed in Definition 3.1. The key idea that allows converting integral estimates in linear representations into integral estimates over the space of lattices is the use of the so-called systems of integral inequalities which first appeared in [EMM98]. The main result of this section is Theorem 6.3.
6.1. Preliminary notation
Throughout this section, we set
[TABLE]
In view of Proposition 5.1, the results of this section apply to general Borel measures and time parametrizations on . However, for our application, we restrict ourselves to the setting of Theorem A. We fix a finite set and an irreducible IFS on satisfying the open set condition with limit set . We denote by the Hausdorff dimension of and the unique self-similar probability measure supported on for the canonical probability vector . Recall that in this case coincides with the normalized restriction of the -dimensional Hausdorff measure to .
We denote by and the following vector spaces, endowed with the standard representations of ,
[TABLE]
Motivated by Proposition 5.1, we define exponents of the form , for as follows:
[TABLE]
where was defined in (1.3). Corollary A.10 shows that since the IFS is irreducible.
The space is identified with the space of unimodular lattices in via the map . For , let denote the set of all primitive subgroups of the lattice . Recall that a subgroup of a lattice in is primitive if , where is the -span of any -basis of . We say a monomial is -integral if the abelian subgroup of generated by is primitive, i.e. belongs to .
For every , we define a function as follows: ,
[TABLE]
For , set . For a compact set , define
[TABLE]
where is the operator norm induced by the Euclidean norm on . It follows from the definitions that for every , and every ,
[TABLE]
6.2. The Contraction Hypothesis on X
We recall the following Lemma from [EMM98] which underlies the main property of Margulis functions we prove later in the section.
Lemma 6.1** (Lemma 5.6 in [EMM98]).**
Let and let . Then,
[TABLE]
The following proposition establishes the fundamental property of Margulis functions obtained in [EMM98]. It is obtained via the method of integral inequalities first introduced in [EMM98]
Proposition 6.2**.**
For every , there exists a constant , depending only on and , such that for every , there exists , so that for all and and all
[TABLE]
where
[TABLE]
and was defined in (6.1).
Proof.
Let be given. Fix and let
[TABLE]
Let as defined in (6.3) for as above. Following [EMM98], let denote the finite subset of of rank subgroups of satisfying
[TABLE]
The finiteness of follows from the discreteness of the lattice . Suppose that consists of a single element and denote it by . In this case, by (6.4), we see that for all ,
[TABLE]
Observe further that the definition of the exponents implies , for each . In particular, upon applying Proposition 5.1 with , , , and , we obtain
[TABLE]
where is the constant in Proposition 5.1 and and are as in (6.5).
Alternatively, suppose the cardinality of is at least . Let be such that , and let be another element of . Then, the group has rank for some . Moreover, by definition of , . Hence, in view of Lemma 6.1, for all ,
[TABLE]
Combining this estimate with (6.2), we get the desired conclusion with
[TABLE]
∎
Given and , we define the function by
[TABLE]
Define as follows
[TABLE]
We are now ready to verify the contraction hypothesis in our context. The idea of the deduction of the following result from Proposition 6.2 is due to Margulis and first appeared in [EMM98]. In that article, all the exponents of were the same. Since our exponents are distinct, this introduces a complication which we address in the proof. The reader may wish to consult variants of this idea in [KKLM17, Proposition 4.1] and [BQ11, Claim 5.9].
Theorem 6.3**.**
For every , there exists a constant , depending only on and , such that for every , there exists and , depending only on and , so that for all and and all satisfying ,
[TABLE]
Remark 6.4**.**
The proof of Theorem 6.3 will show that the lower bound restriction on is only for aesthetic reasons.
We first prove the following elementary but crucial lemma.
Lemma 6.5**.**
For all natural numbers , we have
[TABLE]
Proof.
We note that the function satisfies the following concavity property:
[TABLE]
It follows that
[TABLE]
Since and , the lemma follows. ∎
Proof of Theorem 6.3.
Fix and . Let be the constants provided by Proposition 6.2. To simplify notation, let
[TABLE]
where and are as in (6.5). Note that depends on . Let be a constant to be determined. Suppose . It follows from Proposition 6.2 that
[TABLE]
where we used the fact that . We observe that the exponents satisfy the following relation:
[TABLE]
for all . In particular, this implies
[TABLE]
Moreover, Lemma 6.5 shows that for all :
[TABLE]
Applying the estimates (6.11) and (6.12) to the last sum in (6.2), using the fact that , yields
[TABLE]
Choosing , , and , we obtain
[TABLE]
Note that also depends on . Since , Jensen’s inequality implies that
[TABLE]
thus completing the proof. ∎
To demonstrate the power of the method of integral inequalities, we state a consequence of the above analysis, which can be obtained with a little more work. We do not need this statement for our purposes, so we omit the proof and refer the interested reader to [BQ11, Claim 5.9] for the proof of a similar statement. Given , consider the function defined by
[TABLE]
where the maximum is taken over all -integral monomials and all .
Proposition 6.6**.**
For every compact set , there exist constants and , depending only on , such that for every , whenever , the set of -integral monomials satisfying
[TABLE]
contains at most one primitive vector up to a sign in each with .
6.3. Proof of Theorem 1.12 and its corollaries
Let be given. For every , let be the constant provided by Theorem 6.3. Consider the collection of height functions:
[TABLE]
Theorem 6.3 shows that the action of on satisfies the -contraction hypothesis with and given by:
[TABLE]
where was defined in (6.1). Indeed, the contraction property of Definition 3.1 follows from Theorem 6.3, where can be chosen as follows:
[TABLE]
The remaining properties follow directly from the definition of .
Hence, Theorem 3.4 applies and shows that the dimension of the set in the conclusion of Theorem 1.12 is at most . Since was arbitrary, this completes the proof.
Theorem A follows from Theorem 1.12 by taking to be the identity coset and using Dani’s correspondence. Corollary 1.4 follows from Theorem A along with Lemma A.1 showing that in this case. Finally, in the setting of Corollary 1.5, it is shown in [Shm16, Theorem 8.2] that for every , where is the projection of in direction and is the Frostman exponent of defined in (1.7). Hence, applying Theorem 1.9, we get that . Corollary 1.5 then follows from Theorem A.
7. Fractals and the Teichmüller Flow
The goal of this section is to prove Theorem B. The height functions needed to apply Theorem 3.4 in this context were constructed by Eskin and Masur in [EM01]. In using this construction, we apply Proposition 5.1 with . We remark that the proof of Proposition 5.1 simplifies in this case and does not require the results of Section 4.
7.1. Background and Definitions
For background on Teichmüller dynamics and translation flows, the reader is referred to [FM14] for an excellent survey. Suppose is a compact oriented surface of genus . An abelian differential on is an isotopy class of pairs , where is a Riemann surface structure on and is a holomorphic -form. Then, induces a (possibly singular) flat metric on . A unit area abelian differential is one in which has area in the induced flat metric. A saddle connection of is a flat geodesic segment joining zeros of .
If denotes the set of zeros of , then admits an atlas of charts to the complex plane so that all transition maps are given by translations. In these coordinates, is given by the pull-back of the canonical holomorphic -form on . Moreover, the “vertical" vector field (parallel to the imaginary axis) on induces a well-defined vector field on . The flow defined by this vector field is referred to as the vertical flow on . The induced area measure by is invariant for this flow. The flow is said to be uniquely ergodic if this is the only invariant measure.
Let be an integral partition of , i.e. and . By a stratum of abelian differentials of order , we mean the space of unit area abelian differentials on whose zeroes have multiplicities . Strata of abelian differentials are non-compact. This can be seen by taking a sequence of abelian differentials in which the systole on the associated Riemann surface tends to [math]. If the integral partition definining the stratum contains two distinct elements, then a sequence of abelian differentials may “diverge" if the distance between two distinct zeros tends to [math].
There are local coordinates on a stratum into for appropriate , called period coordinates (e.g., see [FM14, Section 2.3] for details), such that all changes of coordinates are given by affine maps. In period coordinates, acts naturally on each copy of . The Teichmüller geodesic flow is the flow induced by the action of the diagonal group in . The behavior of the orbit of under and its various subgroups determines many of the ergodic properties of the vertical flow. Most relevant to our application is Masur’s criterion asserting that the -orbit of diverges if the vertical flow defined by is not uniquely ergodic.
In the sequel, we fix one such stratum and denote it by . For simplicity, we use to denote elements of .
7.2. The Contraction Hypothesis on strata of abelian differentials
Suppose is a Borel probability measure on . We say is -linearly expanding on for some constants if for all the following holds
[TABLE]
For any , denote by the operator norm of in its action on . Given a compact set , define as follows:
[TABLE]
Theorem 7.1** (Lemma 7.5, [EM01]).**
There exist such that the following holds. Suppose is given. Then, there exist functions for each such that is a proper function and each satisfies the log-Lipschitz property in Def. 3.1(2). Moreover, suppose is a compactly supported Borel probability measure on which is -linearly expanding on . Then, there exist constants and , depending only on , and (cf. (7.1)), such that for every and :
[TABLE]
Proof.
Theorem 7.1 was obtained in [EM01, Lemma 7.5] for the special measures for any , where is the normalized Lebesgue measure on . The main part of the proof is the construction of the functions in [EM01, p.464] (denoted in loc. cit.) using the notion of admissible complexes. The definition of these functions depends on a parameter which we take to be in our notation. Inspection of the (short) proof of [EM01, Lemma 7.5] shows that the only input used to establish the desired contraction property is the -linear expansion and the compactness of the support of the measure. The other parts of the argument are independent of the shape of the measure. ∎
Remark 7.2**.**
The function in Theorem 7.1 is given by a power of the reciprocal of the length of the shortest saddle connection.
Corollary 7.3** (Lemma 2.10, [Ath06]).**
Suppose , , and are as in Theorem 7.1. Then, there exist constants , depending only on and , such that the following holds. Let and let . Then, for all :
[TABLE]
Proof.
Let . Then, one verifies that for each . It follows that
[TABLE]
∎
7.3. Proof of Theorem B and Corollary 1.8
We wish to apply Theorem 3.4. Let be an IFS as in the statement of the theorem and let be its limit set. Denote by and the normalized restriction of to . Then, by Proposition 2.2.
Fix and . Let and . Suppose is any number satisfying and let . Define a Borel measure on by
[TABLE]
for every compactly supported continuous function on .
Applying Proposition 5.1 with , , and then shows that is -linearly expanding on , with
[TABLE]
for a constant depending only on and . Hence, Theorem 7.1 and Corollary 7.3 apply and provide, for every , constants and (depending on ) and a function such that for every ,
[TABLE]
by Jensen’s inequality. For , define
[TABLE]
Fix and suppose that . Then, the above estimate becomes
[TABLE]
Let , , and . The above argument shows that satisfies the -contraction hypothesis. In particular, this completes the verification of the hypotheses of Theorem 3.4 in this setting and shows that the dimension of divergent on average directions belonging to is at most . Since was arbitrary, Theorem B follows.
To prove Corollary 1.8, we first observe that the restriction of the map to (the compact set) is bi-Lipschitz onto its image. Moreover, we note that for ,
[TABLE]
where for , denotes its transpose. Since contracts and commutes with , it follows that the orbit diverges on average in if and only if the orbit does. Since bi-Lipschitz maps preserve Hausdorff dimension, the corollary follows from Theorem B.
Appendix A Frostman Exponents of Projections of Self-similar Measures
The goal of this section is to complete the proofs of Corollary 1.4 by computing in this case in Lemma A.1. We also provide a proof of Theorem 1.9 relating to the Frostman exponents of projections of for planar homogeneous fractal measures, thus completing the proof of Corollary 1.5. Finally, we show that the limit in the definition of the exponents exists for every (Proposition A.7) in full generality.
Notation
If is a measure space and is a measurable map, we denote by the push-forward measure.
A.1. Projections of products of Cantor sets
Consider the IFS on given by maps of the form
[TABLE]
The limit set of coincides with a product of copies of Cantor’s middle thirds set. For convenience, let denote the set consisting of a single point and, for , denote by the identity mapping. For , takes the form , where is a rational vector satisfying .
The following lemma computes the value of in this case and completes the proof of Corollary 1.4. We thank Pablo Shmerkin for providing its proof.
Lemma A.1**.**
Suppose , where is Cantor’s middle-thirds set and let . Let denote . Then, .
Proof.
That follows from projecting onto the coordinate axes. For the reverse inequality, it suffices to show that for each and each affine line
[TABLE]
where the implied constant is absolute. Let . We claim that for every line , the number of squares of the form with which meet is , with an absolute implied constant. Assuming the claim, fix a line and , and let denote the lines parallel to and bounding the neighborhood . Then, we note that is contained in the union of the squares which meet either of , or , for some . Moreover, assigns mass to each such square. It follows that , and hence (A.1) follows.
For a line , denote by the -coordinate of the intersection of with the -axis. For , denote by the line of slope such that . Denote by the line of slope such that . We say a line is exceptional if
[TABLE]
We note that each of the exceptional lines and meet of the squares at their corners. Moreover, a line meets three squares of the form , if and only if is exceptional.
Suppose is an affine line which is not expceptional. Then, we observe that meets at most squares of the form , where . It follows by induction that meets at most squares of the form , for and . Indeed, if is the prefix of , then is a non-exceptional affine lineand hence meets at most squares of the form , .
Finally, assume is an exceptional line of slope , the case of slope being identical. Let be such that for . Writing , the line has the form:
[TABLE]
Since , it follows that , for coprime to . Moreover, suppose , for . Since is parallel to , if and only if . A calculation similar to that yielding (A.2) shows that , where is coprime to . Thus, it follows that
[TABLE]
Let denote the set of squares of the form which meet with and denote by the cardinality of . We show that (A.3) implies that , which will conclude the proof. Indeed, suppose and let . Then, since , it follows that meets at most squares of the form . Thus, . Alternatively, if , then meets at most such squares and, hence, . By induction, we conclude that as desired.
∎
A.2. Frostman exponents of projections and Theorem 1.9
Let the notation be as in the statement of the theorem. A key ingredient in the argument is a a generalization of the sub-additive ergodic theorem for uniquely ergodic systems due to Furman in [Fur97]. It gives information about the behavior of every orbit, as opposed to the almost everywhere statement of the subadditve ergodic theorem. Recall that if is a measure preserving system, a sequence of functions is a sub-additive cocycle over if for all and almost every :
[TABLE]
Theorem A.2** (Theorem 1, [Fur97]).**
Suppose is a uniquely ergodic probability measure preserving system, where is a compact metric space and is continuous. Let be a continuous sub-additive cocycle over . Then, the following holds
[TABLE]
uniformly over all .
Given an affine line and , we write whenever is orthogonal to any line of slope . Recall that denotes the collection of all affine subspaces of of dimension . For every and , , we define
[TABLE]
The first step in applying Theorem A.2 is the following continuity result.
Proposition A.3**.**
For every , the function is continuous on .
The key ingredient is the compactness of the support of , which is used in the proof of the following lemma.
Lemma A.4**.**
Suppose is an affine line and suppose is given. Let . Then, there exists an affine line which meets at an angle and such that for every ,
[TABLE]
where .
Proof.
If , we can take . Hence, we will assume . Let be a ball of radius containing . If , then and the statement follows trivially. Otherwise, let . Let be the line passing through at an angle with .
For , let be the unit normal vectors to so that the angle between and is . Then, for , . Let . We then have
[TABLE]
Since , we have . Moreover, using the law of sines, one verifies that
[TABLE]
where we used the fact that .
∎
Proof of Proposition A.3.
We may assume without loss of generality that the similarity dimension of is positive. Indeed, otherwise, consists of a single map, is a single point, and all the quantities in question are [math]. We then observe that the irrationality of the rotation angle implies that is irreducible and in particular, that by Corollary A.10. Fix some . It follows from the definition of that there exists so that for all and all affine lines ,
[TABLE]
Moreover, by Proposition 2.2, there exists a constant such that
[TABLE]
for every and every , where . Fix and assume is given. Denote by and let . Let be sufficiently small so that
[TABLE]
Suppose are two angles at distance at most in . Let be a line satisfying . By Lemma A.4, there exists a line such that
[TABLE]
Moreover, we can find lines parallel to and satisfying and . This, along with (A.5), imply
[TABLE]
The next ingredient is to observe that . This follows by taking to be a line passing through a point in . Then, contains a ball of radius centered in . The claim thus follows from the estimate in (A.6). Our choice of hence implies that
[TABLE]
Since was arbitrary, we see that . Since is independent of and , one can run the above argument with the roles of and reversed, to get the reverse inequality and conclude the proof. ∎
The next ingredient in the proof of Theorem 1.9 is establishing the following cocycle property of the functions .
Proposition A.5**.**
There exists a constant , such that for every and ,
[TABLE]
We will need the following doubling property of the functions .
Lemma A.6**.**
For every , there exists such that for every and ,
[TABLE]
Proof.
Let denote the cardinality of a finite cover of a ball of radius in by balls of radius . By applying scaling and translation, it follows that for every , any ball of radius in can be covered by at most balls of radius . Hence, for every Borel measure on , one has
[TABLE]
for all and all . ∎
Proof of Proposition A.5.
Let and be given and let . For every and , observe that . Moreover, one has Then, by equation (2.10) and Lemma 2.5, it follows that
[TABLE]
The other ingredient is the observation that for every satisfying , we have that , where . Indeed, this follows from the fact that the diameter of is . Moreover, we have for every . Hence, Proposition 2.3 on the null overlaps between the distinct implies that
[TABLE]
Finally, we apply the doubling property from Lemma A.6 with to get that , for a constant depending only on . ∎
We are now ready for the proof of Theorem 1.9.
Proof of Theorem 1.9.
Let , where is the constant in the conclusion of Proposition A.5. One can then verify that for every :
[TABLE]
Let be the Lebesgue probability measure on . Then, is a sub-additive cocycle over the transformation of . In particular, Kingman’s sub-additive ergodic theorem implies that
[TABLE]
Moreover, the cocycle is continuous by Proposition A.4. Hence, Theorem A.2 implies that
[TABLE]
It remains to show that . That follows by definition. For the reverse inequality, we use the uniformity in covergence provided by Theorem A.2. Fix . For every , let be such that
[TABLE]
Here we used the fact that since is a probability measure. From uniform convergence of the in in (A.7), we can find so that for all and for all ,
[TABLE]
Combining this with (A.8) and the fact that , we get
[TABLE]
for all . Since was arbitrary, it follows that as desired. ∎
A.3. Existence of the limit in Definition 1.3
Throughout this section, we fix a finite set and an IFS on satisfying the open set condition. We denote by the limit set of , the similarity dimension of , and the restriction of to .
Proposition A.7**.**
Suppose is as above. Then, for every , the limit in the definition of exists.
For every , and , , we define
[TABLE]
where is defined by (2.6). The functions satisfy the following doubling property.
Lemma A.8**.**
For every , there exists a constant , depending only on and , such that for every ,
[TABLE]
Proof.
The proof is completely analogous to Lemma A.6.
∎
The key step in proving Proposition A.7 is to show that is submultiplicative. This was essentially shown in [KLW04]. We include a proof for completeness.
Lemma A.9**.**
There exists a constant so that
[TABLE]
for all and all .
Before proving this lemma, we record the following useful corollary.
Corollary A.10** (Lemma 8.2, [KLW04]).**
Suppose is an irreducible IFS on satisfying the open set condition. Let be its limit set and let . Then, for each , , where is the restriction of to .
Proof.
Since Lemma 8.2 of [KLW04] is stated in a different form, we provide here a proof for completeness. It is shown over the course of the proof of [KLW04, Theorem 2.3] that the irreducibility of implies that for all proper affine subspaces . We claim that this implies that
[TABLE]
Indeed, suppose not. Then, there exist and sequences and such that and
[TABLE]
Since is compact, the subset of consisting of affine subspaces that meet is compact. In particular, by passing to a subsequence if necessary, we may assume that for some such that . Hence, it follows by the dominated convergence theorem that , which is a contradiction.
Let be the constant in Lemma A.9. Let be sufficiently small so that
[TABLE]
for all . Let and denote the largest and smallest contraction ratios of the maps in respectively. Then, . In particular, we can find sufficiently large so that . Fix one such . Then, for all , we have
[TABLE]
Fix some . It follows from Lemma A.9 and (A.9) that for all ,
[TABLE]
Moreover, note that for all since is a probability measure. In particular, for all . Combined with that fact that , this implies that
[TABLE]
for all . This proves that . To conclude the proof, suppose that is given and let be such that
[TABLE]
In particular, we have . Moreover, the choice of implies
[TABLE]
Thus, we obtain the following estimate:
[TABLE]
Note that as . In particular, . This shows that
[TABLE]
as desired. ∎
Following [KLW04], we say a finite set is a complete prefix set if every , there is a unique word which occurs as a prefix for . It is easy to check that the set
[TABLE]
forms a complete prefix set, where , and is as in (2.5). The following lemma allows us to handle the case where the contraction ratios are not all the same.
Lemma A.11**.**
Suppose is a complete prefix set. Then, for every continuous function on ,
[TABLE]
Proof.
Proposition 2.3 implies that forms a measurable partition of the support of with null overlaps. ∎
Proof of Lemma A.9.
Suppose and are given. Let be the set defined in (A.10) with . Let be an affine subspace of dimension . For simplicity, we write for every . Note that for each , . By Lemma A.11 and a similar argument to the proof of Proposition A.5, one obtains
[TABLE]
Proposition 2.3 implies that for every with . Moreover, by definition of , for every satisfying , we have that , where . Hence, arguing as in the proof of Proposition A.5, we obtain
[TABLE]
In view of the doubling property of provided by Lemma A.8, the conclusion follows by combining (A.3) and (A.12). ∎
We now deduce Proposition A.7 from Lemma A.9.
Proof of Proposition A.7.
Let be the constant given by Lemma A.9. Consider the function defined by . Lemma A.9 implies that is a subadditive cocycle over .
Consider the constant sequence , for some for some fixed . Then, the sequence is a subadditive sequence. This follows from the fact that . Thus, by Fekete’s lemma, we obtain the following equalities .
We claim that and in particular that the limit defining exists. To see this, suppose is a sequence tending to [math]. For each , let be such that . Since , it follows that
[TABLE]
In particular, we see that
[TABLE]
The opposite inequality involving the follows analogously.
∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AAE + 17] H. Al-Saqban, P. Apisa, A. Erchenko, O. Khalil, S. Mirzadeh, and C. Uyanik, Exceptional directions for the Teichmüller geodesic flow and Hausdorff dimension , Journal of the Eur. Math. Soc., to appear. (2017), 1–48.
- 2[AM 18] Paul Apisa and Howard Masur, Divergent on average directions of Teichmuller geodesic flow , ar Xiv e-prints (2018), ar Xiv:1803.00093.
- 3[Ath 06] Jayadev S. Athreya, Quantitative recurrence and large deviations for Teichmuller geodesic flow , Geom. Dedicata 119 (2006), 121–140. MR 2247652
- 4[BCC 18] Yann Bugeaud, Yitwah Cheung, and Nicolas Chevallier, Hausdorff dimension and uniform exponents in dimension two , Mathematical Proceedings of the Cambridge Philosophical Society (2018), 1–36.
- 5[BFS 12] Balázs Bárány, Andrew Ferguson, and Károly Simon, Slicing the Sierpiński gasket , Nonlinearity 25 (2012), no. 6, 1753–1770. MR 2929601
- 6[BGSV 18] Victor Beresnevich, Anish Ghosh, David Simmons, and Sanju Velani, Diophantine approximation in Kleinian groups: singular, extremal, and bad limit points , J. Lond. Math. Soc. (2) 98 (2018), no. 2, 306–328. MR 3873110
- 7[BQ 11] Yves Benoist and Jean-Francois Quint, Random walks on finite volume homogeneous spaces , Inventiones mathematicae 187 (2011), no. 1, 37–59.
- 8[BR 14] Balázs Bárány and Michał Rams, Dimension of slices of Sierpiński-like carpets , J. Fractal Geom. 1 (2014), no. 3, 273–294. MR 3276836
