A variational principle in the parametric geometry of numbers, with applications to metric Diophantine approximation
Tushar Das, Lior Fishman, David Simmons, and Mariusz Urba\'nski

TL;DR
This paper introduces a variational principle linking metric Diophantine approximation with the parametric geometry of numbers, enabling the calculation of dimensions of sets related to matrix approximation and resolving several conjectures.
Contribution
It establishes a new variational principle and introduces the concept of templates, generalizing rigid systems, to compute dimensions of sets in Diophantine approximation.
Findings
Proved the Hausdorff and packing dimensions of singular matrices are both mn(1-1/(m+n)).
Resolved a conjecture of Kadyrov, Kleinbock, Lindenstrauss, and Margulis (2014).
Provided a framework to compute dimensions of sets related to Starkov and Schmidt conjectures.
Abstract
We establish a new connection between metric Diophantine approximation and the parametric geometry of numbers by proving a variational principle facilitating the computation of the Hausdorff and packing dimensions of many sets of interest in Diophantine approximation. In particular, we show that the Hausdorff and packing dimensions of the set of singular matrices are both equal to , thus proving a conjecture of Kadyrov, Kleinbock, Lindenstrauss, and Margulis (preprint 2014) as well as answering a question of Bugeaud, Cheung, and Chevallier (preprint 2016). We introduce the notion of a , which generalizes the notion of a (Roy, 2015) to the setting of matrix approximation. Our main theorem takes the following form: for any class of templates closed under finite perturbations, the Hausdorff and packing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A variational principle in the parametric geometry of numbers, with applications to metric Diophantine approximation
Tushar Das
University of Wisconsin – La Crosse, Department of Mathematics & Statistics, 1725 State Street, La Crosse, WI 54601, USA
tdas**@**uwlax.edu https://sites.google.com/a/uwlax.edu/tdas/ ,
Lior Fishman
University of North Texas, Department of Mathematics, 1155 Union Circle #311430, Denton, TX 76203-5017, USA
,
David Simmons
University of York, Department of Mathematics, Heslington, York YO10 5DD, UK
David.Simmons**@**york.ac.uk https://sites.google.com/site/davidsimmonsmath/ and
Mariusz Urbański
University of North Texas, Department of Mathematics, 1155 Union Circle #311430, Denton, TX 76203-5017, USA
urbanski**@**unt.edu http://www.urbanskimath.com/
Abstract.
We establish a new connection between metric Diophantine approximation and the parametric geometry of numbers by proving a variational principle facilitating the computation of the Hausdorff and packing dimensions of many sets of interest in Diophantine approximation. In particular, we show that the Hausdorff and packing dimensions of the set of singular matrices are both equal to mn\big{(}1-\frac{1}{m+n}\big{)}, thus proving a conjecture of Kadyrov, Kleinbock, Lindenstrauss, and Margulis as well as answering a question of Bugeaud, Cheung, and Chevallier. Other applications include computing the dimensions of the sets of points witnessing conjectures of Starkov and Schmidt.
Résumé. Nous établissons une nouvelle connexion entre l’approximation métrique diophantine et la géométrie paramétrique des nombres en prouvant un principe variationnel facilitant le calcul des dimensions d’Hausdorff et de packing de nombreux ensembles d’intérêt dans l’approximation diophantienne. Nous montrons que les dimensions précitées de l’ensemble des matrices singulières sont toutes deux égales à mn\big{(}1-\frac{1}{m+n}\big{)}, démontrant ainsi une conjecture de Kadyrov, Kleinbock, Lindenstrauss, et Margulis et répondant à une question de Bugeaud, Cheung, et Chevallier. D’autres applications comprennent le calcul des dimensions des ensembles des points témoignant des conjectures de Starkov et de Schmidt.
1. Main results
The notion of singularity (in the sense of Diophantine approximation) was introduced by Khintchine, first in 1937 in the setting of simultaneous approximation [11], and later in 1948 in the more general setting of matrix approximation [12].111Although Khintchine’s 1926 paper [10] includes a proof of the existence of and matrices possessing a certain property which implies that they are singular, it does not include a definition of singularity nor discuss any property equivalent to singularity. Since then this notion has been studied within Diophantine approximation and allied fields, see Moshchevitin’s 2010 survey [13]. An matrix is called singular if for all , there exists such that for all , there exist integer vectors and such that
[TABLE]
Here denotes an arbitrary norm on or . We denote the set of singular matrices by . For matrices (i.e. numbers), being singular is equivalent to being rational, and in general any matrix which satisfies an equation of the form , with integral and nonzero, is singular. However, Khintchine proved that there exist singular matrices whose entries are linearly independent over [10, Satz II], and his argument generalizes to the setting of matrices for all . The name singular derives from the fact that is a Lebesgue nullset for all , see e.g. [11, p.431] or [2, Chapter 5, §7]. Note that singularity is a strengthening of the property of Dirichlet improvability introduced by Davenport and Schmidt [6].
In contrast to the measure zero result mentioned above, the computation of the Hausdorff dimension of has been a challenge that so far only met with partial progress. The first breakthrough was made in 2011 by Cheung [3], who proved that the Hausdorff dimension of is ; this was extended in 2016 by Cheung and Chevallier [4], who proved that the Hausdorff dimension of is for all ; while most recently Kadyrov, Kleinbock, Lindenstrauss, and Margulis [8] proved that the Hausdorff dimension of is at most \delta_{m,n}:={mn}\big{(}1-\tfrac{1}{{m+n}}\big{)}, and went on to conjecture that their upper bound is sharp for all (see also [1, Problem 1]).
In this paper, we announce a proof that their conjecture is correct. We will also show that the packing dimension of is the same as its Hausdorff dimension, thus answering a question of Bugeaud, Cheung, and Chevallier [1, Problem 7]. To summarize:
Theorem 1.1**.**
For all , we have
[TABLE]
where and denote the Hausdorff and packing dimensions of a set , respectively.
1.1. Dani correspondence
The set of singular matrices is linked to homogeneous dynamics via the Dani correspondence principle. For each and for each matrix , let
[TABLE]
where denotes the -dimensional identity matrix. Finally, let , and for each , let denote the th successive minimum of a lattice (with respect to some fixed norm on ), i.e. the infimum of such that the set contains linearly independent vectors.. Then the Dani correspondence principle is a dictionary between the Diophantine properties of a matrix on the one hand, and the dynamical properties of the orbit on the other. A particular example is the following result:
Theorem 1.2** ([5, Theorem 2.14]).**
An matrix is singular if and only if the trajectory is divergent in the space of unimodular lattices in , or equivalently if
[TABLE]
It is natural to ask about the set of matrices such that the above limit occurs at a prescribed rate, such as the set of matrices such that grows linearly with respect to . This question is closely linked with the concept of uniform exponents of irrationality. The uniform exponent of irrationality of an matrix , denoted , is the supremum of such that for all sufficiently large, there exist integer vectors and such that
[TABLE]
By Dirichlet’s theorem, every matrix satisfies . Moreover, it is immediate from the definitions that any matrix satisfying is singular. We call a matrix very singular if it satisfies the inequality , in analogy with the set of very well approximable matrices, which satisfy a similar inequality for the regular (non-uniform) exponent of irrationality (see (1.2)). We denote the set of very singular matrices by . The relationship between uniform exponents of irrationality and very singular matrices on the one hand, and homogeneous dynamics on the other, is given as follows:
Theorem 1.3**.**
A matrix is very singular if and only , where
[TABLE]
Moreover, the quantities and are related by the formula
[TABLE]
This theorem is a straightforward example of the Dani correspondence principle and is probably well-known, but we have not been able to find a reference.
1.2. Dimensions of very singular matrices
Perhaps unsurprisingly, the set of very singular matrices has the same dimension properties as the set of singular matrices.
Theorem 1.4**.**
For all , we have
[TABLE]
One can also ask for more precise results regarding the function . Specifically, for each we can consider the levelset222For results considering the superlevelset, see Theorem 2.9.
[TABLE]
where is given by (1.1). It would be desirable to give precise formulas for the Hausdorff and packing dimensions of in terms of , , and , see e.g. [1, Problem 2]. However, this appears quite challenging at the present juncture, though we have made significant progress towards this question which we will describe in the next section. Thus, instead of precise formulas we will give asymptotic formulas of two types: estimates valid when is small and estimates valid when is large. Note that while the minimum value of is always (corresponding to ), the maximum value depends on whether or not is at least . If , then the maximum value of is (corresponding to ), while if , then the maximum value of (excluding rational points) is (corresponding to ). Consequently, we have two different asymptotic estimates of the dimensions of when is large corresponding to these two cases.
Recall that denotes any number such that for some uniform constant . In all of the formulas below, is given by (1.1).
Theorem 1.5**.**
Suppose that . Then for all sufficiently close to , we have
[TABLE]
unless , in which case
[TABLE]
Theorem 1.6**.**
Suppose that . Then for all sufficiently large, we have
[TABLE]
Theorem 1.7**.**
Suppose that and . Then for all sufficiently close to , we have
[TABLE]
Remark 1.8**.**
Call a matrix trivially singular if there exists such that
[TABLE]
Then all of the above formulas remain true if is replaced by the set
[TABLE]
Moreover, for we have
[TABLE]
and for , we have
[TABLE]
Note that the class of trivially singular matrices is smaller than the class of matrices with degenerate trajectories in the sense of [5, Definition 2.8], but larger than the class considered in [1, p.2] consisting of matrices such that the group does not have full rank. A or matrix is trivially singular if and only if it is contained in a rational hyperplane of .
1.3. and matrices
Although we cannot give precise formulas for the Hausdorff and packing dimensions of for all pairs , the special cases and are easier to handle.
Theorem 1.9**.**
For all we have
[TABLE]
(cf. Figure 1).
Remark**.**
By Jarník’s identity [7], for all we have
[TABLE]
where . Thus by applying an appropriate substitution to the above formulas, it is possible to get explicit formulas for and , either in terms of or in terms of . However, the resulting formulas are not very elegant so we omit them.
Remark**.**
The transition point in the above formula for Hausdorff dimension corresponds to , , , and . The transition point for packing dimension corresponds to , , , and .
Remark**.**
Theorem 1.9 implies that for all . This answers the first part of [1, Problem 7] in the affirmative.
Remark**.**
There has been a lot of partial progress towards the Hausdorff dimension part of Theorem 1.9. In particular, the direction follows from [1, Corollary 2 and Theorem 3]. For the upper bound follows from [1, Corollary 2] and for , a non-optimal upper bound is given in [1, Theorem 1]. We refer to [1] for a detailed history of the prior results.
1.4. Singularity on average
A different way of quantifying the notion of singularity is the notion of singularity on average introduced in [8]. Given a matrix , we define the proportion of time spent in the cusp to be the number
[TABLE]
where denotes Lebesgue measure. The matrix is said to be singular on average if . Clearly, every singular matrix is singular on average.
Theorem 1.10**.**
For all , we have
[TABLE]
In particular, the dimension of the set of matrices singular on average is .
Note that the fact that the Hausdorff dimension of the set of matrices singular on average is was proven in [8], while the fact that this number is follows from Theorem 1.1.
1.5. Starkov’s conjecture
In [22, p.213], Starkov asked whether there exists a singular vector (i.e. singular matrix) which is not very well approximable. Here, we recall that a matrix is called very well approximable if for some , there exist infinitely many pairs such that
[TABLE]
or equivalently in terms of the Dani correspondence principle, a matrix is very well approximable if . This question was answered affirmatively by Cheung [3, Theorem 1.4] in the case . In fact, Cheung showed that if is any function such that as , then there exists a singular vector which is not -approximable. Here, a matrix is called -approximable if there exist infinitely many pairs such that and
[TABLE]
The following theorem improves on Cheung’s result both by generalizing it to the case of arbitrary , and also by computing the dimension of the set of matrices with the given property:
Theorem 1.11**.**
If is any function such that as , then the set of singular matrices that are not -approximable has Hausdorff dimension . Equivalently, if is any function such that as , then the set of singular matrices such that for all sufficiently large has Hausdorff dimension . The same is true for the packing dimension.
Note that this theorem is optimal in the sense that if for some constant , then it is easy to check that every singular matrix is -approximable.
1.6. Schmidt’s conjecture
In [19, p.273], Schmidt conjectured that for all , there exists an matrix such that
[TABLE]
(Note that any matrix satisfying (1.3) is singular by Theorem 1.2.) This conjecture was proven by Moshchevitin [14], who constructed an matrix satisfying (1.3) and not contained in any rational hyperplane of (see also [9, 17]). We will improve Moshchevitin’s result by computing the Hausdorff and packing dimensions of the set of matrices witnessing this conjecture:
Theorem 1.12**.**
For all and for all , the Hausdorff and packing dimensions of the set of matrices that satisfy (1.3) are both equal to
[TABLE]
where
[TABLE]
Here denotes the fractional part of a real number . The same formulas are valid for the set of matrices that satisfy (1.3) and are not trivially singular.
Remark 1.13**.**
The function satisfies and . Moreover, for all we have . It follows that when or , the Hausdorff and packing dimensions of the set of matrices that satisfy (1.3) are both equal to .
Remark 1.14**.**
When or , the fractional parts appearing in (1.4) can be computed explicitly, leading to the formula
[TABLE]
However, this formula is not valid when .
Acknowledgements. This research began when the authors met at the American Institute of Mathematics via their SQuaRE program. We thank them for their hospitality. The first-named author was supported in part by a 2017-2018 Faculty Research Grant from the University of Wisconsin–La Crosse. The second-named author was supported in part by the Simons Foundation grant #245708. The third-named author was supported in part by the EPSRC Programme Grant EP/J018260/1. The fourth-named author was supported in part by the NSF grant DMS-1361677. We thank Nicolas Chevallier, Elon Lindenstrauss, Damien Roy, and Johannes Schleischitz for helpful comments.
2. The variational principle
All the theorems in the previous section (with the exception of Theorems 1.2 and 1.3) are consequences of a single variational principle in the parametric geometry of numbers. This variational principle is a quantitative analogue of theorems due to Schmidt and Summerer [21, §2] and Roy [15, Theorem 1.3]. However, we will state their results in language somewhat different from the language used in their papers, due to the fact that the fundamental object we consider is the one-parameter family of unimodular lattices used by the Dani correspondence principle, rather than a one-parameter family of (non-unimodular) convex bodies as is done in [21, 15]. We leave it to the reader to verify that the theorems we attribute below to [21] and [15] are indeed faithful translations of their results to our setting. We note that these papers, unlike ours, do not consider the case of matrices.
The fundamental question of our version of the parametric geometry of numbers will be as follows: given a matrix , what does the function defined by the formula
[TABLE]
look like? The function will be called the successive minima function of the matrix . The Dani correspondence principle shows that many interesting Diophantine questions about the matrix are equivalent to questions about its successive minima function.
The main restriction on the successive minima function comes from an application of Minkowski’s second theorem on successive minima to certain subgroups of . Specifically, fix and let be an interval such that for all . For each , let be the linear span of the set
[TABLE]
Then a continuity argument shows that the map is constant on , see [20, Lemma 2.1] for the case of simultaneous approximation. Write . By Minkowski’s second theorem, we have
[TABLE]
where denotes the covolume of a discrete group , and means that there exists a constant such that . Now an argument based on the exterior product formula for covolume and the definition of shows that for some convex, piecewise linear function whose slopes are in the set
[TABLE]
where for convenience we write , , and . This suggests that can be approximated by a piecewise linear function such that whenever on an interval , the function is convex on with slopes in . Moreover, it is obvious that , and the formula for implies that for all , we have wherever is differentiable. We therefore make the following definition:
Definition 2.1**.**
An template is a piecewise linear map with the following properties:
- (I)
.
- (II)
for all .
- (III)
For all and for every interval such that on , the function is convex on with slopes in . Here we use the convention that and .
When , templates are a slight generalization of reparameterized versions of the rigid systems of [15]. We denote the space of templates by .
The fundamental relation between templates and successive minima functions is given as follows:
Theorem 2.2**.**
- (i)
For every matrix , there exists an template such that .
- (ii)
For every template , there exists an matrix such that .
In the case , part (i) of Theorem 2.2 is due to Schmidt and Summerer [21, §2] and part (ii) is due to Roy ([15, Theorem 1.3] and [16, Corollary 4.7]).
Theorem 2.2(ii) asserts that for every template , the set
[TABLE]
is nonempty. It is natural to ask how big this set is in terms of Hausdorff and packing dimension. Moreover, given a collection of templates , we can ask the same question about the set
[TABLE]
It turns out to be easier to answer the second question than the first, assuming that the collection of templates is closed under finite perturbations. Here, is said to be closed under finite perturbations if whenever , we have .
Theorem 2.3** (Variational principle, version 1).**
Let be a collection of templates closed under finite perturbations. Then
[TABLE]
where the functions are as in Definition 2.5 below.
Corollary 2.4**.**
With as above, we have
[TABLE]
However, note that Theorem 2.3 does not imply that for an individual template , since the family is not closed under finite perturbations. And indeed, since the function is sensitive to finite perturbations, the formula cannot hold for all .
Definition 2.5**.**
We define the lower and upper average contraction rate of a template as follows. Let be an open interval on which is linear. For each such that on , let be chosen to satisfy and
[TABLE]
as guaranteed by (III) of Definition 2.1. An interval of equality for on is an interval , where satisfy
[TABLE]
Note that the collection of intervals of equality forms a partition of . If is an interval of equality for on , then we let , where
[TABLE]
and we let
[TABLE]
where the unions are taken over all intervals of equality for on . Note that by (II) of Definition 2.1, and further that and are disjoint and satisfy , and that and . Next, let
[TABLE]
and note that
[TABLE]
The lower and upper average contraction rates of are the numbers
[TABLE]
where
[TABLE]
Here we abuse notation by writing for all .
Definition 2.5 can be understood intuitively in terms of a simple version of one-dimensional physics with sticky collisions and conservation of momentum. Suppose that we observe particles travelling along trajectories during a time interval along which is linear, and we want to infer the velocities of these particles before they collided, based on the following background information: before the collision of the particles were travelling upwards at a speed of , and of the particles were travelling downwards at a speed of . When particles collide (that is, when the velocities of the particles of lower index are more upwards than the velocities of the particles of higher index at the same location), they join forces to move as a unit, and their new velocity is determined by conservation of momentum. However, we can still think of the group as being composed of a certain number of “upwards” particles and a certain number of “downwards” particles.
The equations (2.8) and (2.9) can be understood as suggesting a particular solution to this problem of inference: assume that within each group, all of the upwards-travelling particles started out below all of the downwards-travelling particles. This is not the only possible solution but it is the nicest one for certain purposes. Specifically, we can imagine a force of “gravity” attempting to bring all of the particles together, which acts between any two particles by imposing a fixed energy cost if the two particles are travelling away from each other.333This is of course unlike real gravity, which imposes an energy cost variable with respect to distance. The total energy cost is then the codimension defined by (2.11). The equations (2.8) and (2.9) can then be thought of as giving the solution that minimizes this cost.
The idea of codimension as an energy cost is also useful for computing the suprema (2.3) in certain circumstances, since it suggests principles like the conservation of energy. However, one needs to be careful since the stickiness of collisions means that some naive formulations of conservation of energy are violated.
In most cases of interest, the collection in Theorem 2.3 is defined by some Diophantine condition. In this case, generally rather than the set we are really interested in is the set of all matrices whose corresponding successive minima functions satisfy the same Diophantine condition. Although these two sets are a priori different, Theorem 2.2(i) implies that they are the same and thus Theorem 2.3 is equivalent modulo Theorem 2.2(i) to the following:
Theorem 2.6** (Variational principle, version 2).**
Let be a collection of functions from to which is closed under finite perturbations, and let
[TABLE]
Then
[TABLE]
Proof of equivalence.
Theorem 2.6 implies Theorem 2.3 since we can take . Conversely, Theorem 2.3 implies Theorem 2.6 modulo Theorem 2.2(i) since we can take . ∎
Theorem 2.6 can be thought of as a quantitative analogue of Theorem 2.2, as shown by the following equivalent formulation:
Theorem 2.7** (Variational principle, version 3).**
- (i)
Let be a set of matrices of Hausdorff (resp. packing) dimension . Then there exists a matrix and a template whose lower (resp. upper) average contraction rate is .
- (ii)
Let be a template whose lower (resp. upper) average contraction rate is . Then there exists a set of matrices of Hausdorff (resp. packing) dimension , such that for all .
Proof of equivalence.
Part (i) is equivalent to the direction of (2.13), and part (ii) to the direction. For the first equivalence, for the forwards direction take , and for the backwards direction take . For the second equivalence, for the backwards direction take and . ∎
It is worth stating the special case of Theorem 2.6 that occurs when the collection is defined by the Diophantine conditions defining and for some . Thus, we define the uniform dynamical exponent of a map to be the number
[TABLE]
Similarly, is said to be trivially singular if as for some . Letting or in Theorem 2.6 yields the following result:
Theorem 2.8** (Special case of variational principle).**
For all , we have
[TABLE]
where is as in (1.1).
Theorem 2.6 can also be used to compute the dimensions of the set
[TABLE]
Theorem 2.9** (Special case of variational principle).**
For all , we have
[TABLE]
(Theorem 2.9 is also true with the stars removed, but in that case it is not as interesting because is “too large”, whereas is the “correct” size according to Remark 1.8.)
It is natural to expect that the map is monotonically decreasing, in which case Theorem 2.9 would imply that
[TABLE]
Conjecture 2.10**.**
The functions
[TABLE]
are decreasing and continuous, and furthermore are computable in the sense of [23].
The main difficulty in proving this conjecture is the rigidity of templates – one would like to show that every template can be perturbed into a new template whose uniform dynamical exponent is either slightly larger, or much smaller, than that of the original template, and whose average contraction rates are not too much smaller than those of the original template. However, it is not at all clear how one would perform such a perturbation except in a few special cases.
3. A characterization of Hausdorff and packing dimensions using games
The proof of the variational principle is based on a new variant of Schmidt’s game which is in principle capable of computing the Hausdorff and packing dimensions of any set. In Schmidt’s game [18], players take turns choosing a descending sequence of balls and compete to determine whether or not the intersection point of these balls is in a certain target set. The key feature of our new variant is that instead of requiring the rate at which the players’ moves contribute information to the game to be constant, the new variant allows the rate of information transfer to be variable, with the first player, Alice, getting to choose the rate of information transfer. However, Alice is penalized if she exerts too much control over the game over long periods of time without giving her opponent Bob a chance to exert control over the game.
Definition 3.1**.**
Given , Alice and Bob play the -dimensional Hausdorff (resp. packing) -game as follows:
- •
The turn order is alternating, with Alice playing first. Thus, Bob’s th turn occurs after Alice’s th turn and before Alice’s st turn.
- •
Alice begins by choosing a starting radius .
- •
On the th turn, Alice chooses a nonempty -separated set , and Bob responds by choosing a ball , where and .
- •
On the first (0th) turn, Alice’s choice can be any finite set, but on subsequent turns she must choose it to satisfy
[TABLE]
Note that this condition guarantees that
[TABLE]
After infinitely many turns have passed, the point
[TABLE]
is computed (note that the right-hand side is always a singleton). It is called the outcome of the game. Also, we let , and we compute the numbers
[TABLE]
and
[TABLE]
which are called Alice’s lower and upper scores, respectively. Alice’s goal will be to ensure that the outcome is in a certain set , called the target set, and simultaneously to maximize her score while doing so.
To be precise, a set is said to be -dimensionally Hausdorff (resp. packing) -winning if Alice has a strategy to simultaneously ensure that the outcome is in , and that her lower (resp. upper) score is at least . It is said to be -dimensionally Hausdorff (resp. packing) winning if it is -dimensionally Hausdorff (resp. packing) -winning for all sufficiently small . The reader may contrast the definition of our game with that of Cheung’s self-similar coverings in [3, Section 3].
The following result is one of the key ingredients in the proof of the variational principle:
Theorem 3.2**.**
The Hausdorff (resp. packing) dimension of a Borel set is the supremum of such that is -dimensionally Hausdorff (resp. packing) winning.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Yann Bugeaud, Yitwah Cheung, and Nicolas Chevallier, Hausdorff dimension and uniform exponents in dimension two , https://arxiv.org/abs/1610.06374 , preprint 2016.
- 2[2] J. W. S. Cassels, An introduction to Diophantine approximation , Cambridge Tracts in Mathematics and Mathematical Physics, No. 45, Cambridge University Press, New York, 1957.
- 3[3] Yitwah Cheung, Hausdorff dimension of the set of singular pairs , Ann. of Math. (2) 173 (2011), no. 1, 127–167.
- 4[4] Yitwah Cheung and Nicolas Chevallier, Hausdorff dimension of singular vectors , Duke Math. J. 165 (2016), no. 12, 2273–2329. MR 3544282
- 5[5] Shrikrishna Gopal Dani, Divergent trajectories of flows on homogeneous spaces and Diophantine approximation , J. Reine Angew. Math. 359 (1985), 55–89.
- 6[6] Harold Davenport and Wolfgang M. Schmidt, Dirichlet’s theorem on diophantine approximation. II , Acta Arith. 16 (1969/1970), 413–424. MR 0279040
- 7[7] Vojtěch Jarník, Zum Khintchineschen “Übertragungssatz” , Trav. Inst. Math. Tbilissi 3 (1938), 193–212 (German).
- 8[8] Shirali Kadyrov, Dmitry Kleinbock, Elon Lindenstrauss, and Gregory Margulis, Singular systems of linear forms and non-escape of mass in the space of lattices , http://arxiv.org/abs/1407.5310 , preprint 2014.
