A case of the Rodriguez Villegas conjecture
Ted Chinburg, Eduardo Friedman, Fernando Rodriguez-Villegas, James, Sundstrom

TL;DR
This paper proves a high-rank case of Rodriguez Villegas's conjecture, which relates to bounds on units in number fields and interpolates between known cases of Lehmer's conjecture and bounds on regulators.
Contribution
It establishes the conjecture for number fields containing a subfield with a large degree extension, extending previous partial results.
Findings
Proves the conjecture when L contains a subfield K with large [L:K] relative to [K:Q]
Shows the kernel of the norm map plays a key role in the high-rank case
Extends understanding of bounds on units and regulators in number fields
Abstract
Let L be a number field and let E be any subgroup of the units O_L^* of L. If rank(E) = 1, Lehmer's conjecture predicts that the height of any non-torsion element of E is bounded below by an absolute positive constant. If rank(E) = rank(O_L^*), Zimmert proved a lower bound on the regulator of E which grows exponentially with [L:Q]. Fernando Rodriguez Villegas made a conjecture in 2002 that "interpolates" between these two extremes of rank. Here we prove a high-rank case of this conjecture. Namely, it holds if L contains a subfield K for which [L:K] >> [K:Q] and E contains the kernel of the norm map from O_L^* to O_K^*.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgebraic Geometry and Number Theory · Finite Group Theory Research
\markleft
T. Chinburg E. Friedman J. Sundstrom
A case of the Rodriguez Villegas conjecture
with an appendix by Fernando Rodriguez Villegas
Ted Chinburg, Eduardo Friedman and James Sundstrom
Department of Mathematics, University of Pennsylvania, David Rittenhouse Lab., 209 South 33rd Street, Philadelphia PA 19104-6395, USA
Departamento de Matemáticas, Facultad de Ciencias, Universidad de Chile, Las Palmeras 3425, Ñuñoa, Santiago R.M., CHILE
The Abdus Salam International Centre for Theoretical Physics, ICTP Math Section, Strada Costiera 11, I-34151 Trieste, Italy
Department of Mathematics (038-16), Temple University, Wachman Hall, 1805 North Broad Street, Philadelphia PA 19122, USA
Abstract.
Let be a number field and let be any subgroup of the units of . If , Lehmer’s conjecture predicts that the height of any non-torsion element of is bounded below by an absolute positive constant. If ), Zimmert proved a lower bound on the regulator of which grows exponentially with . Fernando Rodriguez Villegas made a conjecture in 2002 that “interpolates” between these two extremes of rank. Here we prove a high-rank case of this conjecture. Namely, it holds if contains a subfield for which and contains the kernel of the norm map from to .
Key words and phrases:
Lehmer’s conjecture, Mahler measure, units.
2010 Mathematics Subject Classification:
11R06, 11R27
Partially supported by U.S. N.S.F. grant NSF FRG Grant DMS-1360767 (Chinburg and Sundstrom), U.S. N.S.F. SaTC Grants CNS-1513671/1701785 (Chinburg) and by Chilean FONDECYT grant 1170176 (Friedman).
1. Introduction
In 2002 Fernando Rodriguez Villegas conjectured a surprising lower bound on a natural -norm of any non-trivial element of the -th exterior power of the units of a number field. For minimal, i.e., , Rodriguez Villegas’ conjecture is equivalent to Lehmer’s 1933 conjectural lower bound on the height of an algebraic number [Le] [Sm2]. For maximal, i.e., , it is equivalent to Zimmert’s 1981 theorem stating that the regulator of a number field grows at least exponentially with the degree of the number field [Zi].
We now state his conjecture in its strongest possible form.111 The original 2002 write-up of this conjecture was kindly supplied to us by F. Rodriguez Villegas and appears with his permission for the first time in print here (see §7). The 2002 conjecture is somewhat weaker, but F. Rodriguez Villegas later strengthened it to the form given here.
RV Conjecture**.**
(Rodriguez Villegas) There exist two absolute constants and such that for any number field and any ,
[TABLE]
Here denotes the exterior power of the lattice , denotes the set of archimedean places of , and is defined by
[TABLE]
where is the absolute value associated to extending the usual absolute value on . To define the -norm in (1), we start with the usual orthonormal basis on , i.e., for
[TABLE]
This gives rise to the orthonormal basis of , where denotes the set of subsets of having cardinality , for each such we fix an ordering of and
[TABLE]
The -norm on in the RV conjecture (1) is defined with respect to this basis. Namely,222 Although Rodrigez Villegas phrased the 1-norm in terms of the archimedean embeddings rather than places (see §7.4), the 1-norm is unchanged as we inserted a factor of 2 at complex places in (2). However, the embedding using places gives a larger 2-norm if the field is not totally real, and so is better for our purposes. for , we let .
It is worth mentioning that Siegel [Sie] showed that the conjectural inequality (1) is not possible for the Euclidean norm . Indeed, if is a prime, if satisfies and , then . Hence, the RV conjecture is necessarily for the -norm, at least for .
However, for close to the maximal value , the -norm and the Euclidean norm are interchangeable for the purposes of Rodriguez Villegas’ conjecture. This is simply because on any Euclidean space , we have , provided the -norm is taken with respect to an orthonormal basis for . In this paper we will work only with the Euclidean norm and close to .
Aside from Zimmert’s theorem on the regulator [Zi] and the known cases of Lehmer’s conjecture [Sm2], the cleanest result in favor of the RV conjecture is
[TABLE]
proved for all , but only for totally real fields . This follows from work of Pohst [Po] dating back to 1978. Indeed, Pohst showed for totally real that
[TABLE]
Using estimates of Hermite’s constant, he deduced good lower bounds for the regulator of a totally real field. The same calculations show that the -dimensional co-volume of the lattice spanned by satisfies [CF, p. 293]
[TABLE]
Since
[TABLE]
a short numerical computation with (5) yields (4).
As far as we know, the only proved cases of the RV conjecture involve “pure wedges,” i.e., of the form , where the are independent elements of . If or , every element of is (trivially) a pure wedge, but this also holds if (see Lemma 22 below). In particular, if is a totally real field of degree over , then
[TABLE]
for all . In general, however, the RV conjecture makes a stronger prediction than simply a lower bound on the 1-norm of pure wedges.
Another known case of the RV conjecture occurs when
[TABLE]
is the group of relative units associated to an extension . Friedman and Skoruppa [FS] proved in 1999 that inequality (1) in the RV conjecture holds for pure wedges if for some absolute constant .333 The inequality proved in [FS] is for the relative regulator rather than for the co-volume of the relative units. This suffices since , where is the number of places of above . The proof of this relation between the co-volume and the relative regulator mimics the determinant manipulations in the case [BS, p. 115]. We note that J. Sundstrom, in the appendix to his doctoral thesis [Su1], corrected an error in Skoruppa and Friedman’s proof. Namely, in the bound on what is called in the proof of Lemma 5.5 of [FS], the real part of the error term in the exponential was neglected. This did not affect the proof of their Main Theorem, but it did affect the numerical constants claimed in Theorem 4.1 and its corollaries. By improving the asymptotic estimates in [FS] and using extensive computer calculations, Sundstrom was able to prove the estimate in Theorem 4.1 of [FS], with the constants as given there, In particular, . If we are willing to settle for , the proof in [FS] will do after adjusting the constants to correct for the error in the proof of Lemma 5.5. To prove their result, Friedman and Skoruppa defined a -type series associated to any subgroup of arbitrary rank and used it to produce a complicated inequality for the co-volume associated to the lattice . In the case of they obtained the desired inequality using the saddle-point method to estimate the terms in the series as . Although the saddle-point method in one variable is a standard tool, the difficulty in the asymptotic estimates in [FS, §5] was that the estimates needed to depend only on .
The results cited so far all pre-date the RV conjecture and essentially dealt with regulators or Lehmer’s conjecture. Inspired by the RV conjecture, Sundstrom [Su1] [Su2] dealt in his 2016 thesis with a new kind of subgroup of the units. Namely, suppose contains two distinct real quadratic subfields , and let The series is still defined and yields an inequality for the co-volume \mu\big{(}{\mathrm{LOG}}(E)\big{)}, but to estimate the terms in the inequality Sundstrom had to apply the saddle-point method to a triple integral. Keeping all estimates uniform in this case proved considerably harder than in the one-variable case treated in [FS]. In the end, Sundstrom was able to verify the RV conjecture in this case for pure wedges. More precisely, he proved the existence of absolute constants and such that for and .
Here we extend Sundstrom’s result, letting the be arbitrary, as follows. Let be subfields of a number field , let be the compositum of the , let be the subgroup of the units of whose norm to each is a root of unity, and let be independent elements of , where . Then there is an absolute constant such that
[TABLE]
*whenever . *
In fact the above is an immediate corollary of our
Main Theorem**.**
Suppose is a subgroup of the units of the number field such that for some subfield , where are the relative units defined in (7). Let be independent elements of , where . Then the RV conjecture (1) holds for and large enough compared to .
More precisely, there is an absolute constant such that if , then
[TABLE]
Our proof of the Main Theorem is again through an asymptotic analysis of the inequality for in [FS], but there are several new features which bring the proof closer to the case of a general high-rank subgroup .
In both [FS] and [Su2], the uniformity of the asymptotic estimates depends on having explicit expressions for the orthogonal complement of inside , but here we have very little knowledge of . As in [FS] and [Su2], we take a Mellin transform of the terms of and invert it to express each term in as a -dimensional complex contour integral (see Lemma 3 below). Here is the co-rank of , shifted by 1.
To apply the saddle-point method to our integral, we need a saddle point. In the case of [FS] one could easily write down a formula for the saddle point in terms of the logarithmic derivative of the classical -function. In [Su2] the equations for the critical point were explicit enough that monotonicity arguments proved the existence of the saddle point. In our case the equations are too complicated to analyse directly. Instead, in §3 we obtain the existence and uniqueness of the saddle point by re-interpreting it as the value of the Legendre transform of a convex function on , closely related to .
Since (what will prove to be) the main term in our asymptotic expansion depends on the saddle point , of which we can only control , in §4 we prove inequalities for the main term which depend only on . We need these inequalities to prove that the main term has the exponential growth claimed in the Main Theorem.
The results proved in §2-§4 are valid for any subgroup . In §5 we carry out the required uniform asymptotic estimates, assuming and to show that the purported main term actually dominates. Finally, in §6 we put everything together and prove the Main Theorem.
2. The -function
In this section we recall the series associated to a subgroup of the units and to a fractional ideal of the number field . We also recall the inequality for the co-volume of resulting from the functional equation of . This is all quoted from [FS, §2]. Our main new task here is to express the terms in the inequality as an inverse Mellin transform.
2.1. The basic inequality
Given a subgroup , we define as the group generated by all elements of the form
[TABLE]
Here is the multiplicative group of the positive real numbers, denotes the set of Archimedean places of , and is the (un-normalized) absolute value associated to the archimedean place . Thus, for we have
[TABLE]
Note that
[TABLE]
[TABLE]
and that acts on , via .
We fix a Haar measure on as follows. The standard Euclidean structure on , in which the in (3) form an orthonormal basis of , induces a Euclidean structure (and therefore a unique Haar measure) on any -subspace of . We give the Haar measure that results from pulling back the Haar measure on the -subspace via the isomorphism , and let be the measure of a fundamental domain for the action of on .
Following [FS, p. 120], for a fractional ideal and , we let
[TABLE]
where is the number of roots of unity in ,
[TABLE]
Note that the integral in (12) depends only on the -orbit of , and hence is independent of the representative taken for the -orbit of .
Our starting point for proving lower bounds on co-volumes is the inequality [FS, Corol. p. 121], valid for any and any fractional ideal of .
[TABLE]
Writing out the individual terms of (13), we have [FS, p. 121, eq. (2.6)] the
Basic Inequality**.**
[TABLE]
Note that in [FS] we find instead of in (14), but is arbitrary there too.
2.2. Mellin transforms
Our main task in this section is to re-write the -dimensional integral in (12) as an inverse Mellin transform. For this it will prove convenient to characterize not through generators, but rather through generators of the orthogonal complement in of . Here is the group isomorphism defined by
[TABLE]
Note that is not the traditional logarithmic embedding in (2), as we do not insert a factor of in (15). Instead we endow with a new inner product
[TABLE]
where or 2 as in (9). Let \big{\{}q_{j}\big{\}}_{j=1}^{k}=\big{\{}(q_{jv})_{v}\big{\}}_{j=1}^{k} be an -basis of the orthogonal complement of in such that
[TABLE]
Thus, for ,
[TABLE]
Let . Define a homomorphism by
[TABLE]
so that by (18) we have an exact sequence
[TABLE]
Let be a homomorphism splitting the exact sequence (20), i.e., is the identity map on . Such a splitting exists because and are real vector spaces. Let
[TABLE]
be the usual Haar measures on and .
Recall that in order to define in (12) we fixed a Haar measure on . In order to calculate Mellin transforms below, we will need to compare the Haar measure on with a Haar measure coming from . Namely, if is the isomorphism defined by the splitting , i.e.,
[TABLE]
then the measure is a Haar measure on . Hence
[TABLE]
where the positive constant is evaluated in the next lemma.
Lemma 1**.**
Let be the matrix whose rows are indexed by and whose columns are indexed by , with entry in the row and the column, with as in (17). Then in (23) is independent of the splitting in (22) and is given by
[TABLE]
where is the transpose of and is the number of complex places of .
Proof.
For and , let be the standard dot product . Recall that we defined in (16) another inner product on , namely . To relate these products, let be given by \big{(}T(x)\big{)}_{v}:=e_{v}x_{v}. Then
[TABLE]
Note that .
Let be an orthonormal basis of (with respect to the dot product), let C_{1}:=\big{\{}\sum_{\ell}x_{\ell}u_{\ell}\big{|}\,0\leq x_{\ell}\leq 1\big{\}}\subset V be the -cube spanned by the , and let . By the definition of the measure given in the paragraph preceding (12), .
We define next an analogous subset with . Let be the “standard” orthonormal basis of as an -vector space; that is, if , and otherwise. Let be the -cube spanned by , so that .
Set , so that . Thus in (23) satisfies
[TABLE]
Now, and is the measure on that maps by to the standard Haar measure on \big{(}see (15), (21) and (22)\big{)}. Hence, , where is the -matrix whose first columns are the vectors w_{\ell}:={\mathrm{Log}}_{G}\big{(}{\mathrm{LOG}}^{-1}(u_{\ell})\big{)}\in\mathbb{R}^{{\mathcal{A}_{L}}}\ \,(1\leq\ell\leq r). The remaining columns of are the vectors {\mathrm{Log}}_{G}\big{(}\sigma(F_{j})\big{)}\ \,(1\leq j\leq k).
Suppose is another splitting of (20). Then , and therefore {\mathrm{Log}}_{G}\big{(}\sigma(F_{j})\big{)}-{\mathrm{Log}}_{G}\big{(}\tilde{\sigma}(F_{j})\big{)} lies in the span of the columns . Hence is independent of the splitting , as claimed in the lemma. We are therefore free to use the splitting determined by
[TABLE]
Using (19) and the orthogonality relations (17), one checks that this is indeed a splitting of . With this , the last columns of are just {\mathrm{Log}}_{G}\big{(}\sigma(F_{j})\big{)}=d_{j}^{-1}q_{j}\in\mathbb{R}^{\mathcal{A}_{L}}. As and \big{(}see (25)\big{)}, we have
[TABLE]
where is the -matrix whose columns are applied to the columns of , i.e., the columns of are , followed by .
To prove the lemma we must show that . We calculate as
[TABLE]
where is the -matrix whose columns are , followed by (i.e., ). Using the orthonormality of the ’s (with respect to the dot product), we see that can be divided into four blocks, the upper left one being the identity matrix . Below it, has a block with entries
[TABLE]
where we used (25) and the definition of the ’s as a basis of the orthogonal complement of \big{(}with respect to , see (18)\big{)}. Since the bottom right block of is , we find that . Thus, \det\!\big{(}R_{\phantom{l}}^{\intercal}R\big{)}=\sqrt{\mathrm{det}(Q_{\phantom{l}}^{\intercal}Q)}. A similar calculation shows , whence .∎
In order to study the -series (12), we need to consider integrals of the form
[TABLE]
for . For , define by substituting above:
[TABLE]
Note that the integral (27) depends only on modulo , so the function is independent of the choice of splitting the exact sequence (20). The fact that (27) depends only on modulo also shows that
[TABLE]
so we will concentrate on , a function of only variables.
Define a linear map by , where is the matrix whose column is , as in Lemma 1. Also define maps for each by S_{v}(s)=\big{(}S(s)\big{)}_{v}. That is,
[TABLE]
Note that is injective since the are linearly independent.
Our first aim is to calculate the (-dimensional) Mellin transform
[TABLE]
where \mathrm{Re}(s):=\big{(}\mathrm{Re}(s_{1}),\ldots,\mathrm{Re}(s_{k})\big{)}\in{\mathcal{D}}, with
[TABLE]
As for all v\in{\mathcal{A}_{L}}\ \big{(}see (17)\big{)}, for we have . Hence is a non-empty, open, convex subset of . We will presently prove that the Mellin transform in (31) converges if .
In the following calculation of the reader should initially consider only real , so that the integrand is positive. At the end of the calculation it will become clear that the integral converges for in the open subset of where .
[TABLE]
where in the last step we used Lemma 1 and \delta\big{(}\gamma(x,h)\big{)}=\delta\big{(}\sigma(h)x\big{)}=h, with as in (19). Next we substitute to get
[TABLE]
where is the number of real places of .
Lemma 2**.**
For any (see (32)), the Mellin inversion formula holds:
[TABLE]
where and is the product of the vertical lines , taken from to .
Proof.
The calculation (33) shows that the Mellin transform is defined for . Thus Mellin inversion will work provided that \int_{I_{\sigma}}\big{|}(M\psi)(s)h^{-s}\,ds\big{|}<\infty. Since \big{|}h^{-s}\big{|} and \big{|}e_{v}^{e_{v}S_{v}(s)/2}\big{|} are constant on , we turn to the factors in (33). Write , . In a strip , we have |\Gamma(z)|<C_{3}\exp\!\big{(}\!-|\mathrm{Im}(z)|\big{)}.444 In fact, holds for any [AAR, Cor. 1.4.4]. Since \mathrm{Re}\big{(}e_{v}S_{v}(s)\big{)}=e_{v}S_{v}(\sigma)>0 for ,
[TABLE]
where is the 1-norm on , and is the linear function from (30). Since is injective, there exists such that
[TABLE]
Thus is integrable over and Mellin inversion (34) holds. ∎
Let
[TABLE]
and
[TABLE]
We take the branch of which is real when is real and positive.
Lemma 3**.**
Let and . Then
[TABLE]
with as in (28), as in (36), as in Lemma 1, as in Lemma 2, and (resp. ) being the number of real (resp. complex) places of .
Proof.
If is complex, so , the duplication formula gives
[TABLE]
If is real, so , then
[TABLE]
From (33) and Mellin inversion (34) we get
[TABLE]
Now we apply the lemma to the Basic Inequality (14).
Corollary 4**.**
For and , define by
[TABLE]
Then, with \mathcal{L}:=\sqrt{\det(Q_{\phantom{l}}^{\intercal}Q)}/\big{(}2^{r_{1}}(2\sqrt{\pi})^{r_{2}}\pi^{k}\big{)}, for any we have
[TABLE]
Proof.
Define by . In view of (29) and Lemma 3, (39) will follow from \big{(}\delta(r)\big{)}_{j}=\mathrm{e}^{ny_{j}/2}. Indeed, by (19),
[TABLE]
If , then by (17) we have for all . Using (9) and (10) we find
[TABLE]
If , then \big{(}see (17)\big{)}, so
[TABLE]
as claimed. To prove (40), apply to (39), noting that for . ∎
3. Existence and uniqueness of the critical point
We shall show that for every there is a unique \sigma=\sigma(y)\in{\mathcal{D}}\ \,\big{(}see (32)\big{)} which is a critical point of , defined as
[TABLE]
with as in (36). The map taking to the critical point is closely related to the Legendre transform of , but we will develop the theory from scratch as ours is an easy case of the general theory of the Legendre transform [HUL, §E] [Sim, §1 and §5].
Lemma 5**.**
Let be as in (36). Then is steep [Sim, p. 30], i.e.,
[TABLE]
where the limit is taken over as its Euclidean norm tends to infinity.
Proof.
Recall that the linear map in (30) is injective. Hence there exists such that, for all ,
[TABLE]
For any , there is a such that S_{v_{0}}(\sigma)=\max_{v\in{\mathcal{A}_{L}}}\big{\{}S_{v}(\sigma)\big{\}}. The previous inequality says that
[TABLE]
The known behavior of for shows that there is a such that
[TABLE]
for all and all ( will do). Also, Stirling’s formula shows that
[TABLE]
for . It follows from (43), (42), and (44) that when is large,
[TABLE]
and the lemma follows. ∎
The next lemma amounts to the fact that the gradient of a steep and differentiable strictly convex function is a bijection. However, in our case the domain , which means that we would need to check the boundary behavior of before citing results from convex analysis. We prefer not to quote and instead adapt the usual proof [Sim, §1] [HUL, §E] to our nicely behaved function .
Lemma 6**.**
For any there is a unique such that .
Proof.
For any , let , , and let
[TABLE]
which we will now prove to be finite, i.e., . Let be a sequence in such that converges to . By (43), is bounded below, so it suffices to check that the sequence is bounded. By Lemma 5, for with sufficiently large. For such ,
[TABLE]
which shows that is bounded.
We now prove that the infimum defining is assumed at a point in the open set . Passing to a subsequence of the bounded sequence , we may assume that the converge to a point in the closure of in . Recall from (32) that is the (non-empty) open set consisting of such that for all . If , then for some . Since \log\Gamma_{v}\big{(}S_{v}(\tau^{(i)})\big{)}\to+\infty as , and the remaining summands in the definition of remain bounded from below (as does ), we conclude that . Since is an interior minimum of the smooth function , we have . By (41), , as claimed.
To prove the uniqueness of , it suffices to prove that is a strictly convex function on .555 That is, for all and all . Such a function cannot have more than one critical point. To prove this, let g(t):=F_{y}\big{(}t\tau+(1-t)\tilde{\tau}\big{)}. Assuming that is strictly convex, is a strictly convex function of a single real variable . Thus, , so has an increasing derivative . But would imply , whence is constant and therefore not strictly convex. The strict convexity of follows from the strict convexity of for . Indeed,
[TABLE]
with strict inequality holding for unless for all . But this is impossible because in (30) is injective. ∎
The function in (45) is a concave function of , being the infimum over of the set of concave (in fact, affine) functions . The convex function is known as the Legendre transform of .
4. Inequalities at the critical point
To take advantage of the inequality (14), we will later need to drop all terms in (14) corresponding to algebraic integers . For this we will need some control of the first coordinate of the function in Lemma 6. In this subsection we take advantage of the concavity of to find a lower bound for . Then we use the convexity of to find a lower bound for {\alpha}\big{(}\sigma(y)\big{)}. Let
[TABLE]
[TABLE]
These definitions ensure that (see (35)). Note that is a concave function of for . We also note that has an inverse function since is strictly increasing when , tends to as , and tends to as .
Writing out the -th coordinate of the equation in Lemma 6, we get
[TABLE]
which for simplifies to
[TABLE]
Lemma 7**.**
Let be a number field of degree , with complex places. For , let be the first coordinate of the function defined in Lemma 6. Then
[TABLE]
Proof.
We prove (50) using the concavity of . Namely, from (49),
[TABLE]
where the last step uses
[TABLE]
which follows from (17) since
[TABLE]
Inequality (50) now follows, since is an increasing function. ∎
Our next result is a similar inequality for .
Lemma 8**.**
With notation as in Lemma 7, we have
[TABLE]
Proof.
We compute directly from the definition (36) of , using the convexity of for and (51):
[TABLE]
We now prove a lower bound for in terms of and .
Lemma 9**.**
Let , , and let be as in Lemma 6. Assume that for some , and . Then or
[TABLE]
Proof.
We shall show below that both denominators in (53) are positive if , as we may assume. Replacing with in (49), we have
[TABLE]
Since is a monotone decreasing convex function on , we find
[TABLE]
From and the fact that for ,
[TABLE]
Hence, as we are assuming ,
[TABLE]
Since , the right-hand side above is negative. Hence the left-most inequality in (53) is proved.
Next recall [Ni, §71, eq. (11)],
[TABLE]
Whence for , and so
[TABLE]
Now the second inequality in (53) follows as before. ∎
5. Asymptotics
With a view to applying Corollary 4 and the Basic Inequality (14), in this section we will estimate integrals of the type
[TABLE]
where as in Lemma 6, and . We will let be a Gaussian approximating (see (63) below) in a bounded neighborhood of \big{(}see (85)\big{)}. As usual with the saddle point method, we decompose the integral (54) into four pieces
[TABLE]
The term (i.e., ) is readily computed and gives (as we will prove in this section) the main term in (55). Thus, we shall prove that the terms and are as , uniformly in .
From now on we always (and usually tacitly) assume that the relative units for some subfield \big{(}see(7)\big{)}. Define by
[TABLE]
Note that the complex places do not carry a factor of 2. Instead we use this factor in the inner product (16) on defined by . The usefulness of assuming lies in the following.
Lemma 10**.**
Suppose and lies in the orthogonal complement of inside with respect to the above inner product. Then whenever and lie above the same place of and
[TABLE]
Proof.
The lemma will follow from the fact that is contained in the -span of in . Clearly , so it suffices to prove that . This follows from and . ∎
Recall that in (17) we fixed a basis of such that for all and for . In view of Lemma 10, we will write for any extending .
For a place , let and denote respectively the number of real and complex places of extending , and let (cf. [FS, p. 134])
[TABLE]
Note that or , and that .
Lemma 10 implies that defined in (30) satisfies
[TABLE]
where is any place extending . We therefore rewrite in (36) as
[TABLE]
where we write if extends , and was defined in (57).
For each and \sigma\in{\mathcal{D}}\ \,\big{(}\text{see }\eqref{D}\big{)}, define by
[TABLE]
i.e., is the error in the degree-2 Taylor approximation of T\mapsto\alpha_{\kappa_{w}}\!\big{(}S_{w}(\sigma+iT)\big{)} at . We shall henceforth take any and let be the corresponding saddle point in Lemma 6. Thus . Using this and (59), we find
[TABLE]
It follows from (59)–(61) that
[TABLE]
The linear terms in have disappeared as is a critical point of .
For fixed and , define the following functions of :
[TABLE]
Although and depend on , we do not include in our notation.
5.1. The main term
In Lemma 1 we defined the matrix of rank whose coefficients are . We will write for the matrix with entries and rank . Recall that we write for any extending . Let be the set of -element subsets of . For , let be the submatrix of whose rows are indexed by the elements of . In the computation of in Lemma 3 the term appears. Using the smaller matrix we have
[TABLE]
as follows from
[TABLE]
Next we calculate some integrals such as in (55), and its derivatives.
Lemma 11**.**
Let and be as above, where , let , and define
[TABLE]
Then, with as in (58),
[TABLE]
*Furthermore, for any we have *
[TABLE]
and
[TABLE]
Proof.
Let be the matrix with entries Then for , considered as a matrix, satisfies . Hence
[TABLE]
The matrix is clearly positive semi-definite. The Cauchy-Binet formula gives , with as in (67).666 The Cauchy-Binet formula computes , where is a and is , in terms of the minors of and . But as for at least one , since has rank . Hence is positive definite, and so the integral in (68) is the well-known Gaussian integral attached to a positive definite quadratic form in variables, as claimed in (68).
The other equalities in Lemma 11 are obtained by differentiating (68) with respect to repeatedly. Indeed, noting that the partial derivative is independent of , i.e., , we have
[TABLE]
proving the equalities. The inequalities follow from , as . ∎
As for , we can now evaluate .
Corollary 12**.**
With notation as in (63), for we have
[TABLE]
where as in Lemma 6 and
[TABLE]
5.2. The small terms
We begin by quoting some one-variable estimates.
Lemma 13**.**
If , , and , then
[TABLE]
Proof.
The estimate (70) is proved in [Su2, Lemma 4.4]. We now prove (71). From [Su2, Lemma 4.11] we have
[TABLE]
while from [FS, Lemma 5.3] we have
[TABLE]
where is the floor of . Since [FS, p. 141], we have
[TABLE]
Indeed, for the last inequality is obvious, while for a much better inequality follows from . Hence
[TABLE]
Combining this with (72) we obtain (71). ∎
We will need the following inequality, proved by elementary calculus.
[TABLE]
Lemma 14**.**
Suppose , and let
[TABLE]
Then, for any ,
[TABLE]
and
[TABLE]
Proof.
Inequality (77) follows from
[TABLE]
where the first inequality is from [FS, p. 139] and the last one uses (74) with . To prove (76) we use [Su2, Lemma 4.5],
[TABLE]
where the second inequality again follows from (74). ∎
Next we deal with the second order remainder term in the Taylor expansion about of , taking and .
Lemma 15**.**
For \big{(}see (32)\big{)}, and as in (60), we have
[TABLE]
Proof.
The first inequalities in (78) and (79) are proved in [Su2, Lemma 4.7], as is also (81). The second inequalities in (78) and (79) follow from [FS, Lemma 5.2] and . The identities in (80) follow from (60) and . ∎
Lemma 16**.**
\big{(}[FS, (5.11)]\big{)}*
If with , then*
[TABLE]
We first estimate the easier “outer” terms, and in (55), i.e., where the region of integration is . For , let correspond to a maximal summand in (69), so
[TABLE]
Thus,
[TABLE]
and so
[TABLE]
For and , let \big{(}cf. (75)\big{)}
[TABLE]
Define the neighborhood of as
[TABLE]
The next lemma shows that and are small compared to in Corollary 12.
Lemma 17**.**
Suppose , and . Then, with as in (85), as in Lemma 6, and as in (63) and (65), we have
[TABLE]
Proof.
We first prove (86). Note that implies
[TABLE]
Using this, (65) and (59) we have,
[TABLE]
Let denote the -dimensional box
[TABLE]
and let denote its complement. Making the change of variables for , we have
[TABLE]
The latter integral is easy to bound using Lemmas 13 and 14. We integrate over (overlapping) regions, each of which has of the range over all of , and the remaining over . Since , we conclude that
[TABLE]
Now inequality (83) and Corollary 12 prove (86).
Next we prove (87). Changing variables as before, we have
[TABLE]
Once again, we bound using overlapping regions, one for each . The integral over the region given by all such that is bounded by
[TABLE]
We can use (77) to bound the first integral, and the remaining integrals are explicitly known. Hence, summing over the regions,
[TABLE]
We again conclude using (83). ∎
For the “inner” integral in (55), we can only expect estimates of the kind , whereas and are essentially O\big{(}I_{1}\exp(-m^{1/3})\big{)}. This allowed us to use simple estimates for the contribution of places . However, to estimate we shall need the following geometric result.
Lemma 18**.**
Let be an matrix of rank , and let . Define linear maps by where . For any -element subset , let denote the submatrix of given by \big{(}M_{\eta}\big{)}_{\ell,j}=m_{i_{\ell},j}. Define and let maximize . Then
[TABLE]
Proof.
Replacing with , we may assume . Hence simply maximizes . Fix , and define for by For , let denote with the row of replaced by the row. Then, by Cramer’s rule, so . Hence
[TABLE]
Lemma 19**.**
For and we have
[TABLE]
*with notation as in (55), and Z:=\big{(}\mathrm{e}^{\lvert{\mathcal{A}_{K}}\rvert k^{4}D^{4}m^{-1/3}}-1\big{)}/\big{(}\lvert{\mathcal{A}_{K}}\rvert k^{4}D^{4}m^{-1/3}\big{)}. *
Proof.
Lemma 18, applied to the matrix and a_{w}:=\sqrt{m_{w}\alpha_{\kappa_{w}}^{\prime\prime}\!\big{(}S_{w}(\sigma)\big{)}}, shows
[TABLE]
for and as in (82). Since is convex, we have,
[TABLE]
For and , by (84) and (85) we have
[TABLE]
Hence,
[TABLE]
Combining this with Lemma 15, we conclude that for ,
[TABLE]
Lemmas 15 and 16 now show that for ,
[TABLE]
where in the last step we used the convexity of .
By Lemma 15, \mathrm{Im}\big{(}e^{\rho(T)}\big{)} is odd, while \mathrm{Re}\big{(}e^{\rho(T)}\big{)} is even in . Furthermore, is a real and even function of , and is mapped to itself by . Hence, using (65) and (92),
[TABLE]
Using Lemma 11 and Corollary 12, we find
[TABLE]
Our next estimate will let us deal with the term in the Basic Inequality (14) and (40).
Lemma 20**.**
For and we have
[TABLE]
with as in (55), as in (59) and as in Lemma 6.
Proof.
By (51), for we have
[TABLE]
Hence we will need to bound integrals of the kind .
Let be as in (82) and let . Then, using (88) and changing variables as in the proof of Lemma 17,
[TABLE]
Using Lemma 13 and (83) we obtain,
[TABLE]
By inequality (91),
[TABLE]
where the last inequality uses and for [FS, (5.7)]. Hence, by (94),
[TABLE]
It follows that
[TABLE]
where the last equality uses Corollary 12. ∎
6. Proof of the Main Theorem
The next lemma will allow us to ensure that each integral in the Basic Inequality (14) is positive. As in §5, we always assume that .
Lemma 21**.**
There is an absolute constant such that if and , then for t:=\exp\!\big{(}\Psi(0.51+\frac{r_{2}}{2n})\big{)} we have and
[TABLE]
where is given by Corollary 4, , and
[TABLE]
Proof.
We note that is as in Corollary 4, except that we used (66) to express in terms of rather than . Letting , from Corollary 4 we have
[TABLE]
Again from Corollary 4, for ,
[TABLE]
Applying Lemma 7 to , since is increasing we have,
[TABLE]
Since by (56), we have . Thus, Lemma 20 yields
[TABLE]
for and some absolute . By (55) and (54) we have
[TABLE]
where . Taking in Lemmas 17 and 19, and after possibly enlarging , we obtain Hence,
[TABLE]
and so, since by (97),
[TABLE]
A glance at (95) shows that we are finished. ∎
We now prove the Main Theorem in §1, which we do not repeat here. Note that
[TABLE]
Take and t:=\exp\!\big{(}\Psi(0.51+\tfrac{r_{2}}{2n})\big{)} as in Lemma 21. In the Basic Inequality (14) take , so that the sum there includes only nonzero . By Lemma 21, each integral in the sum is positive. Retaining only the term corresponding to we have, again by Lemma 21,
[TABLE]
where and . Corollary 4 applied to gives
[TABLE]
We need an upper bound for \det\!\big{(}H(\sigma)\big{)} in (101). In view of (69), we look for an upper bound for \alpha_{\kappa_{w}}^{\prime\prime}\!\big{(}S_{w}(\sigma)\big{)}. Note that
[TABLE]
since is decreasing for . Note that by (97) and that
[TABLE]
From Lemma 9 we have
[TABLE]
Estimating the series by an integral, yields
[TABLE]
From (Cauchy-Binet), and (69),
[TABLE]
where we also used .
We now bound the term in (101) from below. From (102) and (103),
[TABLE]
Using the lower bound for in Lemma 8, we have
[TABLE]
We now distinguish two cases according to the size of . If , then \log\Gamma\big{(}\sigma_{1}+\textstyle\frac{r_{2}}{2n}\big{)}\geq\log(6). Since , after possibly increasing , the Main Theorem follows easily from (100), (101), (104) and (105).
We now turn to the remaining case, i.e., . (By Lemma 21, .) Then in (104) we can replace by 5. The critical points of r\mapsto\log\Gamma\big{(}r+\textstyle\frac{r_{2}}{2n}\big{)}-ry_{1} occur where
[TABLE]
But is injective, so is the only critical point of r\mapsto\log\Gamma\big{(}r+\textstyle\frac{r_{2}}{2n}\big{)}-ry_{1} , and it is a local (therefore global) minimum. Since ,
[TABLE]
Note that , for , and for . Hence
[TABLE]
is decreasing for . We conclude that
[TABLE]
Since and , after again possibly increasing , we can use the “spare” to control the term in (104).
We note that the our proof of the Main Theorem shows that the appearing in it can be replaced by \exp\!\big{(}nf(r_{2}/(2n))\big{)}, where is the number of complex places of and
[TABLE]
In particular, if is totally real, we can replace by . We can also replace above by for any .
Finally, we prove that every element of is represented by a pure wedge, as claimed in the Introduction.
Lemma 22**.**
Suppose is a -lattice in of rank . Then every element of has the form
[TABLE]
for some integer and some basis of as a -module.
Proof.
We may clearly assume . Define the homomorphism by . As , is torsion-free and so is a direct summand of of rank . Let be a -basis of such that is a -basis of , let , and define by . Notice that for .
For , write with . Then
[TABLE]
As the -pairing of with is non-degenerate, . ∎
7. Appendix by Fernando Rodriguez Villegas (May 2002)
Some remarks on Lehmer’s conjecture
7.1.
The logarithmic Mahler measure of a non-zero Laurent polynomial
is defined as
[TABLE]
and its Mahler measure as , the geometric mean of on the torus
[TABLE]
When Jensen’s formula gives the identity
[TABLE]
where , from which we clearly obtain that if . By a theorem of Kronecker if for then is cyclotomic, i.e., is monic and its roots are either [math] or roots of unity.
In the early 30’s Lehmer [Le] famously asked whether there is an absolute lower bound for when and . The purpose of this note is to point out a simple reformulation of this question in terms of the logarithmic embedding of units of a number field and, given this setting, to propose a natural generalization.
7.2.
We start with some general observations about . First of all, the fact that the integral in (106) is finite for all non-zero does need a proof. Here is a sketch. Using Jensen’s formula we find, as in (107) that
[TABLE]
where , , , and are the leading coefficient, roots and degree, respectively, of viewed as a polynomial in . The ’s are algebraic functions of , continuous and piecewise smooth, except at those ’s where vanishes (where some will go off to infinity).
We can apply the above procedure to any variable on the torus . It is not hard to see that we may change coordinates in such a way that is actually constant, completing the proof by induction on .
This last remark can be expanded. Let be the Newton polytope of ; i.e., the convex hull of the exponents of monomials such that if
[TABLE]
then .
We define a face of as the non-empty intersection of with a half-space in . Chose a parameterization of the affine subspace of smallest dimension containing ; is the dimension of the face . Define
[TABLE]
a polynomial whose own Newton polytope is . We call the face polynomial associated to the face . It depends on a choice of but note that by changing variables in the integral is actually independent of that choice.
It is not hard to see that for any facet (co-dimension face) we can choose and system of coordinates in so that, in the notation of (108), . By (108) and induction on we conclude [Sm1] that
[TABLE]
In particular,
[TABLE]
Also, since clearly , we have that
[TABLE]
Though Lehmer’s conjecture is about polynomials in one variable, polynomials in more variables are also relevant due to the following result [Bo]. For any and we have
[TABLE]
That is, there are one variable polynomials with as close to as desired. (We should note that (111) is not an immediate consequence of general results about integration but requires a somewhat delicate analysis.)
7.3.
Let us go back to polynomials in one variable. If we want to find polynomials with positive but small , by (109) and (110) (and Gauss’ lemma) we may as well restrict ourselves to minimal polynomials of algebraic units.
Let be a number field of degree . Let be the set of embeddings and the real vector space of formal linear combinations
[TABLE]
We have the decomposition
[TABLE]
where is the subspace of where complex conjugation acts like . We let (in terms of the standard notation and ).
By Dirichlet’s theorem the image of the unit group by the log map
[TABLE]
is a discrete subgroup of rank .
On we define the -norm
[TABLE]
and we let
[TABLE]
(the reason for this indexing will become clear shortly).
For any unit we have hence
[TABLE]
Let be the (monic) minimal polynomial of and
[TABLE]
This simple observation allows us to reformulate Lehmer’s conjecture as follows.
Conjecture**.**
(Lehmer) There exists an absolute constant such that
[TABLE]
7.4.
Let be a vector space over of dimension and a discrete subgroup of rank . A choice of basis for determines -norms on for by
[TABLE]
For each we define (with respect to the chosen basis)
[TABLE]
where the minimum is taken over all which are linearly independent over .
If is the integral matrix whose -th column consists of the coordinates of in the basis then, as it is easily seen,
[TABLE]
where runs over all minors of .
Returning to the number field situation of the previous section we define the invariants
[TABLE]
where, as before, is the image of the units of under the log map.
A general version of Lehmer’s conjecture would then be
Conjecture**.**
For each there exists an absolute constant such that
[TABLE]
A straightforward calculation shows that the top invariant , with the rank of the unit group , equals the regulator of . It is known [Zi], [Fr], [Sk] that the regulator of number fields is universally bounded below and hence the above conjecture is true for .
In summary, we have seen (18) that Lehmer’s conjecture can be phrased in terms of the -norm of units under the log map. The above conjecture is an attempt to quantify, in what seems to be the most natural way, the question of what is the general shape of , the discrete group of units under the log map.
7.5.
We may carry these ideas a little further still. Borel proved, generalizing Dirichlet’s result for units, that for each there is a regulator map
[TABLE]
whose image is a discrete subgroup of , with , of rank and covolume related to the value of the zeta function of at . Here are the groups defined by Quillen.
We now define
[TABLE]
and we may ask: what is the nature of these invariants, how do they depend on the field ? Does the analogue of Lehmer’s conjecture hold?
Apart from their formal analogy with Lehmer’s question, answers to such questions can be quite useful in practice as we now illustrate.
7.6.
For general , very little is known about the groups or the map . For , however, things can be made quite explicit (and of course corresponds to the case of units). Indeed, up to torsion, is isomorphic to the Bloch group , defined by generators and relations as follows.
For any field define
[TABLE]
where the corresponding term in the sum is omitted if and
[TABLE]
It is not hard to check that . Finally, let
[TABLE]
We recall the definition of the Bloch–Wigner dilogarithm. Starting with the usual dilogarithm
[TABLE]
one defines
[TABLE]
and checks that it extends to a real analytic function on , continuous on . See [Za] for an account of its many wonderful properties. It is obvious that
[TABLE]
The 5-term relation satisfied by guarantees that, extended by linearity to , it induces a well defined function on (still denoted by ).
For (114) can be formulated as follows
[TABLE]
((115) makes it clear that the image lies in ) whose image is a discrete subgroup of rank .
An a priori lower bound for even for the simplest case where is of rank (namely, for a field with only one complex embedding) would be quite useful. For example, in [BRV1] we find that an identity between the Mahler measure of certain two-variable polynomials is equivalent to the following
[TABLE]
This was proved by Zagier by showing that it is a consequence of series of 5-term relations. Such calculations, however, can be quite hard and at present there is no known algorithm that is guaranteed to produce the desired result. Clearly if we knew a reasonable lower bound for the possible non-zero values of for a simple numerical verification would be enough to prove (116).
Similarly, many identities [BRV2] between the Mahler measure of certain two-variable polynomials and for a corresponding number field , which by Borel’s theorem are known up to an unspecified rational number, could be proved by a numerical check. For example, we can show that
[TABLE]
with , where is the splitting field , of discriminant . However, though numerically appears to be equal to we cannot prove this at the moment. Again, a reasonable lower bound on for non-torsion elements would allow us to conclude that by checking it numerically to high enough precision.
There is also some evidence that might be universally bounded below, at least for fields with one complex embedding. Indeed, for a such a field one can construct a hyperbolic three dimensional manifold by taking the quotient of hyperbolic space by a torsion-free subgroup of the group of units of norm in a quaternion algebra over ramified at all its real places. Its associated Bloch group element , obtained from a triangulation of into ideal tetrahedra, satisfies . On the other hand, the volume of hyperbolic 3-manifolds is known to be universally bounded below. The question becomes then, that of obtaining an upper bound for the index in of the subgroup generated by all such . This index is likely to be rather small; in fact, if we accept a precise form of Lichtembaum’s conjecture, it should be essentially the order of , an analogue of a class group. Unfortunately, there is no known upper bound for in terms of, say, the degree and discriminant of .
Finally, to a hyperbolic 3-manifold with one cusp one may associate [CCGLS] a two variable polynomial , called the A-polynomial of . Its zero locus parameterizes deformations of the complete hyperbolic structure of .
It is known that
[TABLE]
for every face polynomial of and that is reciprocal, i.e. for some . It is interesting that these two properties, which have a topological and -theoretic origin, are, for irreducible, precisely the known necessary conditions for a polynomial in to have to have small Mahler measure (the first, an analogue of being the minimal polynomial of an algebraic unit, because of (109); the second because is known to be universally bounded below for non-reciprocal [Sm1]).
Though the whole picture is still not completely clear yet one can prove [BRV2] for many ’s identities of the form
[TABLE]
where is the Bloch group element associated to . This suggests a direct link between Lehmer’s conjecture and the size of the invariants .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AAR] G. Andrews, R. Askey and R. Roy, Special functions , Cambridge U. Press, Cambridge (1999).
- 2[BS] Z. I. Borevich and I. R. Shafarevich, Number Theory . Academic Press, New York (1966).
- 3[Bo] D.W. Boyd, Speculations concerning the range of Mahler’s measure , Canad. Math. Bull. 24 (1981) 453–469.
- 4[BRV 1] D.W. Boyd and F. Rodriguez Villegas, Mahler’s measure and the dilogarithm I , Canad. J. Math. 54 (2002) 468–492.
- 5[BRV 2] D.W. Boyd and F. Rodriguez Villegas, Mahler’s measure and the dilogarithm II ,
- 6[Ca] J. Cassels, An introduction to the geometry of numbers . Springer, Berlin (1959).
- 7[CCGLS] D. Cooper, M. Culler, H. Gillet, D. D. Long, and P. B. Shalen, Plane curves associated to character varieties of 3-manifolds , Invent. Math. 118 (1994) 47–84.
- 8[CF] A. Costa and E. Friedman, Ratios of regulators in totally real extensions of number fields , J. Number Th. 37 (1991) 288–297.
