A case of the Rodriguez Villegas conjecture

Ted Chinburg; Eduardo Friedman; Fernando Rodriguez-Villegas; James; Sundstrom

arXiv:1903.01384·math.NT·March 8, 2023

A case of the Rodriguez Villegas conjecture

Ted Chinburg, Eduardo Friedman, Fernando Rodriguez-Villegas, James, Sundstrom

PDF

Open Access

TL;DR

This paper proves a high-rank case of Rodriguez Villegas's conjecture, which relates to bounds on units in number fields and interpolates between known cases of Lehmer's conjecture and bounds on regulators.

Contribution

It establishes the conjecture for number fields containing a subfield with a large degree extension, extending previous partial results.

Findings

01

Proves the conjecture when L contains a subfield K with large [L:K] relative to [K:Q]

02

Shows the kernel of the norm map plays a key role in the high-rank case

03

Extends understanding of bounds on units and regulators in number fields

Abstract

Let L be a number field and let E be any subgroup of the units O_L^* of L. If rank(E) = 1, Lehmer's conjecture predicts that the height of any non-torsion element of E is bounded below by an absolute positive constant. If rank(E) = rank(O_L^*), Zimmert proved a lower bound on the regulator of E which grows exponentially with [L:Q]. Fernando Rodriguez Villegas made a conjecture in 2002 that "interpolates" between these two extremes of rank. Here we prove a high-rank case of this conjecture. Namely, it holds if L contains a subfield K for which [L:K] >> [K:Q] and E contains the kernel of the norm map from O_L^* to O_K^*.

Equations481

\|\omega\|_{1}\geq c_{0}c_{1}^{j}\qquad\qquad\Big{(}\forall\omega\in{\bigwedge}^{j}{\mathrm{LOG}}({\mathcal{O}_{L}^{*}})\subset{\bigwedge}^{j}\mathbb{R}^{\mathcal{A}_{L}},\ \,\omega\not=0\Big{)}.

\|\omega\|_{1}\geq c_{0}c_{1}^{j}\qquad\qquad\Big{(}\forall\omega\in{\bigwedge}^{j}{\mathrm{LOG}}({\mathcal{O}_{L}^{*}})\subset{\bigwedge}^{j}\mathbb{R}^{\mathcal{A}_{L}},\ \,\omega\not=0\Big{)}.

\big{(}{\mathrm{LOG}}(\gamma)\big{)}_{v}:=e_{v}\log|\gamma|_{v},\quad e_{v}:=\begin{cases}1\ &\text{if}\ v\ \text{is real},\\ 2\ &\text{if}\ v\ \text{is complex}\end{cases}\quad\quad(\gamma\in{\mathcal{O}_{L}^{*}},\ v\in{\mathcal{A}_{L}}),

\big{(}{\mathrm{LOG}}(\gamma)\big{)}_{v}:=e_{v}\log|\gamma|_{v},\quad e_{v}:=\begin{cases}1\ &\text{if}\ v\ \text{is real},\\ 2\ &\text{if}\ v\ \text{is complex}\end{cases}\quad\quad(\gamma\in{\mathcal{O}_{L}^{*}},\ v\in{\mathcal{A}_{L}}),

δ_{w}^{v} := {10 if w = v, if w \neq = v .

δ_{w}^{v} := {10 if w = v, if w \neq = v .

δ^{I} := δ^{v_{1}} \land δ^{v_{2}} \land \dots \land δ^{v_{j}} .

δ^{I} := δ^{v_{1}} \land δ^{v_{2}} \land \dots \land δ^{v_{j}} .

∥ LOG (ε_{1}) \land \dots \land LOG (ε_{j}) ∥_{1} > 0.001 \cdot 1. 4^{j},

∥ LOG (ε_{1}) \land \dots \land LOG (ε_{j}) ∥_{1} > 0.001 \cdot 1. 4^{j},

\|{\mathrm{LOG}}(\varepsilon)\|_{2}\geq\sqrt{[L:\mathbb{Q}]}\log\!\big{(}(1+\sqrt{5})/2\big{)}\qquad\qquad(\varepsilon\in{\mathcal{O}_{L}^{*}},\ \varepsilon\not=\pm 1).

\|{\mathrm{LOG}}(\varepsilon)\|_{2}\geq\sqrt{[L:\mathbb{Q}]}\log\!\big{(}(1+\sqrt{5})/2\big{)}\qquad\qquad(\varepsilon\in{\mathcal{O}_{L}^{*}},\ \varepsilon\not=\pm 1).

μ > \frac{([ L : Q ] / j ) ^{j /2} 1.40 6 ^{j}}{( j + 2 ) j} (1 \leq j < [L : Q]) .

μ > \frac{([ L : Q ] / j ) ^{j /2} 1.40 6 ^{j}}{( j + 2 ) j} (1 \leq j < [L : Q]) .

∥ LOG (ε_{1}) \land \dots \land LOG (ε_{j}) ∥_{1} \geq ∥ LOG (ε_{1}) \land \dots \land LOG (ε_{j}) ∥_{2} = μ,

∥ LOG (ε_{1}) \land \dots \land LOG (ε_{j}) ∥_{1} \geq ∥ LOG (ε_{1}) \land \dots \land LOG (ε_{j}) ∥_{2} = μ,

∥ ω ∥_{1} > 0.001 \cdot 1. 4^{n - 2},

∥ ω ∥_{1} > 0.001 \cdot 1. 4^{n - 2},

E=E(L/K):=\big{\{}\varepsilon\in{\mathcal{O}_{L}^{*}}\big{|}\,\text{Norm}_{L/K}(\varepsilon)\text{ is a root of unity}\big{\}}

E=E(L/K):=\big{\{}\varepsilon\in{\mathcal{O}_{L}^{*}}\big{|}\,\text{Norm}_{L/K}(\varepsilon)\text{ is a root of unity}\big{\}}

∥ ε_{1} \land \dots \land ε_{j} ∥_{1} \geq ∥ ε_{1} \land \dots \land ε_{j} ∥_{2} \geq 1. 1^{j},

∥ ε_{1} \land \dots \land ε_{j} ∥_{1} \geq ∥ ε_{1} \land \dots \land ε_{j} ∥_{2} \geq 1. 1^{j},

\|\varepsilon_{1}\wedge\cdots\wedge\varepsilon_{j}\|_{1}\geq\|\varepsilon_{1}\wedge\cdots\wedge\varepsilon_{j}\|_{2}\geq 1.1^{j}\qquad\qquad\big{(}j:=\mathrm{rank}_{\mathbb{Z}}(E)\big{)}.

\|\varepsilon_{1}\wedge\cdots\wedge\varepsilon_{j}\|_{1}\geq\|\varepsilon_{1}\wedge\cdots\wedge\varepsilon_{j}\|_{2}\geq 1.1^{j}\qquad\qquad\big{(}j:=\mathrm{rank}_{\mathbb{Z}}(E)\big{)}.

x=(x_{v})_{v\in{\mathcal{A}_{L}}}=\big{(}\,|\varepsilon|_{v^{{\phantom{-1}}}}^{\xi_{\phantom{-1}}}\!\!\!\!\big{)}_{v\in{\mathcal{A}_{L}}}\qquad\qquad\big{(}\varepsilon\in E,\ \,\xi\in\mathbb{R}\big{)}.

x=(x_{v})_{v\in{\mathcal{A}_{L}}}=\big{(}\,|\varepsilon|_{v^{{\phantom{-1}}}}^{\xi_{\phantom{-1}}}\!\!\!\!\big{)}_{v\in{\mathcal{A}_{L}}}\qquad\qquad\big{(}\varepsilon\in E,\ \,\xi\in\mathbb{R}\big{)}.

∣ Norm_{L / Q} (a) ∣ = v \in A_{L} \prod ∣ a ∣_{v}^{e_{v}}, (e_{v} := 1 if v is real, e_{v} := 2 if v is complex) .

∣ Norm_{L / Q} (a) ∣ = v \in A_{L} \prod ∣ a ∣_{v}^{e_{v}}, (e_{v} := 1 if v is real, e_{v} := 2 if v is complex) .

v \in A_{L} \sum e_{v} = [L : Q] =: n,

v \in A_{L} \sum e_{v} = [L : Q] =: n,

\prod_{v\in{\mathcal{A}_{L}}}x_{v}^{e_{v}}=1\qquad\qquad\big{(}x=(x_{v})_{v\in{\mathcal{A}_{L}}}\in{E_{\mathbb{R}}}\big{)},

\prod_{v\in{\mathcal{A}_{L}}}x_{v}^{e_{v}}=1\qquad\qquad\big{(}x=(x_{v})_{v\in{\mathcal{A}_{L}}}\in{E_{\mathbb{R}}}\big{)},

Θ_{E} (t; a) := \frac{μ _{E_{R}} ( E _{R} / E )}{∣ E _{tor} ∣} + a \in a / E a \neq = 0 \sum \int_{x \in E_{R}} e^{- c_{a} t ∥ a x ∥^{2}} d μ_{E_{R}} (x), ∥ a x ∥^{2} := v \in A_{L} \sum e_{v} ∣ a ∣_{v}^{2} x_{v}^{2},

Θ_{E} (t; a) := \frac{μ _{E_{R}} ( E _{R} / E )}{∣ E _{tor} ∣} + a \in a / E a \neq = 0 \sum \int_{x \in E_{R}} e^{- c_{a} t ∥ a x ∥^{2}} d μ_{E_{R}} (x), ∥ a x ∥^{2} := v \in A_{L} \sum e_{v} ∣ a ∣_{v}^{2} x_{v}^{2},

c_{\mathfrak{a}}:=\pi\big{(}\sqrt{|D_{L}|}\,\mathrm{Norm}_{L/\mathbb{Q}}(\mathfrak{a})\big{)}^{-2/n},\qquad\quad D_{L}:=\text{discriminant of }L,\quad n:=[L:\mathbb{Q}].

c_{\mathfrak{a}}:=\pi\big{(}\sqrt{|D_{L}|}\,\mathrm{Norm}_{L/\mathbb{Q}}(\mathfrak{a})\big{)}^{-2/n},\qquad\quad D_{L}:=\text{discriminant of }L,\quad n:=[L:\mathbb{Q}].

\Theta_{E}(t;\mathfrak{a})+\frac{2t\Theta_{E}^{\prime}(t;\mathfrak{a})}{n}\geq 0\qquad\qquad\qquad\Big{(}t>0,\ \,\Theta_{E}^{\prime}:=\frac{d\Theta_{E}}{dt}\Big{)}.

\Theta_{E}(t;\mathfrak{a})+\frac{2t\Theta_{E}^{\prime}(t;\mathfrak{a})}{n}\geq 0\qquad\qquad\qquad\Big{(}t>0,\ \,\Theta_{E}^{\prime}:=\frac{d\Theta_{E}}{dt}\Big{)}.

\frac{\mu_{E_{\mathbb{R}}}({E_{\mathbb{R}}}/E)}{\lvert E_{\mathrm{tor}}\rvert}\geq\sum_{\begin{subarray}{c}a\in\mathfrak{a}/E\\ a\not=0\end{subarray}}\int_{x\in{E_{\mathbb{R}}}}\Big{(}\frac{2t\|ax\|^{2}}{n}-1\Big{)}\mathrm{e}^{-t\,\|ax\|^{2}}\,d\mu_{E_{\mathbb{R}}}(x)\quad\quad(t>0).

\frac{\mu_{E_{\mathbb{R}}}({E_{\mathbb{R}}}/E)}{\lvert E_{\mathrm{tor}}\rvert}\geq\sum_{\begin{subarray}{c}a\in\mathfrak{a}/E\\ a\not=0\end{subarray}}\int_{x\in{E_{\mathbb{R}}}}\Big{(}\frac{2t\|ax\|^{2}}{n}-1\Big{)}\mathrm{e}^{-t\,\|ax\|^{2}}\,d\mu_{E_{\mathbb{R}}}(x)\quad\quad(t>0).

\big{(}{\mathrm{Log}}_{G}(g)\big{)}_{v}:=\log(g_{v})\qquad\qquad\big{(}v\in{\mathcal{A}_{L}},\ \,g=(g_{v})_{v}\in G:=\mathbb{R}_{+}^{\mathcal{A}_{L}}\big{)}.

\big{(}{\mathrm{Log}}_{G}(g)\big{)}_{v}:=\log(g_{v})\qquad\qquad\big{(}v\in{\mathcal{A}_{L}},\ \,g=(g_{v})_{v}\in G:=\mathbb{R}_{+}^{\mathcal{A}_{L}}\big{)}.

\langle\beta,\gamma\rangle:=\sum_{v\in{\mathcal{A}_{L}}}e_{v}\beta_{v}\gamma_{v}\qquad\qquad\big{(}\beta=(\beta_{v})_{v},\ \gamma=(\gamma_{v})_{v}\in\mathbb{R}^{\mathcal{A}_{L}}\big{)},

\langle\beta,\gamma\rangle:=\sum_{v\in{\mathcal{A}_{L}}}e_{v}\beta_{v}\gamma_{v}\qquad\qquad\big{(}\beta=(\beta_{v})_{v},\ \gamma=(\gamma_{v})_{v}\in\mathbb{R}^{\mathcal{A}_{L}}\big{)},

q_{1v}:=1\ \ (\forall v\in{\mathcal{A}_{L}}),\ \ \sum_{v\in{\mathcal{A}_{L}}}e_{v}q_{iv}q_{jv}=0\ \ \big{(}1\leq i\not=j\leq k:=1+\mathrm{rank}_{\mathbb{Z}}({\mathcal{O}_{L}^{*}}/E)\big{)}.

q_{1v}:=1\ \ (\forall v\in{\mathcal{A}_{L}}),\ \ \sum_{v\in{\mathcal{A}_{L}}}e_{v}q_{iv}q_{jv}=0\ \ \big{(}1\leq i\not=j\leq k:=1+\mathrm{rank}_{\mathbb{Z}}({\mathcal{O}_{L}^{*}}/E)\big{)}.

g \in E_{R} ⟺ v \in A_{L} \sum e_{v} q_{j v} lo g (g_{v}) = 0 (1 \leq j \leq k) .

g \in E_{R} ⟺ v \in A_{L} \sum e_{v} q_{j v} lo g (g_{v}) = 0 (1 \leq j \leq k) .

\big{(}\delta(g)\big{)}_{j}:=\prod_{v\in{\mathcal{A}_{L}}}g_{v}^{e_{v}q_{jv}}\qquad\qquad\big{(}1\leq j\leq k,\ \,g=(g_{v})_{v}\in G:=\mathbb{R}_{+}^{\mathcal{A}_{L}}),

\big{(}\delta(g)\big{)}_{j}:=\prod_{v\in{\mathcal{A}_{L}}}g_{v}^{e_{v}q_{jv}}\qquad\qquad\big{(}1\leq j\leq k,\ \,g=(g_{v})_{v}\in G:=\mathbb{R}_{+}^{\mathcal{A}_{L}}),

1 E_{R} G δ H 1.

1 E_{R} G δ H 1.

d μ_{G} := v \in A_{L} \prod \frac{d g _{v}}{g _{v}}, d μ_{H} := j = 1 \prod k \frac{d h _{j}}{h _{j}}

d μ_{G} := v \in A_{L} \prod \frac{d g _{v}}{g _{v}}, d μ_{H} := j = 1 \prod k \frac{d h _{j}}{h _{j}}

γ (x, h) := x σ (h),

γ (x, h) := x σ (h),

c μ_{G} \circ γ = μ_{E_{R}} \times μ_{H},

c μ_{G} \circ γ = μ_{E_{R}} \times μ_{H},

c = 2^{r_{2}} det (Q_{l}^{⊺} Q),

c = 2^{r_{2}} det (Q_{l}^{⊺} Q),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgebraic Geometry and Number Theory · Finite Group Theory Research

Full text

\markleft

T. Chinburg E. Friedman J. Sundstrom

A case of the Rodriguez Villegas conjecture

with an appendix by Fernando Rodriguez Villegas

Ted Chinburg, Eduardo Friedman and James Sundstrom

Department of Mathematics, University of Pennsylvania, David Rittenhouse Lab., 209 South 33rd Street, Philadelphia PA 19104-6395, USA

[email protected]

Departamento de Matemáticas, Facultad de Ciencias, Universidad de Chile, Las Palmeras 3425, Ñuñoa, Santiago R.M., CHILE

[email protected]

The Abdus Salam International Centre for Theoretical Physics, ICTP Math Section, Strada Costiera 11, I-34151 Trieste, Italy

[email protected]

Department of Mathematics (038-16), Temple University, Wachman Hall, 1805 North Broad Street, Philadelphia PA 19122, USA

[email protected]

Abstract.

Let $L$ be a number field and let $E$ be any subgroup of the units ${\mathcal{O}_{L}^{*}}$ of $L$ . If $\mathrm{rank}_{\mathbb{Z}}(E)=1$ , Lehmer’s conjecture predicts that the height of any non-torsion element of $E$ is bounded below by an absolute positive constant. If $\mathrm{rank}_{\mathbb{Z}}(E)=\mathrm{rank}_{\mathbb{Z}}({\mathcal{O}_{L}^{*}}$ ), Zimmert proved a lower bound on the regulator of $E$ which grows exponentially with $[L:\mathbb{Q}]$ . Fernando Rodriguez Villegas made a conjecture in 2002 that “interpolates” between these two extremes of rank. Here we prove a high-rank case of this conjecture. Namely, it holds if $L$ contains a subfield $K$ for which $[L:K]\gg[K:\mathbb{Q}]$ and $E$ contains the kernel of the norm map from ${\mathcal{O}_{L}^{*}}$ to ${\mathcal{O}_{K}^{*}}$ .

Key words and phrases:

Lehmer’s conjecture, Mahler measure, units.

2010 Mathematics Subject Classification:

11R06, 11R27

Partially supported by U.S. N.S.F. grant NSF FRG Grant DMS-1360767 (Chinburg and Sundstrom), U.S. N.S.F. SaTC Grants CNS-1513671/1701785 (Chinburg) and by Chilean FONDECYT grant 1170176 (Friedman).

1. Introduction

In 2002 Fernando Rodriguez Villegas conjectured a surprising lower bound on a natural $1$ -norm of any non-trivial element of the $j$ -th exterior power of the units of a number field. For $j$ minimal, i.e., $j=1$ , Rodriguez Villegas’ conjecture is equivalent to Lehmer’s 1933 conjectural lower bound on the height of an algebraic number [Le] [Sm2]. For $j$ maximal, i.e., $j=\mathrm{rank}_{\mathbb{Z}}({\mathcal{O}_{L}^{*}})$ , it is equivalent to Zimmert’s 1981 theorem stating that the regulator of a number field grows at least exponentially with the degree of the number field [Zi].

We now state his conjecture in its strongest possible form.111 The original 2002 write-up of this conjecture was kindly supplied to us by F. Rodriguez Villegas and appears with his permission for the first time in print here (see §7). The 2002 conjecture is somewhat weaker, but F. Rodriguez Villegas later strengthened it to the form given here.

RV Conjecture.

(Rodriguez Villegas) There exist two absolute constants $c_{0}>0$ and $c_{1}>1$ such that for any number field $L$ and any $j\in\mathbb{N}$ ,

[TABLE]

Here ${\bigwedge}^{j}{\mathrm{LOG}}({\mathcal{O}_{L}^{*}})$ denotes the $j^{\mathrm{th}}$ exterior power of the lattice ${\mathrm{LOG}}({\mathcal{O}_{L}^{*}})\subset\mathbb{R}^{\mathcal{A}_{L}}$ , ${\mathcal{A}_{L}}$ denotes the set of archimedean places of $L$ , and ${\mathrm{LOG}}\colon{\mathcal{O}_{L}^{*}}\to\mathbb{R}^{{\mathcal{A}_{L}}}$ is defined by

[TABLE]

where $|\ |_{v}$ is the absolute value associated to $v\in{\mathcal{A}_{L}}$ extending the usual absolute value on $\mathbb{Q}$ . To define the $1$ -norm in (1), we start with the usual orthonormal basis $\{\delta^{v}\}_{v\in{\mathcal{A}_{L}}}$ on $\mathbb{R}^{\mathcal{A}_{L}}$ , i.e., for $w\in{\mathcal{A}_{L}}$

[TABLE]

This gives rise to the orthonormal basis $\{\delta^{I}\}_{I\in{\mathcal{A}^{[j]}_{L}}}$ of ${\bigwedge}^{j}\mathbb{R}^{\mathcal{A}_{L}}$ , where ${\mathcal{A}^{[j]}_{L}}$ denotes the set of subsets $I$ of ${\mathcal{A}_{L}}$ having cardinality $j$ , for each such $I$ we fix an ordering $\{v_{1},...,v_{j}\}$ of $I$ and

[TABLE]

The $1$ -norm on ${\bigwedge}^{j}\mathbb{R}^{\mathcal{A}_{L}}$ in the RV conjecture (1) is defined with respect to this basis. Namely,222 Although Rodrigez Villegas phrased the 1-norm in terms of the archimedean embeddings rather than places (see §7.4), the 1-norm is unchanged as we inserted a factor of 2 at complex places in (2). However, the embedding using places gives a larger 2-norm if the field is not totally real, and so is better for our purposes. for $\omega=\sum_{I\in{\mathcal{A}^{[j]}_{L}}}c_{I}\delta^{I}$ , we let $\|\omega\|_{1}:=\sum_{I\in{\mathcal{A}^{[j]}_{L}}}|c_{I}|$ .

It is worth mentioning that Siegel [Sie] showed that the conjectural inequality (1) is not possible for the Euclidean norm $\|\omega\|_{2}:=\sqrt{\sum_{I}c_{I}^{2}}$ . Indeed, if $p>2$ is a prime, if $\varepsilon\in\mathbb{C}$ satisfies $\varepsilon^{p}-\varepsilon+1=0$ and $L:=\mathbb{Q}(\varepsilon)$ , then $\|{\mathrm{LOG}}(\varepsilon)\|_{2}\leq\sqrt{2}\log(p)/\sqrt{p}$ . Hence, the RV conjecture is necessarily for the $1$ -norm, at least for $j=1$ .

However, for $j$ close to the maximal value $r_{L}=\mathrm{rank}_{\mathbb{Z}}({\mathcal{O}_{L}^{*}})$ , the $1$ -norm and the Euclidean norm are interchangeable for the purposes of Rodriguez Villegas’ conjecture. This is simply because on any Euclidean space $V$ , we have $\sqrt{\dim(V)}\,\|v\|_{2}\geq\|v\|_{1}\geq\|v\|_{2}$ , provided the $1$ -norm is taken with respect to an orthonormal basis for $V$ . In this paper we will work only with the Euclidean norm and $j$ close to $r_{L}$ .

Aside from Zimmert’s theorem on the regulator [Zi] and the known cases of Lehmer’s conjecture [Sm2], the cleanest result in favor of the RV conjecture is

[TABLE]

proved for all $j$ , but only for totally real fields $L$ . This follows from work of Pohst [Po] dating back to 1978. Indeed, Pohst showed for $L$ totally real that

[TABLE]

Using estimates of Hermite’s constant, he deduced good lower bounds for the regulator of a totally real field. The same calculations show that the $j$ -dimensional co-volume $\mu$ of the lattice spanned by ${\mathrm{LOG}}(\varepsilon_{1}),...,{\mathrm{LOG}}(\varepsilon_{j})$ satisfies [CF, p. 293]

[TABLE]

Since

[TABLE]

a short numerical computation with (5) yields (4).

As far as we know, the only proved cases of the RV conjecture involve “pure wedges,” i.e., $\omega$ of the form $\omega={\mathrm{LOG}}(\varepsilon_{1})\wedge\cdots\wedge{\mathrm{LOG}}(\varepsilon_{j})$ , where the $\varepsilon_{i}$ are independent elements of ${\mathcal{O}_{L}^{*}}$ . If $j=r_{L}$ or $j=1$ , every element of $\bigwedge^{j}$ is (trivially) a pure wedge, but this also holds if $j=r_{L}-1$ (see Lemma 22 below). In particular, if $L$ is a totally real field of degree $n$ over $\mathbb{Q}$ , then

[TABLE]

for all $\omega\in{\bigwedge}^{n-2}{\mathrm{LOG}}({\mathcal{O}_{L}^{*}})$ . In general, however, the RV conjecture makes a stronger prediction than simply a lower bound on the 1-norm of pure wedges.

Another known case of the RV conjecture occurs when

[TABLE]

is the group of relative units associated to an extension $L/K$ . Friedman and Skoruppa [FS] proved in 1999 that inequality (1) in the RV conjecture holds for pure wedges if $[L:K]\geq N_{0}$ for some absolute constant $N_{0}$ .333 The inequality proved in [FS] is for the relative regulator $\text{Reg}(L/K)$ rather than for the co-volume $\mu$ of the relative units. This suffices since $\mu=\text{Reg}(L/K)\prod_{v\in{\mathcal{A}_{K}}}\sqrt{r_{v}}\geq\text{Reg}(L/K)$ , where $r_{v}$ is the number of places of $L$ above $v$ . The proof of this relation between the co-volume and the relative regulator mimics the determinant manipulations in the case $K=\mathbb{Q}$ [BS, p. 115]. We note that J. Sundstrom, in the appendix to his doctoral thesis [Su1], corrected an error in Skoruppa and Friedman’s proof. Namely, in the bound on what is called $J_{1}$ in the proof of Lemma 5.5 of [FS], the real part of the error term $\rho$ in the exponential was neglected. This did not affect the proof of their Main Theorem, but it did affect the numerical constants claimed in Theorem 4.1 and its corollaries. By improving the asymptotic estimates in [FS] and using extensive computer calculations, Sundstrom was able to prove the estimate in Theorem 4.1 of [FS], with the constants as given there, In particular, $N_{0}=40$ . If we are willing to settle for $N_{0}=400$ , the proof in [FS] will do after adjusting the constants to correct for the error in the proof of Lemma 5.5. To prove their result, Friedman and Skoruppa defined a $\Theta$ -type series $\Theta_{E}$ associated to any subgroup $E\subset{\mathcal{O}_{L}^{*}}$ of arbitrary rank and used it to produce a complicated inequality for the co-volume $\mu(E)$ associated to the lattice ${\mathrm{LOG}}(E)$ . In the case of $E=E(L/K)$ they obtained the desired inequality using the saddle-point method to estimate the terms in the series $\Theta_{E}$ as $[L:K]\to\infty$ . Although the saddle-point method in one variable is a standard tool, the difficulty in the asymptotic estimates in [FS, §5] was that the estimates needed to depend only on $[L:K]$ .

The results cited so far all pre-date the RV conjecture and essentially dealt with regulators or Lehmer’s conjecture. Inspired by the RV conjecture, Sundstrom [Su1] [Su2] dealt in his 2016 thesis with a new kind of subgroup of the units. Namely, suppose $L$ contains two distinct real quadratic subfields $K_{1},K_{2}$ , and let $E:=E(L/K_{1})\cap E(L/K_{2}).$ The series $\Theta_{E}$ is still defined and yields an inequality for the co-volume $\mu\big{(}{\mathrm{LOG}}(E)\big{)}$ , but to estimate the terms in the inequality Sundstrom had to apply the saddle-point method to a triple integral. Keeping all estimates uniform in this case proved considerably harder than in the one-variable case treated in [FS]. In the end, Sundstrom was able to verify the RV conjecture in this case for pure wedges. More precisely, he proved the existence of absolute constants $N_{0},\,c_{0}>0$ and $c_{1}>1$ such that $\mu(E)\geq c_{0}c_{1}^{j}$ for $[L:\mathbb{Q}]\geq N_{0}$ and $j=\text{rank}_{\mathbb{Z}}(E)=\text{rank}_{\mathbb{Z}}({\mathcal{O}_{L}^{*}})-2$ .

Here we extend Sundstrom’s result, letting the $K_{i}$ be arbitrary, as follows. Let $K_{1},\ldots,K_{\ell}$ be subfields of a number field $L$ , let $K:=K_{1}\cdots K_{\ell}$ $\subset L$ be the compositum of the $K_{i}$ , let $E:=\bigcap_{i=1}^{\ell}E(L/K_{i})\subset{\mathcal{O}_{L}^{*}}$ be the subgroup of the units of $L$ whose norm to each $K_{i}$ is a root of unity, and let $\varepsilon_{1},...,\varepsilon_{j}$ be independent elements of $E$ , where $j:=\mathrm{rank}_{\mathbb{Z}}(E)$ . Then there is an absolute constant $N_{0}$ such that

[TABLE]

*whenever $[L:K]\geq N_{0}\cdot 2.01^{[K:\mathbb{Q}]}$ . *

In fact the above is an immediate corollary of our

Main Theorem.

Suppose $E\subset{\mathcal{O}_{L}^{*}}$ is a subgroup of the units of the number field $L$ such that $E(L/K)\subset E$ for some subfield $K\subset L$ , where $E(L/K)$ are the relative units defined in (7). Let $\varepsilon_{1},...,\varepsilon_{j}$ be independent elements of $E$ , where $j:=\mathrm{rank}_{\mathbb{Z}}(E)$ . Then the RV conjecture (1) holds for $\omega:=\varepsilon_{1}\wedge\cdots\wedge\varepsilon_{j}$ and $[L:K]$ large enough compared to $[K:\mathbb{Q}]$ .

More precisely, there is an absolute constant $N_{0}$ such that if $[L:K]\geq N_{0}\cdot 2.01^{[K:\mathbb{Q}]}$ , then

[TABLE]

Our proof of the Main Theorem is again through an asymptotic analysis of the inequality for $\Theta_{E}$ in [FS], but there are several new features which bring the proof closer to the case of a general high-rank subgroup $E\subset{\mathcal{O}_{L}^{*}}$ .

In both [FS] and [Su2], the uniformity of the asymptotic estimates depends on having explicit expressions for the orthogonal complement of ${\mathrm{LOG}}(E)$ inside $\mathbb{R}^{\mathcal{A}_{L}}$ , but here we have very little knowledge of ${\mathrm{LOG}}(E)^{\perp}$ . As in [FS] and [Su2], we take a Mellin transform of the terms of $\Theta_{E}$ and invert it to express each term in $\Theta_{E}$ as a $k$ -dimensional complex contour integral (see Lemma 3 below). Here $k:=1+\text{rank}_{\mathbb{Z}}({\mathcal{O}_{L}^{*}}/E)$ is the co-rank of $E\subset{\mathcal{O}_{L}^{*}}$ , shifted by 1.

To apply the saddle-point method to our integral, we need a saddle point. In the case of [FS] one could easily write down a formula for the saddle point in terms of the logarithmic derivative of the classical $\Gamma$ -function. In [Su2] the equations for the critical point were explicit enough that monotonicity arguments proved the existence of the saddle point. In our case the equations are too complicated to analyse directly. Instead, in §3 we obtain the existence and uniqueness of the saddle point by re-interpreting it as the value of the Legendre transform of a convex function on $\mathbb{R}^{k}$ , closely related to $\log\Gamma$ .

Since (what will prove to be) the main term in our asymptotic expansion depends on the saddle point $\sigma=(\sigma_{1},...,\sigma_{k})\in\mathbb{R}^{k}$ , of which we can only control $\sigma_{1}$ , in §4 we prove inequalities for the main term which depend only on $\sigma_{1}$ . We need these inequalities to prove that the main term has the exponential growth claimed in the Main Theorem.

The results proved in §2-§4 are valid for any subgroup $E\subset{\mathcal{O}_{L}^{*}}$ . In §5 we carry out the required uniform asymptotic estimates, assuming $E(L/K)\subset E$ and $[L:K]\gg 0$ to show that the purported main term actually dominates. Finally, in §6 we put everything together and prove the Main Theorem.

2. The $\Theta$ -function

In this section we recall the series $\Theta_{E}(t;\mathfrak{a})$ associated to a subgroup $E\subset{\mathcal{O}_{L}^{*}}$ of the units and to a fractional ideal $\mathfrak{a}$ of the number field $L$ . We also recall the inequality for the co-volume of ${\mathrm{LOG}}(E)$ resulting from the functional equation of $\Theta_{E}$ . This is all quoted from [FS, §2]. Our main new task here is to express the terms in the inequality as an inverse Mellin transform.

2.1. The basic inequality

Given a subgroup $E\subset{\mathcal{O}_{L}^{*}}$ , we define ${E_{\mathbb{R}}}\subset\mathbb{R}_{+}^{{\mathcal{A}_{L}}}$ as the group generated by all elements of the form

[TABLE]

Here $\mathbb{R}_{+}:=(0,\infty)$ is the multiplicative group of the positive real numbers, ${\mathcal{A}_{L}}$ denotes the set of Archimedean places of $L$ , and $|\ |_{v}$ is the (un-normalized) absolute value associated to the archimedean place $v\in{\mathcal{A}_{L}}$ . Thus, for $a\in L$ we have

[TABLE]

Note that

[TABLE]

and that $\varepsilon\in E$ acts on $x=(x_{v})_{v}\in{E_{\mathbb{R}}}$ , via $(\varepsilon\cdot x)_{v}:=|\varepsilon|_{v}\,x_{v}$ .

We fix a Haar measure on ${E_{\mathbb{R}}}\subset\mathbb{R}_{+}^{\mathcal{A}_{L}}$ as follows. The standard Euclidean structure on $\mathbb{R}^{\mathcal{A}_{L}}$ , in which the $\delta^{v}$ in (3) form an orthonormal basis of $\mathbb{R}^{\mathcal{A}_{L}}$ , induces a Euclidean structure (and therefore a unique Haar measure) on any $\mathbb{R}$ -subspace of $\mathbb{R}^{\mathcal{A}_{L}}$ . We give ${E_{\mathbb{R}}}$ the Haar measure $\mu_{E_{\mathbb{R}}}$ that results from pulling back the Haar measure on the $\mathbb{R}$ -subspace ${\mathrm{LOG}}({E_{\mathbb{R}}})$ via the isomorphism ${\mathrm{LOG}}$ , and let $\mu_{E_{\mathbb{R}}}({E_{\mathbb{R}}}/E)$ be the measure of a fundamental domain for the action of $E$ on ${E_{\mathbb{R}}}$ .

Following [FS, p. 120], for a fractional ideal $\mathfrak{a}\subset L$ and $t>0$ , we let

[TABLE]

where $\lvert E_{\mathrm{tor}}\rvert$ is the number of roots of unity in $E$ ,

[TABLE]

Note that the integral in (12) depends only on the $E$ -orbit of $a$ , and hence is independent of the representative $a\in\mathfrak{a}/E$ taken for the $E$ -orbit of $a$ .

Our starting point for proving lower bounds on co-volumes is the inequality [FS, Corol. p. 121], valid for any $t>0$ and any fractional ideal $\mathfrak{a}$ of $L$ .

[TABLE]

Writing out the individual terms of (13), we have [FS, p. 121, eq. (2.6)] the

Basic Inequality.

[TABLE]

Note that in [FS] we find $tc_{\mathfrak{a}}$ instead of $t$ in (14), but $t>0$ is arbitrary there too.

2.2. Mellin transforms

Our main task in this section is to re-write the $r$ -dimensional integral in (12) as an inverse Mellin transform. For this it will prove convenient to characterize ${E_{\mathbb{R}}}\subset G:=\mathbb{R}_{+}^{\mathcal{A}_{L}}$ not through generators, but rather through generators of the orthogonal complement in $\mathbb{R}^{\mathcal{A}_{L}}$ of ${\mathrm{Log}}_{G}({E_{\mathbb{R}}})$ . Here ${\mathrm{Log}}_{G}\colon G\to\mathbb{R}^{\mathcal{A}_{L}}$ is the group isomorphism defined by

[TABLE]

Note that ${\mathrm{Log}}_{G}$ is not the traditional logarithmic embedding ${\mathrm{LOG}}$ in (2), as we do not insert a factor of $e_{v}$ in (15). Instead we endow $\mathbb{R}^{\mathcal{A}_{L}}$ with a new inner product

[TABLE]

where $e_{v}=1$ or 2 as in (9). Let $\big{\{}q_{j}\big{\}}_{j=1}^{k}=\big{\{}(q_{jv})_{v}\big{\}}_{j=1}^{k}$ be an $\mathbb{R}$ -basis of the orthogonal complement of ${\mathrm{Log}}_{G}({E_{\mathbb{R}}})$ in $\mathbb{R}^{\mathcal{A}_{L}}$ such that

[TABLE]

Thus, for $g=(g_{v})_{v}\in G:=\mathbb{R}_{+}^{\mathcal{A}_{L}}$ ,

[TABLE]

Let $H:=\mathbb{R}_{+}^{k}$ . Define a homomorphism $\delta\colon G\to H$ by

[TABLE]

so that by (18) we have an exact sequence

[TABLE]

Let $\sigma\colon H\to G$ be a homomorphism splitting the exact sequence (20), i.e., $\delta\circ\sigma$ is the identity map on $H$ . Such a splitting exists because $G$ and $H$ are real vector spaces. Let

[TABLE]

be the usual Haar measures on $G:=\mathbb{R}_{+}^{\mathcal{A}_{L}}$ and $H:=\mathbb{R}_{+}^{k}$ .

Recall that in order to define $\Theta_{E}$ in (12) we fixed a Haar measure $\mu_{E_{\mathbb{R}}}$ on ${E_{\mathbb{R}}}$ . In order to calculate Mellin transforms below, we will need to compare the Haar measure $\mu_{H}\times\mu_{E_{\mathbb{R}}}$ on $H\times{E_{\mathbb{R}}}$ with a Haar measure coming from $\mu_{G}$ . Namely, if $\gamma\colon{E_{\mathbb{R}}}\times H\to G$ is the isomorphism defined by the splitting $\sigma$ , i.e.,

[TABLE]

then the measure $\mu_{G}\circ\gamma$ is a Haar measure on ${E_{\mathbb{R}}}\times H$ . Hence

[TABLE]

where the positive constant $c$ is evaluated in the next lemma.

Lemma 1.

Let $Q$ be the $|{\mathcal{A}_{L}}|\times k$ matrix whose rows are indexed by $v\in{\mathcal{A}_{L}}$ and whose columns are indexed by $j=1,\ldots,k$ , with entry $Q_{v,j}:=q_{jv}$ in the $v^{\text{th}}$ row and the $j^{\text{th}}$ column, with $q_{jv}$ as in (17). Then $c$ in (23) is independent of the splitting $\sigma$ in (22) and is given by

[TABLE]

where $Q_{\phantom{l}}^{\intercal}$ is the transpose of $Q$ and $r_{2}$ is the number of complex places of $L$ .

Proof.

For $x=(x_{v})$ and $y=(y_{v})\in\mathbb{R}^{\mathcal{A}_{L}}$ , let $x\cdot y$ be the standard dot product $x\cdot y:=\sum_{v\in{\mathcal{A}_{L}}}x_{v}y_{v}$ . Recall that we defined in (16) another inner product on $\mathbb{R}^{\mathcal{A}_{L}}$ , namely $\langle x,y\rangle:=\sum_{v}e_{v}x_{v}y_{v}$ . To relate these products, let $T:\mathbb{R}^{\mathcal{A}_{L}}\to\mathbb{R}^{\mathcal{A}_{L}}$ be given by $\big{(}T(x)\big{)}_{v}:=e_{v}x_{v}$ . Then

[TABLE]

Note that $\det(T)=2^{r_{2}}$ .

Let $u_{1},...,u_{r}$ be an orthonormal basis of $V$ (with respect to the dot product), let $C_{1}:=\big{\{}\sum_{\ell}x_{\ell}u_{\ell}\big{|}\,0\leq x_{\ell}\leq 1\big{\}}\subset V$ be the $r$ -cube spanned by the $u_{\ell}$ , and let $B_{1}:={\mathrm{LOG}}^{-1}(C_{1})$ . By the definition of the measure $\mu_{E_{\mathbb{R}}}$ given in the paragraph preceding (12), $\mu_{E_{\mathbb{R}}}(B_{1})=1$ .

We define next an analogous subset $B_{2}\subset H:=\mathbb{R}_{+}^{k}$ with $\mu_{H}(B_{2})=1$ . Let $F_{1},\dotsc,F_{k}$ be the “standard” orthonormal basis of $\mathbb{R}_{+}^{k}$ as an $\mathbb{R}$ -vector space; that is, $(F_{j})_{i}=\mathrm{e}$ if $i=j$ , and $(F_{j})_{i}=1$ otherwise. Let $B_{2}\subset\mathbb{R}_{+}^{k}$ be the $k$ -cube spanned by $F_{1},\dotsc,F_{k}$ , so that $\mu_{H}(B_{2})=1$ .

Set $B:=B_{1}\times B_{2}\subset{E_{\mathbb{R}}}\times H$ , so that $(\mu_{E_{\mathbb{R}}}\times\mu_{H})(B)=1$ . Thus $c$ in (23) satisfies

[TABLE]

Now, $\gamma(x,h):=x\sigma(h)$ and $\mu_{G}$ is the measure on $G$ that maps by ${\mathrm{Log}}_{G}$ to the standard Haar measure on $\mathbb{R}^{\mathcal{A}_{L}}$ $\big{(}$ see (15), (21) and (22) $\big{)}$ . Hence, $c^{-1}=|\!\det(M)|$ , where $M$ is the $(|{\mathcal{A}_{L}}|\times|{\mathcal{A}_{L}}|)$ -matrix whose first $r$ columns are the vectors $w_{\ell}:={\mathrm{Log}}_{G}\big{(}{\mathrm{LOG}}^{-1}(u_{\ell})\big{)}\in\mathbb{R}^{{\mathcal{A}_{L}}}\ \,(1\leq\ell\leq r)$ . The remaining $k$ columns of $M$ are the vectors ${\mathrm{Log}}_{G}\big{(}\sigma(F_{j})\big{)}\ \,(1\leq j\leq k)$ .

Suppose $\tilde{\sigma}$ is another splitting of (20). Then $\sigma(F_{j})\tilde{\sigma}(F_{j})^{-1}\in{E_{\mathbb{R}}}$ , and therefore ${\mathrm{Log}}_{G}\big{(}\sigma(F_{j})\big{)}-{\mathrm{Log}}_{G}\big{(}\tilde{\sigma}(F_{j})\big{)}$ lies in the span of the columns $w_{1},...,w_{r}$ . Hence $c$ is independent of the splitting $\sigma$ , as claimed in the lemma. We are therefore free to use the splitting $\sigma$ determined by

[TABLE]

Using (19) and the orthogonality relations (17), one checks that this is indeed a splitting of $\delta$ . With this $\sigma$ , the last $k$ columns of $M$ are just ${\mathrm{Log}}_{G}\big{(}\sigma(F_{j})\big{)}=d_{j}^{-1}q_{j}\in\mathbb{R}^{\mathcal{A}_{L}}$ . As $T\circ{\mathrm{Log}}_{G}={\mathrm{LOG}}$ and $\det(T)=2^{r_{2}}$ $\big{(}$ see (25) $\big{)}$ , we have

[TABLE]

where $N$ is the $(|{\mathcal{A}_{L}}|\times|{\mathcal{A}_{L}}|)$ -matrix whose columns are $T$ applied to the columns of $M$ , i.e., the columns of $N$ are $u_{1},...,u_{r}$ , followed by $d_{1}^{-1}T(q_{1}),...,d_{k}^{-1}T(q_{k})$ .

To prove the lemma we must show that $|\!\det(N)|^{-1}=\sqrt{\mathrm{det}(Q_{\phantom{l}}^{\intercal}Q)}$ . We calculate $|\!\det(N)|$ as

[TABLE]

where $R$ is the $(|{\mathcal{A}_{L}}|\times|{\mathcal{A}_{L}}|)$ -matrix whose columns are $u_{1},...,u_{r}$ , followed by $q_{1},...,q_{k}$ (i.e., $Q$ ). Using the orthonormality of the $u_{\ell}$ ’s (with respect to the dot product), we see that $R_{\phantom{l}}^{\intercal}R$ can be divided into four blocks, the upper left one being the $r\times r$ identity matrix $I_{r\times r}$ . Below it, $R_{\phantom{l}}^{\intercal}R$ has a $k\times r$ block with entries

[TABLE]

where we used (25) and the definition of the $q_{j}$ ’s as a basis of the orthogonal complement of ${\mathrm{Log}}_{G}(E_{\mathbb{R}})\subset\mathbb{R}^{\mathcal{A}_{L}}$ $\big{(}$ with respect to $\langle\ \rangle$ , see (18) $\big{)}$ . Since the bottom right $k\times k$ block of $R_{\phantom{l}}^{\intercal}R$ is $Q_{\phantom{l}}^{\intercal}Q$ , we find that $R_{\phantom{l}}^{\intercal}R=\begin{pmatrix}I_{r\times r}&0_{r\times k}\\ 0_{k\times r}&Q_{\phantom{l}}^{\intercal}Q\end{pmatrix}$ . Thus, $\det\!\big{(}R_{\phantom{l}}^{\intercal}R\big{)}=\sqrt{\mathrm{det}(Q_{\phantom{l}}^{\intercal}Q)}$ . A similar calculation shows $R_{\phantom{l}}^{\intercal}N=\begin{pmatrix}I_{r\times r}&*_{r\times k}\\ 0_{k\times r}&I_{k\times k}\end{pmatrix}$ , whence $\det(R_{\phantom{l}}^{\intercal}N)=1$ .∎

In order to study the $\Theta$ -series (12), we need to consider integrals of the form

[TABLE]

for $g=(g_{v})_{v}\in G:=\mathbb{R}_{+}^{\mathcal{A}_{L}}$ . For $h=(h_{1},\ldots,h_{k})\in H:=\mathbb{R}_{+}^{k}$ , define $\psi$ by substituting $g=\sigma(h)$ above:

[TABLE]

Note that the integral (27) depends only on $g$ modulo ${E_{\mathbb{R}}}$ , so the function $\psi$ is independent of the choice of $\sigma$ splitting the exact sequence (20). The fact that (27) depends only on $g$ modulo ${E_{\mathbb{R}}}$ also shows that

[TABLE]

so we will concentrate on $\psi$ , a function of only $k$ variables.

Define a linear map $S\colon\mathbb{C}^{k}\to\mathbb{C}^{\mathcal{A}_{L}}$ by $S(s)=Qs$ , where $Q$ is the matrix whose $j^{\text{th}}$ column is $q_{j}\in\mathbb{R}^{\mathcal{A}_{L}}\subset\mathbb{C}^{\mathcal{A}_{L}}$ , as in Lemma 1. Also define maps $S_{v}\colon\mathbb{C}^{k}\to\mathbb{C}$ for each $v\in{\mathcal{A}_{L}}$ by $S_{v}(s)=\big{(}S(s)\big{)}_{v}$ . That is,

[TABLE]

Note that $S$ is injective since the $q_{j}\in\mathbb{R}^{\mathcal{A}_{L}}$ are linearly independent.

Our first aim is to calculate the ( $k$ -dimensional) Mellin transform

[TABLE]

where $\mathrm{Re}(s):=\big{(}\mathrm{Re}(s_{1}),\ldots,\mathrm{Re}(s_{k})\big{)}\in{\mathcal{D}}$ , with

[TABLE]

As $q_{1v}:=1$ for all $v\in{\mathcal{A}_{L}}\ \big{(}$ see (17) $\big{)}$ , for $t>0$ we have $(t,0,0,\ldots,0)\in{\mathcal{D}}$ . Hence ${\mathcal{D}}$ is a non-empty, open, convex subset of $\mathbb{R}^{k}$ . We will presently prove that the Mellin transform $(M\psi)(s)$ in (31) converges if $\mathrm{Re}(s)\in{\mathcal{D}}$ .

In the following calculation of $(M\psi)(s)$ the reader should initially consider only real $s_{j}$ , so that the integrand is positive. At the end of the calculation it will become clear that the integral converges for $s$ in the open subset of $\mathbb{C}^{k}$ where $\mathrm{Re}(s)\in{\mathcal{D}}$ .

[TABLE]

where in the last step we used Lemma 1 and $\delta\big{(}\gamma(x,h)\big{)}=\delta\big{(}\sigma(h)x\big{)}=h$ , with $\delta$ as in (19). Next we substitute $g=\gamma(x,h)$ to get

[TABLE]

where $r_{1}$ is the number of real places of $L$ .

Lemma 2.

For any $\sigma\in{\mathcal{D}}$ (see (32)), the Mellin inversion formula holds:

[TABLE]

where $s=(s_{1},...,s_{k})$ and $I_{\sigma}\subset\mathbb{C}^{k}$ is the product of the $k$ vertical lines $\mathrm{Re}(s_{j})=\sigma_{j}$ , taken from $\sigma_{j}-i\infty$ to $\sigma_{j}+i\infty$ .

Proof.

The calculation (33) shows that the Mellin transform $(M\psi)(s)$ is defined for $s\in I_{\sigma}$ . Thus Mellin inversion will work provided that $\int_{I_{\sigma}}\big{|}(M\psi)(s)h^{-s}\,ds\big{|}<\infty$ . Since $\big{|}h^{-s}\big{|}$ and $\big{|}e_{v}^{e_{v}S_{v}(s)/2}\big{|}$ are constant on $I_{\sigma}$ , we turn to the factors $|\Gamma(e_{v}S_{v}(s)/2)|$ in (33). Write $s=\sigma+iT$ , $T\in\mathbb{R}^{k}$ . In a strip $0<C_{1}\leq\mathrm{Re}(z)\leq C_{2}$ , we have $|\Gamma(z)|<C_{3}\exp\!\big{(}\!-|\mathrm{Im}(z)|\big{)}$ .444 In fact, $|\Gamma(z)|<C_{\varepsilon}\exp(-(\pi-\varepsilon)|\mathrm{Im}(z)|/2)$ holds for any $\varepsilon>0$ [AAR, Cor. 1.4.4]. Since $\mathrm{Re}\big{(}e_{v}S_{v}(s)\big{)}=e_{v}S_{v}(\sigma)>0$ for $s\in I_{\sigma}$ ,

[TABLE]

where $\|(m_{v})\|_{1}:=\sum_{v\in{\mathcal{A}_{L}}}|m_{v}|$ is the 1-norm on $\mathbb{R}^{\mathcal{A}_{L}}$ , and $S$ is the linear function from (30). Since $S$ is injective, there exists $C_{5}>0$ such that

[TABLE]

Thus $(M\psi)(s)h^{-s}$ is integrable over $I_{\sigma}$ and Mellin inversion (34) holds. ∎

Let

[TABLE]

and

[TABLE]

We take the branch of $\log\Gamma_{v}(z)$ which is real when $z$ is real and positive.

Lemma 3.

Let $y=(y_{1},\ldots,y_{k})\in\mathbb{R}^{k}$ and $\chi=\chi(y):=(\mathrm{e}^{y_{1}/2},\ldots,\mathrm{e}^{y_{k}/2})\in H:=\mathbb{R}_{+}^{k}$ . Then

[TABLE]

with $\psi$ as in (28), ${\alpha}$ as in (36), $Q$ as in Lemma 1, $I_{\sigma}$ as in Lemma 2, and $r_{1}$ (resp. $r_{2}$ ) being the number of real (resp. complex) places of $L$ .

Proof.

If $v$ is complex, so $e_{v}=2$ , the duplication formula gives

[TABLE]

If $v$ is real, so $e_{v}=1$ , then

[TABLE]

From (33) and Mellin inversion (34) we get

[TABLE]

Now we apply the lemma to the Basic Inequality (14).

Corollary 4.

For $t>0$ and $a\in L^{*}$ , define $y=y_{a,t}\in\mathbb{R}^{k}$ by

[TABLE]

Then, with $\mathcal{L}:=\sqrt{\det(Q_{\phantom{l}}^{\intercal}Q)}/\big{(}2^{r_{1}}(2\sqrt{\pi})^{r_{2}}\pi^{k}\big{)}$ , for any $\sigma\in\mathcal{D}$ we have

[TABLE]

Proof.

Define $r=r_{a,t}\in G:=\mathbb{R}_{+}^{\mathcal{A}_{L}}$ by $r_{v}:=t^{1/2}|a|_{v}$ . In view of (29) and Lemma 3, (39) will follow from $\big{(}\delta(r)\big{)}_{j}=\mathrm{e}^{ny_{j}/2}$ . Indeed, by (19),

[TABLE]

If $j=1$ , then by (17) we have $q_{jv}=1$ for all $v\in{\mathcal{A}_{K}}$ . Using (9) and (10) we find

[TABLE]

If $j>1$ , then $\sum_{v}e_{v}q_{jv}=0$ $\big{(}$ see (17) $\big{)}$ , so

[TABLE]

as claimed. To prove (40), apply $-\frac{2t}{n}\frac{d}{dt}$ to (39), noting that $\frac{dy_{j}}{dt}=0$ for $j\geq 2$ . ∎

3. Existence and uniqueness of the critical point

We shall show that for every $y\in\mathbb{R}^{k}$ there is a unique $\sigma=\sigma(y)\in{\mathcal{D}}\ \,\big{(}$ see (32) $\big{)}$ which is a critical point of $F_{y}\colon{\mathcal{D}}\to\mathbb{R}$ , defined as

[TABLE]

with ${\alpha}$ as in (36). The map taking $y\in\mathbb{R}^{k}$ to the critical point $\sigma(y)\in{\mathcal{D}}$ is closely related to the Legendre transform of ${\alpha}\colon{\mathcal{D}}\to\mathbb{R}$ , but we will develop the theory from scratch as ours is an easy case of the general theory of the Legendre transform [HUL, §E] [Sim, §1 and §5].

Lemma 5.

Let ${\alpha}\colon{\mathcal{D}}\to\mathbb{R}$ be as in (36). Then ${\alpha}$ is steep [Sim, p. 30], i.e.,

[TABLE]

where the limit is taken over $\sigma\in{\mathcal{D}}$ as its Euclidean norm $\|\sigma\|$ tends to infinity.

Proof.

Recall that the linear map $S$ in (30) is injective. Hence there exists $C>0$ such that, for all $\sigma\in{\mathcal{D}}$ ,

[TABLE]

For any $\sigma\in{\mathcal{D}}$ , there is a $v_{0}=v_{0}(\sigma)\in{\mathcal{A}_{L}}$ such that $S_{v_{0}}(\sigma)=\max_{v\in{\mathcal{A}_{L}}}\big{\{}S_{v}(\sigma)\big{\}}$ . The previous inequality says that

[TABLE]

The known behavior of $\Gamma(z)$ for $z>0$ shows that there is a $\kappa<0$ such that

[TABLE]

for all $z>0$ and all $v\in{\mathcal{A}_{L}}$ ( $\kappa=-1/5$ will do). Also, Stirling’s formula shows that

[TABLE]

for $z\gg 0$ . It follows from (43), (42), and (44) that when $\|\sigma\|$ is large,

[TABLE]

and the lemma follows. ∎

The next lemma amounts to the fact that the gradient $\nabla\!f$ of a steep and differentiable strictly convex function $f$ is a bijection. However, in our case the domain ${\mathcal{D}}\not=\mathbb{R}^{k}$ , which means that we would need to check the boundary behavior of ${\alpha}$ before citing results from convex analysis. We prefer not to quote and instead adapt the usual proof [Sim, §1] [HUL, §E] to our nicely behaved function ${\alpha}$ .

Lemma 6.

For any $y\in\mathbb{R}^{k}$ there is a unique $\sigma=\sigma(y)\in{\mathcal{D}}$ such that $y=\nabla\!{\alpha}(\sigma)$ .

Proof.

For any $y\in\mathbb{R}^{k}$ , let $F_{y}\colon{\mathcal{D}}\to\mathbb{R}$ , $\,F_{y}(\tau):=\alpha(\tau)-y\cdot\tau$ , and let

[TABLE]

which we will now prove to be finite, i.e., ${\alpha}^{\dagger}(y)\not=-\infty$ . Let $\tau^{(i)}$ be a sequence in ${\mathcal{D}}$ such that $F_{y}(\tau^{(i)})$ converges to ${\alpha}^{\dagger}(y)$ . By (43), ${\alpha}(\tau^{(i)})$ is bounded below, so it suffices to check that the sequence $\tau^{(i)}$ is bounded. By Lemma 5, ${\alpha}(\tau)>(\|y\|+1)\|\tau\|$ for $\tau\in{\mathcal{D}}$ with $\|\tau\|$ sufficiently large. For such $\tau$ ,

[TABLE]

which shows that $\tau^{(i)}$ is bounded.

We now prove that the infimum defining ${\alpha}^{\dagger}(y)$ is assumed at a point in the open set ${\mathcal{D}}\subset\mathbb{R}^{k}$ . Passing to a subsequence of the bounded sequence $\tau^{(i)}$ , we may assume that the $\tau^{(i)}\in{\mathcal{D}}$ converge to a point $\sigma$ in the closure of ${\mathcal{D}}$ in $\mathbb{R}^{k}$ . Recall from (32) that ${\mathcal{D}}$ is the (non-empty) open set consisting of $\tau\in\mathbb{R}^{k}$ such that $S_{v}(\tau)>0$ for all $v\in{\mathcal{A}_{L}}$ . If $\sigma\notin{\mathcal{D}}$ , then $S_{v}(\sigma)=0$ for some $v\in{\mathcal{A}_{L}}$ . Since $\log\Gamma_{v}\big{(}S_{v}(\tau^{(i)})\big{)}\to+\infty$ as $S_{v}(\tau^{(i)})\to 0^{+}$ , and the remaining summands in the definition of ${\alpha}$ remain bounded from below (as does $y\cdot\tau^{(i)}$ ), we conclude that $\sigma\in{\mathcal{D}}$ . Since $\sigma$ is an interior minimum of the smooth function $F_{y}$ , we have $\nabla\!F_{y}(\sigma)=0$ . By (41), $y=\nabla\!{\alpha}(\sigma)$ , as claimed.

To prove the uniqueness of $\sigma$ , it suffices to prove that $F_{y}$ is a strictly convex function on ${\mathcal{D}}$ .555 That is, $F_{y}(t\tau+(1-t)\tilde{\tau})<tF_{y}(\tau)+(1-t)F_{y}(\tilde{\tau})$ for all $t\in(0,1)$ and all $\tau\not=\tilde{\tau}\in{\mathcal{D}}$ . Such a function cannot have more than one critical point. To prove this, let $g(t):=F_{y}\big{(}t\tau+(1-t)\tilde{\tau}\big{)}$ . Assuming that $F_{y}$ is strictly convex, $g$ is a strictly convex function of a single real variable $t\in[0,1]$ . Thus, $g^{\prime\prime}\geq 0$ , so $g$ has an increasing derivative $g^{\prime}(t)=\nabla\!F_{y}(t\tau+(1-t)\tilde{\tau})\cdot(\tau-\tilde{\tau})$ . But $\nabla\!F_{y}(\tau)=0=\nabla\!F_{y}(\tilde{\tau})$ would imply $g^{\prime}(0)=0=g^{\prime}(1)$ , whence $g$ is constant and therefore not strictly convex. The strict convexity of $F_{y}$ follows from the strict convexity of $\log\Gamma(z)$ for $z>0$ . Indeed,

[TABLE]

with strict inequality holding for $t\in(0,1)$ unless $S_{v}(\tau)=S_{v}(\tilde{\tau})$ for all $v\in{\mathcal{A}_{L}}$ . But this is impossible because $S$ in (30) is injective. ∎

The function ${\alpha}^{\dagger}$ in (45) is a concave function of $y\in\mathbb{R}^{k}$ , being the infimum over $\tau\in{\mathcal{D}}$ of the set of concave (in fact, affine) functions $y\mapsto-y\cdot\tau+{\alpha}(\tau)$ . The convex function $-{\alpha}^{\dagger}$ is known as the Legendre transform of ${\alpha}$ .

4. Inequalities at the critical point

To take advantage of the inequality (14), we will later need to drop all terms in (14) corresponding to algebraic integers $a\not=1$ . For this we will need some control of the first coordinate $\sigma_{1}(y)$ of the function $\sigma$ in Lemma 6. In this subsection we take advantage of the concavity of $\Psi:=\Gamma^{\prime}/\Gamma$ to find a lower bound for $\sigma_{1}(y)$ . Then we use the convexity of $\log\Gamma$ to find a lower bound for ${\alpha}\big{(}\sigma(y)\big{)}$ . Let

[TABLE]

These definitions ensure that $\Psi_{v}(z)=\tfrac{d}{dz}\log\Gamma_{v}(z)=\Gamma_{v}^{\prime}(z)/\Gamma_{v}(z)$ (see (35)). Note that $\Psi_{v}(z)$ is a concave function of $z$ for $z>0$ . We also note that $\Psi_{v}\colon(0,\infty)\to\mathbb{R}$ has an inverse function $\Psi_{v}^{-1}\colon\mathbb{R}\to(0,\infty)$ since $\Psi(z)$ is strictly increasing when $z>0$ , tends to $-\infty$ as $z\to 0^{+}$ , and tends to $+\infty$ as $z\to+\infty$ .

Writing out the $\ell$ -th coordinate of the equation $y=\nabla\!{\alpha}(\sigma)$ in Lemma 6, we get

[TABLE]

which for $\ell=1$ simplifies to

[TABLE]

Lemma 7.

Let $L$ be a number field of degree $n$ , with $r_{2}$ complex places. For $y=(y_{1},y_{2},\ldots,y_{k})\in\mathbb{R}^{k}$ , let $\sigma_{1}(y)$ be the first coordinate of the function $\sigma(y)$ defined in Lemma 6. Then

[TABLE]

Proof.

We prove (50) using the concavity of $\Psi$ . Namely, from (49),

[TABLE]

where the last step uses

[TABLE]

which follows from (17) since

[TABLE]

Inequality (50) now follows, since $\Psi^{-1}$ is an increasing function. ∎

Our next result is a similar inequality for ${\alpha}(\sigma)$ .

Lemma 8.

With notation as in Lemma 7, we have

[TABLE]

Proof.

We compute directly from the definition (36) of ${\alpha}$ , using the convexity of $z\mapsto\log\Gamma(z)$ for $z>0$ and (51):

[TABLE]

We now prove a lower bound for $S_{v}(\sigma)$ in terms of $\sigma_{1}$ and $y_{1}$ .

Lemma 9.

Let $u\in{\mathcal{A}_{L}}$ , $y\in\mathbb{R}^{k}$ , and let $\sigma:=\sigma(ny)\in\mathcal{D}$ be as in Lemma 6. Assume that $y_{1}\geq t_{0}$ for some $t_{0}\in\mathbb{R}$ , and $n:=[L:\mathbb{Q}]\geq 2$ . Then $S_{u}(\sigma)\geq 2/5$ or

[TABLE]

Proof.

We shall show below that both denominators in (53) are positive if $S_{u}(\sigma)<2/5$ , as we may assume. Replacing $y$ with $ny$ in (49), we have

[TABLE]

Since $-\Psi$ is a monotone decreasing convex function on $(0,\infty)$ , we find

[TABLE]

From $x\Gamma(x)=\Gamma(x+1)$ and the fact that $\Psi(x)<0$ for $x<1.461$ ,

[TABLE]

Hence, as we are assuming $S_{u}(\sigma)<2/5$ ,

[TABLE]

Since $S_{u}(\sigma)>0$ , the right-hand side above is negative. Hence the left-most inequality in (53) is proved.

Next recall [Ni, §71, eq. (11)],

[TABLE]

Whence $\Psi(x)<\log(x)$ for $x>0$ , and so

[TABLE]

Now the second inequality in (53) follows as before. ∎

5. Asymptotics

With a view to applying Corollary 4 and the Basic Inequality (14), in this section we will estimate integrals of the type

[TABLE]

where $n:=[L:\mathbb{Q}],\ y=(y_{1},\ldots,y_{k})\in\mathbb{R}^{k},\ \sigma:=\sigma(ny)\in{\mathcal{D}}\subset\mathbb{R}^{k}$ as in Lemma 6, and $y\cdot s:=\sum_{j=1}^{k}y_{j}s_{j}$ . We will let $\mathcal{H}(T)$ be a Gaussian approximating $\mathcal{G}(T)$ (see (63) below) in a bounded neighborhood $\Delta\subset\mathbb{R}^{k}$ of $T=0$ $\big{(}$ see (85) $\big{)}$ . As usual with the saddle point method, we decompose the integral (54) into four pieces

[TABLE]

The term $I_{1}$ (i.e., $\int_{\mathbb{R}^{k}}\mathcal{H}$ ) is readily computed and gives (as we will prove in this section) the main term in (55). Thus, we shall prove that the terms $I_{2},I_{3}$ and $I_{4}$ are $o(I_{1})$ as $[L:K]\to\infty$ , uniformly in $y\in\mathbb{R}^{k}$ .

From now on we always (and usually tacitly) assume that the relative units $E(L/K)\subset E\subset{\mathcal{O}_{L}^{*}}$ for some subfield $K\subset L$ $\big{(}$ see(7) $\big{)}$ . Define ${\mathrm{Log}}\colon L^{*}\to\mathbb{R}^{\mathcal{A}_{L}}$ by

[TABLE]

Note that the complex places do not carry a factor of 2. Instead we use this factor in the inner product (16) on $\mathbb{R}^{\mathcal{A}_{L}}$ defined by $\langle\beta,\gamma\rangle:=\sum_{v\in{\mathcal{A}_{L}}}e_{v}\beta_{v}\gamma_{v}$ . The usefulness of assuming $E(L/K)\subset E$ lies in the following.

Lemma 10.

Suppose $E(L/K)\subset E\subset{\mathcal{O}_{L}^{*}}$ and $q=(q_{v})_{v\in{\mathcal{A}_{L}}}\in{\mathrm{Log}}(E)^{\perp}$ lies in the orthogonal complement of ${\mathrm{Log}}(E)$ inside $\mathbb{R}^{\mathcal{A}_{L}}$ with respect to the above inner product. Then $q_{v}=q_{v^{\prime}}$ whenever $v$ and $v^{\prime}$ lie above the same place of $K$ and

[TABLE]

Proof.

The lemma will follow from the fact that ${\mathrm{Log}}(E)^{\perp}$ is contained in the $\mathbb{R}$ -span of ${\mathrm{Log}}(K^{*})$ in $\mathbb{R}^{\mathcal{A}_{L}}$ . Clearly ${\mathrm{Log}}(E)^{\perp}\subset{\mathrm{Log}}(E(L/K))^{\perp}$ , so it suffices to prove that $\operatorname{span}{\mathrm{Log}}(K^{*})={\mathrm{Log}}(E(L/K))^{\perp}$ . This follows from $\operatorname{span}{\mathrm{Log}}(K^{*})\subset{\mathrm{Log}}(E(L/K))^{\perp}$ and $\dim(\operatorname{span}{\mathrm{Log}}(K^{*}))=\dim({\mathrm{Log}}(E(L/K))^{\perp})=|{\mathcal{A}_{K}}|$ . ∎

Recall that in (17) we fixed a basis $q_{1},...,q_{k}$ of ${\mathrm{Log}}(E)^{\perp}$ such that $q_{1v}=1$ for all $v\in{\mathcal{A}_{L}}$ and $\langle q_{1},q_{j}\rangle=0$ for $2\leq j\leq k$ . In view of Lemma 10, we will write $q_{jw}:=q_{jv}$ for any $v\in{\mathcal{A}_{L}}$ extending $w\in{\mathcal{A}_{K}}$ .

For a place $w\in{\mathcal{A}_{K}}$ , let $r_{1,w}$ and $r_{2,w}$ denote respectively the number of real and complex places of $L$ extending $w$ , and let (cf. [FS, p. 134])

[TABLE]

Note that $m_{w}=e_{w}[L:K]=[L:K]$ or $2[L:K]$ , and that $\frac{1}{2}\leq\kappa_{w}\leq 1$ .

Lemma 10 implies that $S_{v}$ defined in (30) satisfies

[TABLE]

where $v\in{\mathcal{A}_{L}}$ is any place extending $w\in{\mathcal{A}_{K}}$ . We therefore rewrite ${\alpha}$ in (36) as

[TABLE]

where we write $v\mid w$ if $v$ extends $w$ , and $\alpha_{\kappa_{w}}$ was defined in (57).

For each $w\in{\mathcal{A}_{K}}$ and $\sigma\in{\mathcal{D}}\ \,\big{(}\text{see }\eqref{D}\big{)}$ , define $\rho_{w}\colon\mathbb{R}^{k}\to\mathbb{C}$ by

[TABLE]

i.e., $\rho_{w}$ is the error in the degree-2 Taylor approximation of $T\mapsto\alpha_{\kappa_{w}}\!\big{(}S_{w}(\sigma+iT)\big{)}$ at $T=0$ . We shall henceforth take any $y\in\mathbb{R}^{k}$ and let $\sigma:=\sigma(ny)$ be the corresponding saddle point in Lemma 6. Thus $\nabla\alpha(\sigma)=ny$ . Using this and (59), we find

[TABLE]

It follows from (59)–(61) that

[TABLE]

The linear terms in $T$ have disappeared as $\sigma$ is a critical point of $s\mapsto\alpha(s)-ny\cdot s$ .

For fixed $y\in\mathbb{R}^{k}$ and $\sigma:=\sigma(ny)\in{\mathcal{D}}$ , define the following functions of $T\in\mathbb{R}^{k}$ :

[TABLE]

Although $\mathcal{H},H,\mathcal{G}$ and $\rho$ depend on $y\in\mathbb{R}^{k}$ , we do not include $y$ in our notation.

5.1. The main term

In Lemma 1 we defined the $|{\mathcal{A}_{L}}|\times k$ matrix $Q$ of rank $k$ whose coefficients are $Q_{v,j}:=q_{jv}$ . We will write $\mathcal{Q}$ for the $|{\mathcal{A}_{K}}|\times k$ matrix with entries $\mathcal{Q}_{wj}:=q_{jw}$ and rank $k$ . Recall that we write $q_{jw}:=q_{jv}$ for any $v\in{\mathcal{A}_{L}}$ extending $w\in{\mathcal{A}_{K}}$ . Let ${\mathcal{A}^{[k]}_{K}}$ be the set of $k$ -element subsets of ${\mathcal{A}_{K}}$ . For $\eta\in{\mathcal{A}^{[k]}_{K}}$ , let $\mathcal{Q}_{\eta}$ be the $k\times k$ submatrix of $\mathcal{Q}$ whose rows are indexed by the elements of $\eta$ . In the computation of $\psi(\chi)$ in Lemma 3 the term $\det(Q_{\phantom{l}}^{\intercal}Q)$ appears. Using the smaller matrix $\mathcal{Q}$ we have

[TABLE]

as follows from

[TABLE]

Next we calculate some integrals such as $I_{1}$ in (55), and its derivatives.

Lemma 11.

Let $\mathcal{Q}$ and $\mathcal{Q}_{\eta}$ be as above, where $\eta\in{\mathcal{A}^{[k]}_{K}}$ , let $(b_{w})_{w\in{\mathcal{A}_{K}}}\in\mathbb{R}_{+}^{{\mathcal{A}_{K}}}$ , and define

[TABLE]

Then, with $S_{w}$ as in (58),

[TABLE]

*Furthermore, for any $w_{0}\in{\mathcal{A}_{K}}$ we have *

[TABLE]

and

[TABLE]

Proof.

Let $P=(P_{w,j})$ be the $|{\mathcal{A}_{K}}|\times k$ matrix with entries $P_{w,j}:=\sqrt{b_{w}}q_{jw}\ \,(w\in{\mathcal{A}_{K}},\ 1\leq j\leq k).$ Then for $T=(T_{1},...,T_{k})\in\mathbb{R}^{k}$ , considered as a $k\times 1$ matrix, $PT\in\mathbb{R}^{{\mathcal{A}_{K}}}$ satisfies $(PT)_{w}=\sqrt{b_{w}}S_{w}(T)$ . Hence

[TABLE]

The $k\times k$ matrix $H$ is clearly positive semi-definite. The Cauchy-Binet formula gives $\det(H)={\mathfrak{D}}$ , with ${\mathfrak{D}}$ as in (67).666 The Cauchy-Binet formula computes $\det(AB)$ , where $A$ is a $k\times\ell$ and $B$ is $\ell\times k$ , in terms of the $k\times k$ minors of $A$ and $B$ . But ${\mathfrak{D}}>0$ as ${\mathfrak{D}}_{\eta}>0$ for at least one $\eta\in{\mathcal{A}^{[k]}_{K}}$ , since $\mathcal{Q}$ has rank $k$ . Hence $H$ is positive definite, and so the integral in (68) is the well-known Gaussian integral attached to a positive definite quadratic form $H$ in $k$ variables, as claimed in (68).

The other equalities in Lemma 11 are obtained by differentiating (68) with respect to $b_{w_{0}}$ repeatedly. Indeed, noting that the partial derivative $\frac{\partial{\mathfrak{D}}}{\partial b_{w_{0}}}=b_{w_{0}}^{-1}\sum_{{\eta}\ni w_{0}}{\mathfrak{D}}_{{\eta}}$ is independent of $b_{w_{0}}$ , i.e., $\frac{\partial^{2}{\mathfrak{D}}}{\partial b_{w_{0}}^{2}}=0$ , we have

[TABLE]

proving the equalities. The inequalities follow from $\sum_{{\eta}\ni w_{0}}{\mathfrak{D}}_{{\eta}}\leq{\mathfrak{D}}$ , as ${\mathfrak{D}}_{{\eta}}\geq 0$ . ∎

As $\alpha_{\kappa}^{\prime\prime}(t)>0$ for $t>0$ , we can now evaluate $I_{1}$ .

Corollary 12.

With notation as in (63), for $y\in\mathbb{R}^{k}$ we have

[TABLE]

where $\sigma:=\sigma(ny)\in{\mathcal{D}}$ as in Lemma 6 and

[TABLE]

5.2. The small terms

We begin by quoting some one-variable estimates.

Lemma 13.

If $m\geq 1000$ , $\kappa\in[\tfrac{1}{2},1]$ , and $r>0$ , then

[TABLE]

Proof.

The estimate (70) is proved in [Su2, Lemma 4.4]. We now prove (71). From [Su2, Lemma 4.11] we have

[TABLE]

while from [FS, Lemma 5.3] we have

[TABLE]

where $\lfloor r\rfloor$ is the floor of $r$ . Since $0<r^{2}\alpha_{\kappa}^{\prime\prime}(r)<1+r$ [FS, p. 141], we have

[TABLE]

Indeed, for $0<r<1$ the last inequality is obvious, while for $r\geq 1$ a much better inequality follows from $m\kappa\geq 500$ . Hence

[TABLE]

Combining this with (72) we obtain (71). ∎

We will need the following inequality, proved by elementary calculus.

[TABLE]

Lemma 14.

Suppose $m\geq 1000,\ \frac{1}{2}\leq\kappa\leq 1,\ 0<D\leq m^{1/3}\kappa$ , and let

[TABLE]

Then, for any $r>0$ ,

[TABLE]

and

[TABLE]

Proof.

Inequality (77) follows from

[TABLE]

where the first inequality is from [FS, p. 139] and the last one uses (74) with $x:=m^{1/3}D^{2}/2$ . To prove (76) we use [Su2, Lemma 4.5],

[TABLE]

where the second inequality again follows from (74). ∎

Next we deal with the second order remainder term in the Taylor expansion about $a$ of $\log\Gamma(a+ib)$ , taking $a=S_{w}(\sigma)$ and $b=S_{w}(T)$ .

Lemma 15.

For $w\in{\mathcal{A}_{K}},\ \sigma\in{\mathcal{D}}$ $\big{(}$ see (32) $\big{)}$ , $T\in\mathbb{R}^{k}$ and $\rho_{w}$ as in (60), we have

[TABLE]

Proof.

The first inequalities in (78) and (79) are proved in [Su2, Lemma 4.7], as is also (81). The second inequalities in (78) and (79) follow from [FS, Lemma 5.2] and $\kappa_{w}\geq\tfrac{1}{2}$ . The identities in (80) follow from (60) and $\log\Gamma(\overline{z})=\overline{\log\Gamma(z)}$ . ∎

Lemma 16.

$\big{(}$ [FS, (5.11)] $\big{)}$ *

If $u,v\in\mathbb{R}$ with $0\leq u\leq R$ , then*

[TABLE]

We first estimate the easier “outer” terms, $I_{2}$ and $I_{3}$ in (55), i.e., where the region of integration is $\mathbb{R}^{k}-\Delta$ . For $y\in\mathbb{R}^{k}$ , let $\eta_{0}=\eta_{0}(y)\in{\mathcal{A}^{[k]}_{K}}$ correspond to a maximal summand in (69), so

[TABLE]

Thus,

[TABLE]

and so

[TABLE]

For $y\in\mathbb{R}^{k},\ w\in\eta_{0}(y)$ and $D>0$ , let $\big{(}$ cf. (75) $\big{)}$

[TABLE]

Define the neighborhood $\Delta\subset\mathbb{R}^{k}$ of $T=0\in\mathbb{R}^{k}$ as

[TABLE]

The next lemma shows that $I_{2}$ and $I_{3}$ are small compared to $I_{1}$ in Corollary 12.

Lemma 17.

Suppose $m:=[L:K]\geq 1000,\ \,0<D<m^{1/3}/\sqrt{2}$ , and $y\in\mathbb{R}^{k}$ . Then, with $\Delta$ as in (85), $\sigma:=\sigma(ny)\in{\mathcal{D}}$ as in Lemma 6, $\mathcal{H}$ and $\mathcal{G}$ as in (63) and (65), we have

[TABLE]

Proof.

We first prove (86). Note that $\Gamma(z)=\int_{0}^{\infty}x^{z}\mathrm{e}^{-x}\frac{dx}{x}$ implies

[TABLE]

Using this, (65) and (59) we have,

[TABLE]

Let $B\subset\mathbb{R}^{\eta_{0}}$ denote the $k$ -dimensional box

[TABLE]

and let $B^{c}:=\mathbb{R}^{\eta_{0}}-B$ denote its complement. Making the change of variables $\tilde{T}_{w}:=S_{w}(T)$ for $w\in{\eta}_{0}$ , we have

[TABLE]

The latter integral is easy to bound using Lemmas 13 and 14. We integrate over $k$ (overlapping) regions, each of which has $k-1$ of the $\tilde{T}_{w}$ range over all of $\mathbb{R}$ , and the remaining $\tilde{T}_{w_{0}}$ over $|\tilde{T}_{w_{0}}|>\delta_{w_{0}}$ . Since $m_{w}\geq m:=[L:K]$ , we conclude that

[TABLE]

Now inequality (83) and Corollary 12 prove (86).

Next we prove (87). Changing variables as before, we have

[TABLE]

Once again, we bound $\int_{B^{c}}$ using $k$ overlapping regions, one for each $w_{0}\in\eta_{0}$ . The integral over the region given by all $\tilde{T}\in\mathbb{R}^{\eta_{0}}$ such that $|\tilde{T}_{w_{0}}|>\delta_{w_{0}}$ is bounded by

[TABLE]

We can use (77) to bound the first integral, and the remaining integrals are explicitly known. Hence, summing over the $k$ regions,

[TABLE]

We again conclude using (83). ∎

For the “inner” integral $I_{4}=\int_{\Delta}(\mathcal{G}-\mathcal{H})$ in (55), we can only expect estimates of the kind $O(I_{1}/m)$ , whereas $I_{2}$ and $I_{3}$ are essentially $O\big{(}I_{1}\exp(-m^{1/3})\big{)}$ . This allowed us to use simple estimates for the contribution of places $w\notin\eta_{0}$ . However, to estimate $I_{4}$ we shall need the following geometric result.

Lemma 18.

Let $M=(m_{ij})$ be an $N\times k$ matrix of rank $k$ , and let $a_{i}>0\ \,(1\leq i\leq N)$ . Define linear maps $P_{i}\colon\mathbb{R}^{k}\to\mathbb{R}$ by $P_{i}(T):=\sum_{j=1}^{k}m_{ij}T_{j},$ where $T=(T_{1},...,T_{k})$ . For any $k$ -element subset $\eta=\{i_{1},\ldots,i_{k}\}\subset\{1,2,\ldots,N\}$ , let $M_{\eta}$ denote the $k\times k$ submatrix of $M$ given by $\big{(}M_{\eta}\big{)}_{\ell,j}=m_{i_{\ell},j}$ . Define $E_{\eta}:=\lvert\det(M_{\eta})\rvert\prod_{i\in\eta}a_{i},$ and let $\eta_{0}$ maximize $E_{\eta}$ . Then

[TABLE]

Proof.

Replacing $m_{ij}$ with $a_{i}m_{ij}$ , we may assume $a_{i}=1$ . Hence $\eta_{0}$ simply maximizes $\lvert\det(M_{\eta})\rvert$ . Fix $i\in\{1,2,\dotsc,N\}$ , and define $\lambda_{j}\in\mathbb{R}$ for $j\in\eta_{0}$ by $P_{i}=\sum_{j\in\eta_{0}}\lambda_{j}P_{j}.$ For $j\in\eta_{0}$ , let $M_{j}$ denote $M_{\eta}$ with the $j^{\text{th}}$ row of $M$ replaced by the $i^{\text{th}}$ row. Then, by Cramer’s rule, $\lvert\lambda_{j}\det(M_{\eta})\rvert=\lvert\det(M_{j})\rvert\leq\lvert\det(M_{\eta})\rvert,$ so $\lvert\lambda_{j}\rvert\leq 1$ . Hence

[TABLE]

Lemma 19.

For $y\in\mathbb{R}^{k}$ and $D>0$ we have

[TABLE]

*with notation as in (55), $m:=[L:K]$ and $Z:=\big{(}\mathrm{e}^{\lvert{\mathcal{A}_{K}}\rvert k^{4}D^{4}m^{-1/3}}-1\big{)}/\big{(}\lvert{\mathcal{A}_{K}}\rvert k^{4}D^{4}m^{-1/3}\big{)}.$ *

Proof.

Lemma 18, applied to the matrix $\mathcal{Q}$ and $a_{w}:=\sqrt{m_{w}\alpha_{\kappa_{w}}^{\prime\prime}\!\big{(}S_{w}(\sigma)\big{)}}$ , shows

[TABLE]

for $w\in{\mathcal{A}_{K}},\ T\in\mathbb{R}^{k}$ and $\eta_{0}$ as in (82). Since $x\mapsto x^{4}$ is convex, we have,

[TABLE]

For $T\in\Delta$ and $w_{0}\in\eta_{0}$ , by (84) and (85) we have

[TABLE]

Hence,

[TABLE]

Combining this with Lemma 15, we conclude that for $T\in\Delta$ ,

[TABLE]

Lemmas 15 and 16 now show that for $T\in\Delta$ ,

[TABLE]

where in the last step we used the convexity of $x\mapsto x^{2}$ .

By Lemma 15, $\mathrm{Im}\big{(}e^{\rho(T)}\big{)}$ is odd, while $\mathrm{Re}\big{(}e^{\rho(T)}\big{)}$ is even in $T$ . Furthermore, $\mathcal{H}(T)$ is a real and even function of $T$ , and $\Delta$ is mapped to itself by $T\mapsto-T$ . Hence, using (65) and (92),

[TABLE]

Using Lemma 11 and Corollary 12, we find

[TABLE]

Our next estimate will let us deal with the term $\int_{E_{\mathbb{R}}}\|ax\|^{2}\mathrm{e}^{-t\,\|ax\|^{2}}\,d\mu(x)$ in the Basic Inequality (14) and (40).

Lemma 20.

For $y\in\mathbb{R}^{k}$ and $m:=[L:K]\geq 1000$ we have

[TABLE]

with $I_{1}$ as in (55), $\alpha$ as in (59) and $\sigma=(\sigma_{1},\ldots,\sigma_{k}):=\sigma(ny)$ as in Lemma 6.

Proof.

By (51), for $T\in\mathbb{R}^{k}$ we have

[TABLE]

Hence we will need to bound integrals of the kind $\int_{\mathbb{R}^{k}}|S_{w}(T)\mathrm{e}^{\alpha(\sigma+iT)}|\,dT$ .

Let $\eta_{0}$ be as in (82) and let $w_{0}\in\eta_{0}$ . Then, using (88) and changing variables as in the proof of Lemma 17,

[TABLE]

Using Lemma 13 and (83) we obtain,

[TABLE]

By inequality (91),

[TABLE]

where the last inequality uses $m_{w_{0}}\leq 2m_{w}$ and $x^{2}\alpha_{\kappa_{w}}^{\prime\prime}(x)>\kappa_{w}\geq 1/2$ for $x>0$ [FS, (5.7)]. Hence, by (94),

[TABLE]

It follows that

[TABLE]

where the last equality uses Corollary 12. ∎

6. Proof of the Main Theorem

The next lemma will allow us to ensure that each integral in the Basic Inequality (14) is positive. As in §5, we always assume that $E(L/K)\subset E\subset{\mathcal{O}_{L}}$ .

Lemma 21.

There is an absolute constant $N_{0}$ such that if $[L:K]\geq N_{0}\cdot 2.01^{[K:\mathbb{Q}]}$ and $a\in{\mathcal{O}_{L}},\ a\not=0$ , then for $t:=\exp\!\big{(}\Psi(0.51+\frac{r_{2}}{2n})\big{)}$ we have $\sigma_{1}(ny_{a,t})\geq 0.51$ and

[TABLE]

where $y_{a,t}$ is given by Corollary 4, $\Psi(x):=\Gamma^{\prime}(x)/\Gamma(x)$ , and

[TABLE]

Proof.

We note that $\mathcal{L}$ is as in Corollary 4, except that we used (66) to express $\mathcal{L}$ in terms of $\mathcal{Q}$ rather than $Q$ . Letting $y:=y_{a,t}$ , from Corollary 4 we have

[TABLE]

Again from Corollary 4, for $a\in{\mathcal{O}_{L}},\ a\not=0$ ,

[TABLE]

Applying Lemma 7 to $ny$ , since $\Psi^{-1}$ is increasing we have,

[TABLE]

Since $k\leq|{\mathcal{A}_{K}}|\leq[K:\mathbb{Q}]$ by (56), we have $|{\mathcal{A}^{[k]}_{K}}|=\binom{|{\mathcal{A}_{K}}|}{k}\leq 2^{|{\mathcal{A}_{K}}|}\leq 2^{[K:\mathbb{Q}]}$ . Thus, Lemma 20 yields

[TABLE]

for $m\geq N_{0}\cdot 2.01^{[K:\mathbb{Q}]}$ and some absolute $N_{0}\geq 500$ . By (55) and (54) we have

[TABLE]

where $I_{j}=I_{j}(ny)$ . Taking $D=1$ in Lemmas 17 and 19, and after possibly enlarging $N_{0}$ , we obtain $|I_{2}|+|I_{3}|+|I_{4}|\leq 0.01I_{1}.$ Hence,

[TABLE]

and so, since $\sigma_{1}\geq 0.51$ by (97),

[TABLE]

A glance at (95) shows that we are finished. ∎

We now prove the Main Theorem in §1, which we do not repeat here. Note that

[TABLE]

Take $N_{0}$ and $t:=\exp\!\big{(}\Psi(0.51+\tfrac{r_{2}}{2n})\big{)}$ as in Lemma 21. In the Basic Inequality (14) take $\mathfrak{a}:={\mathcal{O}_{L}}$ , so that the sum there includes only nonzero $a\in{\mathcal{O}_{L}}$ . By Lemma 21, each integral in the sum is positive. Retaining only the term corresponding to $a=1\in{\mathcal{O}_{L}}$ we have, again by Lemma 21,

[TABLE]

where $y:=y_{1,t}$ and $\sigma:=\sigma(ny)$ . Corollary 4 applied to $a=1$ gives

[TABLE]

We need an upper bound for $\det\!\big{(}H(\sigma)\big{)}$ in (101). In view of (69), we look for an upper bound for $\alpha_{\kappa_{w}}^{\prime\prime}\!\big{(}S_{w}(\sigma)\big{)}.$ Note that

[TABLE]

since $\Psi^{\prime}(x)$ is decreasing for $x>0$ . Note that $\sigma_{1}\geq 0.51$ by (97) and that

[TABLE]

From Lemma 9 we have

[TABLE]

Estimating the series by an integral, $\Psi^{\prime}(x)=\sum_{k=0}^{\infty}\frac{1}{(k+x)^{2}}<\frac{1}{x}+\frac{1}{x^{2}},$ yields

[TABLE]

From $\det(\mathcal{Q}_{\phantom{l}}^{\intercal}\mathcal{Q})=\sum_{\eta\in{\mathcal{A}^{[k]}_{K}}}\det^{2}(\mathcal{Q}_{\eta})$ (Cauchy-Binet), $r_{1,w}+r_{2,w}\geq m_{w}/2$ and (69),

[TABLE]

where we also used $k\leq|{\mathcal{A}_{K}}|\leq[K:\mathbb{Q}]$ .

We now bound the term $\mathrm{e}^{\alpha(\sigma)-ny\cdot\sigma}$ in (101) from below. From (102) and (103),

[TABLE]

Using the lower bound for $\alpha(\sigma)$ in Lemma 8, we have

[TABLE]

We now distinguish two cases according to the size of $\sigma_{1}$ . If $\sigma_{1}\geq 4$ , then $\log\Gamma\big{(}\sigma_{1}+\textstyle\frac{r_{2}}{2n}\big{)}\geq\log(6)$ . Since $-n\sigma_{1}y_{1}>n\sigma_{1}$ , after possibly increasing $N_{0}$ , the Main Theorem follows easily from (100), (101), (104) and (105).

We now turn to the remaining case, i.e., $0.51\leq\sigma_{1}<4$ . (By Lemma 21, $\sigma_{1}\geq 0.51$ .) Then in (104) we can replace $\log(23\sigma_{1})$ by 5. The critical points $r\in(0,\infty)$ of $r\mapsto\log\Gamma\big{(}r+\textstyle\frac{r_{2}}{2n}\big{)}-ry_{1}$ occur where

[TABLE]

But $\Psi\colon(0,\infty)\to\mathbb{R}$ is injective, so $r=0.51$ is the only critical point of $r\mapsto\log\Gamma\big{(}r+\textstyle\frac{r_{2}}{2n}\big{)}-ry_{1}$ , and it is a local (therefore global) minimum. Since $\sigma_{1}\geq 0.51$ ,

[TABLE]

Note that $0\leq\textstyle\frac{r_{2}}{2n}\leq\frac{1}{4}$ , $\Psi(r)<-1$ for $0<r<0.76$ , and $\Psi^{\prime}(r)>0$ for $r>0$ . Hence

[TABLE]

is decreasing for $0\leq x\leq\frac{1}{4}$ . We conclude that

[TABLE]

Since $\mathrm{e}^{0.0955}>1.1$ and $j:=\text{rank}_{\mathbb{Z}}(E)\leq|{\mathcal{A}_{L}}|\leq n$ , after again possibly increasing $N_{0}$ , we can use the “spare” $\exp(0.0045n)$ to control the term in (104). $\square$

We note that the our proof of the Main Theorem shows that the $1.1^{j}$ appearing in it can be replaced by $\exp\!\big{(}nf(r_{2}/(2n))\big{)}$ , where $r_{2}$ is the number of complex places of $L$ and

[TABLE]

In particular, if $L$ is totally real, we can replace $1.1^{j}$ by $2.3^{n}$ . We can also replace $0.51$ above by $\epsilon+1/2$ for any $\epsilon>0$ .

Finally, we prove that every element of $\bigwedge^{r_{L}-1}{\mathrm{LOG}}({\mathcal{O}_{L}^{*}})$ is represented by a pure wedge, as claimed in the Introduction.

Lemma 22.

Suppose $M$ is a $\mathbb{Z}$ -lattice in $\mathbb{R}^{n}$ of rank $n\geq 1$ . Then every element of $w\in\bigwedge^{n-1}M$ has the form

[TABLE]

for some integer $d$ and some basis $\{\epsilon_{1},\ldots,\epsilon_{n}\}$ of $M$ as a $\mathbb{Z}$ -module.

Proof.

We may clearly assume $\omega\not=0$ . Define the homomorphism $\wedge_{\omega}:M\to\bigwedge^{n}M$ by $\wedge_{\omega}(m):=\omega\wedge m$ . As $\bigwedge^{n}M\cong\mathbb{Z}$ , $M/\ker(\wedge_{\omega})$ is torsion-free and so $\ker(\wedge_{\omega})$ is a direct summand of $M$ of rank $n-1$ . Let $\epsilon_{1},...,\epsilon_{n}$ be a $\mathbb{Z}$ -basis of $M$ such that $\epsilon_{1},...,\epsilon_{n-1}$ is a $\mathbb{Z}$ -basis of $\ker(\wedge_{\omega})$ , let $\eta:=\epsilon_{1}\wedge\cdots\wedge\epsilon_{n-1}\in\bigwedge^{n-1}M$ , and define $d\in\mathbb{Z}$ by $\omega\wedge\epsilon_{n}=d\eta\wedge\epsilon_{n}$ . Notice that $\eta\wedge\epsilon_{i}=0=\omega\wedge\epsilon_{i}$ for $1\leq i\leq n-1$ .

For $m\in M$ , write $m=\sum_{i=1}^{n}a_{i}\epsilon_{i}$ with $a_{i}\in\mathbb{Z}$ . Then

[TABLE]

As the $\wedge$ -pairing of $\bigwedge^{n-1}M$ with $M$ is non-degenerate, $\omega=d\eta=d\epsilon_{1}\wedge\cdots\wedge\epsilon_{n-1}$ . ∎

7. Appendix by Fernando Rodriguez Villegas (May 2002)

Some remarks on Lehmer’s conjecture

7.1.

The logarithmic Mahler measure of a non-zero Laurent polynomial $P\in$

$\mathbb{C}[x_{1}^{\pm 1},\ldots,\;x_{n}^{\pm 1}]$ is defined as

[TABLE]

and its Mahler measure as $M(P)=e^{m(P)}$ , the geometric mean of $|P|$ on the torus

[TABLE]

When $n=1$ Jensen’s formula gives the identity

[TABLE]

where $P(x)=a_{0}\prod_{\nu=1}^{d}(x-\alpha_{\nu})$ , from which we clearly obtain that $M(P)\geq 1$ if $P\in\mathbb{Z}[x]$ . By a theorem of Kronecker if $M(P)=1$ for $P\in\mathbb{Z}[x]$ then $P$ is cyclotomic, i.e., $P$ is monic and its roots are either [math] or roots of unity.

In the early 30’s Lehmer [Le] famously asked whether there is an absolute lower bound for $M(P)$ when $P\in\mathbb{Z}[x]$ and $M(P)>1$ . The purpose of this note is to point out a simple reformulation of this question in terms of the logarithmic embedding of units of a number field and, given this setting, to propose a natural generalization.

7.2.

We start with some general observations about $m(P)$ . First of all, the fact that the integral in (106) is finite for all non-zero $P$ does need a proof. Here is a sketch. Using Jensen’s formula we find, as in (107) that

[TABLE]

where $y=(y_{1},\cdots,y_{n-1})$ , $dy/y=dy_{1}/y_{1}\cdots dy_{n-1}/y_{n-1}$ , $\log^{+}(x)=\max\{\log|x|,0\}$ , and $a_{0}(y),\alpha_{v}(y),d$ are the leading coefficient, roots and degree, respectively, of $P$ viewed as a polynomial in $x_{n}$ . The $\alpha_{\nu}$ ’s are algebraic functions of $y\in\mathbb{C}^{n-1}$ , continuous and piecewise smooth, except at those $y$ ’s where $a_{0}(y)$ vanishes (where some will go off to infinity).

We can apply the above procedure to any variable $x_{n}$ on the torus $T^{n}$ . It is not hard to see that we may change coordinates in such a way that $a_{0}(y)$ is actually constant, completing the proof by induction on $n$ .

This last remark can be expanded. Let $\Delta$ be the Newton polytope of $P$ ; i.e., the convex hull of the exponents $m\in\mathbb{Z}^{n}$ of monomials $x^{m}=x_{1}^{m_{1}}\cdots x_{n}^{m_{n}}$ such that if

[TABLE]

then $c_{m}\neq 0$ .

We define a face $\tau$ of $\Delta$ as the non-empty intersection of $\Delta$ with a half-space in $\mathbb{R}^{n}$ . Chose a parameterization $\phi:\mathbb{R}^{k}\longrightarrow\mathbb{R}^{n}$ of the affine subspace of smallest dimension containing $\tau$ ; $k$ is the dimension of the face $\tau$ . Define

[TABLE]

a polynomial whose own Newton polytope is $\phi^{-1}(\tau)$ . We call $P_{\tau}$ the face polynomial associated to the face $\tau$ . It depends on a choice of $\phi$ but note that by changing variables in the integral $m(P_{\tau})$ is actually independent of that choice.

It is not hard to see that for any facet (co-dimension $1$ face) $\tau\subset\Delta$ we can choose $\phi$ and system of coordinates in $T^{n}$ so that, in the notation of (108), $P_{\tau}=a_{0}(y)$ . By (108) and induction on $n$ we conclude [Sm1] that

[TABLE]

In particular,

[TABLE]

Also, since clearly $m(PQ)=m(P)+m(Q)$ , we have that

[TABLE]

Though Lehmer’s conjecture is about polynomials in one variable, polynomials in more variables are also relevant due to the following result [Bo]. For any $0\neq P\in\mathbb{Z}[x_{1},x_{1}^{-1},\ldots,x_{n},x_{n}^{-1}]$ and $0\neq(a_{1},\ldots,a_{n})\in\mathbb{Z}^{n}$ we have

[TABLE]

That is, there are one variable polynomials $Q$ with $m(Q)$ as close to $m(P)$ as desired. (We should note that (111) is not an immediate consequence of general results about integration but requires a somewhat delicate analysis.)

7.3.

Let us go back to polynomials in one variable. If we want to find polynomials $P\in\mathbb{Z}[x]$ with positive but small $m(P)$ , by (109) and (110) (and Gauss’ lemma) we may as well restrict ourselves to minimal polynomials of algebraic units.

Let $F$ be a number field of degree $n$ . Let $I$ be the set of embeddings $\sigma:F\longrightarrow\mathbb{C}$ and $V$ the real vector space of formal linear combinations

[TABLE]

We have the decomposition

[TABLE]

where $V^{\pm}$ is the subspace of $V$ where complex conjugation acts like $\pm 1$ . We let $n_{\pm}=\dim_{\mathbb{R}}V^{\pm}$ (in terms of the standard notation $n_{+}=r_{1}+r_{2}$ and $n_{-}=r_{2}$ ).

By Dirichlet’s theorem the image of the unit group $\mathcal{O}_{F}^{*}$ by the log map

[TABLE]

is a discrete subgroup $L_{1}\subset V$ of rank $r=n^{+}-1$ .

On $V$ we define the $L^{1}$ -norm

[TABLE]

and we let

[TABLE]

(the reason for this indexing will become clear shortly).

For any unit $\epsilon\in\mathcal{O}_{F}^{*}$ we have $|\mathbb{N}_{F/\mathbb{Q}}(\epsilon)|=1$ hence

[TABLE]

Let $P\in\mathbb{Z}[x]$ be the (monic) minimal polynomial of $\epsilon$ and

[TABLE]

This simple observation allows us to reformulate Lehmer’s conjecture as follows.

Conjecture.

(Lehmer) There exists an absolute constant $\delta_{1}>0$ such that

[TABLE]

7.4.

Let $V$ be a vector space over $\mathbb{R}$ of dimension $n$ and $L\subset V$ a discrete subgroup of rank $r\geq 1$ . A choice of basis $v_{1},\ldots,v_{n}$ for $V$ determines $L^{1}$ -norms on $\Lambda^{k}V$ for $k=1,\ldots,n$ by

[TABLE]

For each $1\leq k\leq r$ we define (with respect to the chosen basis)

[TABLE]

where the minimum is taken over all $l_{1},\ldots,l_{k}\in L$ which are linearly independent over $\mathbb{R}$ .

If $A$ is the $n\times k$ integral matrix whose $i$ -th column consists of the coordinates of $l_{i}$ in the basis $v_{1},\ldots,v_{n}$ then, as it is easily seen,

[TABLE]

where $A^{\prime}$ runs over all $k\times k$ minors of $A$ .

Returning to the number field situation of the previous section we define the invariants

[TABLE]

where, as before, $L_{1}$ is the image of the units of $F$ under the log map.

A general version of Lehmer’s conjecture would then be

Conjecture.

For each $k\in\mathbb{N}$ there exists an absolute constant $\delta_{k}>0$ such that

[TABLE]

A straightforward calculation shows that the top invariant $\mu_{1,r}(F)$ , with $r=n^{+}-1$ the rank of the unit group $\mathcal{O}_{F}^{*}$ , equals the regulator of $F$ . It is known [Zi], [Fr], [Sk] that the regulator of number fields is universally bounded below and hence the above conjecture is true for $k=r$ .

In summary, we have seen (18) that Lehmer’s conjecture can be phrased in terms of the $L^{1}$ -norm of units under the log map. The above conjecture is an attempt to quantify, in what seems to be the most natural way, the question of what is the general shape of $L_{1}$ , the discrete group of units under the log map.

7.5.

We may carry these ideas a little further still. Borel proved, generalizing Dirichlet’s result for units, that for each $j>1$ there is a regulator map $\operatorname{reg}_{j}$

[TABLE]

whose image is a discrete subgroup $L_{j}$ of $V^{\pm}$ , with $\pm=(-1)^{j-1}$ , of rank $n^{\pm}$ and covolume related to the value of the zeta function $\zeta_{F}$ of $F$ at $s=j$ . Here $K_{2j-1}(F)$ are the $K$ groups defined by Quillen.

We now define

[TABLE]

and we may ask: what is the nature of these invariants, how do they depend on the field $F$ ? Does the analogue of Lehmer’s conjecture hold?

Apart from their formal analogy with Lehmer’s question, answers to such questions can be quite useful in practice as we now illustrate.

7.6.

For general $j$ , very little is known about the groups $K_{2j-1}(F)$ or the map $\operatorname{reg}_{j}$ . For $j=2$ , however, things can be made quite explicit (and of course $j=1$ corresponds to the case of units). Indeed, up to torsion, $K_{3}(F)$ is isomorphic to the Bloch group ${\mathcal{B}}(F)$ , defined by generators and relations as follows.

For any field $F$ define

[TABLE]

where the corresponding term in the sum is omitted if $z_{i}=0,1$ and

[TABLE]

It is not hard to check that $\mathcal{C}(F)\subset{\mathcal{A}}(F)$ . Finally, let

[TABLE]

We recall the definition of the Bloch–Wigner dilogarithm. Starting with the usual dilogarithm

[TABLE]

one defines

[TABLE]

and checks that it extends to a real analytic function on $\mathbb{C}\setminus\{0,1\}$ , continuous on $\mathbb{C}$ . See [Za] for an account of its many wonderful properties. It is obvious that

[TABLE]

The 5-term relation satisfied by $D$ guarantees that, extended by linearity to ${\mathcal{A}}(F)$ , it induces a well defined function on ${\mathcal{B}}(\mathbb{C})$ (still denoted by $D$ ).

For $j=2$ (114) can be formulated as follows

[TABLE]

((115) makes it clear that the image $L_{2}$ lies in $V^{-}$ ) whose image $L_{2}$ is a discrete subgroup of rank $n^{-}$ .

An a priori lower bound for $\left|\left|l_{2}(\xi)\right|\right|_{1}$ even for the simplest case where $L_{2}$ is of rank $1$ (namely, for a field with only one complex embedding) would be quite useful. For example, in [BRV1] we find that an identity between the Mahler measure of certain two-variable polynomials is equivalent to the following

[TABLE]

This was proved by Zagier by showing that it is a consequence of series of 5-term relations. Such calculations, however, can be quite hard and at present there is no known algorithm that is guaranteed to produce the desired result. Clearly if we knew a reasonable lower bound for the possible non-zero values of $|D(\xi)|$ for $\xi\in{\mathcal{B}}(\mathbb{Q}(\sqrt{-7}))$ a simple numerical verification would be enough to prove (116).

Similarly, many identities [BRV2] between the Mahler measure of certain two-variable polynomials and $\zeta_{F}(2)$ for a corresponding number field $F$ , which by Borel’s theorem are known up to an unspecified rational number, could be proved by a numerical check. For example, we can show that

[TABLE]

with $s\in\mathbb{Q}^{*}$ , where $F$ is the splitting field $x^{4}-2x^{3}-2x+1$ , of discriminant $-1728$ . However, though numerically $s$ appears to be equal to $1$ we cannot prove this at the moment. Again, a reasonable lower bound on $|D(\xi)|$ for non-torsion elements $\xi\in{\mathcal{B}}(F)$ would allow us to conclude that $s=1$ by checking it numerically to high enough precision.

There is also some evidence that $\mu_{2,1}(F)$ might be universally bounded below, at least for fields with one complex embedding. Indeed, for a such a field one can construct a hyperbolic three dimensional manifold $M$ by taking the quotient of hyperbolic space by a torsion-free subgroup of the group of units of norm $1$ in a quaternion algebra over $F$ ramified at all its real places. Its associated Bloch group element $\xi(M)$ , obtained from a triangulation of $M$ into ideal tetrahedra, satisfies $D(\xi(M))=\operatorname{vol}(M)$ . On the other hand, the volume of hyperbolic 3-manifolds is known to be universally bounded below. The question becomes then, that of obtaining an upper bound for the index in ${\mathcal{B}}(F)$ of the subgroup generated by all such $\xi(M)$ . This index is likely to be rather small; in fact, if we accept a precise form of Lichtembaum’s conjecture, it should be essentially the order of $K_{2}(\mathcal{O}_{F})$ , an analogue of a class group. Unfortunately, there is no known upper bound for $|K_{2}(\mathcal{O}_{F})|$ in terms of, say, the degree and discriminant of $F$ .

Finally, to a hyperbolic 3-manifold $M$ with one cusp one may associate [CCGLS] a two variable polynomial $A(x,y)\in\mathbb{Z}[x,y]$ , called the A-polynomial of $M$ . Its zero locus parameterizes deformations of the complete hyperbolic structure of $M$ .

It is known that

[TABLE]

for every face polynomial of $A$ and that $A$ is reciprocal, i.e. $A(1/x,1/y)=x^{a}y^{b}A(x,y)$ for some $a,b\in\mathbb{Z}$ . It is interesting that these two properties, which have a topological and $K$ -theoretic origin, are, for $A$ irreducible, precisely the known necessary conditions for a polynomial in $\mathbb{Z}[x,y]$ to have to have small Mahler measure (the first, an analogue of being the minimal polynomial of an algebraic unit, because of (109); the second because $m(P)$ is known to be universally bounded below for $P$ non-reciprocal [Sm1]).

Though the whole picture is still not completely clear yet one can prove [BRV2] for many $M$ ’s identities of the form

[TABLE]

where $\xi(M)$ is the Bloch group element associated to $M$ . This suggests a direct link between Lehmer’s conjecture and the size of the invariants $\mu_{2,1}$ .

Bibliography23

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AAR] G. Andrews, R. Askey and R. Roy, Special functions , Cambridge U. Press, Cambridge (1999).
2[BS] Z. I. Borevich and I. R. Shafarevich, Number Theory . Academic Press, New York (1966).
3[Bo] D.W. Boyd, Speculations concerning the range of Mahler’s measure , Canad. Math. Bull. 24 (1981) 453–469.
4[BRV 1] D.W. Boyd and F. Rodriguez Villegas, Mahler’s measure and the dilogarithm I , Canad. J. Math. 54 (2002) 468–492.
5[BRV 2] D.W. Boyd and F. Rodriguez Villegas, Mahler’s measure and the dilogarithm II ,
6[Ca] J. Cassels, An introduction to the geometry of numbers . Springer, Berlin (1959).
7[CCGLS] D. Cooper, M. Culler, H. Gillet, D. D. Long, and P. B. Shalen, Plane curves associated to character varieties of 3-manifolds , Invent. Math. 118 (1994) 47–84.
8[CF] A. Costa and E. Friedman, Ratios of regulators in totally real extensions of number fields , J. Number Th. 37 (1991) 288–297.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

A case of the Rodriguez Villegas conjecture

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

RV Conjecture**.**

Main Theorem**.**

2. The Θ\ThetaΘ-function

2.1. The basic inequality

Basic Inequality**.**

2.2. Mellin transforms

Lemma 1**.**

Proof.

Lemma 2**.**

Proof.

Lemma 3**.**

Proof.

Corollary 4**.**

Proof.

3. Existence and uniqueness of the critical point

Lemma 5**.**

Proof.

Lemma 6**.**

Proof.

4. Inequalities at the critical point

Lemma 7**.**

Proof.

Lemma 8**.**

Proof.

Lemma 9**.**

Proof.

5. Asymptotics

Lemma 10**.**

Proof.

5.1. The main term

Lemma 11**.**

Proof.

Corollary 12**.**

5.2. The small terms

Lemma 13**.**

Proof.

Lemma 14**.**

Proof.

Lemma 15**.**

Proof.

Lemma 16**.**

Lemma 17**.**

Proof.

Lemma 18**.**

Proof.

Lemma 19**.**

Proof.

Lemma 20**.**

Proof.

6. Proof of the Main Theorem

Lemma 21**.**

Proof.

Lemma 22**.**

Proof.

7. Appendix by Fernando Rodriguez Villegas (May 2002)

7.1.

7.2.

7.3.

Conjecture**.**

7.4.

Conjecture**.**

7.5.

7.6.

RV Conjecture.

Main Theorem.

2. The $\Theta$ -function

Basic Inequality.

Lemma 1.

Lemma 2.

Lemma 3.

Corollary 4.

Lemma 5.

Lemma 6.

Lemma 7.

Lemma 8.

Lemma 9.

Lemma 10.

Lemma 11.

Corollary 12.

Lemma 13.

Lemma 14.

Lemma 15.

Lemma 16.

Lemma 17.

Lemma 18.

Lemma 19.

Lemma 20.

Lemma 21.

Lemma 22.

Conjecture.

Conjecture.