Simultaneous approximation to values of the exponential function over   the adeles

Damien Roy

arXiv:1905.01678·math.NT·February 2, 2022

Simultaneous approximation to values of the exponential function over the adeles

Damien Roy

PDF

TL;DR

This paper demonstrates that Hermite's approximations to exponential function values at algebraic numbers are nearly optimal from an adelic viewpoint, considering all completions of a number field.

Contribution

It introduces an adelic framework to evaluate the optimality of Hermite's approximations across different completions of number fields.

Findings

01

Hermite's approximations are nearly optimal adelically.

02

The approach accounts for both Archimedean and p-adic valuations.

03

Provides a unified adelic perspective on exponential value approximations.

Abstract

We show that Hermite's approximations to values of the exponential function at given algebraic numbers are nearly optimal when considered from an adelic perspective. We achieve this by taking into account the ratio of these values whenever they make sense in the various completions (Archimedean or $p$ -adic) of a number field containing these algebraic numbers.

Equations357

e = [2, (1, 2 n, 1)_{n = 1}^{\infty}] = [2, 1, 2, 1, 1, 4, 1, 1, 6, 1, \dots] .

e = [2, (1, 2 n, 1)_{n = 1}^{\infty}] = [2, 1, 2, 1, 1, 4, 1, 1, 6, 1, \dots] .

e^{z} = k = 0 \sum \infty \frac{z ^{k}}{k !}

e^{z} = k = 0 \sum \infty \frac{z ^{k}}{k !}

∣ x ∣ ∣ x e^{α} - y ∣ \geq c (lo g ∣ x ∣)^{- 2 g - 1}

∣ x ∣ ∣ x e^{α} - y ∣ \geq c (lo g ∣ x ∣)^{- 2 g - 1}

C_{n}

C_{n}

Λ_{n}

(c n^{2})^{- 1} \leq λ_{1} (C_{n}, Λ_{n}) \leq λ_{2} (C_{n}, Λ_{n}) \leq c n^{2},

(c n^{2})^{- 1} \leq λ_{1} (C_{n}, Λ_{n}) \leq λ_{2} (C_{n}, Λ_{n}) \leq c n^{2},

∣ x ∣ ∣ x e^{3} - y ∣ \geq c_{ϵ} ∣ x ∣^{- ϵ}

∣ x ∣ ∣ x e^{3} - y ∣ \geq c_{ϵ} ∣ x ∣^{- ϵ}

∣ x ∣ ∣ x e^{3} - y ∣ \geq (3 lo g ∣ x ∣ lo g lo g ∣ x ∣)^{- 1} if 4 \leq ∣ x ∣ \leq 1 0^{500000} .

∣ x ∣ ∣ x e^{3} - y ∣ \geq (3 lo g ∣ x ∣ lo g lo g ∣ x ∣)^{- 1} if 4 \leq ∣ x ∣ \leq 1 0^{500000} .

∣ x_{1} ∣ ∣ x_{1} e^{3} - x_{2} ∣ ∣ x_{1} e^{3} - x_{2} ∣_{3} \geq (lo g ∣ x_{1} ∣)^{- g}

∣ x_{1} ∣ ∣ x_{1} e^{3} - x_{2} ∣ ∣ x_{1} e^{3} - x_{2} ∣_{3} \geq (lo g ∣ x_{1} ∣)^{- g}

∣ x_{1} ∣ ∣ x_{1} e^{α_{2}} - x_{2} ∣ \dots ∣ x_{1} e^{α_{s}} - x_{s} ∣ \geq c_{ϵ} ∣ x_{1} ∣^{- ϵ}

∣ x_{1} ∣ ∣ x_{1} e^{α_{2}} - x_{2} ∣ \dots ∣ x_{1} e^{α_{s}} - x_{s} ∣ \geq c_{ϵ} ∣ x_{1} ∣^{- ϵ}

μ (K_{A} / K) = 2^{- r_{2}} ∣ D (K) ∣^{1/2},

μ (K_{A} / K) = 2^{- r_{2}} ∣ D (K) ∣^{1/2},

C = v \prod C_{v} \subset K_{A}^{s},

C = v \prod C_{v} \subset K_{A}^{s},

λ C = v ∣ \infty \prod λ C_{v} v ∤ \infty \prod C_{v}

λ C = v ∣ \infty \prod λ C_{v} v ∤ \infty \prod C_{v}

2^{s r_{1}} (s!)^{- d} \leq (λ_{1} (C) \dots λ_{s} (C))^{d} μ (C) \leq 2^{s (r_{1} + r_{2})} ∣ D (K) ∣^{s /2} .

2^{s r_{1}} (s!)^{- d} \leq (λ_{1} (C) \dots λ_{s} (C))^{d} μ (C) \leq 2^{s (r_{1} + r_{2})} ∣ D (K) ∣^{s /2} .

f_{n} (z) = (z - α_{1})^{n_{1}} \dots (z - α_{s})^{n_{s}} and P_{n} (z) = k = 0 \sum N f_{n}^{(k)} (z)

f_{n} (z) = (z - α_{1})^{n_{1}} \dots (z - α_{s})^{n_{s}} and P_{n} (z) = k = 0 \sum N f_{n}^{(k)} (z)

N = n_{1} + \dots + n_{s}

N = n_{1} + \dots + n_{s}

a_{\mathbf{n}}:=\big{(}P_{\mathbf{n}}(\alpha_{1}),\dots,P_{\mathbf{n}}(\alpha_{s})\big{)}\in K^{s}.

a_{\mathbf{n}}:=\big{(}P_{\mathbf{n}}(\alpha_{1}),\dots,P_{\mathbf{n}}(\alpha_{s})\big{)}\in K^{s}.

\frac{d}{dz}\big{(}P_{\mathbf{n}}(z)e^{-z}\big{)}=\big{(}P_{\mathbf{n}}^{\prime}(z)-P_{\mathbf{n}}(z)\big{)}e^{-z}=-f_{\mathbf{n}}(z)e^{-z}.

\frac{d}{dz}\big{(}P_{\mathbf{n}}(z)e^{-z}\big{)}=\big{(}P_{\mathbf{n}}^{\prime}(z)-P_{\mathbf{n}}(z)\big{)}e^{-z}=-f_{\mathbf{n}}(z)e^{-z}.

P_{n} (α_{i}) e^{- α_{i}} - P_{n} (α_{j}) e^{- α_{j}} = \int_{α_{i}}^{α_{j}} f_{n} (z) e^{- t} \/ d z,

P_{n} (α_{i}) e^{- α_{i}} - P_{n} (α_{j}) e^{- α_{j}} = \int_{α_{i}}^{α_{j}} f_{n} (z) e^{- t} \/ d z,

z \in [α_{i}, α_{j}] max ∣ f_{n} (z) ∣ \leq R^{N} with R = 1 \leq k, ℓ \leq s max ∣ α_{k} - α_{ℓ} ∣,

z \in [α_{i}, α_{j}] max ∣ f_{n} (z) ∣ \leq R^{N} with R = 1 \leq k, ℓ \leq s max ∣ α_{k} - α_{ℓ} ∣,

P_{n} (α_{i}) e^{- α_{i}} - P_{n} (α_{j}) e^{- α_{j}} \leq c_{1} R^{N}

P_{n} (α_{i}) e^{- α_{i}} - P_{n} (α_{j}) e^{- α_{j}} \leq c_{1} R^{N}

P_{n} (α_{i}) = \int_{0}^{\infty} f_{n} (z + α_{i}) e^{- z} \/ d z,

P_{n} (α_{i}) = \int_{0}^{\infty} f_{n} (z + α_{i}) e^{- z} \/ d z,

∣ P_{n} (α_{i}) ∣ \leq \int_{0}^{\infty} (t + R)^{N} e^{- t} \/ d t \leq e^{R} \int_{0}^{\infty} t^{N} e^{- t} \/ d t = e^{R} N! .

∣ P_{n} (α_{i}) ∣ \leq \int_{0}^{\infty} (t + R)^{N} e^{- t} \/ d t \leq e^{R} \int_{0}^{\infty} t^{N} e^{- t} \/ d t = e^{R} N! .

R_{v} = 1 \leq k, ℓ \leq s max ∣ α_{k} - α_{ℓ} ∣_{v}

R_{v} = 1 \leq k, ℓ \leq s max ∣ α_{k} - α_{ℓ} ∣_{v}

P_{n} (α_{i}) e^{- α_{i}} - P_{n} (α_{j}) e^{- α_{j}}_{v}

P_{n} (α_{i}) e^{- α_{i}} - P_{n} (α_{j}) e^{- α_{j}}_{v}

∣ P_{n} (α_{i}) ∣_{v}

\frac{b ^{N}}{( n _{i} )!} P_{n} (α_{i}) = k = n_{i} \sum N \frac{b ^{N}}{( n _{i} )!} f^{(k)} (α_{i}) = k = n_{i} \sum N \frac{b ^{k} k !}{( n _{i} )!} \cdot \frac{g ^{(k)} ( b α _{i} )}{k !} \in O_{K} .

\frac{b ^{N}}{( n _{i} )!} P_{n} (α_{i}) = k = n_{i} \sum N \frac{b ^{N}}{( n _{i} )!} f^{(k)} (α_{i}) = k = n_{i} \sum N \frac{b ^{k} k !}{( n _{i} )!} \cdot \frac{g ^{(k)} ( b α _{i} )}{k !} \in O_{K} .

Δ_{n} := det (a_{n - e_{1}}, \dots, a_{n - e_{s}}) = i = 1 \prod s (n_{i} - 1)! k \neq = i \prod (α_{i} - α_{k})^{n_{k}} \neq = 0.

Δ_{n} := det (a_{n - e_{1}}, \dots, a_{n - e_{s}}) = i = 1 \prod s (n_{i} - 1)! k \neq = i \prod (α_{i} - α_{k})^{n_{k}} \neq = 0.

∣ x_{i} ∣_{v} \leq e^{R_{v}} (N - 1)! and ∣ x_{i} e^{α_{j} - α_{i}} - x_{j} ∣_{v} \leq 1 \leq k \leq s max \int_{σ (α_{i})}^{σ (α_{j})} f_{n - e_{k}}^{σ} (z) e^{σ (α_{j}) - z} \/ d z

∣ x_{i} ∣_{v} \leq e^{R_{v}} (N - 1)! and ∣ x_{i} e^{α_{j} - α_{i}} - x_{j} ∣_{v} \leq 1 \leq k \leq s max \int_{σ (α_{i})}^{σ (α_{j})} f_{n - e_{k}}^{σ} (z) e^{σ (α_{j}) - z} \/ d z

|x_{i}|_{v}\leq p^{3}N\prod_{1\leq k\leq s}\max\big{\{}|\alpha_{i}-\alpha_{k}|_{v},\,p^{-1/(p-1)}\big{\}}^{n_{k}}

|x_{i}|_{v}\leq p^{3}N\prod_{1\leq k\leq s}\max\big{\{}|\alpha_{i}-\alpha_{k}|_{v},\,p^{-1/(p-1)}\big{\}}^{n_{k}}

|x_{i}e^{\alpha_{j}-\alpha_{i}}-x_{j}|_{v}\leq p^{3}N\prod_{1\leq k\leq s}\max\big{\{}|\alpha_{i}-\alpha_{k}|_{v},|\alpha_{j}-\alpha_{k}|_{v}\big{\}}^{n_{k}}.

|x_{i}e^{\alpha_{j}-\alpha_{i}}-x_{j}|_{v}\leq p^{3}N\prod_{1\leq k\leq s}\max\big{\{}|\alpha_{i}-\alpha_{k}|_{v},|\alpha_{j}-\alpha_{k}|_{v}\big{\}}^{n_{k}}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Simultaneous approximation to values of the exponential function over the adeles

Damien Roy

Abstract.

We show that Hermite’s approximations to values of the exponential function at given algebraic numbers are nearly optimal when considered from an adelic perspective. We achieve this by taking into account the ratio of these values whenever they make sense in the various completions (Archimedean or $p$ -adic) of a number field containing these algebraic numbers.

Key words and phrases:

adeles, exponential function, geometry of numbers, Hermite approximations, measures of approximation, roots of polynomials, semi-resultant, steepest ascent, volumes.

2010 Mathematics Subject Classification:

Primary 11J13; Secondary 11J61, 11J82, 11H06.

Research partially supported by NSERC

1. Introduction

We know by Euler that the number $e$ admits a continued fraction expansion consisting of intertwined arithmetic progressions

[TABLE]

Euler, Sundman and Hurwitz also obtained similar expansions for the numbers $e^{2/m}$ where $m$ is a non-zero integer [13, §§31-32]. Consequently, one may derive very good measures of rational approximations to these numbres (see for example the fully explicit results of Bundschuch [6, Satz 2], is the case where $m$ is even). This is the aspect that interests us here. We propose the following heuristic explanation: the ratios $2/m$ with $m\in\mathbb{Z}\setminus\{0\}$ are the only non-zero rational numbers $z$ for which the usual power series

[TABLE]

converges only as a real number. Indeed, let $p$ be a prime number and let $\mathbb{C}_{p}$ denote the completion of the algebraic closure $\bar{\mathbb{Q}}$ of $\mathbb{Q}$ for the $p$ -adic absolute value of $\mathbb{Q}$ extended to $\bar{\mathbb{Q}}$ , with $|p|_{p}=p^{-1}$ . We know that, for $z\in\mathbb{C}_{p}$ , the series (1.1) converges in $\mathbb{C}_{p}$ if and only if $|z|_{p}<p^{-1/(p-1)}$ . In particular, for a rational number $z$ , viewed as an element of $\mathbb{C}_{p}$ , this series converges if and only if the numerator of $z$ is divisible by $p$ when $p\neq 2$ , and by $4$ when $p=2$ .

This phenomenon also extends to algebraic numbers. Indeed, let $K$ be a number field, namely an algebraic extension of $\mathbb{Q}$ of finite degree. Then any absolute value on $K$ induces the same topology on $K$ as an absolute value coming from an embedding from $K$ into $\mathbb{C}$ or into $\mathbb{C}_{p}$ for a prime number $p$ . We say that such embeddings define the same place $v$ of $K$ if they induce the same absolute value on $K$ denoted $|\ |_{v}$ . We then denote by $K_{v}$ the completion of $K$ for this absolute value. When the place $v$ comes from an embedding of $K$ into $\mathbb{C}$ , the place $v$ is called Archimedean and we write $v\mid\infty$ . Otherwise it is called ultrametric, and we write $v\mid p$ if it comes from an embedding of $K$ into $\mathbb{C}_{p}$ . When $\alpha\in K$ , the series for $e^{\alpha}$ converges in each Archimedian completion of $K$ but only in a finite number of ultrametric completions. In particular, when $K$ admits a single Archimedean place, which happens when $K=\mathbb{Q}$ or when $K$ is quadratic imaginary, then it may occur that $e^{\alpha}$ has a meaning only for this place. Then, we obtain the following estimate where ${\mathcal{O}}_{K}$ denotes the ring of integers of $K$ .

Proposition 1.1.

Let $K\subset\mathbb{C}$ be the field $\mathbb{Q}$ or a quadratic imaginary extension of $\mathbb{Q}$ , and let $\alpha$ be a non-zero element of $K$ such that $|\alpha|_{v}\geq p^{-1/(p-1)}$ for each prime number $p$ and each place $v$ of $K$ with $v\mid p$ . Then, for any $x,y\in{\mathcal{O}}_{K}$ with $x\neq 0$ , we have

[TABLE]

where $g$ stands for the number of places $v$ of $K$ with $v\mid\infty$ or $|\alpha|_{v}\neq 1$ , and where $c>0$ is a constant depending only on $\alpha$ and $K$ .

For example if $K=\mathbb{Q}(\sqrt{-2})$ , we may take $\alpha=2(1\pm\sqrt{-2})/m$ where $m\in{\mathcal{O}}_{K}\setminus\{0\}$ . If $K=\mathbb{Q}(\sqrt{-23})$ , we may take $\alpha=(1\pm\sqrt{-23})/(2m)$ where $m\in{\mathcal{O}}_{K}\setminus\{0\}$ . In some cases, $e^{\alpha}$ admits a generalized continued fraction expansion similar to the one of $e$ (with partial quotients in ${\mathcal{O}}_{K}$ ) but we do not consider this question here.

More generally, let $\alpha_{1},\dots,\alpha_{s}$ be distinct elements of a number field $K\subset\mathbb{C}$ . Lindemann-Weierstrass theorem [18] tells us that their exponentials $e^{\alpha_{1}},\dots,e^{\alpha_{s}}\in\mathbb{C}$ are linearly independent over $K$ and the classical proof, in all variants (see [11, Appendix]), is based on Hermite’s approximations which we recall in the next section. Our goal is to show that these approximations are nearly optimal in the context of geometry of numbers in the adeles of $K$ , when taking into account all places $v$ of $K$ and all pairs of indices $i,j$ with $1\leq i<j\leq s$ for which the series for $e^{\alpha_{i}-\alpha_{j}}$ converges in $K_{v}$ . It is possible that this observation reflects a much wider property of the values of the exponential function.

For example the series for $e^{3}$ converges in $\mathbb{R}$ and in $\mathbb{Q}_{3}$ but not in any $\mathbb{Q}_{p}$ for a prime number $p\neq 3$ . Then our approach leads to the following result.

Proposition 1.2.

For any integer $n\geq 1$ , we define a convex body ${\mathcal{C}}_{n}$ of $\mathbb{R}^{2}$ and a lattice $\Lambda_{n}$ of $\mathbb{R}^{2}$ by

[TABLE]

For $i=1,2$ , let $\lambda_{i}({\mathcal{C}}_{n},\Lambda_{n})$ denote the $i$ -th minimum of ${\mathcal{C}}_{n}$ with respect to $\Lambda_{n}$ , that is the smallest $\lambda>0$ such that $\lambda{\mathcal{C}}_{n}$ contains at least $i$ $\mathbb{Q}$ -linearly independent elements of $\Lambda_{n}$ . Then we have

[TABLE]

for a constant $c>1$ that does not depend on $n$ .

Using the fact that $3^{n}\mathbb{Z}^{2}\subset\Lambda_{n}$ , one deduces that $\lambda_{1}({\mathcal{C}}_{n},\mathbb{Z}^{2})\geq(cn^{2}3^{n})^{-1}$ for any integer $n\geq 1$ . Consequently, for each $\epsilon>0$ , there exists a constant $c_{\epsilon}>0$ such that

[TABLE]

for all $(x,y)\in\mathbb{Z}^{2}$ with $x\neq 0$ . One may even derive slightly sharper estimates (see [6, Satz 1]). However, numerical computations described in section 12 yield

[TABLE]

More involved computations which we do not describe here even suggest the existence of a real number $g>0$ such that

[TABLE]

for any $(x_{1},x_{2},x_{3})\in\mathbb{Z}^{3}$ with $|x_{1}|$ large enough. Finally, an important result of Baker [2] shows that if $\alpha_{2},\dots,\alpha_{s}\in\mathbb{Q}$ are distinct non-zero rational numbers then, for each $\epsilon>0$ , there also exists a constant $c_{\epsilon}>0$ such that

[TABLE]

for each $(x_{1},\dots,x_{s})\in\mathbb{Z}^{s}$ with $|x_{1}|\neq 0$ . The properties of Hermite’s approximations suggest that the right hand side $c_{\epsilon}|x_{1}|^{-\epsilon}$ in this inequality could be remplaced by $(\log|x_{1}|)^{-g}$ for a constant $g>0$ depending only on $(\alpha_{2},\dots,\alpha_{s})$ , when $|x_{1}|$ is large enough.

In this paper, $\mathbb{N}$ stands for the set of non-negative integers and $\mathbb{N}_{+}=\mathbb{N}\setminus\{0\}$ for the set of positive integers.

Acknowledgments: I warmly thank Michel Waldschmidt for numerous exchanges on these questions. In particular, his course notes [17] were a source of inspiration.

2. Statement of the main result

Let $K$ be a number field, let ${\mathcal{O}}_{K}$ be its ring of integers, let $d=[K:\mathbb{Q}]$ be its degree over $\mathbb{Q}$ , and let $s\in\mathbb{N}_{+}$ . For any ultrametric place $v$ of $K$ , we denote by ${\mathcal{O}}_{v}=\{x\in K_{v}\,;\,|x|_{v}\leq 1\}$ the ring of integers of $K_{v}$ and by $d_{v}=[K_{v}:\mathbb{Q}_{p}]$ the local degree of $K_{v}$ , where $p$ stands for the prime number below $v$ (notation $v\mid p$ ), namely the prime number $p$ for which $|\ |_{v}$ extends the $p$ -adic absolute value on $\mathbb{Q}$ . Following McFeat [12, §2.2], we denote by $\mu_{v}$ the Haar measure on $K_{v}$ normalized so that $\mu_{v}({\mathcal{O}}_{v})=1$ . For an Archimedean place (notation $v\mid\infty$ ), we again denote by $d_{v}=[K_{v}:\mathbb{R}]$ the local degree of $K_{v}$ , and define $\mu_{v}$ as the Lebesgue measure on $K_{v}$ (this field is $\mathbb{R}$ or $\mathbb{C}$ ). We denote by $r_{1}$ (resp. $r_{2}$ ) the number of places $v\mid\infty$ with $d_{v}=1$ (resp. $d_{v}=2$ ), so that $d=r_{1}+2r_{2}$ .

The ring of adeles of $K$ is the product $K_{\mathbb{A}}=\prod_{v}K_{v}$ running over all places $v$ of $K$ , with the restricted topology. This is a locally compact ring that we equip with the Haar measure $\mu$ , product of the $\mu_{v}$ . We identify $K$ as a subfield of $K_{\mathbb{A}}$ via the diagonal embedding. Then $K$ becomes a discrete subgroup of $K_{\mathbb{A}}$ and, with the above normalization, we have

[TABLE]

where $D(K)$ stands for the discriminant of $K$ . By abuse of notation, we also write $\mu$ for the product measure of $s$ copies of $\mu$ on $K_{\mathbb{A}}^{s}$ . Similarly, for each place $v$ of $K$ , we also write $\mu_{v}$ for the product measure of $s$ copies of $\mu_{v}$ on $K_{v}^{s}$ . With our normalization of the absolute value on $K_{v}$ , if $T\colon K_{v}^{s}\to K_{v}^{s}$ is a $K_{v}$ -linear map and if $E$ is a measurable subset of $K_{v}^{s}$ , the set $T(E)$ is measurable with measure $\mu_{v}(T(E))=|\det T|_{v}^{d_{v}}\mu_{v}(E)$ .

2.1. Minima of adelic convex bodies

An adelic convex body of $K^{s}$ is a product

[TABLE]

indexed by all places $v$ of $K$ , which satisfies the following properties:

(i)

if $v\mid\infty$ , then ${\mathcal{C}}_{v}$ is a convex body of $K_{v}^{s}$ , namely a compact connected neighborhood of [math] in $K_{v}^{s}$ such that $\alpha\,{\mathcal{C}}_{v}={\mathcal{C}}_{v}$ for any $\alpha\in K_{v}$ with $|\alpha|_{v}=1$ ;

(ii)

if $v\nmid\infty$ , then ${\mathcal{C}}_{v}$ is a finite type (thus free) sub- ${\mathcal{O}}_{v}$ -module of $K_{v}^{s}$ of rank $s$ ;

(iii)

${\mathcal{C}}_{v}={\mathcal{O}}_{v}^{s}$ for all but finitely many places $v$ of $K$ with $v\nmid\infty$ .

Suppose that ${\mathcal{C}}$ is such a product. For each $i=1,\dots,s$ , we define its $i$ -th minimum $\lambda_{i}({\mathcal{C}})$ as the smallest $\lambda>0$ for which the adelic convex body

[TABLE]

contains at least $i$ linearly independent elements of $K^{s}$ over $K$ . With this notation and our normalization of measures, the adelic version of Minkowski’s theorem reads as follows.

Theorem 2.1 (McFeat, Bombieri et Vaaler).

For any adelic convex body ${\mathcal{C}}$ of $K^{s}$ , we have

[TABLE]

We refer the reader to [12, Theorem 5] and [4, Theorem 3] for the upper bound on the product of the minima (see also the upper bound of Thunder in [16, Theorem 1 and Corollary]). The lower bound given here is taken from [12, Theorem 6]; it is slightly weaker than the one of [4, Theorem 6].

2.2. Hermite’s approximations

Let $\alpha_{1},\dots,\alpha_{s}$ be distinct elements of $K$ . For each $s$ -tuple $\mathbf{n}:=(n_{1},\dots,n_{s})\in\mathbb{N}^{s}$ , we define polynomials of $K[z]$ by

[TABLE]

where

[TABLE]

represents the degree of $f_{\mathbf{n}}$ , and where $f_{\mathbf{n}}^{(k)}$ denotes the $k$ -th derivative of $f_{\mathbf{n}}$ for each integer $k\geq 0$ . We then form the point

[TABLE]

We call it the Hermite approximation of order $\mathbf{n}$ for the $s$ -tuple $(\alpha_{1},\dots,\alpha_{s})$ . Our goal is to give a precise meaning to the term “approximation”, by working in the adeles of $K$ .

We first recall some properties of these points. For simplicity, we start by assuming that $K\subset\mathbb{C}$ . We find

[TABLE]

So, for any pair $i,j\in\{1,\dots,s\}$ , we obtain

[TABLE]

independently of the path of integration from $\alpha_{i}$ to $\alpha_{j}$ in $\mathbb{C}$ . Upon integrating along the line segment $[\alpha_{i},\alpha_{j}]$ joining those two points and observing that

[TABLE]

we deduce that

[TABLE]

for a constant $c_{1}>0$ that is independent of the choice of $i$ , $j$ and $\mathbf{n}$ . Similarly, for $i=1,\dots,s$ , the formula (2.1) yields

[TABLE]

by integrating along $[0,\infty)\subset\mathbb{R}$ . Since $|f_{\mathbf{n}}(t+\alpha_{i})|\leq(t+R)^{N}$ for all $t\geq 0$ , we deduce that

[TABLE]

More generally, let $v$ be any Archimedean place of $K$ . Put

[TABLE]

and choose an embedding $\sigma\colon K\to\mathbb{C}$ such that $|\alpha|_{v}=|\sigma(\alpha)|$ for all $\alpha\in K$ . Then, for any pair of indices $i,j\in\{1,\dots,s\}$ , the above computations yield

[TABLE]

where $f^{\sigma}_{\mathbf{n}}$ denotes the image of $f_{\mathbf{n}}$ under the ring homomorphism from $K[z]$ to $\mathbb{C}[z]$ which fixes $z$ and coincides with $\sigma$ on $K$ , and where $c_{v}>0$ depends only on $v$ and $\alpha_{1},\dots,\alpha_{s}$ . Thus, $a_{\mathbf{n}}$ is a projective approximation to $(e^{\alpha_{1}},\dots,e^{\alpha_{s}})$ at each Archimedean place of $K$ .

In this paper, we establish an upper bound for the integral in (2.3) which is sharper than $c_{v}R_{v}^{N}$ for each Archimedean place $v$ of $K$ . We also provide analogs of (2.3) and of (2.4) for the ultrametric places $v$ of $K$ whenever their left hand side makes sense in $K_{v}$ . More precisely, as $e^{\alpha_{j}-\alpha_{i}}$ could make sense in $K_{v}$ without $e^{\alpha_{i}}$ and $e^{\alpha_{j}}$ making sense, we consider instead the quantities $|P_{\mathbf{n}}(\alpha_{i})e^{\alpha_{j}-\alpha_{i}}-P_{\mathbf{n}}(\alpha_{j})|_{v}$ . Here again, we will need sharp estimates while usually the ultrametric places are treated in an expeditious manner. In general, one chooses a common denominator $b$ of $\alpha_{1},\dots,\alpha_{s}$ , that is an integer $b\geq 1$ such that $b\alpha_{1},\dots,b\alpha_{s}\in{\mathcal{O}}_{K}$ . Then the polynomial $g(z):=b^{N}f(z/b)$ has coefficients in ${\mathcal{O}}_{K}$ and, for each $i=1,\dots,s$ , we find

[TABLE]

For example, if $n_{1}=\cdots=n_{s}=n$ , this implies that $(b^{N}/n!)a_{\mathbf{n}}\in{\mathcal{O}}_{K}^{s}$ .

The above estimates are key-ingredients in the classical proof of the Lindemann-Weiertrass theorem asserting that $e^{\alpha_{1}},\dots,e^{\alpha_{s}}$ are linearly independent over $K$ . However, two more ingredients are missing. The first one is a reduction step of Weierstrass which is explained in [11, Appendix, §3] (see also [3, Chapter 1, §3]). The second one is the existence of families of $s$ linearly independent approximations over $K$ . Hermite himself noticed this problem and solved it in order to prove the transcendence of $e$ . We will use here the following remarkable result of Mahler.

Theorem 2.2 (Mahler).

Suppose that $\mathbf{n}=(n_{1},\dots,n_{s})\in\mathbb{N}_{+}^{s}$ has positive coordinates. Let $\mathbf{e}_{1}=(1,0,\dots,0),\dots,\mathbf{e}_{s}=(0,\dots,0,1)$ denote the canonical basis elements of $\mathbb{Z}^{s}$ . Then, we have

[TABLE]

The proof of Mahler is clever. It is presented in [10, §8] and again in [11, Appendix, §16]. In the case where $n_{1}=\dots=n_{s}$ , the result is due to Hermite [9]. Hermite’s proof is different. It is based on the recurrence relations satisfied by the points points $\mathbf{a}_{\mathbf{n}}$ which we generalize in Appendix A.

2.3. Statement of the main result

With the above notation, let $E$ be the finite set consisting of all Archimedean places of $K$ together with the ultrametric places $v$ of $K$ such that $|\alpha_{i}-\alpha_{j}|_{v}\neq 1$ for at least one pair of indices $i,j\in\{1,\dots,s\}$ with $i\neq j$ . For each $s$ -tuple $\mathbf{n}=(n_{1},\dots,n_{s})\in\mathbb{N}_{+}^{s}$ , we let $N$ denote its sum and we define an adelic convex body ${\mathcal{C}}_{\mathbf{n}}=\prod_{v}{\mathcal{C}}_{\mathbf{n},v}$ of $K^{s}$ as follows.

(i)

If $v\,|\,\infty$ is the place attached to an embedding $\sigma\colon K\hookrightarrow\mathbb{C}$ , we define $R_{v}$ by (2.2). Then ${\mathcal{C}}_{\mathbf{n},v}$ is the set of points $(x_{1},\dots,x_{s})\in K_{v}^{s}$ which satisfy

[TABLE]

for each pair of indices $i,j\in\{1,\dots,s\}$ with $i\neq j$ .

(ii)

If $v\in E$ and if $v\,|\,p$ for a prime number $p$ , then ${\mathcal{C}}_{\mathbf{n},v}$ is the set of points $(x_{1},\dots,x_{s})$ in $K_{v}^{s}$ which satisfy

[TABLE]

for $i=1,\dots,s$ , as well as

[TABLE]

for each pair of integers $i,j\in\{1,\dots,s\}$ such that $0<|\alpha_{j}-\alpha_{i}|_{v}<p^{-1/(p-1)}$ .

(iii)

Finally, if $v\notin E$ , then ${\mathcal{C}}_{\mathbf{n},v}$ is the set of points $(x_{1},\dots,x_{s})\in K_{v}^{s}$ satisfying

[TABLE]

for $i=1,\dots,s$ .

The crucial feature of these adelic convex bodies ${\mathcal{C}}_{\mathbf{n}}$ is that the linear forms which define them involve only the complex or $p$ -adic values of the exponential function at the points $\alpha_{i}$ or $\alpha_{j}-\alpha_{i}$ . In view of the estimates in §2.2, their component ${\mathcal{C}}_{\mathbf{n},v}$ contains the points $a_{\mathbf{n}-\mathbf{e}_{1}},\dots,a_{\mathbf{n}-\mathbf{e}_{s}}$ for each Archimedean place $v$ of $K$ . We will show in the next section that this holds in fact for all places of $K$ , yielding the first assertion in the following result.

Theorem 2.3.

Let $\mathbf{n}=(n_{1},\dots,n_{s})\in\mathbb{N}_{+}^{s}$ . Then the adelic convex body ${\mathcal{C}}_{\mathbf{n}}$ contains the points $a_{\mathbf{n}-\mathbf{e}_{1}},\dots,a_{\mathbf{n}-\mathbf{e}_{s}}$ . Moreover, upon setting $N=n_{1}+\cdots+n_{s}$ , we have the following volume estimates.

(i)

If $v\mid\infty$ , then

[TABLE]

for a constant $c_{v}>0$ depending only on $\alpha_{1},\dots,\alpha_{s}$ and $v$ .

(ii)

If $v\in E$ and if $v\mid p$ for a prime number $p$ , then

[TABLE]

(iii)

If $v\notin E$ , then $\mu_{v}({\mathcal{C}}_{\mathbf{n},v})^{1/d_{v}}=|\Delta_{\mathbf{n}}|_{v}$ .

Note that, for each place $v$ of $K$ , these estimates enclose the volume of ${\mathcal{C}}_{\mathbf{n},v}$ between limits whose ratio is a polynomial in $N$ while these limits themselves grow like $|\Delta_{\mathbf{n}}|_{v}$ , that is roughly like an exponential in $N$ if $v\nmid\infty$ or like $N!$ if $v\mid\infty$ . When $v\mid\infty$ , we give an explicit value for the constant $c_{v}$ in Theorem 8.1.

The lower bounds for $\mu_{v}({\mathcal{C}}_{\mathbf{n},v})$ follow easily from the definition of $\Delta_{\mathbf{n}}$ as a determinant in (2.5), if we take for granted the fact that ${\mathcal{C}}_{\mathbf{n},v}$ contains the points $\mathbf{a}_{\mathbf{n}-\mathbf{e}_{i}}$ for $i=1,\dots,s$ . Indeed, let $T\colon K_{v}^{s}\to K_{v}^{s}$ be the $K_{v}$ -linear map defined by

[TABLE]

for each $(x_{1},\dots,x_{s})\in K_{v}^{s}$ . Then ${\mathcal{C}}_{\mathbf{n},v}$ contains $T({\mathcal{E}}_{v})$ where ${\mathcal{E}}_{v}$ is given by

[TABLE]

As $|\det T|_{v}=|\Delta_{\mathbf{n}}|_{v}$ , we have $\mu_{v}(T({\mathcal{E}}_{v}))=|\Delta_{\mathbf{n}}|_{v}^{d_{v}}\mu_{v}({\mathcal{E}}_{v})$ . If $v\mid\infty$ , we also have $\mu_{v}({\mathcal{E}}_{v})\geq(s!)^{-d_{v}}$ , thus $\mu_{v}({\mathcal{C}}_{\mathbf{n},v})^{1/d_{v}}\geq(s!)^{-1}|\Delta_{\mathbf{n}}|_{v}$ . If $v\nmid\infty$ , we simply have $\mu_{v}({\mathcal{E}}_{v})=1$ , thus $\mu_{v}({\mathcal{C}}_{\mathbf{n},v})^{1/d_{v}}\geq|\Delta_{\mathbf{n}}|_{v}$ .

Our main contribution therefore lies in the upper bounds for the volume of the components ${\mathcal{C}}_{\mathbf{n},v}$ , and we explain our strategy below. These upper bounds in turn yield an upper bound for the volume of ${\mathcal{C}}_{\mathbf{n}}$ from which we derive the following conclusion thanks to the adelic Minkowski theorem.

Corollary 2.4.

In the notation of Theorem 2.3, we have

[TABLE]

and where $c>0$ is a constant depending only on $\alpha_{1},\dots,\alpha_{s}$ .

Proof.

Since $\prod_{v}|\Delta_{\mathbf{n}}|_{v}^{d_{v}}=1$ and since $E$ contains all Archimedean places of $K$ , we find

[TABLE]

where $c_{1}>0$ is independent of $\mathbf{n}$ . Since ${\mathcal{C}}_{\mathbf{n}}$ contains the points $a_{\mathbf{n}-\mathbf{e}_{1}},\dots,a_{\mathbf{n}-\mathbf{e}_{s}}$ of $K^{s}$ and since, by Theorem 2.2, these points are linearly independent over $K$ , we also have

[TABLE]

Thus, by Theorem 2.1, we obtain

[TABLE]

so $\lambda_{1}({\mathcal{C}}_{\mathbf{n}})\geq cN^{-g}$ with $c=1/(c_{1}s!)$ . ∎

The proof of Theorem 2.3 uses general results on univariate polynomials $f(z)\in\mathbb{C}[z]$ which we could not find in the literature. Suppose that $f$ has degree $N\geq 1$ . Let $A$ be its set of roots in $\mathbb{C}$ and let $B$ be the set of roots of its derivative $f^{\prime}$ which do not belong to $A$ . In section 5, we consider the paths of steepest descent for $|f|$ starting from an arbitrary point $\beta$ of $\mathbb{C}$ . These paths necessarily end in an element of $A$ . We show that they are contained in the convex hull of $A\cup\{\beta\}$ , with length at most $\pi RN$ where $R$ is the radius of any disk containing $A\cup\{\beta\}$ . In section 6, for each $\beta\in B$ , we denote by $m(\beta)$ the multiplicity of $\beta$ as a root of $f^{\prime}$ and, starting from $\beta$ , we choose $m(\beta)+1$ paths of steepest descent for $|f|$ which are locally distinct in a neighborhood of $\beta$ . These paths draw a graph on $A\cup B$ and we show that this graph is in fact a tree. We extract from it a sub-graph $G$ on $A$ which is also a tree with edges indexed by $B$ . Then, for each edge of $G$ with end points $\alpha,\alpha^{\prime}\in A$ , indexed by $\beta\in B$ , we obtain a path joining $\alpha$ to $\alpha^{\prime}$ passing through $\beta$ , with length at most $2\pi RN$ , along which $|f|$ is maximal at the point $\beta$ .

For the proof of Theorem 2.3 (i), we may assume that the given place $v\mid\infty$ comes from an inclusion $K\subset\mathbb{C}$ . We then apply the above construction, choosing $f$ to be the gcd of the polynomials $f_{\mathbf{n}-\mathbf{e}_{1}},\dots,f_{\mathbf{n}-\mathbf{e}_{s}}$ . If the coordinates of $\mathbf{n}\in\mathbb{N}_{+}^{s}$ are all $\geq 2$ , we thus obtain a tree $G$ on $A=\{\alpha_{1},\dots,\alpha_{s}\}$ . Then, for each edge of $G$ with end points $\alpha_{i},\alpha_{j}$ , we bound from above the integrals in (2.6) as a function of $|f(\beta)|$ where $\beta\notin A$ is the corresponding root of $f^{\prime}$ . From this, we deduce in section 8 an upper bound for the volume of the convex body ${\mathcal{C}}_{\mathbf{n},v}$ in terms of the product of the values $|f(\beta)|^{m(\beta)}$ with $\beta\in B$ , this being the Chudnovsky semi-resultant of $f$ and $f^{\prime}$ . The upper bound for $\mu_{v}({\mathcal{C}}_{\mathbf{n},v})$ then follows thanks to the computation of this semi-resultant in section 7. The general case where at least one coordinate of $\mathbf{n}$ is equal to $1$ requires a slight adjustment.

The treatment of the ultrametric places $v\nmid\infty$ is simpler. In section 3, we show that ${\mathcal{C}}_{\mathbf{n},v}$ contains the points $\mathbf{a}_{\mathbf{n}-\mathbf{e}_{1}},\dots,\mathbf{a}_{\mathbf{n}-\mathbf{e}_{s}}$ . Afterwards, in section 9, we construct a rooted forest on $\{\alpha_{1},\dots,\alpha_{s}\}$ associated with the place $v$ . This allows us to select $s$ inequalities among (2.7) and (2.8) and to deduce from them the required upper bound on the volume of ${\mathcal{C}}_{\mathbf{n},v}$ in section 10. The relevant notions from graph theory are recalled in section 4.

In section 11, we restrict to “diagonal” approximations to two exponentials, namely to the case $s=2$ and $n_{1}=n_{2}$ . In this situation, we provide a refined form of our main result whose proof relies only on the estimates from sections 2.2 and 3. We then use it to prove Propositions 1.1 and 1.2 from the introduction.

We conclude in section 12 by explaining how Hermite’s recurrence formulas recalled in Appendix A can be used to compute efficiently the partial quotients in the continued fraction expansion of $e^{3}$ . This in turn permits to validate the inequalities (1.2) in less than two hours of computation on a small desk computer.

3. Ultrametric estimates

Let $v$ be a place of $K$ above a prime number $p$ . In this section, we complete the proof of the first assertion in Theorem 2.3 by showing that the component ${\mathcal{C}}_{\mathbf{n},v}$ of ${\mathcal{C}}_{\mathbf{n}}$ contains the points $\mathbf{a}_{\mathbf{n}-\mathbf{e}_{1}},\dots,\mathbf{a}_{\mathbf{n}-\mathbf{e}_{s}}$ for each $\mathbf{n}\in\mathbb{N}_{+}^{s}$ . To this end, we use the following notation and results.

For each $a\in\mathbb{C}_{p}$ and each $r>0$ , we denote by

[TABLE]

the closed disk of $\mathbb{C}_{p}$ with center $a$ and radius $r$ (both closed and open in $\mathbb{C}_{p}$ ). For such a disk $B=B(a,r)$ and for any analytic function $g\colon B\to\mathbb{C}_{p}$ , we define

[TABLE]

This quantity can also be computed from the Taylor series expansion of $g$ around the point $a$ via the formula

[TABLE]

which yields the $p$ -adic form of Cauchy’s inequalities

[TABLE]

(see [14, §1.5]). For the computations, we also use the estimates

[TABLE]

which follow from the formula $|k!|_{p}=p^{-m}$ where $m=\sum_{\ell=1}^{\infty}\lfloor k/p^{\ell}\rfloor$ .

Lemma 3.1.

Let $\mathbf{n}=(n_{1},\dots,n_{s})\in\mathbb{N}^{s}$ , let $N=n_{1}+\cdots+n_{s}$ , and let $i,j\in\{1,\dots,s\}$ . Then, we have

[TABLE]

If $|\alpha_{i}-\alpha_{k}|_{v}\leq 1$ for $k=1,\dots,s$ , we also have

[TABLE]

Finally, if $\rho=|\alpha_{i}-\alpha_{j}|_{v}$ satisfies $0<\rho<\delta$ , we have

[TABLE]

Proof.

To simplify, we may assume that $K\subset\mathbb{C}_{p}$ and that $|\alpha|_{v}=|\alpha|_{p}$ for each $\alpha\in K$ . Then, the polynomial $f_{\mathbf{n}}(z)\in K[z]$ can be viewed as an analytic function $f_{\mathbf{n}}\colon\mathbb{C}_{p}\to\mathbb{C}_{p}$ . To estimate $|P_{\mathbf{n}}(\alpha_{i})|_{v}=|P_{\mathbf{n}}(\alpha_{i})|_{p}$ , we set

[TABLE]

For $k=0,1,\dots,N$ , Cauchy’s inequalities together with (3.1) yield

[TABLE]

thus

[TABLE]

This proves (3.2) since

[TABLE]

If $|\alpha_{i}-\alpha_{k}|_{v}\leq 1$ for each $k$ , a similar computation yields $|f_{\mathbf{n}}|_{B}\leq 1$ with $B=B(\alpha_{i},1)$ . Then Cauchy’s inequalities give $|f_{\mathbf{n}}^{(k)}(\alpha_{i})|_{p}\leq|k!|_{p}$ for each $k\in\mathbb{N}$ . Since we have $f_{\mathbf{n}}^{(k)}(\alpha_{i})=0$ for $k=0,\dots,n_{i}-1$ , we deduce that $|f_{\mathbf{n}}^{(k)}(\alpha_{i})|_{v}\leq|n_{i}!|_{v}$ for each $k\in\mathbb{N}$ and the upper bound (3.3) follows.

Suppose now that $0<\rho=|\alpha_{i}-\alpha_{j}|_{p}<\delta$ . To prove (3.4), we use instead

[TABLE]

Since $\rho<\delta$ , the function $g\colon B\to\mathbb{C}_{p}$ given by

[TABLE]

is analytic with $g(\alpha_{j})=0$ and

[TABLE]

For each integer $\ell=0,1,\dots,N$ , we have

[TABLE]

Since $\displaystyle f_{\mathbf{n}}^{(\ell)}=0$ for $\ell>N$ , this remains valid for each $\ell\in\mathbb{N}$ . Then, by (3.5), Leibniz formula for the derivative of a product yields, for each integer $k\geq 1$ ,

[TABLE]

Since $\alpha_{i}\in B$ and $g(\alpha_{j})=0$ , we deduce that

[TABLE]

The upper bound (3.4) follows since

[TABLE]

Theorem 3.2.

Let $\mathbf{n}=(n_{1},\dots,n_{s})\in\mathbb{N}_{+}^{s}$ . Then the subset ${\mathcal{C}}_{\mathbf{n},v}$ of $K_{v}^{s}$ defined in Section 2.3 contains the points $\mathbf{a}_{\mathbf{n}-\mathbf{e}_{1}},\dots,\mathbf{a}_{\mathbf{n}-\mathbf{e}_{s}}$ .

Proof.

Fix an integer $\ell\in\{1,\dots,s\}$ and put $P=P_{\mathbf{n}-\mathbf{e}_{\ell}}$ . To show that ${\mathcal{C}}_{\mathbf{n},v}$ contains the point $\mathbf{a}_{\mathbf{n}-\mathbf{e}_{\ell}}=(P(\alpha_{1}),\dots,P(\alpha_{s}))$ , we fix arbitrary $i,j\in\{1,\dots,s\}$ . Since $\max\{|\alpha_{i}-\alpha_{\ell}|_{v},\,\delta\}\geq\delta\geq 1/p$ , the inequality (3.2) of Lemma 3.1 provides

[TABLE]

If $|\alpha_{i}-\alpha_{k}|_{v}=1$ for each $k=1,\dots,s$ with $k\neq i$ , the inequality (3.3) of the same lemma also provides

[TABLE]

Finally, if $\rho=|\alpha_{j}-\alpha_{i}|_{v}$ satisfies $0<\rho<\delta$ , then, since $\max\{|\alpha_{i}-\alpha_{\ell}|_{v},\,|\alpha_{j}-\alpha_{\ell}|_{v}\}\geq\rho$ , the inequality (3.4) yields

[TABLE]

4. Preliminaries of graph theory

A graph $G$ is a pair of finite sets $(V,E)$ where $E$ consists of subsets of $V$ with two elements. The elements of $V$ are called the vertices of $G$ and those of $E$ the edges of $G$ in agreement with the usual graphic representation.

Let $G=(V,E)$ be a graph. An elementary chain in $G$ is a sequence $(\alpha_{1},\dots,\alpha_{m})$ of $m\geq 2$ distinct elements of $V$ such that $\{\alpha_{i},\alpha_{i+1}\}\in E$ for $i=1,\dots,m-1$ . We say that $G$ is connected if, for each pair of distinct elements $\alpha,\beta$ of $V$ , there exists at least one elementary chain $(\alpha_{1},\dots,\alpha_{m})$ in $G$ with $\alpha_{1}=\alpha$ and $\alpha_{m}=\beta$ . We say that $G$ is a tree if there exists exactly one such chain for each choice of $\alpha,\beta\in V$ with $\alpha\neq\beta$ . When $G$ is connected, we have $|V|\leq|E|+1$ with equality if and only if $G$ is a tree.

In general, for a graph $G=(V,E)$ , there exists one and only one choice of integer $r\geq 1$ and partitions $V=V_{1}\cup\cdots\cup V_{r}$ and $E=E_{1}\cup\cdots\cup E_{r}$ of $V$ and $E$ into $r$ disjoint subsets such that $G_{i}=(V_{i},E_{i})$ is a connected graph for $i=1,\dots,r$ . We say that $G_{1},\dots,G_{r}$ are the connected components of $G$ . If these are trees, we say that $G$ is a forest. When $G$ admits $r$ connected components, we have $|V|\leq|E|+r$ with equality if and only if $G$ is a forest.

A rooted forest is a triple $G=(R,V,E)$ where $(V,E)$ is a forest and where $R$ is a subset of $V$ containing exactly one vertex from each connected component of $(V,E)$ . We say that $R$ is the set of roots of $G$ . Then, for each $\beta\in V\setminus R$ , there is a unique elementary chain $(\alpha_{1},\dots,\alpha_{m})$ with $\alpha_{1}\in R$ and $\alpha_{m}=\beta$ . So we obtain a partial ordering on $V$ by defining $\alpha<\beta$ if $\beta\notin R\cup\{\alpha\}$ and if the elementary chain which links $\beta$ to an element of $R$ contains $\alpha$ . In particular, any edge $\{\alpha,\beta\}\in E$ can be ordered so that $\alpha<\beta$ . The resulting pairs $(\alpha,\beta)$ are called the directed edges of $G$ . For fixed $\alpha\in V$ , we say that $D_{G}(\alpha)=\{\beta\in V\,;\,\alpha<\beta\}$ is the set of descendants of $\alpha$ . The set $S_{G}(\alpha)$ of minimal elements of $D_{G}(\alpha)$ is called the set of successors of $\alpha$ . Note that the pairs $(\alpha,\beta)\in V\times V$ with $\beta\in S_{G}(\alpha)$ are exactly the directed edges of $G$ . Moreover, any $\beta\in V\setminus R$ is the successor of a unique $\alpha\in V$ . This allows us to formulate the following result.

Proposition 4.1.

Let $G=(R,V,E)$ be a rooted tree, let $K$ be a field, let $(x_{\alpha})_{\alpha\in V}$ be a family of indeterminates over $K$ indexed by $V$ , and let $\varphi\colon E\to K$ be a function. For each $\beta\in V$ , we define

[TABLE]

Then, upon extending the partial ordering on $V$ to a total ordering, the matrix of the linear forms $(L_{\beta})_{\beta\in V}$ with respect to the basis $(x_{\alpha})_{\alpha\in V}$ is lower triangular with $1$ everywhere on the diagonal.

5. Paths of steepest ascent

In this section, we fix a non-constant monic polynomial $f(z)\in\mathbb{C}[z]$ , a compact convex subset ${\mathcal{K}}$ of $\mathbb{C}$ containing all the roots of $f$ , and a closed disk $D$ of $\mathbb{C}$ containing ${\mathcal{K}}$ . We denote by $N$ the degree of $f$ , and by $R$ the radius of $D$ . The main goal of this section is to prove the following result.

Theorem 5.1.

Let $\beta\in{\mathcal{K}}$ . There exists a root $\alpha$ of $f$ and a path $\gamma\colon[0,1]\to\mathbb{C}$ linking $\gamma(0)=\alpha$ to $\gamma(1)=\beta$ , such that $f(\gamma(t))=tf(\beta)$ for each $t\in[0,1]$ . The image of such a path is contained in ${\mathcal{K}}$ , with length at most $\pi RN$ .

By a path we mean here a continuous piecewise differentiable map $\gamma\colon I\to\mathbb{C}$ on a closed subinterval $I$ of $\mathbb{R}$ . For a path $\gamma$ as in the statement of the theorem, $\gamma(0)$ is necessarily a root of $f$ and we have $\max\{|f(\gamma(t))|\,;\,0\leq t\leq 1\}=|f(\beta)|$ . We will see that, in fact, $\gamma$ is a path of steepest ascent for $|f|$ .

For the proof, we consider the polynomial $f$ as a covering of Riemann surfaces $f\colon\mathbb{C}\to\mathbb{C}$ of degree $N$ , ramified in a finite number of points. Then any path $\gamma\colon[0,1]\to\mathbb{C}$ lifts into $N$ paths $\gamma_{1},\dots,\gamma_{N}\colon[0,1]\to\mathbb{C}$ such that $f^{-1}(\gamma(t))=\{\gamma_{1}(t),\dots,\gamma_{N}(t)\}$ for all $t\in[0,1]$ . The latter are not unique in general, because of ramification, and are constructed by pasting as in the proof of [8, Theorem 4.14]. For a path $\gamma$ of the form $\gamma(t)=tf(\beta)$ with $f(\beta)\neq 0$ , this leads to the following statement.

Lemma 5.2.

Let $\beta\in\mathbb{C}$ with $f(\beta)\neq 0$ , and let $m=m(\beta)\geq 0$ denote the order of the derivative of $f$ at $\beta$ . Then, there exist $\delta\in(0,1)$ and $m+1$ paths $\gamma_{0},\dots,\gamma_{m}$ from $[0,1]$ to $\mathbb{C}$ such that

(i)

$\gamma_{0}(1)=\cdots=\gamma_{m}(1)=\beta$ ,

(ii)

$f(\gamma_{0}(t))=\cdots=f(\gamma_{m}(t))=tf(\beta)$ * for each $t\in[0,1]$ ,*

(iii)

$\gamma_{0}(t),\dots,\gamma_{m}(t)$ * are $m+1$ distinct numbers for each $t\in(1-\delta,1)$ .*

Moreover, for each $j=0,1,\dots,m$ and each $t\in(0,1)$ such that $f^{\prime}(\gamma_{j}(t))\neq 0$ , the function $\gamma_{j}$ is analytic at $t$ and its derivative $\gamma_{j}^{\prime}(t)$ heads in the direction where the norm $|f|$ of $f$ grows fastest.

The last assertion of the lemma means that $\gamma_{0},\dots,\gamma_{m}$ are paths of steepest ascent for the norm of $f$ . This is true in fact for any path $\gamma$ such that $f(\gamma(t))=ct$ ( $0\leq t\leq 1$ ) with a fixed $c\in\mathbb{C}\setminus\{0\}$ because the image of the map $t\mapsto ct$ with $t\geq 0$ is a half line that is orthogonal to the circles centered at the origin. As the map $f\colon\mathbb{C}\to\mathbb{C}$ is conformal outside of the ramification points, the preimage $\gamma$ of this curve is orthogonal to the level curves of $|f|$ outside of these points. We will revisit the construction of the paths $\gamma_{j}$ in Lemma 6.3.

Proof of Theorem 5.1

If $f(\beta)=0$ , the constant path $\gamma(t)=\beta$ for each $t\in[0,1]$ is the only possible choice and it has the required properties. Suppose from now on that $f(\beta)\neq 0$ . Then the preceding lemma provides a path $\gamma$ of the required type linking $\beta$ to a root of $f$ . Fix such a path. For the computations, we denote by $\alpha_{1},\dots,\alpha_{s}$ the distinct roots of $f$ in $\mathbb{C}$ and by $n_{1},\dots,n_{s}$ their respective multiplicities so that

[TABLE]

We also denote by $B$ the set of zeros of the derivative $f^{\prime}$ of $f$ .

By Gauss-Lucas theorem the set $B$ is contained in the convex hull of the roots of $f$ , thus $B\subset{\mathcal{K}}$ . The fact that the image of $\gamma$ is contained in ${\mathcal{K}}$ admits a similar proof. Indeed, suppose by contradiction that the image escapes from ${\mathcal{K}}$ . Then, since ${\mathcal{K}}$ is convex, there exists a half-plane containing ${\mathcal{K}}$ but not the image of $\gamma$ . More precisely, there exist $a,b\in\mathbb{C}$ with $|a|=1$ such that $\operatorname{Re}(az+b)\leq 0$ for each $z\in{\mathcal{K}}$ and $\operatorname{Re}(a\gamma(t)+b)>0$ for at least one $t\in[0,1]$ . Choose $t_{0}\in[0,1]$ for which $\operatorname{Re}(a\gamma(t_{0})+b)$ is maximal, and set $z_{0}=\gamma(t_{0})$ . Since $\operatorname{Re}(az_{0}+b)>0$ , we have $z_{0}\notin{\mathcal{K}}$ , thus $t_{0}\in(0,1)$ and $z_{0}\notin B$ . Therefore $\gamma$ is differentiable at $t_{0}$ with $\operatorname{Re}(a\gamma^{\prime}(t_{0}))=0$ . However, by differentiating both sides of the equality $f(\gamma(t))=tf(\beta)$ at $t=t_{0}$ , we obtain

[TABLE]

As $\operatorname{Re}(a(z_{0}-\alpha_{\ell}))=\operatorname{Re}(az_{0}+b)-\operatorname{Re}(a\alpha_{\ell}+b)\geq\operatorname{Re}(az_{0}+b)>0$ for $\ell=1,\dots,s$ , we deduce that $\operatorname{Re}(a\gamma^{\prime}(t_{0}))>0$ , a contradiction.

To estimate the length $L(\gamma)$ of $\gamma$ , we use the Cauchy-Crofton formula

[TABLE]

(see for example the beautiful proof of [1]). Fix $r,\theta\in\mathbb{R}$ and consider the polynomial

[TABLE]

If $t_{0}\in[0,1]$ satisfies $\operatorname{Re}(\gamma(t_{0})e^{-i\theta})=r$ , we may write $\gamma(t_{0})=(r+iu_{0})e^{i\theta}$ for some $u_{0}\in\mathbb{R}$ . Then we have $f((r+iu_{0})e^{i\theta})=t_{0}f(\beta)$ and consequently $g_{r,\theta}(u_{0})=0$ . As $\gamma$ is injective on $[0,1]$ (because $f\circ\gamma$ is), this means that $N(r,\theta)$ is at most equal to the number of real roots of $g_{r,\theta}$ . But, as $f$ has degree $N$ , the polynomial $g_{r,\theta}(u)$ has degree at most $N$ and its coefficient of $u^{N}$ is $\operatorname{Im}((ie^{i\theta})^{N}/f(\beta))$ . Thus, except possibly for the $2N$ values of $\theta\in[0,2\pi)$ for which this coefficient vanishes, we have $g_{r,\theta}\neq 0$ and thus $N(r,\theta)\leq N$ .

For fixed $\theta$ , the set $\{\operatorname{Re}(ze^{-i\theta})\,;\,z\in D\}$ is an interval $I_{\theta}$ of $\mathbb{R}$ of length $2R$ . As the image of $\gamma$ is contained in ${\mathcal{K}}\subset D$ , we have $N(r,\theta)=0$ if $r\notin I_{\theta}$ . We conclude that $A(\theta)\leq 2RN$ except for at most $2N$ values of $\theta\in[0,2\pi)$ , and thus $L(\gamma)\leq\pi RN$ .

6. A tree of paths between complex roots

As in the preceding section, we fix a non-constant monic polynomial $f(z)\in\mathbb{C}[z]$ . We denote by $N$ its degree, by $A=\{\alpha_{1},\dots,\alpha_{s}\}$ the set of its complex roots, by ${\mathcal{K}}$ the convex hull of $A$ , and by $R$ the radius of a closed disk $D$ containing $A$ . We also denote by $B=\{\beta_{1},\dots,\beta_{p}\}$ the set of roots of $f^{\prime}(z)$ which are not roots of $f(z)$ , that is the set of zeros of the logarithmic derivative $f^{\prime}(z)/f(z)$ . Then we may write

[TABLE]

for integers $n_{1},\dots,n_{s}\geq 1$ with sum $N$ , and integers $m_{1},\dots,m_{p}\geq 1$ with sum $s-1$ .

For each $\beta\in\mathbb{C}$ , we denote by $m(\beta)$ the order of $f^{\prime}(z)$ at $\beta$ . With this notation, we have $m_{j}=m(\beta_{j})$ for $j=1,\dots,p$ . The goal of this section is to prove the following result.

Theorem 6.1.

There exists a tree $G$ with the following properties:

(i)

Its set of vertices is $A$ .

(ii)

It has $s-1$ edges, each one indexed by an element of $B$ .

(iii)

For each $\beta\in B$ , there are exactly $m(\beta)$ edges indexed by $\beta$ .

(iv)

If $\{\alpha,\alpha^{\prime}\}$ is an edge of $G$ indexed by $\beta$ , there exists a path $\gamma\colon[0,1]\to\mathbb{C}$ of length at most $2\pi RN$ , contained in ${\mathcal{K}}$ , linking $\gamma(0)=\alpha$ to $\gamma(1)=\alpha^{\prime}$ , such that

[TABLE]

When all the roots of $f(z)$ are real, we have $f(z)\in\mathbb{R}[z]$ and we can give a very simple proof of the theorem. To this end, we may assume that the roots are labelled in increasing order $\alpha_{1}<\cdots<\alpha_{s}$ . Then, in each interval $[\alpha_{j},\alpha_{j+1}]$ with $1\leq j\leq s-1$ , the function $|f(z)|$ achieves its maximum in a zero $\beta_{j}$ of $f^{\prime}(z)$ with $\alpha_{j}<\beta_{j}<\alpha_{j+1}$ . Since $B$ has cardinality $p\leq s-1$ , this exhausts all the elements of $B$ : we have $p=s-1$ and $m_{1}=\cdots=m_{s-1}=1$ . We take for $G$ the graph with set of vertices $A$ , whose edges are the pairs $\{\alpha_{j},\alpha_{j+1}\}$ indexed by $\beta_{j}$ for $j=1,\dots,s-1$ . Then $G$ is a tree and, for each $j=1,\dots,s-1$ , the piecewise affine linear path $\gamma_{j}$ with $\gamma_{j}(0)=\alpha_{j}$ , $\gamma_{j}(1/2)=\beta_{j}$ and $\gamma_{j}(1)=\alpha_{j+1}$ fulfills the conditions in (iv). Moreover its length is $\alpha_{j+1}-\alpha_{j}\leq 2R$ .

Step 1

The proof of the general case requires several lemmas. For each $\beta\in B$ , we choose once for all $m(\beta)+1$ paths $\gamma_{\beta,0},\dots,\gamma_{\beta,m(\beta)}$ with end point $\beta$ as in Lemma 5.2. Then we have $\gamma_{\beta,j}(0)\in A$ for $j=0,\dots,m(\beta)$ . Our goal is to show that these $m(\beta)+1$ points of $A$ are distinct and that the graph $G$ with vertices $\alpha_{1},\dots,\alpha_{s}$ and edges $\{\gamma_{\beta,0}(0),\gamma_{\beta,j}(0)\}$ with $\beta\in B$ and $1\leq j\leq m(\beta)$ satisfies the properties (i) to (iv) from the theorem. We start with property (iv).

Lemma 6.2.

Let $\beta\in B$ and $j\in\{1,\dots,m(\beta)\}$ . Then the path $\tilde{\gamma}$ from $\gamma_{\beta,0}(0)$ to $\gamma_{\beta,j}(0)$ given by

[TABLE]

is contained in ${\mathcal{K}}$ , with length at most $2\pi RN$ . Moreover, it satisfies

[TABLE]

Proof.

We have $B\subset{\mathcal{K}}$ by Gauss-Lucas theorem. Then, for each $\beta\in B$ , Theorem 5.1 shows that the paths $\gamma_{\beta,0}$ and $\gamma_{\beta,j}$ are contained in ${\mathcal{K}}$ with length at most $\pi RN$ . The conclusion follows since these are path of steepest ascent for $|f|$ . ∎

Step 2

We first prove the following result where $S=\mathbb{C}\cup\{\infty\}$ stands for the Riemann sphere with its usual topology. Afterwards, we use it to construct a tree ${H}$ on $A\cup B$ .

Lemma 6.3.

Let $\beta\in B$ and let $m=m(\beta)$ . There exist $\delta>0$ and $m+1$ continuous functions $\gamma^{+}_{0},\dots,\gamma^{+}_{m}$ from $[1,\infty]$ to $S=\mathbb{C}\cup\{\infty\}$ such that

(i)

$\gamma^{+}_{0}(1)=\cdots=\gamma^{+}_{m}(1)=\beta$ ,

(ii)

$f(\gamma^{+}_{0}(t))=\cdots=f(\gamma^{+}_{m}(t))=tf(\beta)$ * for each $t\in[1,\infty]$ ,*

(iii)

$\gamma^{+}_{0}(t),\dots,\gamma^{+}_{m}(t)$ * are $m+1$ distinct numbers for each $t\in(1,1+\delta)$ .*

Then, the curves $\Gamma^{+}_{0}=\gamma^{+}_{0}([1,\infty]),\dots,\Gamma^{+}_{m}=\gamma^{+}_{m}([1,\infty])$ meet only at the points $\beta$ and $\infty$ on $S$ . Moreover, their complement $S\setminus(\Gamma^{+}_{0}\cup\cdots\cup\Gamma^{+}_{m})$ is the union of $m+1$ disjoint connected open subsets ${\mathcal{R}}_{0},\dots,{\mathcal{R}}_{m}$ of $\mathbb{C}$ such that $\gamma_{\beta,j}([0,1))\subseteq{\mathcal{R}}_{j}$ for $j=0,\dots,m$ .

The proof is based on Jordan curve theorem and is illustrated in Figure 1.

Proof.

Upon putting $\ell=m+1$ , we may write $f(z)=f(\beta)(1+(z-\beta)^{\ell}g(z))$ where $g(z)$ is a polynomial with $g(\beta)\neq 0$ . Then, for sufficiently small $\epsilon>0$ , there exist an open neighborhood $V$ of $\beta$ and a biholomorphic function $h$ from $V$ to $B(0,\epsilon)=\{z\in\mathbb{C}\,;\,|z|<\epsilon\}$ satisfying $h(\beta)=0$ and

[TABLE]

for each $z\in V$ . Fix such a choice of $\epsilon$ , $V$ and $h$ , and set $\delta=\epsilon^{\ell}$ and $\rho=e^{\pi i/\ell}$ . For $j=0,\dots,m$ , we define a continous function $\gamma^{+}_{j}\colon[1,1+\delta)\to V$ by

[TABLE]

Then, for fixed $t\in(1,1+\delta)$ , the numbers $z=\gamma^{+}_{0}(t),\dots,\gamma^{+}_{m}(t)$ are the $\ell$ distinct solutions of $f(z)=tf(\beta)$ with $z\in V$ . In particular, $\gamma^{+}_{0},\dots,\gamma^{+}_{m}$ satisfy Conditions (i) and (iii) of the lemma, as well as (ii) for each $t\in[1,1+\delta)$ . For $j=0,\dots,m$ , we extend $\gamma^{+}_{j}$ to a continuous function $\gamma^{+}_{j}\colon[1,\infty]\to S$ satisfying $f(\gamma^{+}_{j}(t))=tf(\beta)$ for each $t\in[1,\infty]$ .

Similarly, for $j=0,\dots,m$ , we define a continuous function $\gamma^{-}_{j}\colon(1-\delta,1]\to V$ by

[TABLE]

For fixed $t\in(1-\delta,1)$ , the numbers $z=\gamma^{-}_{0}(t),\dots,\gamma^{-}_{m}(t)$ are the $\ell$ distinct solutions of $f(z)=tf(\beta)$ with $z\in V$ , thus they form a permutation of $\gamma_{\beta,0}(t),\dots,\gamma_{\beta,m}(t)$ . This permutation being independent of $t$ , there is no loss of generality in assuming that $\gamma^{-}_{j}$ is the restriction of $\gamma_{\beta,j}$ to $(1-\delta,1]$ for $j=0,\dots,m$ . Then we extend each $\gamma_{\beta,j}\colon[0,1]\to\mathbb{C}$ to a continuous function $\gamma^{-}_{j}\colon[-\infty,1]\to S$ such that $f(\gamma^{-}_{j}(t))=tf(\beta)$ for each $t\in[-\infty,1]$ .

Put $\Gamma^{-}_{j}=\gamma^{-}_{j}([-\infty,1])$ and $\Gamma^{+}_{j}=\gamma^{+}_{j}([1,\infty])$ for $j=0,\dots,m$ , and fix $j,k\in\{0,1,\dots,m\}$ . The curves $\Gamma^{-}_{j}$ and $\Gamma^{+}_{k}$ meet only at the points $\beta$ and $\infty$ because if $\gamma^{-}_{j}(t)=\gamma^{+}_{k}(u)$ for some $t\in[-\infty,1]$ and $u\in[1,\infty]$ , then $tf(\beta)=uf(\beta)$ , thus $t=u=1$ or $-t=u=\infty$ . Suppose now that $j<k$ . As the curves $\Gamma^{+}_{j}$ and $\Gamma^{+}_{k}$ meet at infinity, there exists a smallest $r\in[1+\delta,\infty]$ such that $\gamma^{+}_{j}(r)=\gamma^{+}_{k}(r)$ . For this choice of $r$ , the union $\gamma^{+}_{j}([1,r])\cup\gamma^{+}_{k}([1,r])$ is a simple closed curve $\Gamma$ . By Jordan curve theorem, its complement in $S$ is thus the union of two connected open sets ${\mathcal{R}}$ and ${\mathcal{R}}^{\prime}$ with boundary $\Gamma$ . On the other hand, we have

[TABLE]

Moreover, $B(0,\epsilon)\setminus P$ is the union of two disjoint connected open sets ${\mathcal{U}}$ and ${\mathcal{U}}^{\prime}$ (open sectors of the disk $B(0,\epsilon)$ ), where ${\mathcal{U}}$ contains the rays $(0,\epsilon)\rho^{2i+1}$ with $j\leq i<k$ and ${\mathcal{U}}^{\prime}$ those with $0\leq i<j$ or $k\leq i\leq m$ . As $h\colon V\to B(0,\epsilon)$ is a homeomorphism, $h^{-1}({\mathcal{U}})$ and $h^{-1}({\mathcal{U}}^{\prime})$ are disjoint connected open subsets of $S$ whose union is $V\setminus\Gamma$ . We may assume that $h^{-1}({\mathcal{U}})\subset{\mathcal{R}}$ and $h^{-1}({\mathcal{U}}^{\prime})\subset{\mathcal{R}}^{\prime}$ . Then, we obtain

[TABLE]

However, ${\mathcal{R}}$ and ${\mathcal{R}}^{\prime}$ share the same boundary, contained in $\Gamma^{+}_{j}\cup\Gamma^{+}_{k}$ . Thus none of the sets $\Gamma^{-}_{i}\setminus\{\beta,\infty\}=\gamma_{i}^{-}((-\infty,1))$ meet this boundary. As these are connected curves, we conclude that $\Gamma^{-}_{i}\setminus\{\beta,\infty\}$ is contained in ${\mathcal{R}}$ if $j\leq i<k$ and in ${\mathcal{R}}^{\prime}$ otherwise. In particular, none of the open subsets ${\mathcal{R}}$ and ${\mathcal{R}}^{\prime}$ of $\mathbb{C}$ is bounded and consequently we must have $r=\infty$ . This means that $\Gamma^{+}_{j}$ and $\Gamma^{+}_{k}$ meet only at $\beta$ and $\infty$ .

With the above notation, we define ${\mathcal{R}}_{j}={\mathcal{R}}$ for the choice of $j\in\{0,\dots,m-1\}$ and $k=j+1$ . We also define ${\mathcal{R}}_{m}={\mathcal{R}}^{\prime}$ for the choice of $j=0$ and $k=m$ . These are connected open subsets of $\mathbb{C}$ with $\gamma_{\beta,j}([0,1))\subset\Gamma^{-}_{j}\setminus\{\beta,\infty\}\subset{\mathcal{R}}_{j}$ for $j=0,\dots,m$ . It remains to show that ${\mathcal{R}}_{0},\dots,{\mathcal{R}}_{m}$ pairwise disjoint. To this end, we first note that if $j\neq k$ , then ${\mathcal{R}}_{j}\not\subseteq{\mathcal{R}}_{k}$ since $\Gamma^{-}_{j}\setminus\{\beta,\infty\}$ is contained in ${\mathcal{R}}_{j}$ but not in ${\mathcal{R}}_{k}$ . So if ${\mathcal{R}}_{j}$ and ${\mathcal{R}}_{k}$ intersect, then ${\mathcal{R}}_{j}$ meets the boundary of ${\mathcal{R}}_{k}$ . Then ${\mathcal{R}}_{j}$ contains at least one point of $\Gamma^{+}_{i}\setminus\{\beta,\infty\}$ for some $i\in\{0,1,\dots,m\}$ . However, by the choice of ${\mathcal{R}}_{j}$ , we have $\gamma^{+}_{i}(t)\notin{\mathcal{R}}_{j}$ for each $t\in(1,1+\delta)$ . Thus the curve $\Gamma^{+}_{i}\setminus\{\beta,\infty\}$ is not fully contained in ${\mathcal{R}}_{j}$ and, as it is a connected set, it meets the boundary of ${\mathcal{R}}_{j}$ without being fully contained in it. This is impossible because that boundary is the union of two curves among $\Gamma^{+}_{0},\dots,\Gamma^{+}_{m}$ . ∎

Lemma 6.4.

For each $\beta\in B$ , the $m(\beta)+1$ points $\gamma_{\beta,j}(0)\in A$ with $0\leq j\leq m(\beta)$ are distinct. Moreover, let ${H}$ be the graph whose set of vertices is $A\cup B$ and whose edges are the pairs $\{\beta,\gamma_{\beta,j}(0)\}$ with $\beta\in B$ and $0\leq j\leq m(\beta)$ . Then ${H}$ is a tree.

Proof.

The first assertion is a direct consequence of the preceding lemma because, for $\beta\in B$ and $m=m(\beta)$ , this lemma provides disjoint connected open sets ${\mathcal{R}}_{0},\dots,{\mathcal{R}}_{m}$ such that $\gamma_{\beta,j}(0)\in{\mathcal{R}}_{j}$ for $j=0,\dots,m$ .

Suppose that ${H}$ is not a forest. Then ${H}$ contains a simple cycle: an elementary chain $(a_{1},\dots,a_{k})$ with $k\geq 3$ such that $\{a_{k},a_{1}\}$ is an edge of ${H}$ . Then, $k$ is an even integer and the $a_{i}$ ’s belong alternatively to $A$ or $B$ according to the parity of $i$ . By permuting cyclicly the elements of this chain if necessary, we may assume that $a_{1}\in B$ and that $|f(a_{1})|\geq|f(a_{i})|$ for $i=1,\dots,k$ . Let $m=m(a_{1})$ and let ${\mathcal{R}}_{0},\dots,{\mathcal{R}}_{m}$ be the connected open sets associated to the point $a_{1}\in B$ by Lemma 6.3. For each point $z\neq a_{1}$ outside of these open sets, we have $f(z)=tf(a_{1})$ for a real number $t>1$ , thus $|f(z)|>|f(a_{1})|$ . We set $a_{k+1}=a_{1}$ and, for $i=1,\dots,k$ , we denote by $\gamma_{i}$ the path of the form $\gamma_{\beta,j}$ which links $a_{i}$ and $a_{i+1}$ . For each $t\in[0,1]$ , we have $f(\gamma_{i}(t))=tf(a_{i})$ if $i$ is odd and $f(\gamma_{i}(t))=tf(a_{i+1})$ if $i$ is even. In both cases, this yields $|f(\gamma_{i}(t))|\leq|f(a_{1})|$ , with the strict inequality if $t\neq 1$ . As $a_{1},\dots,a_{k}$ are distinct and as $\gamma_{i}(1)\in\{a_{3},\dots,a_{k-1}\}$ when $2\leq i\leq k-1$ , we deduce that the curve

[TABLE]

is contained in ${\mathcal{R}}_{0}\cup\cdots\cup{\mathcal{R}}_{m}$ . As this is a connected subset of $\mathbb{C}$ , it is therefore fully contained in ${\mathcal{R}}_{j}$ for some $j$ . Since $\gamma_{1}(1)=\gamma_{k}(1)=a_{1}$ , this implies that $\gamma_{1}=\gamma_{k}$ , thus $a_{2}=\gamma_{1}(0)=\gamma_{k}(0)=a_{k}$ , which is impossible.

So ${H}$ is a forest. Therefore, its number of connected components is equal to its number of vertices minus its number of edges, that is

[TABLE]

Thus ${H}$ is in fact a tree. ∎

Step 4. Proof of Theorem 6.1

Let $G$ be the graph whose set of vertices is $A$ and whose edges are the pairs

[TABLE]

Since ${H}$ is connected, so is the graph $G$ . Since $G$ possesses $s=|A|$ vertices and since $\sum_{\beta\in B}m(\beta)=s-1$ , we deduce that the $s-1$ edges (6.3) are distinct and that $G$ is a tree. In particular, for each $\beta\in B$ , there are exactly $m(\beta)$ edges of $G$ indexed by $\beta$ and Lemma 6.2 shows that, for each of them, there exists a path satisfying Condition (iv) of the theorem.

7. Computation of a semi-resultant

We first prove the following formula.

Proposition 7.1.

With the notation of the preceding section, we have

[TABLE]

The left hand side of this equality is the semi-resultant of $f(z)$ and $f^{\prime}(z)$ in the sense of Chudnovsky [5, 7].

Proof.

The formula for the derivative of a product applied to the factorization (6.1) of $f(z)$ yields

[TABLE]

where

[TABLE]

By comparison with the factorization (6.2) of $f^{\prime}(z)$ , we also find that

[TABLE]

Upon evaluating both expressions for $g(z)$ at $z=\alpha_{k}$ , we obtain

[TABLE]

Since $m_{1}+\cdots+m_{p}=s-1$ , these equalities may be rewritten as

[TABLE]

As stated, this yields

[TABLE]

Corollary 7.2.

With the same notation, we have

[TABLE]

Proof.

Since $N=n_{1}+\cdots+n_{s}$ , we find

[TABLE]

This yields $N!\prod_{i=1}^{s}n_{i}^{n_{i}}\leq N^{N}\prod_{i=1}^{s}n_{i}!$ , and the conclusion follows. ∎

8. Volume of the Archimedean components

We are now ready to prove the upper bound estimate in Theorem 2.3 (i). The notation is as in Section 2.

Theorem 8.1.

Let $v$ be an Archimedean place of $K$ and let ${\mathcal{C}}_{\mathbf{n},v}$ be the convex body of $K_{v}^{s}$ defined in Section 2.3 for the choice of an $s$ -tuple $\mathbf{n}=(n_{1},\dots,n_{s})\in\mathbb{N}_{+}^{s}$ . Then, we have

[TABLE]

where $N=n_{1}+\cdots+n_{s}$ , $R_{v}=\max_{1\leq i<j\leq s}|\alpha_{i}-\alpha_{j}|_{v}$ , and $\mathbf{1}=(1,\dots,1)$ .

Proof.

To simplify, we may assume that $K\subset\mathbb{C}$ and that $|\alpha|_{v}=|\alpha|$ for each $\alpha\in K$ . By permuting $\alpha_{1},\dots,\alpha_{s}$ if necessary, we may also assume that $n_{1}\geq\cdots\geq n_{s}$ form a decreasing sequence. We denote by $D$ the closed disk of radius $R_{v}$ and center $(\alpha_{1}+\dots+\alpha_{s})/s$ in $\mathbb{C}$ . As this disk contains $\alpha_{1},\dots,\alpha_{s}$ , it also contains the convex hull ${\mathcal{K}}$ of these points.

Suppose first that $n_{1}\geq 2$ and let $r$ be the largest index such that $n_{r}\geq 2$ . We form the polynomial

[TABLE]

The set of its roots is $A=\{\alpha_{1},\dots,\alpha_{r}\}$ and its degree is $N-s$ . Its derivative factors as

[TABLE]

where $B=\{\beta_{1},\dots,\beta_{p}\}$ is the set of roots of $f^{\prime}(z)$ outside of $A$ , and where $m_{j}$ is the multiplicity of $\beta_{j}$ for $j=1,\dots,p$ . We choose a tree $G$ as in Theorem 6.1 for this polynomial $f(z)$ . By construction, the set of vertices of $G$ is $A$ . We now extend $G$ to a graph $\tilde{G}$ on $\{\alpha_{1},\dots,\alpha_{s}\}$ in the following way. For each $j=r+1,\dots,s$ , we choose a path $\gamma_{j}\colon[0,1]\to\mathbb{C}$ such that $\gamma_{j}(1)=\alpha_{j}$ and $f(\gamma_{j}(t))=tf(\alpha_{j})$ as in Theorem 5.1. Then $\gamma_{j}(0)$ is a root of $f$ , thus an element of $A$ , and we add the edge $\{\gamma_{j}(0),\alpha_{j}\}$ to the graph $G$ . Finally, we choose $\alpha_{1}\in A$ as a root of the resulting tree $\tilde{G}$ . Then, ${\mathcal{C}}_{\mathbf{n},v}$ is contained in the set ${\mathcal{K}}_{v}$ of all points $(x_{1},\dots,x_{s})\in K_{v}^{s}$ satisfying

[TABLE]

as well as

[TABLE]

for each directed edge $(\alpha_{i},\alpha_{j})$ of $\tilde{G}$ with $\alpha_{i}<\alpha_{j}$ . Since $\tilde{G}$ is a rooted tree, Proposition 4.1 shows that the $s$ linear forms defining ${\mathcal{K}}_{v}$ are linearly independent, with determinant $\pm 1$ . Thus ${\mathcal{K}}_{v}$ is a convex body of $K_{v}^{s}$ with

[TABLE]

where $E$ stands for the set of directed edges of $\tilde{G}$ .

For now, fix $(\alpha_{i},\alpha_{j})\in E$ and $k\in\{1,\dots,s\}$ . By construction, we have $i\leq r$ , that is $\alpha_{i}\in A$ . If $j\leq r$ , we also have $\alpha_{j}\in A$ , and $\{\alpha_{i},\alpha_{j}\}$ is an edge of $G$ . Then, Theorem 6.1 associates to this edge a point $\beta\in B$ and a path $\gamma\colon[0,1]\to\mathbb{C}$ of length at most $2\pi R_{v}N$ , contained in ${\mathcal{K}}$ , joining $\alpha_{i}$ and $\alpha_{j}$ , such that

[TABLE]

This yields

[TABLE]

since $|z-\alpha_{\ell}|\leq R_{v}$ for any $z\in{\mathcal{K}}$ and $\ell=1,\dots,s$ . Finally, if $j>r$ , we have $\alpha_{i}=\gamma_{j}(0)$ for the path $\gamma_{j}$ chosen earlier. By Theorem 5.1, the image of $\gamma_{j}$ is contained in ${\mathcal{K}}$ , of length at most $\pi R_{v}N\leq 2\pi R_{v}N$ . Thus the same computation as above yields

[TABLE]

Since each $\beta_{j}$ is associated to $m_{j}$ edges of $G$ and since $m_{1}+\cdots+m_{p}=r-1$ , we deduce from (8.1) that

[TABLE]

As $n_{k}=1$ for $k>r$ , Corollary 7.2 gives

[TABLE]

For $i=r+1,\dots,s$ , we also find that

[TABLE]

This implies that

[TABLE]

Substituting this upper bound in (8.2), we conclude that $\mu_{v}({\mathcal{C}}_{\mathbf{n},v})^{1/d_{v}}\leq c_{v}N^{2s-2}|\Delta_{\mathbf{n}}|_{v}$ , as in the statement of the theorem. ∎

9. A forest at ultrametric places

Let $v$ be an ultrametric place of $K$ . In this section we use the terminology for graphs explained in section 4 to build a rooted forest on an arbitrary non-empty finite subset of $K_{v}$ . We start with a preliminary construction.

Proposition 9.1.

Let $A$ be a non-empty finite subset of $K_{v}$ and let $\alpha_{0}\in A$ . There exists a tree $G$ rooted in $\alpha_{0}$ having $A$ as its set of vertices, such that, for each $\alpha,\beta,\gamma\in A$ with $\beta\in S_{G}(\alpha)$ , we have

[TABLE]

Proof.

We proceed by induction on the cardinality $|A|$ of $A$ . If $|A|=1$ , there is nothing to prove. Suppose that $|A|\geq 2$ . Let $\rho$ be the largest distance between two elements de $A$ , and let $\{\alpha_{0},\dots,\alpha_{k}\}$ be a maximal subset of $A$ containing $\alpha_{0}$ , whose elements are at mutual distance $|\alpha_{i}-\alpha_{j}|_{v}=\rho$ for $0\leq i<j\leq k$ . Since $v$ is ultrametric, we have $k\geq 1$ and the sets

[TABLE]

form a partition of $A$ . For $i=0,\dots,k$ , we have $\alpha_{i}\in A_{i}$ and $|A_{i}|<|A|$ , thus we may assume the existence of a rooted tree $G_{i}=(\alpha_{i},A_{i},E_{i})$ which fulfils Condition (9.1) for each choice of $\alpha,\beta,\gamma\in A_{i}$ with $\beta\in S_{G_{i}}(\alpha)$ . We set

[TABLE]

Then $G=(\alpha_{0},A,E)$ is a rooted tree. Let $\alpha,\beta,\gamma\in A$ with $\beta\in S_{G}(\alpha)$ , and let $i$ be the index for which $\alpha\in A_{i}$ . If $\beta\in A_{i}$ , then $\beta\in S_{G_{i}}(\alpha)$ and $D_{G}(\beta)=D_{G_{i}}(\beta)$ , thus

[TABLE]

If instead $\beta\in A_{j}$ for some $j\neq i$ , then we must have $i=0$ , $\alpha=\alpha_{0}$ and $\beta=\alpha_{j}$ . Then $|\alpha-\beta|_{v}=\rho$ and $D_{G}(\beta)=A_{j}\setminus\{\alpha_{j}\}$ . So we find

[TABLE]

Thus $G$ has the required property. ∎

As the proof shows, the graph $G$ constructed in this way is not unique in general (since the choice $\alpha_{1},\dots,\alpha_{k}\in A$ is not unique). This leads to the following construction which in general is not unique either.

Theorem 9.2.

Let $A$ be a non-empty finite subset of $K_{v}$ , let $\delta>0$ , and let $R$ be a maximal subset of $A$ whose elements are at mutual distance at least $\delta$ . Then, there exists a rooted forest $G$ having $A$ as its set of vertices and $R$ as its set of roots, which satisfies the following properties:

(i)

for any $\beta\in R$ and $\gamma\in A$ , we have

[TABLE]

(ii)

for any $\alpha,\beta,\gamma\in A$ with $\beta\in S_{G}(\alpha)$ , we have

[TABLE]

Proof.

For each $\rho\in R$ , we define

[TABLE]

and we choose a rooted tree $G^{(\rho)}=(\rho,A^{(\rho)},E^{(\rho)})$ as in Proposition 9.1. Since the sets $A^{(\rho)}$ with $\rho\in R$ form a partition of $A$ , the union of these graphs constitute a rooted forest $G=(R,A,E)$ where $E=\cup_{\rho\in R}E^{(\rho)}$ . By construction, it satisfies Condition (i). To show that Condition (ii) is also fulfilled, fix $\alpha,\beta,\gamma\in A$ with $\beta\in S_{G}(\alpha)$ , and let $\rho\in R$ such that $\alpha\in A^{(\rho)}$ . Since $\beta\in S_{G}(\alpha)$ , we have $\beta\in A^{(\rho)}$ and $D_{G}(\beta)=D_{G^{(\rho)}}(\beta)$ . Moreover, if $\gamma$ satisfies $|\alpha-\beta|_{v}>|\beta-\gamma|_{v}$ then $|\beta-\gamma|_{v}<\delta$ and so $\gamma\in A^{(\rho)}$ . Thus Condition (ii) for $\alpha,\beta,\gamma$ is satisfied in $G$ since it is satisfied in $G^{(\rho)}$ . ∎

In terms of elementary chains, Conditions (i) et (ii) of the theorem can be reformulated as follows: given $\gamma\in A$ , a sequence $(\gamma_{1},\dots,\gamma_{k})$ in $G$ , with $k\geq 1$ and $\gamma_{k}\neq\gamma$ , starting on a root $\gamma_{1}\in R$ , can be extended to an elementary chain $(\gamma_{1},\dots,\gamma_{\ell})$ ending on $\gamma_{\ell}=\gamma$ if and only if either we have $k=1$ and $\delta>|\gamma_{1}-\gamma|_{v}>0$ or the sequence $(\gamma_{1},\dots,\gamma_{k})$ is an elementary chain with $k\geq 2$ and $|\gamma_{k-1}-\gamma_{k}|_{v}>|\gamma_{k}-\gamma|_{v}>0$ .

10. Volume of the ultrametric components

We now complete the proof of Theorem 2.3 by proving the remaining estimates in parts (ii) and (iii). The notation is as in Section 2.

Theorem 10.1.

Let $v$ be a place of $K$ above a prime number $p$ , let $\mathbf{n}=(n_{1},\dots,n_{s})\in\mathbb{N}_{+}^{s}$ and let $N=n_{1}+\cdots+n_{s}$ . Then the sub- ${\mathcal{O}}_{v}$ -module ${\mathcal{C}}_{\mathbf{n},v}$ of $K_{v}^{s}$ defined in Section 2.3 satisfies

[TABLE]

Moreover, if $|\alpha_{i}-\alpha_{j}|_{v}=1$ for each $i,j\in\{1,\dots,s\}$ with $i\neq j$ , then we also have

[TABLE]

Proof.

We apply Theorem 9.2 to the set $A=\{\alpha_{1},\dots,\alpha_{s}\}$ with $\delta=p^{-1/(p-1)}$ . It provides a rooted forest $G$ with roots $R$ , vertices $A$ , and edges $E$ . For each $\alpha\in A$ , we define $x_{\alpha}=x_{i}$ and $n_{\alpha}=n_{i}$ where $i$ is the index for which $\alpha=\alpha_{i}$ . Then, ${\mathcal{C}}_{\mathbf{n},v}$ is contained in the set ${\mathcal{K}}_{v}$ of points $(x_{1},\dots,x_{s})\in K_{v}^{s}$ satisfying

[TABLE]

for each root $\beta\in R$ , as well as

[TABLE]

for each directed edge $(\alpha,\beta)\in E$ or equivalently for each pair $\{\alpha,\beta\}$ with $\beta\in S_{G}(\alpha)$ (since we then have $|\beta-\alpha|_{v}<\delta$ ). By Proposition 4.1, the above $s$ linear forms are linearly independent, with determinant $\pm 1$ . So ${\mathcal{K}}_{v}$ is a free sub- ${\mathcal{O}}_{v}$ -module of $K_{v}^{s}$ of rank $s$ with

[TABLE]

where

[TABLE]

Let $\beta,\gamma\in A$ . If $\beta\in R$ , Theorem 9.2 (i) yields

[TABLE]

Otherwise, there exists a unique $\alpha\in A$ such that $\beta\in S_{G}(\alpha)$ and, since

[TABLE]

Theorem 9.2 (ii) yields

[TABLE]

Since $D_{G}(\beta)\cup\{\beta\}$ runs through all connected components of $G$ as $\beta$ runs through $R$ and since we have $\sum_{\gamma\in A}n_{\gamma}=N$ , the equality (10.1) implies that

[TABLE]

Furthermore, the equality (10.2) implies that

[TABLE]

As a result we obtain

[TABLE]

Since $\delta^{N}=\prod_{\beta\in A}\delta^{n_{\beta}}\leq\prod_{\beta\in A}|n_{\beta}!|_{v}\leq\prod_{\beta\in A}|(n_{\beta}-1)!|_{v}$ , we conclude that

[TABLE]

Finally, if $|\alpha_{i}-\alpha_{j}|_{v}=1$ for each $i,j\in\{1,\dots,s\}$ with $i\neq j$ , then ${\mathcal{C}}_{\mathbf{n},v}$ consists of all points $(x_{1},\dots,x_{s})\in K_{v}^{s}$ satisfying

[TABLE]

for $i=1,\dots,s$ , thus

[TABLE]

11. A special case

The adelic convex bodies ${\mathcal{C}}_{\mathbf{n}}$ associated to a point $(\alpha_{1},\dots,\alpha_{s})\in K^{s}$ depend only on the differences $\alpha_{j}-\alpha_{i}$ with $1\leq i<j\leq s$ . So, we may always assume that $\alpha_{1}=0$ . Then for $s=2$ , we simply have a point $(0,\alpha)\in K^{2}$ . The proposition below is an explicit form of Corollary 2.4 for such a point and for diagonal pairs $\mathbf{n}=(n,n)\in\mathbb{N}_{+}^{2}$ . In this statement, the adelic convex body is rescaled so that its $v$ -adic component is contained in ${\mathcal{O}}_{v}^{2}$ for each ultrametric place $v$ of $K$ . We use it afterwards to prove Propositions 1.1 and 1.2 from the introduction. The notation is the same as in Section 2.

Proposition 11.1.

Let $\alpha\in K\setminus\{0\}$ , and let $E$ be the finite set of places $v$ of $K$ with $v\mid\infty$ or $|\alpha|_{v}\neq 1$ . For each place $v$ of $K$ with $v\nmid\infty$ , we set $B_{v}=\min\big{\{}1,p^{1/(p-1)}|\alpha|_{v}\big{\}}$ where $p$ is the prime number below $v$ . We also set

[TABLE]

Finally, for each $n\in\mathbb{N}_{+}$ , we denote by $\tilde{{\mathcal{C}}}_{n}$ the adelic convex body of $K^{2}$ whose components $\tilde{{\mathcal{C}}}_{n,v}$ are defined as follows.

(i)

If $v\mid\infty$ , then $\tilde{{\mathcal{C}}}_{n,v}$ is the set of points $(x,y)\in K_{v}^{2}$ such that

[TABLE]

(ii)

If $v\mid p$ for a prime number $p$ and if $|\alpha|_{v}<p^{-1/(p-1)}$ , then $\tilde{{\mathcal{C}}}_{n,v}$ consists of the points $(x,y)\in K_{v}^{2}$ such that

[TABLE]

(iii)

If $v\mid p$ for a prime number $p$ and if $|\alpha|_{v}\geq p^{-1/(p-1)}$ , then $\tilde{{\mathcal{C}}}_{n,v}={\mathcal{O}}_{v}^{2}$ .

Then we have

[TABLE]

for constants $c_{3},c_{4}>0$ that depend only on $\alpha$ and $K$ .

Proof.

Let $n\in\mathbb{N}_{+}$ . We consider the adelic convex body ${\mathcal{C}}_{\mathbf{n}}$ constructed in Section 2.3 for the choice of $\alpha_{1}=0$ , $\alpha_{2}=\alpha$ and $\mathbf{n}=(n,n)$ . For an Archimedean place $v$ of $K$ associated to an embedding $\sigma\colon K\hookrightarrow\mathbb{C}$ and for $k=1,2$ , we find

[TABLE]

Thus the points $(x,y)$ of ${\mathcal{C}}_{\mathbf{n},v}$ satisfy

[TABLE]

This implies that $a_{v}{\mathcal{C}}_{\mathbf{n},v}\subseteq\tilde{{\mathcal{C}}}_{n,v}$ for

[TABLE]

For each prime number $p$ and each place $v$ of $K$ with $v\mid p$ , we also find that $a_{v}{\mathcal{C}}_{\mathbf{n},v}\subseteq\tilde{{\mathcal{C}}}_{n,v}$ for

[TABLE]

where $t_{v}$ is the integer for which

[TABLE]

if $v\in E$ , and $t_{v}=0$ otherwise. This computation is based simply on the fact that $|(n-1)!|_{v}\geq|n!|_{v}\geq p^{-n/(p-1)}$ . Thus we obtain $a\,{\mathcal{C}}_{\mathbf{n}}\subseteq\tilde{{\mathcal{C}}}_{n}$ for the idele $a=(a_{v})_{v}\in K_{\mathbb{A}}^{\times}$ .

The product ${\mathcal{D}}=\prod_{v}\{x\in K_{v}\,;\,|x|_{v}\leq|a_{v}|_{v}\}\subset K_{\mathbb{A}}$ is an adelic convex body of $K$ . By the product formula applied to the principal idele $\alpha^{n}(n-1)!\in K^{\times}$ , we find that the volume of ${\mathcal{D}}$ is

[TABLE]

Since $\prod_{v\mid\infty}B^{d_{v}}=B^{d}=\prod_{v\nmid\infty}B_{v}^{-d_{v}}$ , this can be rewritten as

[TABLE]

with $c_{1}=2^{r_{1}}\pi^{r_{2}}\prod_{v\mid\infty}(4e^{|\alpha|_{v}})^{-d_{v}}$ . Since $p^{t_{v}}B_{v}^{n}=1$ if $v\notin E$ and $p^{t_{v}}B_{v}^{n}<2np^{4}$ if $v\in E$ and $v\mid p$ , this yields

[TABLE]

where $E^{\prime}=\{v\in E\,;\,v\nmid\infty\}$ and $c_{2}=c_{1}\prod_{v\in E^{\prime}}(2p^{4})^{-d_{v}}$ . By Theorem 2.1 (with $s=1$ ), we thus have $\lambda_{1}({\mathcal{D}})\leq c_{3}$ where $c_{3}=(2^{r_{1}+r_{2}}|D(K)|^{1/2}c_{2}^{-1})^{1/d}$ . This means that there exists $\beta\in K^{\times}$ satisfying $|\beta|_{v}\leq c_{3}|a_{v}|_{v}$ for all Archimedean places $v$ of $K$ and $|\beta|_{v}\leq|a_{v}|_{v}$ for all other places. So, we obtain

[TABLE]

which yields

[TABLE]

since $\beta{\mathcal{C}}_{\mathbf{n}}$ contains the $K$ -linearly independent points $\beta\mathbf{a}_{\mathbf{n}-\mathbf{e}_{1}},\beta\mathbf{a}_{\mathbf{n}-\mathbf{e}_{2}}$ of $K^{2}$ . By Theorem 2.1 (with $s=2$ ), this implies that

[TABLE]

Finally, for each place $v$ of $K$ , we find that

[TABLE]

Since $B^{d}\prod_{v\nmid\infty}B_{v}^{d_{v}}=1$ , this implies that $\mu(\tilde{{\mathcal{C}}}_{n})^{1/d}\leq 4n^{2g-1}$ , and so (11.1) follows with $c_{4}=(8c_{3}^{2})^{-1}$ . ∎

Proof of Proposition

1.1.

Under the hypotheses of this proposition, the field $K$ admits a single Archimedean place $\infty$ , induced by the inclusion $K\subset\mathbb{C}$ . Moreover, in the notation of Proposition 11.1, the choice of $\alpha$ leads to $B_{v}=1$ for any other place $v$ of $K$ . Thus, for each $n\in\mathbb{N}_{+}$ , we obtain

[TABLE]

where $\tilde{{\mathcal{C}}}_{n,\infty}$ consists of all points $(x,y)$ of $K_{\infty}^{2}\subseteq\mathbb{C}^{2}$ satisfying

[TABLE]

Moreover, by (11.1), we have $\lambda_{1}(\tilde{{\mathcal{C}}}_{n})\geq c_{4}n^{-2g+1}$ for a constant $c_{4}>0$ depending only on $\alpha$ and $K$ .

Let $(x,y)\in{\mathcal{O}}_{K}^{2}$ with $x\neq 0$ . The above implies that, for each $n\in\mathbb{N}_{+}$ ,

[TABLE]

If $|x|$ is large enough, we can find an integer $n\geq 2$ such that $e^{n}\leq h(n-1)\leq|x|<h(n)$ . Then we have $n\leq\log|x|$ and we obtain

[TABLE]

with $c_{5}=c_{4}^{2}|\alpha|/8$ . Since ${\mathcal{O}}_{K}$ is a discret subset of $\mathbb{C}$ , this leaves out a finite number of values of $x$ . To include them in the final lower bound, it suffices to replace $c_{5}$ by a sufficiently small constant $c>0$ . ∎

Proof of Proposition 1.2.

We apply Proposition 11.1 with $K=\mathbb{Q}$ and $\alpha=3$ . In this context, we have $g=2$ and $B=B_{3}^{-1}=3^{1/2}$ . For a given $n\in\mathbb{N}_{+}$ , a simple computation shows that the Archimedean component $\tilde{{\mathcal{C}}}_{n,\infty}$ of the adelic convex body $\tilde{{\mathcal{C}}}_{n}$ satisfies

[TABLE]

where ${\mathcal{C}}_{n}$ is the convex body of $\mathbb{R}^{2}$ defined in Proposition 1.2. For its ultrametric components, we find that

[TABLE]

and $\tilde{{\mathcal{C}}}_{n,p}=\mathbb{Z}_{p}^{2}$ for each prime number $p\neq 3$ . Thus the points of $\mathbb{Q}^{2}$ which belong to the latter components are exactly those of the lattice $\Lambda_{n}$ in Proposition 1.2. Therefore, the minima of $\tilde{{\mathcal{C}}}_{n}$ with respect to $\mathbb{Q}^{2}$ in the adelic sense are also the minima of $\tilde{{\mathcal{C}}}_{n,\infty}$ with respect to $\Lambda_{n}$ in the classical sense. In view of the inclusions (11.2), this implies that $c_{4}n^{-2}\leq\lambda_{1}({\mathcal{C}}_{n},\Lambda)\leq\lambda_{2}({\mathcal{C}}_{n},\Lambda)\leq c_{3}n^{2}$ for the constants $c_{3}$ and $c_{4}$ given by Proposition 11.1. ∎

12. Numerical computations

The formulas in Appendix A allow us to compute recursively the diagonal Hermite approximations to $(1,e^{3})$ . In this last section, we explain how they can be used to compute efficiently the partial quotients in the continued fraction expansion of $e^{3}\in\mathbb{R}$ , and then to verify the inequalities (1.2) from the introduction. Our reference for continued fractions is [15, Ch. I].

Let $e^{3}=[a_{0},a_{1},a_{2},\dots]$ denote the continued fraction expansion of $e^{3}$ . Its first terms are

[TABLE]

without any noticeable regularity. For each integer $n\geq 0$ , we form the $n$ -th convergent of $e^{3}$

[TABLE]

with $p_{n}\in\mathbb{Z}$ , $q_{n}\in\mathbb{N}_{+}$ and $\gcd(p_{n},q_{n})=1$ . The table below lists all integers $n\geq 1$ with $q_{n-1}\leq 10^{500\,000}$ for which

[TABLE]

For each of those integers, it provides the corresponding value of $a_{n}$ as well as the value of $\log(q_{n-1})$ truncated at the first decimal place.

[TABLE]

To show how this implies the estimations (1.2), define $\psi(x)=3\log(x)\log(\log(x))$ for each $x\geq e$ . For each pair $(p,q)\in\mathbb{Z}^{2}$ with $q\geq 1$ , there exists an integer $n\geq 1$ such that $q_{n-1}\leq q<q_{n}$ . By a theorem of Lagrange [15, Chapter I, Theorem 5E], we have

[TABLE]

Assuming $q\geq 3$ , this implies that

[TABLE]

It is easy to check that the right hand side of (12.1) is $\geq 1$ for all entries $n$ of the table with $n\geq 10$ . Thus it is also $\geq 1$ for each integer $n\geq 10$ with $q_{n-1}\leq 10^{500\,000}$ . A quick computation shows that this is also true for $n=2,\dots,9$ . Thus the left hand side of (12.1) is $\geq 1$ if $11\leq q\leq 10^{500\,000}$ . Finally, one checks that this is still true when $4\leq q\leq 10$ .

To compute the partial quotients $a_{n}$ , put

[TABLE]

for each $n\geq\mathbb{N}_{+}$ . By Corollary A.3 in the Appendix, the rows of $(n-1)!A_{n}$ are Hermite’s approximations $\mathbf{a}_{n-1,n}$ and $\mathbf{a}_{n,n-1}$ to $(1,e^{3})$ . Thus we have

[TABLE]

We also note that, for each $n\geq 2$ , the matrices $C_{n}$ and $A_{n}$ belong to the set

[TABLE]

This is clear for the matrices $C_{n}$ . For the matrices $A_{n}$ , this follows from the fact that ${\mathcal{M}}$ is closed under matrix multiplication.

In general, if $A=\begin{pmatrix}t&u\\ t^{\prime}&u^{\prime}\end{pmatrix}\in{\mathcal{M}}$ , the ratios $t/u$ and $t^{\prime}/u^{\prime}$ admit unique continued fraction expansions

[TABLE]

with $a_{0}=a^{\prime}_{0}=0$ , $a_{\ell}\geq 2$ if $\ell\geq 1$ , and $a^{\prime}_{\ell^{\prime}}\geq 2$ if $\ell^{\prime}\geq 1$ . Let $(a_{0},\dots,a_{k})$ be the common initial part of the sequences $(a_{0},\dots,a_{\ell})$ and $(a^{\prime}_{0},\dots,a^{\prime}_{\ell^{\prime}})$ . When $k=0$ , that is when $t=0$ or $t^{\prime}=0$ or $\lfloor u/t\rfloor\neq\lfloor u^{\prime}/t^{\prime}\rfloor$ , we say that $A$ is reduced. Then, we find that

[TABLE]

where $R\in{\mathcal{M}}$ is reduced, with the convention that the right hand side is $R$ when $k=0$ . In particular, for each $n\geq 2$ , we obtain

[TABLE]

for a reduced matrix $R_{n}\in{\mathcal{M}}$ , integers $0\leq k(1)\leq k(2)\leq\cdots$ and positive integers $a_{1},a_{2},\dots$ such that

[TABLE]

with the convention that the product on the right is $R_{n+1}$ when $k(n+1)=k(n)$ . By (12.2), the integers $k(n)$ go to infinity with $n$ and so we conclude that

[TABLE]

are the respective continued fraction expansions of $e^{-3}$ and $e^{3}$ . Therefore, to compute their partial quotients $a_{k}$ , it suffices to compute recursively the matrices $R_{n}$ whose coefficients are in practice much smaller then those of $A_{n}$ (we may also at each step factor out the power of $3$ dividing $R_{n}$ ). To further save computation time we do not compute exactly the integers $q_{n}$ but keep only a floating point approximation of them (in practice we use 10 significative decimal digits). In this way, it takes slightly above an hour of CPU time to produce the tables using MAPLE software with a 64 bits intel i5 processor.

Appendix A Recurrence relations

The notation being as in Section 2.2 we extend the definition of $f_{\mathbf{n}}(z)$ , $P_{\mathbf{n}}(z)$ and $a_{\mathbf{n}}$ to any $s$ -tuple $\mathbf{n}\in\mathbb{Z}^{s}$ by setting

[TABLE]

For each $\mathbf{n}\in\mathbb{N}_{+}^{s}$ , we denote by $A_{\mathbf{n}}$ the matrix whose $\ell$ -th row is $\mathbf{a}_{\mathbf{n}-\mathbf{e}_{\ell}}$ for $\ell=1,\dots,s$ . In [9, §§IX-X], Hermite provides a recurrence formula linking $A_{\mathbf{n}+\mathbf{1}}$ to $A_{\mathbf{n}}$ where $\mathbf{1}=(1,\dots,1)$ . Here we give more general recurrence relations based on the same principle. The formula (A.1) below is due to Hermite [9, §IX, p. 230] when $\mathbf{n}\in\mathbb{N}_{+}^{s}$ .

Proposition A.1.

Let $\mathbf{n}=(n_{1},\dots,n_{s})\in\mathbb{N}^{s}$ . We have

[TABLE]

Moreover, if $k,\ell\in\{1,\dots,s\}$ with $n_{k}\geq 1$ , we also have

[TABLE]

Proof.

Leibniz formula for the derivative of a product gives

[TABLE]

Taking the sum of all derivatives on both sides of this equality, we obtain

[TABLE]

and (A.1) follows. The formula (A.2) is trivial if $k=\ell$ . Suppose that $k\neq\ell$ and $n_{k}\geq 1$ so that $\mathbf{n}-\mathbf{e}_{k}\in\mathbb{N}^{s}$ . Then we find

[TABLE]

Taking again the sum of the derivatives, this yields

[TABLE]

and (A.2) follows. ∎

Corollary A.2.

Let $\mathbf{n}=(n_{1},\dots,n_{s})\in\mathbb{N}_{+}^{s}$ and $\ell\in\{1,\dots,s\}$ . Then we have

[TABLE]

where

[TABLE]

Proof.

As the entries of $\mathbf{n}$ are positive, the polynomial $f_{\mathbf{n}}$ vanishes at all points $\alpha_{1},\dots,\alpha_{s}$ and the formulas of Proposition A.1 yield

[TABLE]

When $s=2$ , this provides a quick way of computing the matrices $A_{n,n}$ .

Corollary A.3.

Suppose that $s=2$ , $\alpha_{1}=0$ and $\alpha_{2}=\alpha\in K\setminus\{0\}$ . Then, for each $n\in\mathbb{N}_{+}$ , we have

[TABLE]

where

[TABLE]

Proof.

We find that $P_{0,1}(z)=z+1-\alpha$ and $P_{1,0}(z)=z+1$ , thus $A_{1,1}=C_{1}$ . In general, for an integer $n\geq 1$ , the formulas of the preceding corollary give

[TABLE]

and the conclusion follows by induction on $n$ . ∎

Bibliography18

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Ayarai and S. Dubuc, La formule de Cauchy sur la longueur d’une courbe, Canad. Math. Bull. 40 (1997), 3–9.
2[2] A. Baker, On some Diophantine inequalities involving the exponential function, Canad. J. Math. 17 (1965), 616–626.
3[3] A. Baker, Transcendental number theory , Cambridge University Press, London-New York, 1975, x+147 pp.
4[4] E. Bombieri and J. Vaaler, On Siegel’s Lemma, Invent. Math. 73 (1983), 11–32.
5[5] W. D. Brownawell, Some remarks on semi-resultants, Transcendence Theory: Advances and Applications , Chapter 14 (A. Baker and D. W. Masser, eds.), New York, Academic Press, 1977.
6[6] P. Bundschuh, Irrationalitätsmaße für e a superscript 𝑒 𝑎 e^{a} , a ≠ 0 𝑎 0 a\neq 0 , rational oder Liouville-Zahl, Math. Ann. 192 (1971), 229–242.
7[7] G. V. Chudnovsky, Some analytic methods in the theory of transcendental numbers, Chapter 1 in Math. Surveys Monogr. , Vol. 19, Amer. Math. Soc., Providence, R. I., 1984.
8[8] O. Forster, Lectures on Riemann surfaces , Graduate texts in Math. Vol. 81, Springer-Verlag, New York, 1981.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Simultaneous approximation to values of the exponential function over the adeles

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Proposition 1.1**.**

Proposition 1.2**.**

2. Statement of the main result

2.1. Minima of adelic convex bodies

Theorem 2.1** (McFeat, Bombieri et Vaaler).**

2.2. Hermite’s approximations

Theorem 2.2** (Mahler).**

2.3. Statement of the main result

Theorem 2.3**.**

Corollary 2.4**.**

Proof.

3. Ultrametric estimates

Lemma 3.1**.**

Proof.

Theorem 3.2**.**

Proof.

4. Preliminaries of graph theory

Proposition 4.1**.**

5. Paths of steepest ascent

Theorem 5.1**.**

Lemma 5.2**.**

Proof of Theorem 5.1

6. A tree of paths between complex roots

Theorem 6.1**.**

Step 1

Lemma 6.2**.**

Proof.

Step 2

Lemma 6.3**.**

Proof.

Lemma 6.4**.**

Proof.

Step 4. Proof of Theorem 6.1

7. Computation of a semi-resultant

Proposition 7.1**.**

Proof.

Corollary 7.2**.**

Proof.

8. Volume of the Archimedean components

Theorem 8.1**.**

Proof.

9. A forest at ultrametric places

Proposition 9.1**.**

Proof.

Theorem 9.2**.**

Proof.

10. Volume of the ultrametric components

Theorem 10.1**.**

Proof.

11. A special case

Proposition 11.1**.**

Proof.

Proof of Proposition

Proof of Proposition 1.2.

12. Numerical computations

Appendix A Recurrence relations

Proposition A.1**.**

Proof.

Corollary A.2**.**

Proof.

Corollary A.3**.**

Proof.

Proposition 1.1.

Proposition 1.2.

Theorem 2.1 (McFeat, Bombieri et Vaaler).

Theorem 2.2 (Mahler).

Theorem 2.3.

Corollary 2.4.

Lemma 3.1.

Theorem 3.2.

Proposition 4.1.

Theorem 5.1.

Lemma 5.2.

Theorem 6.1.

Lemma 6.2.

Lemma 6.3.

Lemma 6.4.

Proposition 7.1.

Corollary 7.2.

Theorem 8.1.

Proposition 9.1.

Theorem 9.2.

Theorem 10.1.

Proposition 11.1.

Proposition A.1.

Corollary A.2.

Corollary A.3.