Coloring the Voronoi tessellation of lattices

Mathieu Dutour Sikiri\'c; David Madore; Philippe Moustrou; Frank; Vallentin

arXiv:1907.09751·math.CO·October 12, 2021

Coloring the Voronoi tessellation of lattices

Mathieu Dutour Sikiri\'c, David Madore, Philippe Moustrou, Frank, Vallentin

PDF

TL;DR

This paper introduces the concept of the chromatic number for lattice Voronoi tessellations, computes it for various lattices, and explores its asymptotic behavior and spectral bounds, connecting geometry, algebra, and spectral graph theory.

Contribution

It defines the lattice chromatic number, computes it for key lattices, and establishes spectral bounds, advancing understanding of lattice coloring and its asymptotics.

Findings

01

Chromatic number computed for root lattices, duals, and Leech lattice.

02

Spectral lower bounds for lattice chromatic number established.

03

Asymptotic behavior of lattice chromatic number analyzed in high dimensions.

Abstract

In this paper we define the chromatic number of a lattice: It is the least number of colors one needs to color the interiors of the cells of the Voronoi tessellation of a lattice so that no two cells sharing a facet are of the same color. We compute the chromatic number of the root lattices, their duals, and of the Leech lattice, we consider the chromatic number of lattices of Voronoi's first kind, and we investigate the asymptotic behaviour of the chromatic number of lattices when the dimension tends to infinity. We introduce a spectral lower bound for the chromatic number of lattices in spirit of Hoffman's bound for finite graphs. We compute this bound for the root lattices and relate it to the character theory of the corresponding Lie groups.

Tables3

Table 1. Table 1 . The chromatic number of important lattices, in particular the (irreducible) root lattices and their duals.

lattice	chromatic number
$ℤ^{n}$	$2$	Sec. 1, Thm. 2.5, Sec. 3
$𝖠_{n}$	$n + 1$	Thm. 3.6, Thm. 5.3
$𝖠_{n}^{*}$	$n + 1$	Thm. 3.6
$𝖣_{n}$	$χ (\frac{1}{2} H_{n})$	Thm. 6.1
$𝖣_{n}^{*}$	$4$	Sec. 1
$𝖤_{6}$	$9$	Sec. 5.2.4, Thm. 6.3, Thm. B.2
$𝖤_{6}^{*}$	$16$	Thm. 6.4
$𝖤_{7}$	$14$	Sec. 5.2.3, Thm. 6.3, Thm. B.4
$𝖤_{7}^{*}$	$16$	Thm. 6.4
$𝖤_{8}$	$16$	Sec. 4.1, Sec. 5.2.3, Thm. 6.3, Thm. B.3
$Λ_{24}$	$4096$	Sec. 4.1

Table 2. Table 2 . The chromatic number of 3 3 3 -dimensional lattices

Voronoi
lattice	$ℤ^{3}$	$𝖠_{2} ⟂ ℤ$	$𝖠_{3}$	$ℤ (\begin{matrix} 2 \\ 0 \\ 0 \end{matrix}) \oplus ℤ (\begin{matrix} 0 \\ 2 \\ 0 \end{matrix}) \oplus ℤ (\begin{matrix} - 1 \\ - 1 \\ 2 \end{matrix})$	$𝖠_{3}^{*}$

cell
	cube	hexagonal	rhombic	elongated	truncated
	cube	prism	dodecahedron	dodecahedron	octahedron

Delaunay
graph
chromatic	$2$	$3$	$4$	$4$	$4$
number	$2$	$3$	$4$	$4$	$4$

Table 3. Table 3 . The spectral lower bounds on the chromatic number for the irreducible root lattices.

lattice	spectral lower bound
$𝖠_{n}$	$n + 1$	Thm. 5.3
$𝖣_{n}$	$n$ , when $n$ even	Thm. 5.4
	$n + 1$ , when $n$ odd	Thm. 5.4
$𝖤_{6}$	$9$	Thm. B.2, Sec. 5.2.4
$𝖤_{7}$	$10$	Thm. B.4, Sec. 5.2.3
$𝖤_{8}$	$16$	Thm. B.3, Sec. 5.2.3

Equations236

V (Λ) = {x \in R^{n} : ∥ x ∥ \leq ∥ x - v ∥ for all v \in Λ} .

V (Λ) = {x \in R^{n} : ∥ x ∥ \leq ∥ x - v ∥ for all v \in Λ} .

χ (Λ) = χ (Cayley (Λ, Vor (Λ))) .

χ (Λ) = χ (Cayley (Λ, Vor (Λ))) .

\frac{1}{2} H_{n} = conv {x \in {0, 1}^{n} : k = 1 \sum n x_{k} is even},

\frac{1}{2} H_{n} = conv {x \in {0, 1}^{n} : k = 1 \sum n x_{k} is even},

χ (G) \geq 1 - \frac{M ( A )}{m ( A )} .

χ (G) \geq 1 - \frac{M ( A )}{m ( A )} .

χ (Λ) \geq 1 - x \in R^{n} in f \frac{1}{∣ Vor ( Λ ) ∣} u \in Vor (Λ) \sum e^{2 π i u \cdot x}^{- 1},

χ (Λ) \geq 1 - x \in R^{n} in f \frac{1}{∣ Vor ( Λ ) ∣} u \in Vor (Λ) \sum e^{2 π i u \cdot x}^{- 1},

χ (Λ_{n}) \geq 2 \cdot 2^{(0.0990 \dots - o (1)) n} .

χ (Λ_{n}) \geq 2 \cdot 2^{(0.0990 \dots - o (1)) n} .

Λ^{*} = {x \in R^{n} : x \cdot y \in Z for all y \in Λ},

Λ^{*} = {x \in R^{n} : x \cdot y \in Z for all y \in Λ},

V (Λ) = {x \in R^{n} : ∥ x ∥ \leq ∥ x - v ∥ for all v \in Λ} .

V (Λ) = {x \in R^{n} : ∥ x ∥ \leq ∥ x - v ∥ for all v \in Λ} .

Vor (Λ) = {u \in Λ ∖ {0} : \pm u only shortest vectors in u + 2Λ} .

Vor (Λ) = {u \in Λ ∖ {0} : \pm u only shortest vectors in u + 2Λ} .

χ (Λ) = χ (Cayley (Λ, Vor (Λ))) .

χ (Λ) = χ (Cayley (Λ, Vor (Λ))) .

V = Λ/ Λ^{'} and E = {{v + Λ^{'}, w + Λ^{'}} : v - w + u \in Vor (Λ) for some u \in Λ^{'}} .

V = Λ/ Λ^{'} and E = {{v + Λ^{'}, w + Λ^{'}} : v - w + u \in Vor (Λ) for some u \in Λ^{'}} .

χ (Λ) \leq χ (G) \leq Δ (G) + 1,

χ (Λ) \leq χ (G) \leq Δ (G) + 1,

A_{n} = {x \in Z^{n + 1} : k = 0 \sum n x_{k} = 0} .

A_{n} = {x \in Z^{n + 1} : k = 0 \sum n x_{k} = 0} .

D_{n} = {x \in Z^{n} : k = 1 \sum n x_{k} is even}

D_{n} = {x \in Z^{n} : k = 1 \sum n x_{k} is even}

D_{n}^{*} = Z^{n} \cup ((1/2, \dots, 1/2) + Z^{n}) .

D_{n}^{*} = Z^{n} \cup ((1/2, \dots, 1/2) + Z^{n}) .

E_{8} = D_{8} \cup ((1/2, \dots, 1/2) + D_{8}) .

E_{8} = D_{8} \cup ((1/2, \dots, 1/2) + D_{8}) .

E_{7} = {(x_{1}, \dots, x_{8}) \in E_{8} : x_{7} = x_{8}}

E_{7} = {(x_{1}, \dots, x_{8}) \in E_{8} : x_{7} = x_{8}}

E_{6} = {(x_{1}, \dots, x_{8}) \in E_{8} : x_{6} = x_{7} = x_{8}} .

E_{6} = {(x_{1}, \dots, x_{8}) \in E_{8} : x_{6} = x_{7} = x_{8}} .

u = \frac{1}{4} (1, 1, 1, 1, 1, 1, - 3, - 3) and v = \frac{1}{3} (0, - 2, - 2, 1, 1, 1, 1, 0),

u = \frac{1}{4} (1, 1, 1, 1, 1, 1, - 3, - 3) and v = \frac{1}{3} (0, - 2, - 2, 1, 1, 1, 1, 0),

E_{7}^{*} = E_{7} \cup (u + E_{7}) and E_{6}^{*} = E_{6} \cup (v + E_{6}) \cup (- v + E_{6}) .

E_{7}^{*} = E_{7} \cup (u + E_{7}) and E_{6}^{*} = E_{6} \cup (v + E_{6}) \cup (- v + E_{6}) .

Λ = Λ_{1} ⊥ Λ_{2} ⊥ \dots ⊥ Λ_{m}, with m \in N,

Λ = Λ_{1} ⊥ Λ_{2} ⊥ \dots ⊥ Λ_{m}, with m \in N,

Vor (Λ) = i = 1 ⋃ m Vor (Λ_{i}) .

Vor (Λ) = i = 1 ⋃ m Vor (Λ_{i}) .

∥ w ∥^{2} = ∥ v_{1} + 2 u_{1} ∥^{2} + ∥2 u_{2} ∥^{2} \geq ∥ v_{1} + 2 u_{1} ∥^{2} \geq ∥ v_{1} ∥^{2}

∥ w ∥^{2} = ∥ v_{1} + 2 u_{1} ∥^{2} + ∥2 u_{2} ∥^{2} \geq ∥ v_{1} + 2 u_{1} ∥^{2} \geq ∥ v_{1} ∥^{2}

Λ = Λ_{1} ⊥ Λ_{2} ⊥ \dots ⊥ Λ_{m}, with m \in N .

Λ = Λ_{1} ⊥ Λ_{2} ⊥ \dots ⊥ Λ_{m}, with m \in N .

χ (Λ) = i \in {1, \dots, m} max χ (Λ_{i}) .

χ (Λ) = i \in {1, \dots, m} max χ (Λ_{i}) .

\begin{array}[]{ccccc}c&:&\Lambda&\to&\mathbb{Z}/k\mathbb{Z}\\ &&v_{1}+v_{2}&\mapsto&c_{1}(v_{1})+c_{2}(v_{2})\bmod k\\ \end{array}

\begin{array}[]{ccccc}c&:&\Lambda&\to&\mathbb{Z}/k\mathbb{Z}\\ &&v_{1}+v_{2}&\mapsto&c_{1}(v_{1})+c_{2}(v_{2})\bmod k\\ \end{array}

c (v) = c_{1} (u_{1} + w_{1}) + c_{2} (u_{2}) \neq = c_{1} (u_{1}) + c_{2} (u_{2}) = c (u) mod k .

c (v) = c_{1} (u_{1} + w_{1}) + c_{2} (u_{2}) \neq = c_{1} (u_{1}) + c_{2} (u_{2}) = c (u) mod k .

v_{0} = (- \frac{n}{n + 1}, \frac{1}{n + 1}, \dots, \frac{1}{n + 1}), \dots, v_{n} = (\frac{1}{n + 1}, \dots, \frac{1}{n + 1}, - \frac{n}{n + 1}) .

v_{0} = (- \frac{n}{n + 1}, \frac{1}{n + 1}, \dots, \frac{1}{n + 1}), \dots, v_{n} = (\frac{1}{n + 1}, \dots, \frac{1}{n + 1}, - \frac{n}{n + 1}) .

v_{i} \cdot v_{j} = - 1 for every 0 \leq i < j \leq n .

v_{i} \cdot v_{j} = - 1 for every 0 \leq i < j \leq n .

Vor (Λ) \subseteq {v_{I} = i \in I \sum v_{i} : I \subseteq {0, \dots, n}, I \neq = \emptyset, I \neq = {0, \dots, n}} .

Vor (Λ) \subseteq {v_{I} = i \in I \sum v_{i} : I \subseteq {0, \dots, n}, I \neq = \emptyset, I \neq = {0, \dots, n}} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Coloring the Voronoi tessellation of lattices

Mathieu Dutour Sikirić

M. Dutour Sikirić, Rudjer Bosković Institute, Bijenička 54, 10000 Zagreb, Croatia

[email protected]

,

David A. Madore

D. Madore, Télécom Paris / LTCI, Département INFRES, 19 place Marguerite Perey, 91120 Palaiseau, France

[email protected]

,

Philippe Moustrou

P. Moustrou, Department of Mathematics and Statistics, UiT The Arctic University of Norway, N-9037 Tromsø, Norway

[email protected]

and

Frank Vallentin

F. Vallentin, Department Mathematik/Informatik, Abteilung Mathematik, Universität zu Köln, Weyertal 86–90, 50931 Köln, Germany

[email protected]

(Date: March 22, 2021)

Abstract.

In this paper we define the chromatic number of a lattice: It is the least number of colors one needs to color the interiors of the cells of the Voronoi tessellation of a lattice so that no two cells sharing a facet are of the same color.

We compute the chromatic number of the root lattices, their duals, and of the Leech lattice, we consider the chromatic number of lattices of Voronoi’s first kind, and we investigate the asymptotic behaviour of the chromatic number of lattices when the dimension tends to infinity.

We introduce a spectral lower bound for the chromatic number of lattices in spirit of Hoffman’s bound for finite graphs. We compute this bound for the root lattices and relate it to the character theory of the corresponding Lie groups.

Key words and phrases:

lattice, Voronoi cell, chromatic number

1991 Mathematics Subject Classification:

05C15, 52C07

1 Introduction
1.1 Determination of the chromatic number
1.2 Generic and extremal behaviour of the chromatic number
2 Background and first observations
2.1 Lattices, Voronoi cells, Voronoi vectors
2.2 Simple upper bounds for the chromatic number
2.3 Simple lower bounds for the chromatic number
2.4 Root lattices and their duals
2.5 The chromatic number of an orthogonal sum of lattices
3 On the chromatic number of lattices of Voronoi’s first kind
3.1 Definitions and first examples
3.2 Interpretation in terms of graphs and more general results
3.3 Application: the chromatic number of $3$ -dimensional lattices
4 Sphere packing lower bounds
4.1 First application: Chromatic number of $\mathsf{E}_{8}$ and of the Leech lattice
4.2 Second application: Exponential growth of the chromatic number
5 Spectral lower bounds
5.1 Setup
5.2 Computing the spectral bound for irreducible root lattices
5.2.1 $\mathsf{A}_{n}$
5.2.2 $\mathsf{D}_{n}$
5.2.3 $\mathsf{E}_{7}$ , $\mathsf{E}_{8}$ and sums of squares
5.2.4 $\mathsf{E}_{6}$
6 The chromatic number of irreducible root lattices and their duals
6.1 $\mathsf{D}_{n}$ and its dual
6.2 $\mathsf{E}_{6},\mathsf{E}_{7},\mathsf{E}_{8}$ and their duals
A Recollections on compact Lie groups
B Serre’s result on characters of compact Lie groups

1. Introduction

Let $\Lambda\subseteq\mathbb{R}^{n}$ be an $n$ -dimensional lattice in $n$ -dimensional Euclidean space. One can tessellate space by lattice translates of the lattice’ Voronoi cell, which is defined as

[TABLE]

By $V(\Lambda)^{\circ}$ we denote the topological interior of $V(\Lambda)$ . Now we consider translates $v+V(\Lambda)^{\circ}$ , with $v\in L$ , as colored tiles of an $n$ -dimensional mosaic in which one has infinitesimal small interstices between the mosaic tiles. How many colors does one need at least to get a colorful mosaic? In a colorful mosaic two neighboring tiles receive different colors. This defines the chromatic number $\chi(\Lambda)$ of the lattice.

More formally, we can also define the chromatic number of a lattice in graph theoretical terms: Two distinct lattice translates of Voronoi cells $v+V(\Lambda)$ and $w+V(\Lambda)$ , with $v\neq w$ , are defining neighboring tiles whenever they share a facet, i.e. their intersection is a polytope of maximal dimension $n-1$ . The differences $v-w$ are called strict Voronoi vectors and the set of these vectors is denoted by $\operatorname{Vor}(\Lambda)$ . Now the chromatic number of $\Lambda$ equals the chromatic number of the Cayley graph on the additive group $\Lambda$ with generating set $\operatorname{Vor}(\Lambda)$ :

[TABLE]

Here, the set of vertices of the Cayley graph are all elements of $\Lambda$ and two vertices $v,w$ are adjacent whenever the difference $v-w$ lies in the set of strict Voronoi vectors $\operatorname{Vor}(\Lambda)$ . Note that the Cayley graph is an $r$ -regular infinite graph with $r=|\operatorname{Vor}(\Lambda)|$ .

In Section 2.1 we recall all the definitions and properties of lattices, their Voronoi cells, and the Voronoi vectors, which we need later.

The chromatic number of a lattice seems to be a natural parameter. However, to the best of the authors’ knowledge, see [40] and [25], $\chi(\Lambda)$ has not been considered before. The aim of this paper is to start a systematic investigation of it. Then, the following questions immediately come to mind:

1.1. Determination of the chromatic number

What is the chromatic number of some interesting lattices? How to find lower and upper bounds? Is there an algorithm to determine $\chi(\Lambda)$ for a given lattice $\Lambda$ ?

For instance, it is obvious that the chromatic number of the integer lattice $\mathbb{Z}^{n}$ is two, an optimal coloring is given by the black/white checkerboard pattern; see also Theorem 2.5.

We discuss simple lower and upper bounds for the chromatic number of a general lattice in Section 2.2 and in Section 2.3. For instance, we show that $\chi(\Lambda)$ is at most $2^{n}$ .

All two- and three-dimensional lattices are of Voronoi’s first kind. We consider the chromatic number of this class of lattices in Section 3 where we compute the chromatic number of all three-dimensional lattices. It would be interesting to have a better understanding of the chromatic number of this class of lattices.

One of the most important classes of lattices are the root lattices. We recall the definitions and classification in Section 2.4. One main result of our paper is the determination of the chromatic number of all root lattices and their duals. Table 1 summarizes our results.

Note that we currently do not know the numerical value of $\chi(\mathsf{D}_{n})$ . We only know it is equal to the chromatic number of the (finite) vertex-edge-graph of the half-cube polytope

[TABLE]

which at the moment is only known up to dimension $n=9$ : For $n=4$ , $5$ , $6$ , $7$ , $8$ and $9$ we have $\chi(\frac{1}{2}H_{n})=4$ , $8$ , $8$ , $8$ , $8$ and $13$ (see [36]).

For the proof we use a generalization of a lower bound for the chromatic number of finite graphs originally due to Hoffman [28]. Hoffman’s bound is based on spectral considerations: Let $A\in\mathbb{R}^{V\times V}$ be the adjacency matrix of a finite graph $G=(V,E)$ . Let $m(A)$ be the smallest eigenvalue of $A$ and respectively let $M(A)$ be the largest eigenvalue of $A$ , then

[TABLE]

Bachoc, DeCorte, Oliveira, and Vallentin [2] showed how to generalize the spectral bound (and its weighted variant due to Lovász [38]) from finite to infinite graphs. In Section 5.1 we review this generalization and specialize it to $\chi(\Lambda)$ . Here, classical Fourier analysis is used. We show in Corollary 5.2 that

[TABLE]

holds.

In Section 5 we compute this bound for all irreducible root lattices. Surprisingly, the result of this computation can already be found in an Oberwolfach report by Serre [50] albeit in a different language and with a different motivation. In his report Serre computed all critical values of the characters of the adjoint representation of compact Lie groups. However, the report does not contain proofs. In Section 5.2 we provide proofs for the easy cases $\mathsf{A}_{n}$ and $\mathsf{D}_{n}$ . The cases $\mathsf{E}_{6}$ , $\mathsf{E}_{7}$ , and $\mathsf{E}_{8}$ are much harder and we give Serre’s proof in Appendix B after recalling relevant facts about compact Lie groups in Appendix A. We sketch an alternative, computational proof, which is based on optimization, in particular using sum of squares for the cases $\mathsf{E}_{7}$ and $\mathsf{E}_{8}$ at the end of Section 5.2. The case $\mathsf{E}_{6}$ is easier and does not require computer assistance.

Then, in Section 6, we construct several efficient colorings of irreducible root lattices.

It would be nice to know the chromatic number of more important lattices. Following the book [13] by Conway and Sloane the next candidates one should consider are the $12$ -dimensional Coxeter-Todd lattice $\mathsf{K}_{12}$ and the $16$ -dimensional Barnes-Wall lattice $\mathsf{BW}_{16}$ . We expect that the spectral lower bound gives a close approximation to the chromatic number.

We show in Section 4.1 that the chromatic number of the Leech lattice $\Lambda_{24}$ in $24$ dimensions is $4096$ . This is a consequence of the sphere packing optimality of $\Lambda_{24}$ . It would be nice to have an independent (spectral) proof of this fact.

Going back to general lattices: At the moment we do not know whether there is a finite algorithm to compute the chromatic number of a lattice which is given for example by a basis. Determining the strict Voronoi vectors—and thus the Cayley graph $\operatorname{Cayley}(\Lambda,\operatorname{Vor}(\Lambda))$ and the Voronoi cell $V(\Lambda)$ —is possible by a finite algorithm, see for example [20], although it can occupy exponential space (and therefore needs exponential time). The theorem of de Bruijn and Erdős [9] implies that the chromatic number of $\Lambda$ is equal to the largest chromatic number of all finite subgraphs of $\Lambda$ . This shows that the decision problem: “Is $\chi(\Lambda)\leq k$ ?” is at least semidecidable.

1.2. Generic and extremal behaviour of the chromatic number

What is $\chi(\Lambda)$ of a random $n$ -dimensional lattice? How fast can $\chi(\Lambda)$ grow depending on the dimension $n$ ?

In Section 4.2 we prove that the chromatic number of a generic $n$ -dimensional lattice grows exponentially with the dimension. There we show that there are $n$ -dimensional lattices $\Lambda_{n}$ with

[TABLE]

It would be very interesting to understand the extremal behaviour.

2. Background and first observations

2.1. Lattices, Voronoi cells, Voronoi vectors

A lattice $\Lambda$ is a discrete free $\mathbb{Z}$ -module in an $n$ -dimensional Euclidean space. If its rank is strictly lower than $n$ , then $\Lambda$ also defines a lattice in its linear span over $\mathbb{R}$ . We implicitly identify these two lattices, and assume for the following definitions that $\Lambda$ is a full-rank lattice in $\mathbb{R}^{n}$ . We denote by $\Lambda^{*}$ the dual lattice of $\Lambda$ :

[TABLE]

where $x\cdot y$ denotes the standard Euclidean scalar product between $x$ and $y$ . A fundamental region of $\Lambda$ is a region $\mathcal{R}\subset\mathbb{R}^{n}$ such that for any $u\neq v\in\Lambda$ , the volume of $(u+\mathcal{R})\cap(v+\mathcal{R})$ is [math], and $\mathbb{R}^{n}=\bigcup_{v\in\Lambda}(v+\mathcal{R})$ . The volume $\operatorname{vol}(\mathbb{R}^{n}/\Lambda)$ of $\Lambda$ is defined as the volume of any of its fundamental region. A fundamental region of particular interest is the Voronoi cell of $\Lambda$ :

[TABLE]

A vector $u\in\Lambda\setminus\{0\}$ is called a strict Voronoi vector, or sometimes a “relevant” vector, if the intersection $(u+V(\Lambda))\cap V(\Lambda)$ is a facet, a face of dimension $n-1$ , of $V(\Lambda)$ . By a well-known characterization of Voronoi (see for example [13, Chapter 21, Theorem 10] or [15]) the set of these vectors is

[TABLE]

Now the chromatic number of $\Lambda$ equals the chromatic number of the Cayley graph on the additive group $\Lambda$ with generating set $\operatorname{Vor}(\Lambda)$ :

[TABLE]

Here, the set of vertices of the Cayley graph are all elements of $\Lambda$ and two vertices $v,w$ are adjacent whenever the difference $v-w$ lies in the set of strict Voronoi vectors $\operatorname{Vor}(\Lambda)$ .

2.2. Simple upper bounds for the chromatic number

One can color a lattice $\Lambda$ periodically by using translates of one of its sublattices $\Lambda^{\prime}$ which does not contain Voronoi vectors. More precisely it is enough to color the vertices of the graph $G=(V,E)$ with

[TABLE]

This immediately gives the following upper bound on $\chi(\Lambda)$ :

Lemma 2.1.

Let $\Lambda^{\prime}\subset\Lambda$ be a sublattice of $\Lambda$ with $\Lambda^{\prime}\cap\operatorname{Vor}(\Lambda)=\emptyset$ . Then, $\chi(\Lambda)$ is at most $|\Lambda/\Lambda^{\prime}|$ .

Sometimes, we can improve this bound by coloring the vertices of the graph $G=(V,E)$ greedily. This shows, see for example [6, Chapter V.1], that

[TABLE]

where $\Delta(G)$ is the largest degree of a vertex in $G$ .

Now we take $\Lambda^{\prime}=2\Lambda$ . Lemma 2.1 implies that $\chi(\Lambda)\leq 2^{n}$ . If the number of Voronoi vectors is not maximal, if $|\operatorname{Vor}(\Lambda)|<2(2^{n}-1)$ , then we can improve this bound by using (2):

Lemma 2.2.

The chromatic number of $\Lambda$ is at most $|\operatorname{Vor}(\Lambda)|/2+1$ .

For generic lattices, the number of Voronoi vectors is $2(2^{n}-1)$ , so that Lemma 2.2 also gives an upper bound of $2^{n}$ for the chromatic number of $\Lambda$ .

2.3. Simple lower bounds for the chromatic number

For a general graph $G$ one has $\chi(G)\geq\chi(H)$ for every induced subgraph $H$ of $G$ . In particular, when choosing $H$ to be a largest complete subgraph of $G$ , we have $\chi(G)\geq\omega(G)$ , where $\omega(G)$ is the clique number of $G$ .

Canonical finite induced subgraphs of $\operatorname{Cayley}(\Lambda,\operatorname{Vor}(\Lambda))$ are the vertex-edge graphs of Delaunay polytopes of $\Lambda$ . A Delaunay polytope of the lattice $\Lambda$ is defined as follows: Let $x$ be a vertex of the Voronoi cell $V(\Lambda)$ . Consider all vectors $v_{1},\ldots,v_{m}\in\Lambda$ so that $x$ is contained in all the translates $v_{1}+V(\Lambda),\ldots,v_{m}+V(\Lambda)$ . Then, the convex hull $P=\operatorname{conv}\{v_{1},\ldots,v_{m}\}$ of $v_{1},\ldots,v_{m}$ is a Delaunay polytope of $\Lambda$ . Clearly, all edges of $P$ lie in $\operatorname{Vor}(\Lambda)$ .

Lemma 2.3.

The chromatic number of a lattice $\Lambda$ is at least the chromatic number of the vertex-edge graph of any Delaunay polytope of $\Lambda$ .

2.4. Root lattices and their duals

One of the most important classes of lattices are the root lattices. Assume that $\Lambda\subseteq\mathbb{R}^{n}$ is an even lattice, i.e. we have $v\cdot v\in 2\mathbb{Z}$ for all $v\in\Lambda$ . Lattice vectors $v\in\Lambda$ with $v\cdot v=2$ are called root vectors, or simply roots. A root lattice $\Lambda\subseteq\mathbb{R}^{n}$ is an even lattice which is spanned by roots. Root lattices have been classified by Witt in 1941, see for example [22, Section 1.4], and they are orthogonal direct sums of the irreducible root lattices $\mathsf{A}_{n}$ , $\mathsf{D}_{n}$ , $\mathsf{E}_{6}$ , $\mathsf{E}_{7}$ , and $\mathsf{E}_{8}$ . The strict Voronoi vectors of root lattices are precisely the root vectors [13, Chapter 21, §3.A] (and in fact, this condition that only the shortest nonzero vectors are relevant characterizes root lattices, see [47]). Also, the combinatorial description of the Voronoi cells of root lattices is well known: it is described in more detail in [41]. Here we recall the definitions of the irreducible root lattices and their duals.

Living inside the hyperplane $\Pi:=\{x\in\mathbb{R}^{n+1}:\sum_{k=0}^{n}x_{k}=0\}$ (the coordinates being here numbered [math] through $n$ ), the irreducible root lattice $\mathsf{A}_{n}$ is defined as

[TABLE]

The dual lattice $\mathsf{A}_{n}^{*}$ naturally lives in the vector space $\Pi^{*}$ dual to $\Pi$ , which can be identified with the quotient of $\mathbb{R}^{n+1}$ by the diagonal line $\{(t,\ldots,t):t\in\mathbb{R}\}$ . But we can identify $\Pi^{*}$ with $\Pi$ itself by choosing the representatives $(x_{0},\ldots,x_{n})$ of $\mathbb{R}^{n+1}$ modulo the diagonal which belong to $\Pi$ .

For every $n\geq 4$ , the irreducible root lattice $\mathsf{D}_{n}$ is defined as

[TABLE]

and its dual lattice $\mathsf{D}_{n}^{*}$ equals

[TABLE]

The root lattice $\mathsf{E}_{8}$ can be constructed as the union of two translates of the lattice $\mathsf{D}_{8}$ :

[TABLE]

The lattice $\mathsf{E}_{8}$ is unimodular, i.e. $\mathsf{E}_{8}^{*}=\mathsf{E}_{8}$ . The lattice $\mathsf{E}_{7}$ (resp. $\mathsf{E}_{6}$ ) can be defined as a $7$ -dimensional (resp. $6$ -dimensional) sublattice of $\mathsf{E}_{8}$ :

[TABLE]

and

[TABLE]

Then, if we define

[TABLE]

the dual lattices of $\mathsf{E}_{7}$ and $\mathsf{E}_{6}$ are

[TABLE]

2.5. The chromatic number of an orthogonal sum of lattices

Eichler [23] showed that one can decompose every lattice as a pairwise orthogonal sum of indecomposable lattices and that this decomposition is unique up to permutation of the summand; Kneser [34] gave a constructive and much simpler proof of Eichler’s result.

We prove that the chromatic number of a lattice is the maximum of the chromatic numbers of its orthogonal summands.

This reduces in particular the study of the chromatic number of root lattices to the irreducible root lattices $\mathsf{A}_{n}$ , $\mathsf{D}_{n}$ , $\mathsf{E}_{6}$ , $\mathsf{E}_{7}$ , and $\mathsf{E}_{8}$ .

Lemma 2.4.

Let $\Lambda\subseteq\mathbb{R}^{n}$ be a lattice which can be written as the orthogonal direct sum of lattices $\Lambda_{1},\ldots,\Lambda_{m}\subseteq\mathbb{R}^{n}$ :

[TABLE]

so that every lattice vector $v\in\Lambda$ can be uniquely decomposed as $v=v_{1}+\cdots+v_{m}$ with $v_{i}\in\Lambda_{i}$ and $v_{i}$ is orthogonal to $v_{j}$ whenever $i\neq j$ . Then,

[TABLE]

Proof.

By induction, we may assume that $\Lambda=\Lambda_{1}\perp\Lambda_{2}$ .

Every $v\in\operatorname{Vor}(\Lambda)$ we can write as $v=v_{1}+v_{2}$ with $v_{i}\in V_{i}$ . If both $v_{1}$ and $v_{2}$ are non zero, then $w=v_{1}-v_{2}$ is different from $\pm v$ . It lies in $v+2\Lambda$ and satisfies $\|w\|=\|v\|$ , yielding a contradiction. So $v=v_{i}$ for $i\in\{1,2\}$ . In particular, $\pm v_{i}$ must be the only minimal vectors in $v_{i}+2\Lambda_{i}$ , so that $v\in\operatorname{Vor}(\Lambda_{i})$ .

Conversely, let for instance $v_{1}\in\operatorname{Vor}(\Lambda_{1})$ , and let $w\in v_{1}+2\Lambda$ . Let us write $w=v_{1}+2(u_{1}+u_{2})=(v_{1}+2u_{1})+2u_{2}$ for $u_{i}\in\Lambda_{i}$ . Since $v_{1}\in\operatorname{Vor}(\Lambda_{1})$ , we have $\|v_{1}+2u_{1}\|\geq\|v_{1}\|$ , with equality if and only if $v_{1}+2u_{1}=\pm v_{1}$ . Thus,

[TABLE]

with equality if and only if $u_{2}=0$ and $v_{1}+2u_{1}=\pm v_{1}$ , namely $w=\pm v_{1}$ . So $v_{1}\in\operatorname{Vor}(\Lambda)$ . ∎

Theorem 2.5.

Let $\Lambda$ be a lattice such that

[TABLE]

Then,

[TABLE]

Proof.

We again assume $\Lambda=\Lambda_{1}\perp\Lambda_{2}$ .

By Lemma 2.4, $\operatorname{Vor}(\Lambda)=\operatorname{Vor}(\Lambda_{1})\cup\operatorname{Vor}(\Lambda_{2})$ , and so $\operatorname{Cayley}(\Lambda,\operatorname{Vor}(\Lambda_{i}))$ is a subgraph of $\operatorname{Cayley}(\Lambda,\operatorname{Vor}(\Lambda))$ . Hence, $\chi(\Lambda)\geq\max\{\chi(\Lambda_{1}),\chi(\Lambda_{2})\}$ .

Conversely, let $k=\max\{\chi(\Lambda_{1}),\chi(\Lambda_{2})\}$ . By definition, for $i\in\{1,2\}$ , there is a proper coloring $c_{i}:\Lambda_{i}\to\mathbb{Z}/k\mathbb{Z}$ such that if $v_{i}-v^{\prime}_{i}\in\operatorname{Vor}(\Lambda_{i})$ , then $c_{i}(v_{i})\neq c_{i}(v^{\prime}_{i})$ . We shall show that

[TABLE]

is a proper coloring of $\Lambda$ . For this let $u,v\in\Lambda$ such that $v=u+w$ with $w\in\operatorname{Vor}(\Lambda)$ . Following Lemma 2.4, $w\in\operatorname{Vor}(\Lambda_{1})\cup\operatorname{Vor}(\Lambda_{2})$ . Assume for instance that $w=w_{1}\in\operatorname{Vor}(\Lambda_{1})$ . Then, we write $u=u_{1}+u_{2}$ with $u_{i}\in\Lambda_{i}$ , and

[TABLE]

So $c$ is a proper coloring of $\Lambda$ and $\chi(\Lambda)\leq k$ . ∎

3. On the chromatic number of lattices of Voronoi’s first kind

In this section we give lower and upper bounds for the chromatic number of lattices of Voronoi’s first kind. Lattices of Voronoi’s first kind form a nice class of lattices: All lattices in dimensions $2$ and $3$ belong to this class as well as $\mathsf{A}_{n}$ and $\mathsf{A}_{n}^{*}$ . Our lower and upper bounds coincide for all these cases. For dimension $4$ and greater the bounds can differ. We like to pose the question of computing the chromatic number of a lattice of Voronoi’s first kind as an open problem.

3.1. Definitions and first examples

Lattices of Voronoi’s first kind are treated in detail for example in [14]. Here we start by collecting some facts about them.

Definition 3.1.

A lattice $\Lambda$ is called a lattice of Voronoi’s first kind if it admits an obtuse superbasis: There exist lattice vectors $v_{0},v_{1},\ldots,v_{n}$ such that:

(1)

The set $\{v_{1},\ldots,v_{n}\}$ forms a basis of $\Lambda$ , 2. (2)

We have $v_{0}+v_{1}+\cdots+v_{n}=0$ , 3. (3)

For every $0\leq i<j\leq n$ , the vectors $v_{i}$ and $v_{j}$ enclose an obtuse angle, $v_{i}\cdot v_{j}\leq 0$ .

The lattices $\mathsf{A}_{n}$ and $\mathsf{A}_{n}^{*}$ are of Voronoi’s first kind. The lattice $\mathsf{A}_{n}$ possesses an obtuse superbasis. Let $e_{1},e_{2},\ldots,e_{n+1}$ be the canonical basis of $\mathbb{R}^{n+1}$ . For $1\leq i\leq n$ , let $v_{i}=e_{i}-e_{i+1}$ , and let $v_{0}=e_{n+1}-e_{1}$ . Then $\{v_{0},\ldots,v_{n}\}$ is an obtuse superbasis of $\mathsf{A}_{n}$ . The dual lattice $\mathsf{A}_{n}^{*}$ possesses a strictly obtuse superbasis. Let

[TABLE]

Then $\{v_{0},\ldots,v_{n}\}$ is an obtuse superbasis:

[TABLE]

If $\Lambda$ is a lattice of Voronoi’s first kind, then it is known (see [14]) that

[TABLE]

This immediately gives an upper bound for the chromatic number.

Lemma 3.2.

Let $\Lambda$ be a lattice of Voronoi’s first kind, with an obtuse superbasis $\{v_{0},\ldots,v_{n}\}$ . Then, $\chi(\Lambda)$ is at most $n+1$ .

Proof.

By definition $\{v_{1},\ldots,v_{n}\}$ is a basis of $\Lambda$ . Let us show that the linear map

[TABLE]

is a proper coloring of $\Lambda$ . Because of linearity it is enough to check that it does not vanish on the strict Voronoi vectors. Let $v_{I}$ be such a vector. If $0\in I$ we replace $v_{I}$ by $-v_{I}=v_{\{0,\ldots,n\}\setminus I}$ (by Definition 3.1 (2)) to make sure that $0\notin I$ . Since $I$ is nontrivial, $0<|I|<n+1$ . In other words, $c(v_{I})\neq 0$ . ∎

With this lemma it is easy to see that the chromatic numbers of $\mathsf{A}_{n}$ and of its dual $\mathsf{A}_{n}^{*}$ are both equal to $n+1$ . For $\mathsf{A}_{n}$ and for every $1\leq i<j\leq n$ , the vector $v_{\{i,i+1,\ldots,j\}}=e_{i}-e_{j+1}$ is a minimal vector, and thus is a strict Voronoi vector. So,

[TABLE]

is a clique in $\operatorname{Cayley}(\mathsf{A}_{n},\operatorname{Vor}(\mathsf{A}_{n}))$ and so $\chi(\mathsf{A}_{n})\geq n+1$ . For $\mathsf{A}_{n}^{*}$ we know (see [14]) that every $v_{I}$ is a strict Voronoi vector. Again, (3) is a clique, and $\chi(\mathsf{A}_{n}^{*})\geq n+1$ .

3.2. Interpretation in terms of graphs and more general results

In order to get a better understanding of the chromatic number of lattices of Voronoi’s first kind, we need to know which vectors $v_{I}$ are strict Voronoi vectors.

Let $\Lambda$ be a lattice of Voronoi’s first kind with superbasis $\{v_{0},\ldots,v_{n}\}$ .

Definition 3.3.

The Delaunay graph $D(\Lambda,\{v_{0},\ldots,v_{n}\})$ is an undirected graph with vertex set $\{0,\ldots,n\}$ and where $i$ and $j$ are connected by an edge whenever $v_{i}\cdot v_{j}<0$ .

The combinatorics of the Delaunay graph $D(\Lambda,\{v_{0},\ldots,v_{n}\})$ determines the Cayley graph $\operatorname{Cayley}(\Lambda,\operatorname{Vor}(\Lambda))$ . Recall some standard terminology in graph theory. Let $G=(V,E)$ be a graph, a subset of the vertex set $U\subseteq V$ defines a cut by

[TABLE]

The strict Voronoi vectors of $\Lambda$ are the $v_{I}$ such that the cut $\delta(I)$ is minimal with respect to inclusion, see for example [20] or [53]. The minimal cuts are also known to be the ones such that, when removing the edges of the cut, the number of connected components in the graph increases by one.

A connected graph $G=(V,E)$ is called biconnected (a block) if it remains connected when we remove any of its vertices. One can decompose every connected graph into biconnected components (block decomposition). The set of edges $E$ can be uniquely written as a disjoint union $E=\bigcup_{i=1}^{m}E_{i}$ such that the subgraph $G_{i}$ of $G$ induced by $E_{i}$ is a maximal biconnected subgraph of $G$ . Let $G_{i}$ , with $i=1,\ldots,m$ , be the biconnected components of the Delaunay graph $D(\Lambda,\{v_{0},\ldots,v_{n}\})$ . Then, there exist lattices $\Lambda_{i}$ of Voronoi’s first kind with obtuse superbasis $\mathcal{B}_{i}$ such that

[TABLE]

holds, see for example [45, Chapter 4, Chapter 5]. Then Theorem 2.5 and Lemma 3.2 yield the following upper bound for the chromatic number of $\Lambda$ .

Corollary 3.4.

Let $\Lambda$ be a lattice of Voronoi’s first kind with obtuse superbasis $\{v_{0},\ldots,v_{n}\}$ . Let $G_{i}$ , $1\leq i\leq m$ , be the biconnected components of the Delaunay graph $D(\Lambda,\{v_{0},\ldots,v_{n}\})$ . Then,

[TABLE]

Now we go for lower bounds. Recall that a cycle in a graph is a collection of vertices $i_{1},\ldots,i_{l}$ such that $|\{i_{1},\ldots,i_{l}\}|=l$ and such that $i_{j}$ is connected to $i_{j+1}$ , where indices are computed modulo $l$ . Its length is equal to $l$ .

Theorem 3.5.

Let $\Lambda$ be a lattice of Voronoi’s first kind with obtuse superbasis $\{v_{0},\ldots,v_{n}\}$ . Then, the chromatic number of $\Lambda$ is at least the maximal length of a cycle in the Delaunay graph $D(\Lambda,\{v_{0},\ldots,v_{n}\})$ .

Proof.

Up to a permutation of the indices, we may assume that $\{0,1,\ldots,\sigma-1\}$ is a cycle $C$ in $D(\Lambda,\{v_{0},\ldots,v_{n}\})$ . We shall construct a clique of $\operatorname{Cayley}(\Lambda,\operatorname{Vor}(\Lambda))$ of size $\sigma$ .

For any $k$ in $\{0,\ldots,n\}$ , we say that $c\in C$ is a connector between $k$ and $C$ if there exists a path $\gamma=(k_{1}=k,k_{2},\ldots,k_{s}=c)$ in $D(\Lambda,\{v_{0},\ldots,v_{n}\})$ such that $c$ is the only vertex on the path that belongs to $C$ .

Let $0\leq\ell\leq\sigma-1$ . We define the set $I_{\ell}$ as the subset of all vertices $k$ in $\{0,\ldots,n\}$ such that every connector from $k$ to $C$ is in $\{0,1,\ldots,\ell\}$ . In particular, for $\ell=\sigma-1$ , $I_{\sigma-1}=\{0,\ldots,n\}$ . An example is depicted in Figure 2. Then, if we define $u_{\ell}=v_{I_{\ell}}$ for all $0\leq\ell\leq\sigma-1$ , then for every $0\leq i<j\leq\sigma-1$ ,

[TABLE]

In order to show that the set $\{u_{1},u_{2},\ldots,u_{\sigma}\}$ is a clique in $\operatorname{Cayley}(\Lambda,\operatorname{Vor}(\Lambda))$ , we need to prove that for every $0\leq i<j\leq\sigma-1$ , the vector $v_{I_{j}\setminus I_{i}}$ is in $\operatorname{Vor}(\Lambda)$ . Equivalently, since $D(\Lambda,\{v_{0},\ldots,v_{n}\})$ is connected, we need to check that both $I_{j}\setminus I_{i}$ and its complementary in $\{0,\ldots,n\}$ induce connected subgraphs of $D(\Lambda,\{v_{0},\ldots,v_{n}\})$ .

Take $k$ in $I_{j}\setminus I_{i}$ . Since $k$ is not in $I_{i}$ , there is a connector between $k$ and $C$ which is not in $\{0,\ldots,i\}$ ; but since $k$ is in $I_{j}$ , this connector must be in $\{i+1,\ldots,j\}$ . So $k$ , as well as all the vertices in this path, are in $I_{j}\setminus I_{i}$ and are connected to $\{i+1,\ldots,j\}$ , which is obviously connected and included in $I_{j}\setminus I_{i}$ . Regarding the complementary set, take $k$ not in $I_{j}\setminus I_{i}$ . Then either $k$ is not in $I_{j}$ , and there is a connector between $k$ and $C$ in $\{j+1,\ldots,\sigma-1\}$ , or $k$ is in $I_{i}$ , and every connector between $k$ and $C$ is in $\{0,\ldots,i\}$ . Thus for every such $k$ , one can find a path made of vertices not in $I_{j}\setminus I_{i}$ , going to $\{j+1,\ldots,\sigma-1\}\cup\{0,\ldots,i\}$ .

∎

Theorem 3.6.

$\chi(\mathsf{A}_{n})=\chi(\mathsf{A}_{n}^{*})=n+1$ .

Proof.

For $\mathsf{A}_{n}^{*}$ the Delaunay graph $D(\mathsf{A}_{n}^{*},\{v_{0},\ldots,v_{n}\})$ is the complete graph $K_{n+1}$ . For $\mathsf{A}_{n}$ the Delaunay graph $D(\mathsf{A}_{n},\{v_{0},\ldots,v_{n}\})$ is a cycle of length $n+1$ . In both cases, our upper bound and lower bound coincide. ∎

Example 3.7.

In general, our upper bound differs from the lower bound. Both bounds can be attained: On the left of Figure 3, the longest cycle has size $4$ and one can find a coloring of the corresponding $5$ -dimensional lattice with $4$ colors, whereas the lattice associated with the graph depicted on the right of Figure 3, whose longest has size $5$ , has chromatic number $6$ . This can be seen by computing the chromatic number of a small subgraph of $\operatorname{Cayley}(\Lambda,\operatorname{Vor}(\Lambda))$ .

3.3. Application: the chromatic number of $3$ -dimensional lattices

Every lattice in dimensions $2$ and $3$ is of Voronoi’s first kind, see [14]. We can compute the chromatic number of these lattices by applying our bounds which coincide in these cases, see Table 2.

4. Sphere packing lower bounds

In this section we prove lower bounds for the chromatic number of a lattice by considering connections to the classical sphere packing problem.

A subset $\mathcal{P}$ of $\mathbb{R}^{n}$ defines a packing of unit spheres if the distance between all pairs of distinct points in $\mathcal{P}$ is at least $2$ . Define the center density of $\mathcal{P}$ as the number of points in $\mathcal{P}$ per unit volume, more precisely the (upper) center density of $\mathcal{P}$ is defined as

[TABLE]

where $[-R,R]^{n}$ is the regular $n$ -dimensional cube with side length $2R$ . The largest center density of a packing of unit spheres in $\mathbb{R}^{n}$ is

[TABLE]

Theorem 4.1.

Let $\Lambda$ be an $n$ -dimensional lattice which defines a packing of unit spheres. Let $\rho$ be a positive real number so that all strict Voronoi vectors of $\Lambda$ have length strictly less than $\rho$ . Then,

[TABLE]

Proof.

Suppose that the chromatic number of $\Lambda$ equals $k$ . Then one can decompose $\Lambda$ into $k$ color classes $C_{1},\ldots,C_{k}$ . We may assume that the first color class $C_{1}$ has the largest density among these color classes. In particular, inequality

[TABLE]

holds. Then for all $v,w\in C_{1}$ with $v\neq w$ we have $\|v-w\|\geq\rho$ . Hence, $\frac{2}{\rho}C_{1}$ defines a packing of unit spheres. So,

[TABLE]

and the claim of the theorem follows by combining the two inequalities above. ∎

The following lemma gives a lower bound for $\rho$ .

Lemma 4.2.

Let $\Lambda$ be an $n$ -dimensional lattice which defines a packing of unit spheres. If $v\in\Lambda\setminus\{0\}$ is not a strict Voronoi vector, then $\|v\|\geq\sqrt{8}$ .

Proof.

By Voronoi’s characterization of strict Voronoi vectors (1) there is a lattice vector $w\in v+2\Lambda$ with $w\neq\pm v$ and $\|w\|\leq\|v\|$ . We may assume that $v\cdot w\geq 0$ , otherwise we replace $w$ by its negative $-w$ . Define $u=\frac{1}{2}(v-w)\in\Lambda\setminus\{0\}$ . Then,

[TABLE]

So, $\|v\|\geq\sqrt{2}\|u\|\geq\sqrt{8}$ , since $\|u\|\geq 2$ . ∎

4.1. First application: Chromatic number of $\mathsf{E}_{8}$ and of the Leech lattice

Theorem 4.3.

The chromatic number of the Leech lattice $\Lambda_{24}$ equals $4096$ .

Proof.

It is known that the strict Voronoi vectors of $\Lambda_{24}$ are all vectors $v\in\Lambda_{24}$ with $v\cdot v\in\{4,6\}$ , see [13]. Dong, Li, Mason, and Norton showed [18, Theorem 4.1] that one can find an isometric copy of $\sqrt{2}\Lambda_{24}$ as a sublattice $\Gamma$ of $\Lambda_{24}$ ; in [43] Nebe and Parker classified all $16$ orbits of such sublattices under the action of the automorphism group of $\Lambda_{24}$ . Clearly, such a sublattice $\Gamma$ has index $2^{12}=4096$ and any nonzero vector $v$ in this sublattice satisfies $v\cdot v\geq 8$ . Hence, we can color $\Lambda_{24}$ by using the $4096$ cosets $v+\Gamma$ as color classes, with $v\in\Lambda_{24}$ . Thus, $\chi(\Lambda_{24})\leq 4096$ .

For the lower bound, we apply Theorem 4.1. The Leech lattice defines the densest sphere packing in dimension $24$ , see [12], $\delta(\Lambda_{24})=\delta_{\mathbb{R}^{24}}$ . We can apply Theorem 4.1 with $\rho=\sqrt{8}$ and get

[TABLE]

∎

Similarly, one can show $\chi(\mathsf{E}_{8})=16$ by using the fact that $\mathsf{E}_{8}$ is the densest sphere packing in dimension $8$ , see [55].

4.2. Second application: Exponential growth of the chromatic

number

In this section we investigate the asymptotic behaviour of the chromatic number of lattices when the dimension tends to infinity.

For a lattice $\Lambda\subset\mathbb{R}^{n}$ , we denote by $\mu=\mu(\Lambda)$ the norm of its smallest nonzero vector. Then $\frac{2}{\mu}\Lambda$ defines a packing of unit spheres, and we extend the notation $\delta(\Lambda)$ for the center density of this packing by

[TABLE]

The best upper bound known for $\delta_{\mathbb{R}^{n}}$ is the Kabatiansky-Levenshtein bound, [30], which states

[TABLE]

where $V_{n}$ is the volume of the $n$ -dimensional unit ball. So by using Theorem 4.1 and Lemma 4.2, we get, for any lattice $\Lambda\subset\mathbb{R}^{n}$ ,

[TABLE]

Let us now recall Siegel’s mean value theorem (see [49]): For any lattice $\Lambda\subset\mathbb{R}^{n}$ and any $r>0$ , we denote by $N_{\Lambda}(r)$ the number of nonzero lattice vectors of $\Lambda$ in the open ball $B(r)$ having radius $r$ . The Siegel mean value theorem states that the expected value of $N_{\Lambda}(r)$ in a random lattice $\Lambda$ of volume $1$ is

[TABLE]

To compute the expectation one uses the Haar measure on $\mathrm{SL}_{n}(\mathbb{R})/\mathrm{SL}_{n}(\mathbb{Z})$ .

This equality implies two remarkable consequences: First, by choosing $r_{n}$ such that $\operatorname{vol}(B(r_{n}))=2$ , it proves the existence of a lattice $\Lambda_{n}$ with strictly less than two nonzero vectors in $B(r_{n})$ . Since such a vector would come with its opposite, the minimum of $\Lambda_{n}$ has to be at least $r_{n}$ , and therefore the density of $\Lambda_{n}$ satisfies

[TABLE]

which essentially is the Minkowski-Hlawka lower bound for lattice sphere packings (see [27]). The second consequence concerns the density of a random lattice: Let us fix $\varepsilon>0$ , and $r_{n}$ such that $\operatorname{vol}(B(r_{n}))=2\cdot(1+\varepsilon)^{-n}$ . Following the previous idea, whenever $N_{\Lambda_{n}}(r_{n})<2$ , the density of $\Lambda_{n}$ satisfies

[TABLE]

By using Siegel’s mean value theorem and Markov’s inequality, we prove that this happens with high probability when $n$ grows:

[TABLE]

if $n$ tends to infinity.

Combined with (4), these observations provide:

Theorem 4.4.

With high probability, the chromatic number of a random $n$ -dimensional lattice grows exponentially in $n$ . Moreover, there are $n$ -dimensional lattices $\Lambda_{n}$ with

[TABLE]

Note that the Minkowski-Hlawka lower bound has been improved, in such a way that the constant $2$ in the numerator could be replaced with some quasi-linear function in $n$ , see [48], [4], [52], [54]. Even though any random lattice should be dense, and consequently should have an exponential chromatic number, to date there is no efficient way to construct such a lattice: The only algorithms for this purpose run in exponential time with respect to the dimension (see [42]).

However, explicit examples of subexponential growth are known. For $n\geq 3$ the cut polytope $\operatorname{CUT}_{n}$ (see [16]) is an $n(n-1)/2$ -dimensional polytope which is a Delaunay polytope of lattice (see [17]) . The vertex-edge graph of $\operatorname{CUT}_{n}$ is the complete graph on $2^{n-1}$ vertices. Thus we get an infinite family of Delaunay polytope with chromatic number lower bounded by $2^{O(\sqrt{n})}$ .

We think it is an interesting question to construct explicit families of lattices whose chromatic number grows exponentially with the dimension.

5. Spectral lower bounds

In this section we derive a lower bound for the chromatic number of a lattice where we apply the generalization of Hoffman’s bound as developed by Bachoc, DeCorte, Oliveira, and Vallentin [2]. In Section 5.1 we recall some terminology and facts from [2]. Then we compute the spectral bound for the irreducible root lattices case by case in Section 5.2. Table 3 summarizes the results obtained.

5.1. Setup

For an $n$ -dimensional lattice $\Lambda\subseteq\mathbb{R}^{n}$ define the (complex) Hilbert space

[TABLE]

which has inner product

[TABLE]

The convolution of two elements $f,g\in\ell^{2}(\Lambda)$ is $f*g\in\ell^{2}(\Lambda)$ defined by

[TABLE]

Assume that $\mu\in\ell^{2}(\Lambda)$ is real-valued, that its support is contained in $\operatorname{Vor}(\Lambda)$ , and that it satisfies $\mu(v)=\mu(-v)$ for all $v\in\Lambda$ . Consider the convolution operator

[TABLE]

defined by

[TABLE]

In a certain sense, $A_{\mu}$ is a weighted adjacency operator of $\operatorname{Cayley}(\Lambda,\operatorname{Vor}(\Lambda))$ . It is easy to verify that $A_{\mu}$ is a bounded and self-adjoint operator. Its numerical range is

[TABLE]

The numerical range is known to be an interval in $\mathbb{R}$ . By

[TABLE]

we denote the endpoints of the interval $W(A_{\mu})$ .

We say that a subset $I\subseteq\Lambda$ is an independent set of the operator $A_{\mu}$ if $\langle A_{\mu}f,f\rangle=0$ for each $f\in\ell^{2}(\Lambda)$ which vanishes outside of $I$ . The chromatic number of $A_{\mu}$ is the smallest number $k$ so that one can partition $\Lambda$ into $k$ independent sets. By [2, Theorem 2.3] one has the following lower bound for $\chi(A_{\mu})$ :

[TABLE]

Since every independent set of $\operatorname{Cayley}(\Lambda,\operatorname{Vor}(\Lambda))$ is also an independent set of the operator $A_{\mu}$ we see that

[TABLE]

Therefore, we are interested in determining the parameters $m(A_{\mu})$ and $M(A_{\mu})$ and in choosing $\mu$ so that the bound becomes as large as possible.

For determining $m(A_{\mu})$ and $M(A_{\mu})$ for a given convolution operator $A_{\mu}$ we apply standard facts from Fourier analysis, in particular the Parseval identity, the theorem of Riesz-Fischer, and the fact that the Fourier transform of a convolution is a product.

The only non-standard item here is that in standard texts on Fourier analysis, see for example [19], the role of primal and dual spaces are interchanged. In order not to confuse the reader (and to some extend not to confuse the authors) we consider a new $n$ -dimensional lattice $\Gamma$ . In our context $\Gamma$ will play the role of the dual lattice $\Lambda^{*}$ . When $\Gamma=\Lambda^{*}$ , then $\Gamma^{*}=(\Lambda^{*})^{*}=\Lambda$ .

Consider the Hilbert space of square-integrable $\Gamma$ -periodic functions

[TABLE]

with inner product

[TABLE]

where we normalize the Lebesgue measure $dx$ so that the $n$ -dimensional volume of a fundamental domain $\operatorname{vol}(\mathbb{R}^{n}/\Gamma)$ equals one. The exponential functions

[TABLE]

form a complete orthonormal system for $L^{2}(\mathbb{R}^{n}/\Gamma)$ . We define the Fourier coefficient of $F$ at $v$ by

[TABLE]

By Parseval’s identity and by the Riesz-Fischer theorem the map

[TABLE]

is an isometry:

[TABLE]

We consider two functions $f,g\in\ell^{2}(\Gamma^{*})$ . By the isometry of $~{}\widehat{}~{}$ there are functions $F,G\in L^{2}(\mathbb{R}^{n}/\Gamma)$ with

[TABLE]

Furthermore,

[TABLE]

Back to the lattice $\Lambda=\Gamma^{*}$ and the convolution operator $A_{\mu}$ . For $\mu=\widehat{G}$ , $f=\widehat{F}\in\ell^{2}(\Lambda)$ , we have

[TABLE]

and

[TABLE]

Hence, by choosing two appropriate sequences (see [2, Section 3.1])) approximating the corresponding Dirac measures, we see

[TABLE]

We summarize our considerations in the following theorem.

Theorem 5.1.

Let $\Lambda\subseteq\mathbb{R}^{n}$ be an $n$ -dimensional lattice. Suppose that $\mu\in\ell^{2}(\Lambda)$ is real-valued, $\mu(v)=\mu(-v)$ for all $v\in\Lambda$ , and the support of $\mu$ is contained in $\operatorname{Vor}(\Lambda)$ . Then,

[TABLE]

If we uniformly choose

[TABLE]

then the bound in the previous theorem simplifies to the following generalization of the Hoffman bound for infinite graphs:

Corollary 5.2.

Let $\Lambda\subseteq\mathbb{R}^{n}$ be an $n$ -dimensional lattice. Then,

[TABLE]

5.2. Computing the spectral bound for irreducible root lattices

In this section we compute the lower bound given by Corollary 5.2 for each of the irreducible root lattices. By the classification of Witt these are the families of lattices $\mathsf{A}_{n}$ , $\mathsf{D}_{n}$ and the three sporadic lattices $\mathsf{E}_{6}$ , $\mathsf{E}_{7}$ , and $\mathsf{E}_{8}$ .

As recalled in Section 2.4, the set $\operatorname{Vor}(\Lambda)$ of strict Voronoi vectors for a root lattice $\Lambda$ is simply its set of roots. Now the latter constitutes a root system (sometimes known as a “crystallographic” root system, but these are the only ones which we will consider): see A.1; the root systems are themselves classified, [29, §11], and the ones which arise from root lattices are known as “simply laced” or “A-D-E” root systems. (We will review the simply laced irreducible root systems below along with the corresponding lattices.)

Computing the lower bound of Corollary 5.2 for an irreducible root lattice is therefore equivalent to finding the smallest value attained by the Fourier transform of a simply laced irreducible root system. Here, a finite set $\Phi\subseteq\mathbb{R}^{n}$ defines the function $\mathscr{F}_{\Phi}\colon x\mapsto\sum_{u\in\Phi}e^{2\pi iu\cdot x}$ on $\mathbb{R}^{n}$ which is the Fourier transform of the sum of delta measures concentrated on the elements of $\Phi$ . Let us emphasize that since $\Phi$ is a (crystallographic!) root system, this function is, in fact, a trigonometric polynomial; and since $\Phi$ is symmetric ( $\Phi=-\Phi$ ), it is real.

Now this reformulation affords a link with representation theory (the required facts of which are recalled in Appendix A). Namely, if $\Phi$ is a (reduced, but not necessarily simply laced) root system of rank $n$ then the function $n+\mathscr{F}_{\Phi}$ is “essentially” the character of the adjoint representation of the — say, simply connected — compact real Lie group $G_{\Phi}$ associated with $\Phi$ (namely $\mathrm{SL}_{n+1}$ in the case of $\mathsf{A}_{n}$ , or $\mathrm{Spin}_{2n}$ in the case of $\mathsf{D}_{n}$ , or one of the exceptional Lie groups $\mathsf{E}_{6},\mathsf{E}_{7},\mathsf{E}_{8}$ in the case of the correspondingly named $\mathsf{E}_{6},\mathsf{E}_{7},\mathsf{E}_{8}$ ); the precise statement and explanation of why the two problems are equivalent is given in Proposition A.13.

The problem of computing our spectral lower bound is therefore essentially that of computing the least value attained by the adjoint character of a simple compact real Lie group of type A-D-E. The values in question have been considered and computed by Serre in [50, Theorem 3’]: In Table 3 we state Serre’s result in the form in which it is useful for the main part of the paper.

Again, the equivalence of the result as stated here with that as stated in Serre’s note is provided by Proposition A.13.

Serre’s note does not contain a proof of the stated result111Serre writes: “The classical groups are easy enough, but $\mathsf{F}_{4}$ , $\mathsf{E}_{6}$ , $\mathsf{E}_{7}$ and $\mathsf{E}_{8}$ are not (especially $\mathsf{E}_{6}$ , which I owe to Alain Connes). I hope there is a better proof.”. In this paper we provide one for the A-D-E case. We treat $\mathsf{A}_{n}$ and $\mathsf{D}_{n}$ in Theorem 5.3 and Theorem 5.4 below, and we defer the case $\mathsf{E}_{n}$ to Appendix B. For $\mathsf{E}_{7}$ and $\mathsf{E}_{8}$ we suggest an alternative proof technique based on sums of squares and semidefinite optimization in Section 5.2.3. Using the link to the chromatic number of lattices, the case $\mathsf{E}_{6}$ turns out to be the easiest of the $\mathsf{E}_{n}$ cases. We treat it in Section 5.2.4.

5.2.1. $\mathsf{A}_{n}$

The irreducible root lattice $\mathsf{A}_{n}$ has $n(n+1)$ roots

[TABLE]

where $e_{r}$ denotes the $r$ -th standard unit vector in $\mathbb{R}^{n+1}$ .

Theorem 5.3.

The critical values of ${\mathcal{F}}_{\mathsf{A}_{n}}$ are $n(n+1)$ and $-(n+1)$ . In particular, $\chi(\mathsf{A}_{n})\geq n+1$ .

Proof.

Given $x\in\mathbb{R}^{n+1}$ such that $x_{0}+\cdots+x_{n}=0$ , we define $z_{0},\ldots,z_{n}$ by $z_{r}=e^{2\pi ix_{r}}$ , so that $z_{0}\cdots z_{n}=1$ . The sum

[TABLE]

is equal to

[TABLE]

Clearly $\left|\sum_{r}z_{r}\right|^{2}$ critical values are [math] (when $\sum_{i}z_{i}=0$ ) and $(n+1)^{2}$ (when all $z_{i}$ are identical). The critical values can be obtained when $z_{0}\cdots z_{n}=1$ , for example by the $(n+1)$ -th roots of unity222 For example we have $z_{r}=e^{2\pi i/(n+1)}$ for all $r$ , by letting $x_{1}=\cdots=x_{n}=1/(n+1)$ and $x_{0}=1/(n+1)-1=-n/(n+1)$ . More explicitly, we have a bijection between the set of all $x\in\mathbb{R}^{n+1}$ with $x_{0}+\cdots+x_{n}=0$ , modulo the lattice dual to $\mathsf{A}_{n}$ and the set of complex vectors $(z_{0},\ldots,z_{n})$ which all lie on the unit circle, and whose product equals 1, modulo $(\zeta,\ldots,\zeta)$ where $\zeta$ is some $(n+1)$ -th root of unity obtained by taking $(x_{0},\ldots,x_{n})$ to $z_{r}=e^{2\pi ix_{r}}$ . or by $z_{0}=\cdots=z_{n}=1$ . Then, the lower bound on $\chi(\mathsf{A}_{n})$ follows from Corollary 5.2. ∎

5.2.2. $\mathsf{D}_{n}$

The irreducible root lattice $\mathsf{D}_{n}$ has $2n(n-1)$ roots

[TABLE]

Theorem 5.4.

The critical values of ${\mathcal{F}}_{\mathsf{D}_{n}}$ are:

[TABLE]

In particular,

[TABLE]

and $\chi(\mathsf{D}_{n})\geq n$ for even $n$ , and $\chi(\mathsf{D}_{n})\geq n+1$ for odd $n$ .

Proof.

By the symmetries of the function we can restrict ourselves to $0\leq x_{i}\leq 1/2$ . We consider the sum

[TABLE]

where $W=\sum_{r=1}^{n}\cos(2\pi x_{r})$ . To find the critical values we compute the gradient of $S$ (as a function of the $x_{r}$ ), which is

[TABLE]

Let $x$ be a critical point. Define $S_{1}$ , $S_{-1}$ and $S_{o}$ the set of $i\in\{1,\dots,n\}$ such that $x_{i}=0$ , $x_{i}=1/2$ and $0<x_{i}<1/2$ so that $\cos(2\pi x_{i})=1$ or $\cos(2\pi x_{i})=-1$ or $\cos(2\pi x_{i})\in(-1,1)$ .

For $i\in S_{o}$ we have $W=\cos(2\pi x_{i})$ . Thus, $|W|<1$ if $S_{o}\not=\emptyset$ .

We define $n_{1}=|S_{1}|$ , $n_{-1}=|S_{-1}|$ and $n_{o}=|S_{o}|$ and we have $n_{1}+n_{-1}+n_{o}=n$ , and

[TABLE]

If $n_{o}=1$ we have

[TABLE]

If $n_{o}\not=1$ then the equation for $W$ (5) gives

[TABLE]

The value of the function is then

[TABLE]

The lower bound on $\chi(\mathsf{D}_{n})$ then follows from Corollary 5.2. ∎

5.2.3. $\mathsf{E}_{7}$ , $\mathsf{E}_{8}$ and sums of squares

We start by giving an alternative construction of the $\mathsf{E}_{8}$ lattice which is based on lifting the (extended) Hamming code $\mathcal{H}_{8}$ , which is the vector space over the finite field $\mathbb{F}_{2}$ (consisting of the elements [math] and $1$ ) spanned by the rows of the matrix

[TABLE]

It consists of $2^{4}=16$ code words:

[TABLE]

We can define the lattice $\mathsf{E}_{8}$ by the following lifting construction (which is usually called Construction A):

[TABLE]

Now it is immediate to see that $\mathsf{E}_{8}$ has $240$ shortest (nonzero) vectors:

[TABLE]

where $e_{1},\ldots,e_{8}$ are the standard basis vectors of $\mathbb{R}^{8}$ and where $\operatorname{wt}(c)=|\{i:c_{i}\neq 0\}|$ denotes the Hamming weight of a code word $c$ .

Observe that the lower bound $\chi(\mathsf{E}_{8})\geq 16$ is implied trough the spectral bound by the following inequality (Theorem B.3 gives a stronger result by providing a complete list of all critical values)

[TABLE]

for all $x\in\mathbb{R}^{8}$ .

To simplify the formula we apply a change of variables by setting $T(x)=S(\frac{\sqrt{2}}{2\pi}x)$ . Then,

[TABLE]

Applying the cosine addition formula multiple times we get

[TABLE]

Function $T$ is globally nonnegative if and only if the polynomial

[TABLE]

is nonnegative on the cube $t\in[-1,+1]^{8}$ . This nonnegativity can be verified algorithmically. By

[TABLE]

denote the cone of real polynomials in $n$ indeterminates of degree at most $d$ . One can verify membership in this cone by using semidefinite optimization, see for example [37]. We checked numerically (up to machine precision) that there are polynomials

[TABLE]

so that the following identity holds true:

[TABLE]

It is interesting to observe that using smaller degree did not work.

One can easily modify this proof technique for the case $\mathsf{E}_{7}$ to show $\chi(\mathsf{E}_{7})\geq 10$ . To define $\mathsf{E}_{7}$ we apply Construction A on the $[7,3,4]$ code $\mathcal{H}_{7}^{*}$ , which one obtains from $\mathcal{H}_{8}$ by deleting its first coordinate. Here one shows that the polynomial

[TABLE]

is nonnegative on the cube $t\in[-1,+1]^{7}$ again using sum of squares.

5.2.4. $\mathsf{E}_{6}$

The $\mathsf{E}_{6}$ lattice we can handle without computer as follows: In the proof of Theorem 6.3 we shall color $\mathsf{E}_{6}$ with nine colors. On the other hand, the Schläfli polytope is a Delaunay polytope of $\mathsf{E}_{6}$ whose vertex-edge graph—the Schläfli graph on 27 vertices having 216 edges—is a finite subgraph of $\operatorname{Cayley}(\mathsf{E}_{6},\operatorname{Vor}(\mathsf{E}_{6}))$ . It is known, see for example [5, Chapter 8, page 55] that the Hoffman bound of the Schläfli graph equals nine. It is also known, see [3, Section 10.1], that the Hoffman bound of an infinite edge transitive graph is at least the Hoffman bound of any of its finite subgraphs. Hence, the spectral bound of $\mathsf{E}_{6}$ equals nine.

6. The chromatic number of irreducible root lattices and their duals

In this section, we complete our study of the chromatic number of irreducible root lattices and their duals. The knowledge that we use about Delaunay polytopes of root lattices can be found in [16, Section 14.3]. Our claims regarding the sublattices that we consider and the colorings of certain small graphs can be conveniently checked with computer assistance, for example by using Magma [8] or Polyhedral [21].

When we cannot directly compute the chromatic number of a graph, we apply other methods, computationally easier, in order to get lower and upper bounds. A lower bound for the chromatic number of a graph $G=(V,E)$ is given by its fractional chromatic number: Denote by $\mathcal{I}_{G}$ the set of all independent sets of $G$ . The fractional chromatic number of $G$ is the solution of the following linear program:

[TABLE]

If $G$ affords symmetries, one can use them to reduce the number of variables of this linear program. Regarding upper bounds, given a number $k$ of colors and a graph $G$ , proving the existence of a $k$ -coloring of $G$ can be turned into a satisfiability problem, that can be solved for instance by using Minisat [24].

6.1. $\mathsf{D}_{n}$ and its dual

The half cube $\frac{1}{2}H_{n}$ , sometimes also called the parity polytope, is defined as

[TABLE]

It is one of the two Delaunay polytopes of the root lattice $\mathsf{D}_{n}$ .

Theorem 6.1.

For all $n\geq 4$ we have $\chi(\mathsf{D}_{n})=\chi(\frac{1}{2}H_{n})$ , where we consider the vertex-edge graph of the half cube.

Proof.

The inequality $\chi(\mathsf{D}_{n})\geq\chi(\frac{1}{2}H_{n})$ comes from the fact that $\frac{1}{2}H_{n}$ is a Delaunay polytope of $\chi(\mathsf{D}_{n})$ .

Let $c$ be a proper coloring of $\frac{1}{2}H_{n}$ . We extend it to $\mathsf{D}_{n}$ by giving to any $x\in\mathsf{D}_{n}$ the color $c(x\bmod 2)$ . Assume that two vectors $x_{1}$ and $x_{2}$ are adjacent in $\operatorname{Cayley}(\mathsf{D}_{n},\operatorname{Vor}(\mathsf{D}_{n}))$ . Since the relevant vectors of $\mathsf{D}_{n}$ are of the form $\pm e_{i}\pm e_{j}$ , the difference $x_{1}-x_{2}\bmod 2$ is also such a vector, so that $x_{1}\bmod 2$ and $x_{2}\bmod 2$ are adjacent in $\frac{1}{2}H_{n}$ , and $c(x_{1}\bmod 2)\neq c(x_{2}\bmod 2)$ . Hence, $\chi(\mathsf{D}_{n})\leq\chi(\frac{1}{2}H_{n})$ . ∎

Theorem 6.2.

For every $n\geq 4$ , the chromatic number of $\mathsf{D}^{*}_{n}$ is $4$ .

Proof.

The relevant vectors of the lattice $\mathsf{D}_{n}^{*}=\mathbb{Z}^{n}\cup\left((1/2,\ldots,1/2)+\mathbb{Z}^{n}\right)$ are the $2n$ vectors $\pm e_{i}$ and the $2^{n}$ vectors of the form $(\pm 1/2,\dots,\pm 1/2)$ . The four vectors [math], $(1,0,\ldots,0)$ , $(1/2,\ldots,1/2)$ and $(1/2,\ldots,1/2,-1/2)$ define a clique in $\operatorname{Cayley}(\mathsf{D}_{n}^{*},\operatorname{Vor}(\mathsf{D}_{n}^{*}))$ ; and the unique way to color $\mathsf{D}^{*}_{n}$ with four colors is by coloring each copy of $\mathbb{Z}^{n}$ with two different colors. ∎

6.2. $\mathsf{E}_{6},\mathsf{E}_{7},\mathsf{E}_{8}$ and their duals

Theorem 6.3.

We have $\chi(\mathsf{E}_{6})=9$ , $\chi(\mathsf{E}_{7})=14$ and $\chi(\mathsf{E}_{8})=16$ .

Proof.

By Theorem 6.1 we know that $\chi(\mathsf{D}_{8})=8$ . So one can color the root lattice $\mathsf{E}_{8}=\mathsf{D}_{8}\cup((1/2,\ldots,1/2))+\mathsf{D}_{8})$ with $16$ colors. The lower bound from Theorem B.3 concludes the case of $\mathsf{E}_{8}$ .

For $\mathsf{E}_{7}$ there are two orbits of Delaunay polytopes. One of them is the Gosset polytope with $56$ vertices, whose vertex-edge graph has chromatic number $14$ , which shows that $\chi(\mathsf{E}_{7})\geq 14$ . Moreover, we have a lamination of $\mathsf{E}_{7}$ over the lattice $\mathsf{A}_{6}$ :

[TABLE]

If $v$ and $w$ belong to two layers which differ by an even index, then $v-w$ is not a relevant vector. Following Section 3, we know that $\chi(\mathsf{A}_{6})=7$ . Thus we can color the even layers by colors in $\{1,\dots 7\}$ and the odd layers by colors in $\{8,\dots,14\}$ . This proves that $\chi(\mathsf{E}_{7})=14$ .

The unique Delaunay polytope of $\mathsf{E}_{6}$ is the Schläfli polytope whose vertex-edge graph is the Schläfli graph with $27$ vertices. It is well known that its chromatic number is $9$ , so that $\chi(\mathsf{E}_{6})\geq 9$ . There is just one orbit of independent triples of vertices. Moreover, there are just two orbits of colorings with $9$ colors of the Schäfli graph: One orbit of size $160$ and another of size $40$ . Let us take a coloring from the orbit of size $40$ . It is composed of $9$ different triples of elements. For each such triple $\{v_{1},v_{2},v_{3}\}$ we consider the set of vectors $\{v_{1}-v_{2},v_{2}-v_{3},v_{3}-v_{1}\}$ . Since we have $9$ triples, this gives in total $27$ vectors. Those vectors span a sublattice $L$ of $\mathsf{E}_{6}$ of index $9$ . It turns out that none of the relevant vectors of $\mathsf{E}_{6}$ belongs to $L$ . Thus following Lemma 2.1, $\chi(\mathsf{E}_{6})\leq 9$ . ∎

Theorem 6.4.

The chromatic number of $\mathsf{E}_{n}^{*}$ is $16$ for $n=6,7$ .

Proof.

Upon rescaling to an integral lattice the norms of the vectors of $\mathsf{E}_{6}^{*}$ are $4$ , $6$ , $10$ and so on. The norms of the relevant vectors are $4$ and $6$ . We consider the $432$ vectors of norm $10$ and enumerate the sublattices of $\mathsf{E}_{6}^{*}$ of dimension $6$ spanned by those vectors that do not contain any relevant vector. We found $1393$ orbits of such lattices. Exactly one of them is of index $16$ which proves that $\chi(\mathsf{E}_{6}^{*})\leq 16$ .

The lower bound is obtained in the following way. We consider the graph formed by the origin [math] and the $126$ relevant vectors with two vectors adjacent if their difference is a relevant vector. The fractional chromatic number of this graph is $77/5$ . Thus $\chi(\mathsf{E}_{6}^{*})\geq\lceil 77/5\rceil=16$ .

The lower bound on the chromatic number of $\mathsf{E}_{7}^{*}$ is obtained by the same technique as for $\mathsf{E}_{6}^{*}$ . The upper bound is obtained in the following way: Consider the quotient $\mathsf{E}_{7}^{*}/4\mathsf{E}_{7}^{*}$ with $16384$ elements. One coloring with $16$ colors can be obtained by solving the satisfiability problem using Minisat. ∎

Acknowledgements

We thank Jean-Pierre Serre for his patient explanation of how to compute the critical points of the function $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{n}}$ . We are indebted to Christine Bachoc and Gabriele Nebe for their important contributions to Section 4. We thank the anonymous referee for helpful comments and suggestions.

This material is based upon work supported by the National Science Foundation under Grant No. DMS-1439786 while M.D.S. and P.M. were in residence at the Institute for Computational and Experimental Research in Mathematics in Providence, RI, during the “Point Configurations in Geometry, Physics and Computer Science” semester program. P.M. received support from the Tromsø Research Foundation grant 17_matte_CR. F.V. was partially supported by the SFB/TRR 191 “Symplectic Structures in Geometry, Algebra and Dynamics”. F.V. also gratefully acknowledges support by DFG grant VA 359/1-1. This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie agreement number 764759.

Appendix A Recollections on compact Lie groups

In this section, we collect, for the benefit of the unfamiliar reader, without proof but with references, a few facts about semisimple compact Lie groups and representation theory. All of the following results are standard, although it is not easy to find them conveniently stated in a single place, so we hope this compendium will be helpful.

The main reason for this appendix insofar as the present paper is concerned is to explain the reason behind the reformulation which we give in theorem B.1 of Serre’s [50, Theorem 3’], namely the connection between the Fourier transform of a root system $\Phi$ and the character of the adjoint representation of the Lie groups associated with $\Phi$ : this is provided by A.13. We have, however, stated a few additional results which are not strictly necessary towards that goal but which, we hope, help give a clearer overall picture. The secondary reason for this appendix is to provide the necessary framework for appendix B (although the latter could, in principle, be reworded so as to eliminate all mentions of Lie groups just like we did for theorem B.1, we believe this would be unnecessarily contrived).

Remark A.1.

We have chosen to focus these recollections on semisimple compact real Lie groups, but the classification and representation theory of semisimple complex Lie groups is identical (Weyl’s “unitarian trick”, cf. [32, §6.2] and [26, §26]): we simply mention that the role of the tangent Lie algebra $\mathfrak{t}$ to a maximal torus in what follows is played, in the complex setting, by Cartan subalgebras $\mathfrak{h}$ of $\mathfrak{g}$ ([32, definition 6.32] and [26, §14.1 and appendix D]).

A.2. A compact (real) Lie group is a compact connected real smooth manifold $G$ , together with a group structure on $G$ such that the multiplication and inverse maps are smooth ( $C^{\infty}$ ). The tangent space $\mathfrak{g}$ at the identity $1$ of $G$ is then endowed with a linear action $\operatorname{Ad}$ of $G$ , called the adjoint representation of $G$ , defined by letting $\operatorname{Ad}(g)\colon\mathfrak{g}\to\mathfrak{g}$ (for $g\in G$ ) be the differential at $1$ of $u\mapsto gug^{-1}$ ; this in turns defines a linear map $\operatorname{ad}(x)\colon\mathfrak{g}\to\mathfrak{g}$ for $x\in\mathfrak{g}$ by letting $\operatorname{ad}\colon\mathfrak{g}\to L(\mathfrak{g},\mathfrak{g})$ (where $L(U,V)$ stands for the vector space of linear maps between two vector spaces $U$ and $V$ ) be the differential at $1$ of $\operatorname{Ad}\colon G\to L(\mathfrak{g},\mathfrak{g})$ itself (see [26, §8.1]): writing $[x,y]$ for $\operatorname{ad}(x)(y)$ , this gives $\mathfrak{g}$ the structure of a (real) Lie algebra (simply known as the Lie algebra of $G$ ).

We also recall the exponential map $\exp\colon\mathfrak{g}\to G$ , which takes $x\in\mathfrak{g}$ to the value at $1$ of the unique smooth group homomorphism $\mathbb{R}\to G$ (a.k.a. “ $1$ -parameter subgroup”) whose differential at the origin is $x$ . While $\exp$ is not a group homomorphism in general, it is one whenever $G$ is abelian (whenever the Lie bracket of $\mathfrak{g}$ vanishes; otherwise, the so-called Baker-Campbell-Hausdorff formula expresses the relation of $\exp(x)\,\exp(y)$ to an exponential). Rather than using the exponential, we will find it more convenient to use the function $\mathbf{e}\colon x\mapsto\exp(2\pi x)$ ; just as $\exp$ itself, the function $\mathbf{e}$ in question is surjective (cf. A.1 below).

The Killing form $B\colon\mathfrak{g}\times\mathfrak{g}\to\mathfrak{g}$ is the bilinear form $B(x,y)=\operatorname{tr}(\operatorname{ad}(x)\circ\operatorname{ad}(y))$ , which is $G$ -invariant; in the real compact case in which we placed ourselves, this form is negative semidefinite ([32, theorem 6.10] or [1, theorem 2.13]), and we say that $G$ or $\mathfrak{g}$ is semisimple when $B$ is nondegenerate ([26, prop. C10], [32, theorem 5.53]), i.e., negative definite in the real compact case.

As an example, the group $\mathit{SO}_{2n}$ of $(2n)\times(2n)$ real orthogonal matrices with determinant $+1$ is a compact real Lie group, whose Lie algebra $\mathfrak{so}_{2n}$ consists of antisymmetric $(2n)\times(2n)$ matrices, the Lie bracket $[x,y]$ being the usual $xy-yx$ , and the Killing form on $\mathfrak{so}_{2n}$ is given by $B(x,y)=2(n-1)\operatorname{tr}(xy)$ , so that $\mathfrak{so}_{2n}$ is semisimple if and only if $n\geq 2$ .

A.3. If $G$ is a compact Lie group with Lie algebra $\mathfrak{g}$ , the map taking a connected closed subgroup $H$ of $G$ to its Lie algebra seen as a subalgebra $\mathfrak{h}$ of $\mathfrak{g}$ (i.e., a vector subspace closed under the Lie bracket) is a bijection ([32, theorem 3.40]).

Two simply connected compact Lie groups are isomorphic if and only if their Lie algebras are isomorphic ([32, theorem 3.43]); thus, two compact Lie groups with isomorphic Lie algebras have isomorphic universal coverings: they are then said to be isogenous. (We note that, for the purposes of this paper, isogenous Lie groups are an irrelevant complication.) Beware, however, that the universal covering of a compact Lie group need not be compact as the case of tori shows.

Conversely, any real Lie algebra with a negative semidefinite Killing form is the Lie algebra of some compact Lie group (unique up to isogeny, by the previous paragraph). We return in A.13 to the question of which Lie groups are possible in the semisimple case.

A.4. If $G$ is a compact Lie group, a torus in $G$ is an abelian connected closed subgroup of $G$ , or equivalently, one whose Lie algebra is abelian (meaning that its Lie bracket is trivial). A maximal torus, of course, is a torus that is maximal for inclusion; by A.1, maximal tori of $G$ are in bijection with maximal abelian Lie subalgebras of the Lie algebra $\mathfrak{g}$ of $G$ .

As an example, a maximal torus in $\mathit{SO}_{2n}$ is given by the block diagonal matrices whose diagonal blocks are $2\times 2$ rotation matrices.

Crucial results by Cartan concerning maximal tori of compact Lie groups are ([10, theorems 16.4 and 16.5] or [1, theorem 2.15] or [33, corollaries 4.35 and 4.46]): (a) every element of $G$ belongs to some maximal torus, and (b) all maximal tori of $G$ are conjugate; in particular, each element of $G$ is conjugate to some element of any fixed maximal torus of $G$ .

The dimension of some (any) maximal torus $T$ in $G$ (or equivalently, of its Lie algebra) is known as the rank of $G$ . The quotient $N_{G}(T)/T$ of the normalizer of $T$ (in $G$ ) by $T$ itself is known as the Weyl group $W$ of $G$ : so a $W$ -orbit in $T$ is precisely a full set of $G$ -conjugate elements of $T$ , and the set of conjugacy classes in $G$ can be identified (as a set) with $T/W$ . We note that $W$ acts as a group of automorphisms of $T$ , so it also acts (linearly) on the Lie algebra $\mathfrak{t}$ of $T$ and (by inverse transpose) on the dual $\mathfrak{t}^{*}$ of $\mathfrak{t}$ .

A.5. If $T$ is an abstract torus, i.e., an abelian compact Lie group, and $\mathfrak{t}$ is its Lie algebra, the map $x\mapsto\mathbf{e}(x):=\exp(2\pi x)$ (in other words the differentiable group homomorphism $\mathfrak{t}\to T$ whose differential at [math] is $2\pi$ times the identity) defines a surjective homomorphism $\mathfrak{t}\to T$ , whose kernel is a discrete subgroup $\Gamma$ of $\mathfrak{t}$ . Thus, we can identify $T$ with $\mathfrak{t}/\Gamma$ (as a differentiable group), and functions on $T$ with $\Gamma$ -periodic functions on $\mathfrak{t}$ . We will call $\Gamma$ the period lattice of the torus $T$ .

A.6. If $G$ is a compact Lie group, a (finite-dimensional) representation of $G$ on a finite-dimensional complex vector space $V$ is a differentiable linear action of $G$ on $V$ , i.e., a differentiable group homomorphism $\rho\colon G\to\mathit{GL}(V)$ . The character of said representation is the map $g\mapsto\operatorname{tr}\rho(g)$ (a differentiable function on $G$ , invariant under conjugation). The representation is said to be irreducible when the only $G$ -invariant subspaces of $V$ are [math] and $V$ . It turns out that every representation of $G$ is a direct sum of irreducible representations ([32, theorem 4.40]); and a representation is characterized (up to isomorphism) by its character ([10, theorem 2.5] or [32, theorem 4.46]).

Furthermore, although we will not use this, it might be worth pointing out the Peter-Weyl theorem: the characters of the irreducible representations of $G$ form a Hilbert orthonormal basis for the closed subspace consisting of conjugation-invariant functions in the Hilbert space $L^{2}(G)$ of square-integrable functions on $G$ ([32, theorem 4.50]; incidentally, these functions are also eigenvalues of the Laplace-Beltrami operator on $G$ seen as a Riemannian manifold).

Among the representations of $G$ , the adjoint representation (defined in A.1 above as a map $\operatorname{Ad}\colon G\to\mathit{GL}(\mathfrak{g})$ , which we see as an action on the complexified vector space $\mathfrak{g}_{\mathbb{C}}:=\mathfrak{g}\otimes_{\mathbb{R}}\mathbb{C}$ ) is of particular importance; its character $g\mapsto\operatorname{tr}\operatorname{Ad}(g)$ is called the character of the adjoint representation, or simply the adjoint character, of $G$ ; the adjoint representation is irreducible if and only if $G$ is simple (this can be taken as a definition333A Lie group $G$ is called “simple” when it does not have nontrivial connected normal subgroups: this allows for a finite center (the term “quasisimple” might be more appropriate).).

A.7. Representation theory on a torus $T$ is well known: writing $T=\mathfrak{t}/\Gamma$ through $x\mapsto\mathbf{e}(x):=\exp(2\pi x)$ as in A.1, the irreducible characters of $T$ are of the form $\mathbf{e}(\lambda):\mathbf{e}(x)\mapsto\exp(2\pi i\lambda(x))$ with $\lambda$ ranging over the lattice $\Gamma^{*}$ dual to $\Gamma$ (in the vector space $\mathfrak{t}^{*}$ dual to $\mathfrak{t}$ ), which we call the character lattice of $T$ . The corresponding representations are all one-dimensional (acting by multiplication by the character just defined). In other words, the irreducible representations of $T$ are indexed by $\Gamma^{*}$ , and the orthonormal basis of characters of $T$ predicted by the Peter-Weyl theorem is the usual Fourier basis on $T=\mathfrak{t}/\Gamma$ . (As mentioned in A.1, we write $\mathbf{e}(\lambda)$ both for the function $T\to\mathbb{C}$ defined earlier, and for the $\Gamma$ -periodic function $\mathfrak{t}\to\mathbb{C}$ given by $x\mapsto\exp(2\pi i\lambda(x))$ .)

We note that the ring of linear combinations (over $\mathbb{Z}$ , resp. $\mathbb{C}$ ) of the $\mathbf{e}(\lambda)$ , or character ring (resp. character $\mathbb{C}$ -algebra) of $T$ (cf. A.15 below), is the group ring (resp. group $\mathbb{C}$ -algebra) of the lattice $\Gamma^{*}$ . (Choosing a basis for $\Gamma^{*}$ shows that this is a ring of Laurent polynomials.)

A.8. If now $G$ is a compact Lie group and $T$ is a maximal torus in $G$ with corresponding Lie algebras $\mathfrak{t}\subseteq\mathfrak{g}$ , given a representation $V$ of $G$ having character $\chi$ , we can restrict them to $T$ and consider the Fourier decomposition of $\chi|_{T}$ , i.e., its decomposition $\chi|_{T}=\sum_{\lambda\in\Gamma^{*}}m_{\lambda}\mathbf{e}(\lambda)$ in terms of the characters $\mathbf{e}(\lambda)$ (for $\lambda\in\Gamma^{*}$ ) defined in the previous paragraph: clearly $m_{\lambda}$ is the dimension of the subspace $V^{\lambda}$ of $V$ consisting of those $z\in V$ such that $\rho(u)(z)=\mathbf{e}(\lambda)(u)$ for each $u\in T$ . In particular, $m_{\lambda}\in\mathbb{N}$ . The $\lambda$ such that $m_{\lambda}>0$ are known as the weights of the representation $V$ (or of the character $\chi$ ), and we emphasize that they belong to $\Gamma^{*}$ ; the value $m_{\lambda}$ is know as the multiplicity of the weight $\lambda$ (in $V$ or in $\chi$ ), and the subspace $V^{\lambda}$ on which $T$ acts through $\mathbf{e}(\lambda)$ is known as the weight (eigen)space; we note that the weight space can be defined at the Lie algebra level as the set of $z$ such that $d\rho_{1}(x)(z)=i\lambda(x)\,z$ for all $x\in\mathfrak{t}$ (where $d\rho_{1}$ is the differential of $\rho$ at $1$ ; compare [32, definition 8.1]: we have added a factor $i$ here for convenience in the compact case, but it is a matter of convention).

Since, as explained in A.1 above, all maximal tori of $G$ are conjugate, the weights and multiplicities do not depend on the choice of $T$ ; furthermore, they are invariant under the action of the Weyl group $W$ . Seen as a function on $\mathfrak{t}$ , the character values are both $\Gamma$ -periodic and $W$ -invariant, so they are $\Gamma\rtimes W$ -invariant. (Let us mention here the paper [46], which can serve as link between the “Fourier” and “Lie group characters” points of view.)

A.9. We temporarily leave aside Lie groups to recall the following definitions and facts in relation with abstract root systems (see [29, §9.2], [32, §7.1] and [7, chap. VI]). A (reduced, crystallographic) root system is a set $\Phi$ of vectors in a finite dimensional real vector space $E$ such that (1) $\Phi$ is finite, does not contain [math], and spans $E$ , (2) for every $\alpha\in\Phi$ , there exists $\alpha^{\vee}$ in the dual space $E^{*}$ of $E$ such that $\alpha^{\vee}(\alpha)=2$ and such that the (symmetry) map $s_{\alpha}\colon x\mapsto x-\alpha^{\vee}(x)\,\alpha$ leaves $\Phi$ stable (it is easy to see that $\alpha^{\vee}$ is uniquely defined, cf. [7, chap. VI, §1, no1, lemme 1], so that the notation is legitimate), (3) for every $\alpha,\beta\in\Phi$ we have $\alpha^{\vee}(\beta)\in\mathbb{Z}$ , and (4) if $\alpha\in\Phi$ and $c\alpha\in\Phi$ then $c\in\{\pm 1\}$ . The elements of $\Phi$ are called roots, and the $\alpha^{\vee}$ are the coroots. The set $\Phi^{\vee}:=\{\alpha^{\vee}:\alpha\in\Phi\}$ of coroots is itself a root system (with $(\alpha^{\vee})^{\vee}=\alpha$ ), known as the dual root system to $\Phi$ . The group generated by the $s_{\alpha}$ is known as the Weyl group of $\Phi$ , and it is finite.

Two root systems $\Phi\subseteq E$ and $\Phi^{\prime}\subseteq E^{\prime}$ are said to be isomorphic when there is a linear isomorphism between $E$ and $E^{\prime}$ taking $\Phi$ to $\Phi^{\prime}$ . In this case, they have isomorphic dual systems and isomorphic Weyl groups.

The root system $\Phi\subseteq E$ is said to be reducible when it is the union (“sum”) of root systems $\Phi_{1},\Phi_{2}$ in $E_{1},E_{2}$ with $E=E_{1}\oplus E_{2}$ respectively, irreducible otherwise. Every root system can be written in a unique way as the sum of irreducible root systems, and any sum of root systems is a root systems.

Given a root system $\Phi$ in $E$ , there exists a Euclidean structure on $E$ such that every $s_{\alpha}$ (and consequently, every element of the Weyl group) is orthogonal; equivalently, a Euclidean structure which identifies $E$ with its dual $E^{*}$ so that each coroot $\alpha^{\vee}$ is proportional to the corresponding root $\alpha$ . Such a Euclidean structure is said to be compatible with $\Phi$ . (The definition of root systems is often written in a manner that preassumes the Euclidean structure: in this case, the coroot $\alpha^{\vee}$ associated to $\alpha$ is defined as $2\alpha/\|\alpha\|^{2}$ .) For $\Phi$ irreducible, this Euclidean structure is unique up to a multiplicative constant, i.e., up to the definition of the lengths of the roots; in the case of the simply laced root system (those in which every root has the same length), which concerns us in the present paper, the constant is generally chosen such that the squared root length is $2$ , so that $\Phi$ and $\Phi^{\vee}$ can be identified. Nevertheless, it might be useful for expositional clarity to keep the distinction between $\Phi$ and $\Phi^{\vee}$ and the greater generality afforded by the not necessarily simply laced root system, so we do not perform this identification (but the reader may choose to do so).

Associated with a root system $\Phi$ as above are four lattices: the lattice $Q:=\mathbb{Z}\Phi\subseteq E$ generated by $\Phi$ is known as the root lattice, the lattice $Q^{\vee}:=\mathbb{Z}\Phi^{\vee}\subseteq E^{*}$ generated by $\Phi^{\vee}$ is known as the coroot lattice; the lattice $P:=(Q^{\vee})^{*}$ (in $E$ ) dual to the coroot lattice is known as the weight lattice and contains the root lattice; and the lattice $P^{\vee}:=Q^{*}$ (in $E^{*}$ ) dual to the root lattice is known as the coweight lattice and contains the coroot lattice. The quotient of the weight lattice by the root lattice, or equivalently of the coweight lattice by the coroot lattice, is sometimes known as the fundamental group of $\Phi$ for reasons that will be clarified in A.13.

A.10. Continuing the exposition of root systems started in A.1, if $h$ is a linear form on $E$ such that $h(\alpha)\neq 0$ for each $\alpha\in\Phi$ , the roots such that $h(\alpha)>0$ are then known as the positive roots, and those such that $h(\alpha)<0$ as the negative roots relative to $h$ : a subset $\Phi_{+}:=\{\alpha\in\Phi:h(\alpha)>0\}$ which can be obtained in this manner is known as a choice of positive roots for $\Phi$ . The positive roots which cannot be written as sums of other positive roots are known as simple roots (for this choice of positive roots): it is then a fact that the simple roots form a basis of $E$ , and that every positive root is a linear combination of the simple roots with nonnegative integer coefficients (not all zero); so the choice of positive roots can be defined equivalently by the set of simple roots. The Weyl group acts simply transitively on the set of all choices of positive roots (or the set of all choices of simple roots).

The choice of positive roots also gives a choice of positive coroots, (defined from $h$ as above by identifying $E$ with $E^{*}$ , or by saying that the positive coroots are the coroots associated with positive roots). The dual basis $\varpi_{1},\ldots,\varpi_{n}$ to the set $\alpha^{\vee}_{1},\ldots,\alpha^{\vee}_{n}$ of simple coroots is known as the set of fundamental weights (for the choice of positive roots); symmetrically, the dual basis to the set of simple roots is known as the set of fundamental coweights.

The convex cone in $E$ generated by the fundamental weights, is known as the closed Weyl chamber in $E$ corresponding to the choice of positive roots: it is the dual cone to the positive coroot cone, in other words, it is defined by the inequalities $\alpha^{\vee}(x)\geq 0$ for all positive coroots (or equivalently, for all simple coroots) $\alpha^{\vee}$ ; the open Weyl chamber, defined by the inequalities $\alpha^{\vee}(x)>0$ , is the interior of the closed Weyl chamber (and the closed Weyl chamber is its closure). Dually, the cone in $E^{*}$ generated by the fundamental coweights, which is the dual cone to the positive root cone, is also known as the closed Weyl chamber (in $E^{*}$ ). The choice of a Weyl chamber is equivalent to a choice of positive roots: the Weyl group acts simply transitively on the set of Weyl chambers.

Given a choice of positive roots $\Phi_{+}\subseteq\Phi$ , there exists a unique $\beta\in\Phi_{+}$ such that $\beta+\alpha\not\in\Phi_{+}$ for all $\alpha\in\Phi_{+}$ : this is known as the highest root of $\Phi$ (relative to this choice of positive roots), and it belongs to the open Weyl chamber. The (integer) coefficients $m_{i}$ of $\beta$ on the basis of simple roots, i.e., the $m_{i}$ such that $\beta=\sum_{i=1}^{n}m_{i}\alpha_{i}$ where $\alpha_{1},\ldots,\alpha_{n}$ are the simple roots, often come up in formulae involving $G$ or its root system. It is often more convenient to define $\alpha_{0}=-\beta$ (the lowest root) and $m_{0}=1$ so that $\sum_{i=0}^{n}m_{i}\alpha_{i}=0$ .

A.11. Given a root system $\Phi$ and a choice of positive roots, we define the Dynkin diagram of $\Phi$ as the graph whose vertices (“nodes”) are the simple roots, two nodes $\alpha,\beta$ being connected by a single, double or triple edge, or by no edge at all, according as the angle between them is $2\pi/3$ , $3\pi/4$ or $5\pi/6$ , or $\pi/2$ for no edge at all, these being the only possible values; in the case of a double or triple edge, it is oriented by pointing from the simple root with the larger norm to that with the smaller norm. (These constructions rely on a Euclidean structure compatible with $\Phi$ , but are independent of the choice of such a structure.)

The Dynkin diagram of a root system $\Phi$ determines the latter up to isomorphism. Furthermore, all possible irreducible root systems can be classified, with the help of Dynkin diagrams. The simply laced Dynkin diagrams are shown on Figure 4.

A.12. We now return to the setup of a compact Lie group $G$ as in A.1, and we furthermore assume $G$ to be semisimple.

The nonzero weights of the adjoint representation of $G$ are known as the roots of $G$ (or of $\mathfrak{g}$ ), and each one occurs with multiplicity $1$ ; as for the zero weight space, it is the complexification of $\mathfrak{t}$ itself (in other words, its multiplicity is the rank of $G$ ). So, writing $\mathfrak{g}_{\mathbb{C}}^{\alpha}:=\{z\in\mathfrak{g}_{\mathbb{C}}:[x,z]=i\alpha(x)\,z\text{~{}for all~{}}x\in\mathfrak{t}\}$ for the weight space of $\alpha\in\Phi$ acting on $\mathfrak{g}_{\mathbb{C}}:=\mathfrak{g}\otimes_{\mathbb{R}}\mathbb{C}$ , we have the weight space decomposition $\mathfrak{g}_{\mathbb{C}}=\mathfrak{t}_{\mathbb{C}}\oplus\bigoplus_{\alpha\in\Phi}\mathfrak{g}_{\mathbb{C}}^{\alpha}$ (compare [33, formula (2.16)] and [32, theorem 6.38]).

The set $\Phi$ of these roots is an abstract (reduced, crystallographic) root system ([10, theorem 19.2] or [32, theorem 6.44]), whose Weyl group is that which we have already associated to $G$ (cf. A.1); it is irreducible if and only if $G$ is simple. Furthermore, this induces a bijection between the isomorphism classes of semisimple compact Lie algebras and root systems ([32, corollary 7.55] or [33, corollary 7.3]), or equivalently, isogeny classes of semisimple compact Lie groups or isomorphism classes of semisimple simply connected compact Lie groups ([26, §7.3]); we clarify in A.13 below the classification of groups inside an isogeny class.

Note that, as a set, $\Phi$ depends not only on $G$ but on the choice of the maximal torus $T$ used to define the weights (cf. A.1), or equivalently, not only on $\mathfrak{g}$ but also on $\mathfrak{t}$ : in fact, $\Phi$ is a subset of the dual $\mathfrak{t}^{*}$ of $\mathfrak{t}$ ; however, as an abstract root system, it does not depend on this choice.

The following proposition follows immediately from what has already been said:

Proposition A.13.

If $G$ is a semisimple compact Lie group with rank $n$ , then the range of values taken by the adjoint character $\operatorname{ch}_{\operatorname{ad}}$ of $G$ is precisely the range of the function $n+\mathscr{F}_{\Phi}$ where $\mathscr{F}_{\Phi}\colon x\mapsto\sum_{\alpha\in\Phi}e^{2\pi i\alpha(x)}$ is the Fourier transform of $\Phi$ .

More precisely, if $T$ is a maximal torus in $G$ with Lie algebra $\mathfrak{t}$ , and $u\in T$ is written $\exp(2\pi x)$ for $x\in\mathfrak{t}$ , then $\operatorname{ch}_{\operatorname{ad}}(u)=n+\mathscr{F}_{\Phi}(x)$ (and we have pointed out in A.1 that each element $g$ of $G$ is conjugate to an element $u$ of $T$ , which then obviously has $\operatorname{ch}_{\operatorname{ad}}(g)=\operatorname{ch}_{\operatorname{ad}}(u)$ ).

Proof.

As explained in 4, the weights of the adjoint representation are the elements of $\Phi$ each with multiplicity $1$ , and [math] with multiplicity $n$ , i.e., $\operatorname{ch}_{\operatorname{ad}}|_{T}=n\cdot\mathbf{e}(0)+\sum_{\alpha\in\Phi}\mathbf{e}(\alpha)$ , which is precisely the statement of the second paragraph. ∎

A.14. We briefly clarify the relation between isogenous (cf. A.1) compact Lie groups in the semisimple case (this subsection is required for completeness, but for the purposes of this paper we care only about the simply connected groups):

If $G$ is a semisimple compact Lie group, we have noted its root system $\Phi$ can be defined directly from its Lie algebra $\mathfrak{g}$ and that $\mathfrak{t}$ of a maximal torus $T$ of $G$ (which is the same as a maximal abelian subalgebra of $\mathfrak{g}$ , cf. A.1), namely as the set of nonzero $\alpha\in\mathfrak{t}^{*}$ (where $\mathfrak{t}^{*}$ is the dual vector space to $\mathfrak{t}$ ) such that $\mathfrak{g}_{\mathbb{C}}^{\alpha}:=\{z\in\mathfrak{g}\otimes_{\mathbb{R}}\mathbb{C}:[x,z]=i\alpha(x)\,z\text{~{}for all~{}}x\in\mathfrak{t}\}$ is nontrivial (cf. A.1). So the root lattice $Q:=\mathbb{Z}\Phi$ and weight lattice $P:=(\mathbb{Z}\Phi^{\vee})^{*}$ defined in A.1, inside $\mathfrak{t}^{*}$ , are defined at the Lie algebra level: they depend only on the isogeny class of $G$ . We have $Q\subseteq\Gamma^{*}\subseteq P$ or equivalently $Q^{\vee}\subseteq\Gamma\subseteq P^{\vee}$ where $Q^{\vee}$ is the coroot and $P^{\vee}$ the coweight lattice (the inclusion $Q\subseteq\Gamma^{*}$ follows from the fact that the weights of any representation of $G$ , as defined in A.1, belong to $\Gamma^{*}$ , and in particular the roots belong to $\Gamma^{*}$ ; the inclusion $Q^{\vee}\subseteq\Gamma$ follows from the fact that the coroots can also be defined at the Lie algebra level).

The classification of compact Lie groups having Lie algebra $\mathfrak{g}$ is then as follows: $G$ is uniquely defined by giving the lattice $\Gamma$ satisfying $Q\subseteq\Gamma^{*}\subseteq P$ , or equivalently $Q^{\vee}\subseteq\Gamma\subseteq P^{\vee}$ ; and conversely, for any such $\Gamma$ , there exists a unique corresponding Lie group $G$ ; furthermore, the fundamental group of $G$ is Abelian, finite and canonically isomorphic to $\Gamma/Q^{\vee}$ , and the center $Z(G)$ of $G$ is finite and canonically isomorphic to $P^{\vee}/\Gamma$ . ([44, chap. 4, §3, 6o, theorems 9 and 10]; see also [10, theorem 23.1] and [33, corollary 5.109].)

(In particular, the universal covering of a semisimple compact Lie group $G$ is still compact, and corresponds to taking $\Gamma$ equal to the coroot lattice $Q^{\vee}$ . At the other extreme, the centerless group $G/Z(G)$ corresponds to taking $\Gamma$ equal to the coweight lattice $P^{\vee}$ ; this is also known as the “adjoint” form, because it is the image of the adjoint representation $G\to\mathit{GL}(\mathfrak{g})$ . The fundamental group of the adjoint form, or equivalently the center of the universal covering, is defined at the Lie algebra level, and is $P^{\vee}/Q^{\vee}$ .)

Remark A.15.

If $G$ is a semisimple compact Lie group with maximal torus $T=\mathfrak{t}/\Gamma$ , we have already pointed out in A.1 that the set of conjugacy classes of $G$ can be identified (as a set) with $T/W$ , where $W$ is the Weyl group. Lifting to the Lie algebra $\mathfrak{t}$ of $T$ , it can be identified with $\mathfrak{t}/(\Gamma\rtimes W)$ . This point of view is particularly important when $G$ is simply connected ( $\Gamma$ is the coroot lattice) because then it can be shown that the “affine Weyl group” $\Gamma\rtimes W$ is an affine Coxeter group, having a fundamental domain, known as the Weyl alcove, which is the simplex whose vertices are [math] and the $\varpi_{i}^{\vee}/m_{i}$ , where the $\varpi_{i}^{\vee}$ are the fundamental coweights (cf. A.1) and $m_{i}$ are the coefficients of the highest root (cf. A.1). (See [31, chapter 11].)

A.16. We now briefly review the classification of irreducible representations of a semisimple compact Lie group as provided by “highest weight theory”.

As explained in A.1, the weights of a (finite-dimensional) representation $V$ of a semisimple compact Lie group $G$ (relative to the choice of a maximal torus $T\subseteq G$ ) are the $\lambda\in\Gamma^{*}$ such that $V^{\lambda}:=\{z\in V:(\forall u\in T)\,u\cdot z=\mathbf{e}(\lambda)(u)\}$ is nonzero, the multiplicity $m_{\lambda}$ being $\dim V^{\lambda}$ . Now fix a choice of positive roots of $G$ (cf. A.1): a highest weight of $V$ (or of its character, $\chi$ ) is a weight $\lambda$ such that $\lambda+\alpha$ is not a weight for any positive root $\alpha$ ; a dominant integral weight (for $G$ ) is a $\lambda\in\Gamma^{*}$ belonging to the closed Weyl chamber, i.e., such that $\lambda(\alpha^{\vee})\geq 0$ for each simple (or equivalently, positive) coroot $\alpha^{\vee}$ .

Highest weight theory tells us that ([33, theorem 5.5], [32, §8.3] or [44, chap. 4, §3, 7o, theorem 11] or [11, VI.1.7]):

•

every irreducible representation of $G$ has a unique highest weight, which is a dominant integral weight, and its multiplicity is $1$ ;

•

if $V$ is an irreducible representation of $G$ with highest weight $\lambda$ , then the set of weights of $V$ is the intersection of $\lambda+Q$ , where $Q$ is the root lattice, and of the convex hull of the orbit of $\lambda$ under the Weyl group;

•

two irreducible representations of $G$ are isomorphic if and only if they have the same highest weight (thus, we can speak of “the” irreducible representation with highest weight $\lambda$ );

•

every dominant integral weight is the highest weight of an irreducible representation of $G$ (it is unique by the previous point);

•

if $V$ and $V^{\prime}$ are irreducible representations of $G$ with highest weights $\lambda$ and $\lambda^{\prime}$ respectively, then $V\otimes V^{\prime}$ has highest weight $\lambda+\lambda^{\prime}$ and has a unique irreducible factor with that weight ([11, VI.2.8]).

The highest weight of the adjoint representation is the highest root ( $-\alpha_{0}$ ).

A.17. If $G$ is a semisimple compact Lie group and $T$ a maximal torus of $G$ , then the Weyl character formula ([32, §8.5], [33, theorems 5.75–5.77] or [11, VI.1.7]) expresses the value of the irreducible character $\chi_{\lambda}$ with highest weight $\lambda$ (i.e., the character of the irreducible representation with highest weight $\lambda$ ) as the ratio of two skew- $W$ -invariant polynomials on $T$ , namely

[TABLE]

where $W$ is the Weyl group and $\mathrm{sgn}\colon W\to\{\pm 1\}$ the group homomorphism taking the value $-1$ on each reflection $s_{\alpha}$ (i.e., $\mathrm{sgn}(w)$ is the determinant of $w$ acting on the Lie algebra $\mathfrak{t}$ of $T$ ); and the Weyl vector $\rho:=\frac{1}{2}\sum_{\alpha\in\Phi_{+}}\alpha$ is half the sum of the positive roots, which is also the sum $\sum_{i=1}^{n}\varpi_{i}$ of the fundamental weights. The denominator of the above expression can be factored using the Weyl denominator formula:

[TABLE]

A.18. Assuming that $G$ (still a semisimple compact Lie group) is simply connected (so that $\Gamma^{*}$ is the weight lattice, cf. A.13), the irreducible representations having the fundamental weights (cf. A.1) as highest weights are known as fundamental representations, and their characters as the fundamental characters of $G$ .

A.19. The character ring of a compact Lie group $G$ is the ring generated by the characters of $G$ (that is, the set of differences $\chi_{1}-\chi_{2}$ between two characters of $G$ ) for pointwise sum and product. Equivalently, if we define a virtual representation of $G$ to be the formal difference $V_{1}\ominus V_{2}$ of two (finite dimensional) representations, identifying $V_{1}\ominus V_{2}$ with $V^{\prime}_{1}\ominus V^{\prime}_{2}$ whenever $V_{1}\oplus V^{\prime}_{2}$ and $V^{\prime}_{1}\oplus V_{2}$ are isomorphic (“Grothendieck ring” construction), and if we define the (virtual) character of $V_{1}\ominus V_{2}$ to be $\chi_{1}-\chi_{2}$ where $\chi_{i}$ is the character of $V_{i}$ , the character ring can be defined as the set of virtual representations of $G$ with addition and multiplication being defined as the direct sum and tensor product (extended in the obvious fashion to virtual representations).

To put it differently, the character ring of $G$ consists of $\mathbb{Z}$ -linear combinations of the irreducible characters (or representations) of $G$ , the product being defined by the decomposition into irreducibles of a product of characters (tensor product of representations). One can similarly define the character $\mathbb{C}$ -algebra as the set of $\mathbb{C}$ -linear combinations of the irreducible characters (or representations) of $G$ .

Highest weight theory implies that: (1) for a semisimple compact Lie group $G$ with maximal torus $T$ , the character ring of $G$ is simply the invariant part under the Weyl group of the character ring of $T$ (the latter being the group ring of $\Gamma^{*}$ , cf. A.1), and (2) when $G$ is, additionally, simply connected, the character ring is isomorphic to the polynomial algebra, with coefficients in $\mathbb{Z}$ , over indeterminates corresponding to the fundamental representations. ([11, VI.2.1 and VI.2.11].) The corresponding statements also hold with complex coefficients instead of integers.

Remark A.20.

If $G$ is a semisimple simply connected compact Lie group with maximal torus $T=\mathfrak{t}/\Gamma$ , then the character $\mathbb{C}$ -algebra of $G$ can be identified (via restriction to $T$ ) with the set of $W$ -invariant trigonometric polynomials on $T$ (with complex coefficients), where $W$ is the Weyl group, or, lifting to $\mathfrak{t}$ , of $W$ -invariant (hence $\Gamma\rtimes W$ -invariant) combinations of the $\mathbf{e}(\lambda)$ for $\lambda\in\Gamma^{*}$ . Also note that such functions are entirely defined by their values on the Weyl alcove (cf. A.15).

If we are mostly interested in the character values on $T$ (they determine those on $G$ by A.1), and in this paper we are, the irreducible representations of $G$ are something of a needless complication: the character ring of a semisimple simply connected compact Lie group has a $\mathbb{Z}$ -basis consisting of the sums $\sum_{\lambda\in W\lambda_{0}}\mathbf{e}(\lambda)$ for $\lambda$ ranging over an orbit of the Weyl group $W$ acting on $\Gamma^{*}$ .

A.21. We have recalled in A.1 that every element $g$ of a compact Lie group $G$ belongs to a maximal torus $T$ ; when the torus in question is unique, the element $g$ is said to be regular. Assuming that $G$ is semisimple, this is equivalent ([10, theorem 22.3(ii)]) to saying that $g$ is not in the kernel of any $\mathbf{e}(\alpha)$ for root $\alpha\in\Phi$ (cf. A.1). Correspondingly, we say that an element $x$ of the Lie algebra $\mathfrak{g}$ of $G$ is regular when $\mathbf{e}(x)$ is regular, i.e., when $\alpha(x)\not\in\mathbb{Z}$ for all $\alpha\in\Phi$ (this means that $x$ is represented by an element in the interior of the Weyl alcove, cf. A.15; also compare [11, V.7.8]).

The following fact is crucial to the proof given in appendix B:

Proposition A.22.

Let $G$ be a semisimple simply connected compact Lie group. If $g\in G$ and $T$ is a maximal torus containing $g$ , then the following are equivalent:

•

the element $g$ is regular,

•

the differentials $d\operatorname{ch}_{1},\ldots,d\operatorname{ch}_{n}$ of the fundamental characters of $G$ (cf. A.15) are independent at $g$ ,

•

the differentials $d\operatorname{ch}_{1}|_{T},\ldots,d\operatorname{ch}_{n}|_{T}$ of the fundamental characters of $G$ restricted to $T$ are independent at $g$ .

In a more general context, the equivalence of the first two statements is due to Kostant ([35, theorem 0.1]) and Steinberg ([51, theorem 8.1]); however, since we are only considering compact Lie group, every element $g$ belongs to a maximal torus (i.e., is “semisimple”), making the proof of the equivalence considerably easier and giving the third statement as a byproduct (as detailed in [51, §8.2–8.6]).

A.23. We now briefly discuss how a subset of the nodes of the Dynkin diagram of a semisimple compact Lie group defines a Lie subgroup with the Dynkin diagram defined by the subset in question (i.e., the induced subgraph).

So let $G$ be a semisimple compact Lie group, fix a maximal torus $T$ in $G$ , and let $\mathfrak{g},\mathfrak{t}$ be the corresponding Lie algebras and $\Phi$ the root system of $G$ (cf. 4); choose a system of simple roots $\alpha_{1},\ldots,\alpha_{n}\in\Phi$ (where $n$ is the rank of $G$ ). Now if $I\subseteq\{1,\ldots,n\}$ , this defines a root system $\Phi_{I}\subseteq\Phi$ , sometimes known as the parabolic subsystem associated to $I$ , namely the set $\Phi_{I}:=\Phi\cap\bigoplus_{i\in I}\mathbb{Z}\alpha_{i}=\Phi\cap\bigoplus_{i\in I}\mathbb{R}\alpha_{i}$ “generated by” the $\alpha_{i}$ for $i\in I$ (see [31, §5.1] for a discussion, or [39, §12.1]), so that its Dynkin diagram consists of the nodes of that of $\Phi$ labeled by elements of $I$ .

We now fix such an $I$ and explain how to define a Lie subgroup of $G$ with root system $\Psi:=\Phi_{I}$ . See also [1, §7.3–7.4] for a more detailed and pedagogical account of this construction.

For $\alpha\in\Phi$ , let $\mathfrak{g}_{\mathbb{C}}^{\alpha}:=\{z\in\mathfrak{g}_{\mathbb{C}}:[x,z]=i\alpha(x)\,z\text{~{}for all~{}}x\in\mathfrak{t}\}$ inside $\mathfrak{g}_{\mathbb{C}}:=\mathfrak{g}\otimes_{\mathbb{R}}\mathbb{C}$ be the corresponding weight space. Then (see [33, corollary 5.94], or [39, proposition 12.6] in a different context) $\mathfrak{l}_{\mathbb{C}}:=\mathfrak{t}_{\mathbb{C}}\oplus\bigoplus_{\alpha\in\Psi}\mathfrak{g}_{\mathbb{C}}^{\alpha}$ is a (complex) Lie subalgebra of $\mathfrak{g}_{\mathbb{C}}$ (sometimes known as a “parabolic Levi factor”), which can be further factored as a Lie algebra direct sum (i.e., with trivial bracket between the summands) of its center $\mathfrak{z}(\mathfrak{l}_{\mathbb{C}})=\bigcap_{\alpha\in\Psi}\ker\alpha\subseteq\mathfrak{t}_{\mathbb{C}}$ and its semisimple subalgebra $\mathfrak{l}_{\mathbb{C}}^{\prime}=[\mathfrak{l}_{\mathbb{C}},\mathfrak{l}_{\mathbb{C}}]=\mathfrak{t}^{\prime}_{\mathbb{C}}\oplus\bigoplus_{\alpha\in\Psi}\mathfrak{g}_{\mathbb{C}}^{\alpha}$ where $\mathfrak{t}^{\prime}_{\mathbb{C}}$ is the complex subspace spanned by the coroots $\alpha^{\vee}$ for $\alpha\in\Psi$ (seen as elements of $\mathfrak{t}_{\mathbb{C}}$ ). Since $\mathfrak{l}_{\mathbb{C}}$ and $\mathfrak{l}^{\prime}_{\mathbb{C}}$ are stable under complex conjugation, they define (real!) Lie subalgebras $\mathfrak{l}:=\mathfrak{l}_{\mathbb{C}}\cap\mathfrak{g}$ and $\mathfrak{l}^{\prime}:=\mathfrak{l}^{\prime}_{\mathbb{C}}\cap\mathfrak{g}$ of $\mathfrak{g}$ , hence compact Lie subgroups $L$ , $L^{\prime}$ of $G$ having these Lie algebras (see [33, theorem 5.114]). Then $L^{\prime}$ is a semisimple Lie subgroup of $G$ having root system $\Psi$ and maximal torus $T^{\prime}$ with Lie algebra $\mathfrak{t}^{\prime}$ (as for $L$ , it is isogenous to the product of $L^{\prime}$ with a torus of rank $n-\#I$ , so that it has the same rank $n$ as $G$ ).

(In fact, the only properties of $\Psi$ used above are that it is a “closed” subsystem of $\Phi$ : see [39, definition 13.2 and theorem 13.6].)

In appendix B, we will use the above construction in the case where $I$ is the complement of a single node $\{i\}$ in the Dynkin diagram.

Appendix B Serre’s result on characters of compact Lie groups

In this section, we prove the $\mathsf{E}_{n}$ cases of [50, Theorem 3’]. The proof in the $\mathsf{E}_{7}$ and $\mathsf{E}_{8}$ cases has been kindly communicated to us by J.-P. Serre and only slightly adapted for symbolic computation with Sage (J.-P. Serre was able to perform the entire computation by hand and we have not attempted to reproduce this feat; any errors in the following expositions are, of course, entirely our own) and straightforwardly extended to compute all critical values of the adjoint character; in the $\mathsf{E}_{6}$ case, J.-P. Serre referred us to a proof devised by A. Connes, which we do not follow here, preferring instead a straightforward analogue of the $\mathsf{E}_{7}$ and $\mathsf{E}_{8}$ cases, at the cost of considerably more computing power (the $\mathsf{E}_{6}$ case does not seem doable by hand with the technique presented below).

Theorem B.1.

If $G$ is a semisimple compact Lie group of type $\mathsf{E}_{6}$ , $\mathsf{E}_{7}$ or $\mathsf{E}_{8}$ respectively, and $\operatorname{ch}_{\operatorname{ad}}$ its adjoint character (cf. A.1); then $\inf_{g\in G}\operatorname{ch}_{\operatorname{ad}}(g)$ is equal to $-3$ , $-7$ or $-8$ respectively.

(The proof for $\mathsf{A}_{n}$ and $\mathsf{D}_{n}$ has been given in 5.3 and 5.4 respectively.)

In each case, we divide the proof in two steps: the reduction step and the computation step, the second being itself subdivided into an elimination substep and a ruling-out substep.

We will call $G$ the simply connected semisimple compact Lie group of type $\mathsf{E}_{6}$ , $\mathsf{E}_{7}$ or $\mathsf{E}_{8}$ as the case may be, $T$ its maximal torus (cf. A.1) and $\mathfrak{t}$ the Lie algebra of the latter.

The reduction step uses the trick (†) explained below (and based essentially on A.22) to reduce the number of variables by observing that any critical point of $\operatorname{ch}_{\operatorname{ad}}$ must lie on certain linear subspaces of $\mathfrak{t}$ . The computation step then finds the critical values of some polynomial function $h$ of several variables $x_{1},\ldots,x_{r}$ by elimination theory: there are slight variations in each of the cases below, but broadly speaking, consider the ideal of $\mathbb{C}[x_{1},\ldots,x_{r},y]$ generated by $\frac{\partial h}{\partial x_{i}}$ and $y-h$ (defining the — often [math]-dimensional — algebraic variety of critical points of $h$ ) and use a Gröbner basis for some elimination order (that is, a monomial order such that $y<x_{1}^{i_{1}}\cdots x_{r}^{i_{r}}$ for all $i_{1},\ldots,i_{r}$ not all zero) to find the projection of this variety on the $y$ coordinate (represented by a polynomial in $y$ which one factors to find the actual values, which for some currently mysterious reason happen to be always rational); unfortunately, elimination theory considers all complex values of $x_{1},\ldots,x_{r}$ , so there are many spurious values, and one must then consider each computed value, or at least those that are smaller than the actual minimum, and rule them out by showing that, for some reason, they cannot be realized for real values $x_{1},\ldots,x_{r}$ (generally by noticing that some other element in the Gröbner basis does not have roots in the domain considered).

Let us now explain the idea of the reduction step in more detail. We generally follow appendix A for notation: for example, we call $\Phi$ the root system of $G$ .

The reduction trick (†) is as follows. Suppose $z\in\mathfrak{t}$ is a critical value of the adjoint character (which is a fundamental character in each of $\mathsf{E}_{6}$ , $\mathsf{E}_{7}$ and $\mathsf{E}_{8}$ ), or more generally that it is a critical value of any polynomial in the fundamental characters in which no fundamental character appears more than once; then the result of Kostant and Steinberg A.22 implies that $z$ is on a root hyperplane $\mathfrak{t}^{\prime}=\{\alpha=m\}$ (where $\alpha\in\Phi$ and $m\in\mathbb{Z}$ ); now the affine Weyl group $\Gamma\rtimes W$ (cf. A.15) acts transitively on the set of such root hyperplanes and preserves all character values, so we can assume that the $z$ lies on the root hyperplane $\{\alpha_{0}=0\}$ , where $-\alpha_{0}$ is the highest root, which (for $\mathsf{D}_{n}$ , $\mathsf{E}_{6}$ , $\mathsf{E}_{7}$ , $\mathsf{E}_{8}$ ) is also one of the fundamental weight, say $\varpi_{i}$ . (This $i$ can be read by taking the extended Dynkin diagram for $\Phi$ : it is the node to which the extender node attaches.) In other words, the coordinates of $z$ on the basis of coroots $\alpha^{\vee}_{1},\ldots,\alpha^{\vee}_{n}$ have a zero at coordinate $i$ (the one which is measured by $\varpi_{i}$ ): so $z$ lives, in fact, in the hyperplane generated by $\alpha^{\vee}_{j}$ for $j\neq i$ . Now, as explained in A.22, the linear subspace $\mathfrak{t}^{\prime}$ of $\mathfrak{t}$ generated by $\alpha^{\vee}_{j}$ for $j\in I$ (here $I:=\{1,\ldots,n\}\setminus\{i\}$ ) is, in a natural way, the Lie algebra of the maximal torus $T^{\prime}$ of a semisimple Lie subgroup $G^{\prime}$ of $G$ (denoted $L^{\prime}$ in A.22), whose root system $\Phi^{\prime}$ has a Dynkin diagram obtained from that of $\Phi$ by keeping only the roots labeled by an element of $I$ , so, in our case, by deleting node $i$ . So by restricting the character (initially the adjoint character of $G$ ) to $G^{\prime}$ , we are left with a character on a Lie group $G^{\prime}$ having a rank smaller by $1$ . Of course, the character restricted to the subgroup $G^{\prime}$ in question is no longer the adjoint character, but it can be expressed in terms of the fundamental characters of $G^{\prime}$ using standard tables of “branching rules” or a computer program like Sage444http://doc.sagemath.org/html/en/thematic_tutorials/lie/branching_rules.html (in fact, branching are typically given first for the restriction from $G$ to an intermediate subgroup $G\supseteq G_{1}\supseteq G^{\prime}$ , maximal in $G$ , and described through the removal of the node $i$ from the extended Dynkin diagram of $G$ by means of so-called “Borel-de Siebenthal theory: see [56, §8.10] or [31, chapter 12]; the restriction from $G_{1}$ to $G$ is then straightforward as $G$ is a factor of $G_{1}$ , and we will give both in what follows).

In what follows, we number the nodes of the Dynkin diagrams as in Bourbaki (cf. figure 4). We write $\operatorname{ch}_{i}^{G}$ for the $i$ -th fundamental representation of the simple group $G$ , and $\operatorname{ch}_{\operatorname{ad}}$ for the adjoint representation: thus, $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{6}}=\operatorname{ch}_{2}^{\mathsf{E}_{6}}$ and $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{7}}=\operatorname{ch}_{1}^{\mathsf{E}_{7}}$ and $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{8}}=\operatorname{ch}_{8}^{\mathsf{E}_{8}}$ . More generally, we write $\operatorname{ch}_{\lambda}$ for the character with highest weight $\lambda$ (written as a combination of fundamental weights $\varpi_{i}$ ), so that $\operatorname{ch}_{i}$ is an abbreviation for $\operatorname{ch}_{\varpi_{i}}$ .

Theorem B.2.

The set of critical values of $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{6}}$ is $-3$ , $-2$ , $6$ , $14$ and $78$ . We therefore have $\inf_{x}{\mathcal{F}}_{\mathsf{E}_{6}}=-9$ and $\chi({\mathsf{E}}_{6})\geq 9$ .

Proof.

If $z$ is a critical point of $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{6}}=\operatorname{ch}_{2}^{\mathsf{E}_{6}}$ , i.e., if the differential $d\operatorname{ch}_{2}^{\mathsf{E}_{6}}$ vanishes there, then by (†) we can assume that $z$ belongs to (the Lie algebra $\mathfrak{t}$ of the maximal torus of) the Lie subgroup $\mathsf{A}_{5}$ defined by removing node $2$ from the Dynkin diagram of $\mathsf{E}_{6}$ .

Now $\operatorname{ch}_{2}^{\mathsf{E}_{6}}|_{\mathsf{A}_{5}}=2\operatorname{ch}_{3}^{\mathsf{A}_{5}}+\operatorname{ch}_{1}^{\mathsf{A}_{5}}\,\operatorname{ch}_{5}^{\mathsf{A}_{5}}+2$ . In more detail, the branching rule for the maximal subgroup $\mathsf{A}_{1}\times\mathsf{A}_{5}$ of $\mathsf{E}_{6}$ gives: $\operatorname{ch}_{2}^{\mathsf{E}_{6}}|_{\mathsf{A}_{1}\times\mathsf{A}_{5}}=\operatorname{ch}_{\varpi_{1}}^{\mathsf{A}_{1}}\operatorname{ch}_{\varpi_{3}}^{\mathsf{A}_{5}}+\operatorname{ch}_{\varpi_{1}+\varpi_{5}}^{\mathsf{A}_{5}}+\operatorname{ch}_{2\varpi_{1}}^{\mathsf{A}_{1}}$ as witnessed by Sage:

sage: E6 = WeylCharacterRing("E6", style="coroots") sage: br = branching_rule(E6, "A1xA5","extended") sage: br.branch(E6(E6.fundamental_weights()[2])) A1xA5(1,0,0,1,0,0) + A1xA5(2,0,0,0,0,0) + A1xA5(0,1,0,0,0,1)

We then observe that

[TABLE]

and that

[TABLE]

so that

[TABLE]

Evaluating at the identity of $\mathsf{A}_{1}$ (where $\operatorname{ch}_{1}^{\mathsf{A}_{1}}=2$ ), we get

[TABLE]

as announced.

Now the fundamental characters $\operatorname{ch}_{i}$ of $\mathsf{A}_{5}$ are the elementary symmetric functions $\sigma_{i}$ of six variables $u_{0},\ldots,u_{5}$ ranging over the unit circle $\mathbb{U}:=\{u\in\mathbb{C}:|u|=1\}$ and constrained by $u_{0}u_{1}u_{2}u_{3}u_{4}u_{5}=1$ (the eigenvalues of the element of $\mathit{SU}_{6}$ ). This means that we are to compute the critical values of $h:=2\sigma_{3}+\sigma_{1}\sigma_{5}+2$ over $\{(u_{0},\ldots,u_{5})\in\mathbb{U}^{6}:u_{0}u_{1}u_{2}u_{3}u_{4}u_{5}=1\}$ (which is the maximal torus of $\mathsf{A}_{5}$ ).

This concludes the reduction step (which is simpler in the case of $\mathsf{E}_{6}$ than for $\mathsf{E}_{7},\mathsf{E}_{8}$ ), and we now proceed to the computation step (which, compared to $\mathsf{E}_{7},\mathsf{E}_{8}$ , has fewer cases to consider, but is computationally more challenging).

By elimination theory, we can compute the critical values for $u_{0},\ldots,u_{5}$ ranging over $\mathbb{C}^{6}$ subject to $u_{0}u_{1}u_{2}u_{3}u_{4}u_{5}=1$ : consider the ideal of $\mathbb{C}[u_{0},\ldots,u_{5},y]$ generated by $u_{i}\frac{\partial h}{\partial u_{i}}-u_{0}\frac{\partial h}{\partial u_{0}}$ for $i=1,\ldots,5$ (because saying that $dh$ is proportional to $d(u_{0}\cdots u_{5})$ means $u_{i}\frac{\partial h}{\partial u_{i}}=u_{0}\frac{\partial h}{\partial u_{0}}$ ) and also $y-h$ ; and perform elimination of the variables $u_{0},\ldots,u_{5}$ (by computing a Gröbner basis for a monomial order for which $y<u_{0}^{i_{0}}\cdots u_{5}^{i_{5}}$ for any $i_{0},\ldots,i_{5}$ not all zero) in this ideal to obtain the projection on the $y$ coordinate of the critical points.

By computing the Gröbner basis of the corresponding ideal we obtain that the resulting set of possible critical values is $-66$ , $-3$ , $-2$ , $6$ , $14$ and $78$ .

Now the critical value $-66$ cannot be attained on

[TABLE]

We add the inequality $y+66$ to the ideal and recompute a Gröbner basis. In this basis we have $\sigma_{5}^{3}=-6^{3}$ .

And this is impossible because $\sigma_{5}$ is the sum of the $u_{i}^{-1}$ , which can only take the value $6$ in absolute value provided all the $u_{i}$ are equal to one and the same $6$ -th root of unity $\zeta$ , in which case $\sigma_{1}=6\zeta$ and $\sigma_{5}=6\zeta^{-1}$ and $\sigma_{3}=20\zeta^{3}$ and by checking the possible $\zeta$ one notices that $h=2\sigma_{3}+\sigma_{1}\,\sigma_{5}+2$ does not, in fact, take the value $-66$ .

The value $-3$ , on the other hand, is attained, namely when three of the $u_{i}$ are equal to one primitive cube root of unity and the other three are equal to the other. So it is its minimum and so a critical value.

By further Gröbner basis computation we obtain the $78$ is attained only at $u_{i}=1$ . We also obtained that the critical value $14$ is attained only with four $u_{i}$ set at $-1$ and two $u_{i}$ set at $1$ . The critical value $6$ is attained only by fixing two $u_{i}$ at $1$ , two at $e^{i2\pi/3}$ and two at $e^{-2i\pi/3}$ . The critical value $-2$ corresponds to a manifold of dimension $2$ . In this manifold is the point with four $u_{i}$ set at $1$ and two $u_{i}$ set at $-1$ .

From the formula ${\mathcal{F}}_{\mathsf{E}_{6}}(x)=\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{6}}(x)-6$ we get $\inf_{x}{\mathcal{F}}_{\mathsf{E}_{6}}(x)=-9$ and then using Corollary 5.2 $\chi(\mathsf{E}_{6})\geq 1-(-9/72)^{-1}=9$ . ∎

Theorem B.3.

The set of critical values of $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{8}}$ is $-8$ , $-4$ , $-\frac{104}{27}$ , $-\frac{57}{16}$ , $-3$ , $-2$ , [math], $5$ , $24$ , $248$ . We therefore have $\inf_{x}{\mathcal{F}}_{\mathsf{E}_{8}}=-16$ and $\chi({\mathsf{E}}_{8})\geq 16$ .

Proof.

If $z$ is a critical point of $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{8}}=\operatorname{ch}_{8}^{\mathsf{E}_{8}}$ , i.e., if the differential $d\operatorname{ch}_{8}^{\mathsf{E}_{8}}$ vanishes there, then by (†) we can assume that $z$ belongs to (the Lie algebra $\mathfrak{t}$ of the maximal torus of) the Lie subgroup $\mathsf{E}_{7}$ defined by removing node $8$ from the Dynkin diagram of $\mathsf{E}_{8}$ .

Now

[TABLE]

Now since neither $\operatorname{ch}_{1}^{\mathsf{E}_{7}}$ nor $\operatorname{ch}_{7}^{\mathsf{E}_{7}}$ appear more than once (or with any exponent) in $\operatorname{ch}_{1}^{\mathsf{E}_{7}}+2\operatorname{ch}_{7}^{\mathsf{E}_{7}}+3$ , we can apply (†) again: at a point $z$ where the differential of this expression vanishes, the differentials of the fundamental characters are not independent, so $z$ belongs to (the Lie algebra $\mathfrak{t}$ of the maximal torus of) the Lie subgroup $\mathsf{D}_{6}$ defined by removing node $1$ from the Dynkin diagram of $\mathsf{E}_{7}$ .

We have

[TABLE]

and

[TABLE]

giving:

[TABLE]

Again, there are no multiple occurrences of the various $\operatorname{ch}_{i}^{\mathsf{D}_{6}}$ , so we can apply (†) one more time: at a point $z$ where the differential of this expression vanishes, the differentials of the fundamental characters are not independent, so $z$ belongs to (the Lie algebra $\mathfrak{t}$ of the maximal torus of) the Lie subgroup $\mathsf{D}_{4}\times\mathsf{A}_{1}$ defined by removing node $2$ from the Dynkin diagram of $\mathsf{D}_{6}$ .

Here the branching gets more complicated: We use

sage: WeylCharacterRing("D6").maximal_subgroups()

within Sage to find the correct rule for branching to $\mathsf{A}_{1}\times\mathsf{A}_{1}\times\mathsf{D}_{4}$ , and in principle the two $\mathsf{A}_{1}$ factors are not symmetric (although in the end it turns out that they are, up to a symmetry of $\mathsf{D}_{4}$ ) so one must use br.describe() to chop off the correct $\mathsf{A}_{1}$ factor (call it $\mathsf{A}_{1}^{\circ}$ in what follows). We find:

•

$\operatorname{ch}_{2}^{\mathsf{D}_{6}}|_{\mathsf{A}_{1}\times\mathsf{D}_{4}}=\operatorname{ch}_{2}^{\mathsf{D}_{4}}+2\operatorname{ch}_{1}^{\mathsf{A}_{1}}\,\operatorname{ch}_{1}^{\mathsf{D}_{4}}+((\operatorname{ch}_{1}^{\mathsf{A}_{1}})^{2}-1)+3$ (from $\operatorname{ch}_{2}^{\mathsf{D}_{6}}|_{\mathsf{A}_{1}^{\circ}\times\mathsf{A}_{1}\times\mathsf{D}_{4}}=\operatorname{ch}_{2}^{\mathsf{D}_{4}}+\operatorname{ch}_{1}^{\mathsf{A}_{1}^{\circ}}\,\operatorname{ch}_{1}^{\mathsf{A}_{1}}\,\operatorname{ch}_{1}^{\mathsf{D}_{4}}+((\operatorname{ch}_{1}^{\mathsf{A}_{1}})^{2}-1)+((\operatorname{ch}_{1}^{\mathsf{A}_{1}^{\circ}})^{2}-1)$ ).

•

$\operatorname{ch}_{5}^{\mathsf{D}_{6}}|_{\mathsf{A}_{1}\times\mathsf{D}_{4}}=\operatorname{ch}_{1}^{\mathsf{A}_{1}}\,\operatorname{ch}_{4}^{\mathsf{D}_{4}}+2\operatorname{ch}_{3}^{\mathsf{D}_{4}}$ (from $\operatorname{ch}_{5}^{\mathsf{D}_{6}}|_{\mathsf{A}_{1}^{\circ}\times\mathsf{A}_{1}\times\mathsf{D}_{4}}=\operatorname{ch}_{1}^{\mathsf{A}_{1}}\,\operatorname{ch}_{4}^{\mathsf{D}_{4}}+\operatorname{ch}_{1}^{\mathsf{A}_{1}^{\circ}}\,\operatorname{ch}_{3}^{\mathsf{D}_{4}}$ ).

•

$\operatorname{ch}_{6}^{\mathsf{D}_{6}}|_{\mathsf{A}_{1}\times\mathsf{D}_{4}}=\operatorname{ch}_{1}^{\mathsf{A}_{1}}\,\operatorname{ch}_{3}^{\mathsf{D}_{4}}+2\operatorname{ch}_{4}^{\mathsf{D}_{4}}$ (from $\operatorname{ch}_{5}^{\mathsf{D}_{6}}|_{\mathsf{A}_{1}^{\circ}\times\mathsf{A}_{1}\times\mathsf{D}_{4}}=\operatorname{ch}_{1}^{\mathsf{A}_{1}}\,\operatorname{ch}_{3}^{\mathsf{D}_{4}}+\operatorname{ch}_{1}^{\mathsf{A}_{1}^{\circ}}\,\operatorname{ch}_{4}^{\mathsf{D}_{4}}$ ).

•

$\operatorname{ch}_{1}^{\mathsf{D}_{6}}|_{\mathsf{A}_{1}\times\mathsf{D}_{4}}=\operatorname{ch}_{1}^{\mathsf{D}_{4}}+2\operatorname{ch}_{1}^{\mathsf{A}_{1}}$ (from $\operatorname{ch}_{1}^{\mathsf{D}_{6}}|_{\mathsf{A}_{1}^{\circ}\times\mathsf{A}_{1}\times\mathsf{D}_{4}}=\operatorname{ch}_{1}^{\mathsf{D}_{4}}+\operatorname{ch}_{1}^{\mathsf{A}_{1}^{\circ}}\,\operatorname{ch}_{1}^{\mathsf{A}_{1}}$ ).

Finally, we get:

[TABLE]

We can now apply the reduction trick (†) one last time, for the $\mathsf{D}_{4}$ factor: since $\operatorname{ch}_{1}^{\mathsf{A}_{1}}$ is obviously independent from the $\operatorname{ch}_{i}^{\mathsf{D}_{4}}$ , at a point $z$ where the differential of the expression $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{8}}|_{\mathsf{A}_{1}\times\mathsf{D}_{4}}$ above vanishes, the differentials of the fundamental characters $\operatorname{ch}_{i}^{\mathsf{D}_{4}}$ are not independent, so $z$ belongs to (the Lie algebra $\mathfrak{t}$ of the maximal torus of) the Lie subgroup $\mathsf{A}_{1}\times(\mathsf{A}_{1})^{3}$ defined by removing node $2$ from the Dynkin diagram of $\mathsf{D}_{4}$ .

To compute this restriction, first examine the restriction of $\mathsf{D}_{4}$ to the maximal subgroup $\mathsf{A}_{1}\times\mathsf{A}_{1}\times\mathsf{A}_{1}\times\mathsf{A}_{1}$ of $\mathsf{D}_{4}$ (seen by extending the Dynkin diagram of $\mathsf{D}_{4}$ and removing the node connected to the four others): if we call $t_{1},\ldots,t_{4}$ the (single) fundamental characters of the various $\mathsf{A}_{1}$ factors, numbered in the same way as the nodes of the extended diagram of $\mathsf{D}_{4}$ from which they come (except that $t_{2}$ comes from the extending node), then $\operatorname{ch}_{1}^{\mathsf{D}_{4}}|_{(\mathsf{A}_{1})^{4}}=t_{1}t_{2}+t_{3}t_{4}$ and $\operatorname{ch}_{3}^{\mathsf{D}_{4}}|_{(\mathsf{A}_{1})^{4}}=t_{1}t_{4}+t_{2}t_{3}$ and $\operatorname{ch}_{4}^{\mathsf{D}_{4}}|_{(\mathsf{A}_{1})^{4}}=t_{1}t_{3}+t_{2}t_{4}$ and finally

[TABLE]

Finally, restricting $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{8}}$ to $\mathsf{A}_{1}\times(\mathsf{A}_{1})^{3}$ (the first $\mathsf{A}_{1}$ factor being the factor $\mathsf{A}_{1}$ in $\mathsf{A}_{1}\times\mathsf{D}_{4}$ earlier and the other three coming from nodes $1,3,4$ of $\mathsf{D}_{4}$ as described in the previous paragraphs), we have

[TABLE]

where $s$ is the fundamental character from the first $\mathsf{A}_{1}$ factor and $\sigma_{i}$ are the elementary symmetric functions in the fundamental characters $t_{1},t_{3},t_{4}$ of the three other $\mathsf{A}_{1}$ factors.

We now need to find the critical values of this function

[TABLE]

There is one subtlety, however: “critical” means that for each $t_{i}$ , as well as for $s$ , we either have $\frac{\partial h}{\partial t_{i}}=0$ (resp. $\frac{\partial h}{\partial s}=0$ ) or $t_{i}=\pm 2$ (resp. $s=\pm 2$ ). Indeed, each $t_{i}$ (as well as $s$ ) is a character of $\mathsf{A}_{1}$ , so it is $u+u^{-1}$ for the two eigenvalues $u,u^{-1}$ of the element of $\mathit{SU}_{2}$ in question, so the critical values of $t_{i}$ itself are $\pm 2$ .

This concludes the reduction step for $\mathsf{E}_{8}$ . The computation step is then to use elimination theory, in each possible case depending on how many of the $t_{i}$ satisfy $\frac{\partial h}{\partial t_{i}}=0$ and how many satisfy $t_{i}=2$ and $t_{i}=-2$ , and similarly for $s$ , to find the corresponding values of $h=2\sigma_{3}+\sigma_{1}^{2}+2(s+1)\sigma_{2}+4(s+2)\sigma_{1}+s^{2}+8s+4$ .

For this, we must consider $10\times 3=30$ cases according to constraints placed on the $t_{i}$ (which can be set equal to $+2$ or to $-2$ or to satisfy $\frac{\partial h}{\partial t_{i}}=0$ , which we denote as “ $t_{i}=\partial$ ” for short) and on $s$ (similarly $s=+2$ or $s=-2$ or $s=\partial$ ). In each case, we consider the ideal of $\mathbb{C}[s,t_{1},t_{3},t_{4},y]$ generated by the $t_{i}-2$ or $t_{i}+2$ or $\frac{\partial h}{\partial t_{i}}$ as the case may be, and similarly for $s$ , and also $y-h$ ; and perform elimination of the variables $s,t_{1},t_{3},t_{4}$ (by computing a Gröbner basis for a monomial order for which $y<s^{j}t_{1}^{i_{1}}t_{3}^{i_{3}}t_{4}^{i_{4}}$ for any $j,i_{1},i_{3},i_{4}$ not all zero) in this ideal to obtain the projection on the $y$ coordinate of the critical points.

By considering all cases we find that the set of possible critical values is $-652$ , $-27$ , $-12$ , $-\frac{64}{7}$ , $-8$ , $-4$ , $-\frac{104}{27}$ , $-\frac{57}{16}$ , $-3$ , $-2$ , [math], $5$ , $24$ and $248$ . For each such value and each possible critical value we compute the manifold which turns out to be always [math]-dimensional. The points of those manifolds can be enumerated and we obtain the list of critical values by keeping only the values for which at least one of the point has $|t_{1}|$ , $|t_{3}|$ , $|t_{4}|$ , $|s|\leq 2$ .

From the formula ${\mathcal{F}}_{\mathsf{E}_{8}}(x)=\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{8}}(x)-8$ we get $\inf_{x}{\mathcal{F}}_{\mathsf{E}_{8}}(x)=-16$ and then using Corollary 5.2 $\chi(\mathsf{E}_{8})\geq 1-(-16/240)^{-1}=16$ . ∎

Theorem B.4.

The set of critical values of $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{7}}$ is $-7$ , $-3$ , $-2$ , $1$ , $\frac{17}{5}$ , $5$ , $25$ , $133$ . We therefore have $\inf_{x}{\mathcal{F}}_{\mathsf{E}_{7}}=-14$ and $\chi({\mathsf{E}}_{7})\geq 10$ .

Proof.

We have $\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{7}}=\operatorname{ch}_{1}^{\mathsf{E}_{7}}$ and all the reduction step has already been explained above in the $\mathsf{E}_{8}$ case: we get

[TABLE]

so that

[TABLE]

The computation step is then similar to $\mathsf{E}_{8}$ , except there is now less symmetry between the $t_{i}$ (one can only exchange $t_{1}$ and $t_{4}$ ): one must therefore distinguish $3\times 3\times 6$ cases.

By computing a Gröbner basis we obtain that the set of possible critical values is $-191$ , $-11$ , $-35/4$ , $-7$ , $-3$ , $-2$ , $1$ , $\frac{17}{5}$ , $5$ , $25$ and $133$ . For each case and critical value we compute the corresponding manifold and its complex points. In each case except one the manifold is [math]-dimensional and for two [math]-dimensional cases the computation of the points does not finish. Those $3$ problematic cases occur for the value $-2$ .

For the other cases we compute the points and keep only the values for which one of the points has $|t_{1}|$ , $|t_{3}|$ , $|t_{4}|$ , $|s|\leq 2$ . It turns out that the value $-2$ is attained by one of those points and so the $3$ problematic cases do not prevent us from concluding that $-2$ is a critical value.

From the formula ${\mathcal{F}}_{\mathsf{E}_{7}}(x)=\operatorname{ch}_{\operatorname{ad}}^{\mathsf{E}_{7}}(x)-7$ we get $\inf_{x}{\mathcal{F}}_{\mathsf{E}_{7}}(x)=-14$ and then using Corollary 5.2 $\chi(\mathsf{E}_{7})\geq 1-(-14/126)^{-1}=10$ . ∎

One possible extension of this work could be to consider the non-simply laced diagrams, that is $\mathsf{B}_{n}$ , $\mathsf{C}_{n}$ , $\mathsf{F}_{4}$ and $\mathsf{G}_{2}$ .

Bibliography56

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Arvanitoyeorgos, An Introduction to Lie Groups and the Geometry of Homogeneous Spaces , AMS, 2003.
2[2] C. Bachoc, P.E.B. De Corte, F.M. de Oliveira Filho, and F. Vallentin, Spectral bounds for the independence ratio and the chromatic number of an operator , Israel J. Math. 202 (2014), 227–254. ( https://arxiv.org/abs/1301.1054 )
3[3] C. Bachoc, G. Nebe, F.M. de Oliveira Filho, and F. Vallentin, Lower bounds for measurable chromatic numbers , Geom. Funct. Anal. 19 (2009), 645–661. ( https://arxiv.org/abs/0801.1059 )
4[4] K. Ball, A lower bound for the optimal density of lattice packings , Internat. Math. Res. Notices 10 (1992), 217–221.
5[5] N. Biggs, Algebraic Graph Theory , Cambridge University Press, 1974.
6[6] B. Bollobás, Modern Graph Theory , Springer, 1998.
7[7] N. Bourbaki, Groupes et algèbres de Lie , Chap. 4–6, Hermann, 1968.
8[8] W. Bosma, J. Cannon and C. Playoust, The Magma algebra system. I. The user language , J. Symbolic Comput. 24 (1997), 235–265.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Coloring the Voronoi tessellation of lattices

Abstract.

Key words and phrases:

1991 Mathematics Subject Classification:

Contents

1. Introduction

1.1. Determination of the chromatic number

1.2. Generic and extremal behaviour of the chromatic number

2. Background and first observations

2.1. Lattices, Voronoi cells, Voronoi vectors

2.2. Simple upper bounds for the chromatic number

Lemma 2.1**.**

Lemma 2.2**.**

2.3. Simple lower bounds for the chromatic number

Lemma 2.3**.**

2.4. Root lattices and their duals

2.5. The chromatic number of an orthogonal sum of lattices

Lemma 2.4**.**

Proof.

Theorem 2.5**.**

Proof.

3. On the chromatic number of lattices of Voronoi’s first kind

3.1. Definitions and first examples

Definition 3.1**.**

Lemma 3.2**.**

Proof.

3.2. Interpretation in terms of graphs and more general results

Definition 3.3**.**

Corollary 3.4**.**

Theorem 3.5**.**

Proof.

Theorem 3.6**.**

Proof.

Example 3.7**.**

3.3. Application: the chromatic number of 333-dimensional lattices

4. Sphere packing lower bounds

Theorem 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

4.1. First application: Chromatic number of E8\mathsf{E}_{8}E8​ and of the Leech lattice

Theorem 4.3**.**

Proof.

4.2. Second application: Exponential growth of the chromatic

Theorem 4.4**.**

5. Spectral lower bounds

5.1. Setup

Theorem 5.1**.**

Corollary 5.2**.**

5.2. Computing the spectral bound for irreducible root lattices

5.2.1. An\mathsf{A}_{n}An​

Theorem 5.3**.**

Proof.

5.2.2. Dn\mathsf{D}_{n}Dn​

Theorem 5.4**.**

Proof.

5.2.3. E7\mathsf{E}_{7}E7​, E8\mathsf{E}_{8}E8​ and sums of squares

5.2.4. E6\mathsf{E}_{6}E6​

6. The chromatic number of irreducible root lattices and their duals

6.1. Dn\mathsf{D}_{n}Dn​ and its dual

Theorem 6.1**.**

Proof.

Theorem 6.2**.**

Proof.

6.2. E6,E7,E8\mathsf{E}_{6},\mathsf{E}_{7},\mathsf{E}_{8}E6​,E7​,E8​ and their duals

Theorem 6.3**.**

Proof.

Theorem 6.4**.**

Proof.

Acknowledgements

Appendix A Recollections on compact Lie groups

Remark A.1**.**

Proposition A.13**.**

Proof.

Lemma 2.1.

Lemma 2.2.

Lemma 2.3.

Lemma 2.4.

Theorem 2.5.

Definition 3.1.

Lemma 3.2.

Definition 3.3.

Corollary 3.4.

Theorem 3.5.

Theorem 3.6.

Example 3.7.

3.3. Application: the chromatic number of $3$ -dimensional lattices

Theorem 4.1.

Lemma 4.2.

4.1. First application: Chromatic number of $\mathsf{E}_{8}$ and of the Leech lattice

Theorem 4.3.

Theorem 4.4.

Theorem 5.1.

Corollary 5.2.

5.2.1. $\mathsf{A}_{n}$

Theorem 5.3.

5.2.2. $\mathsf{D}_{n}$

Theorem 5.4.

5.2.3. $\mathsf{E}_{7}$ , $\mathsf{E}_{8}$ and sums of squares

5.2.4. $\mathsf{E}_{6}$

6.1. $\mathsf{D}_{n}$ and its dual

Theorem 6.1.

Theorem 6.2.

6.2. $\mathsf{E}_{6},\mathsf{E}_{7},\mathsf{E}_{8}$ and their duals

Theorem 6.3.

Theorem 6.4.

Remark A.1.

Proposition A.13.

Remark A.15.

Remark A.20.

Proposition A.22.

Theorem B.1.

Theorem B.2.

Theorem B.3.

Theorem B.4.