Coloring the Voronoi tessellation of lattices
Mathieu Dutour Sikiri\'c, David Madore, Philippe Moustrou, Frank, Vallentin

TL;DR
This paper introduces the concept of the chromatic number for lattice Voronoi tessellations, computes it for various lattices, and explores its asymptotic behavior and spectral bounds, connecting geometry, algebra, and spectral graph theory.
Contribution
It defines the lattice chromatic number, computes it for key lattices, and establishes spectral bounds, advancing understanding of lattice coloring and its asymptotics.
Findings
Chromatic number computed for root lattices, duals, and Leech lattice.
Spectral lower bounds for lattice chromatic number established.
Asymptotic behavior of lattice chromatic number analyzed in high dimensions.
Abstract
In this paper we define the chromatic number of a lattice: It is the least number of colors one needs to color the interiors of the cells of the Voronoi tessellation of a lattice so that no two cells sharing a facet are of the same color. We compute the chromatic number of the root lattices, their duals, and of the Leech lattice, we consider the chromatic number of lattices of Voronoi's first kind, and we investigate the asymptotic behaviour of the chromatic number of lattices when the dimension tends to infinity. We introduce a spectral lower bound for the chromatic number of lattices in spirit of Hoffman's bound for finite graphs. We compute this bound for the root lattices and relate it to the character theory of the corresponding Lie groups.
| lattice | |||||
|---|---|---|---|---|---|
| Voronoi | |||||
| cell | |||||
| cube | hexagonal | rhombic | elongated | truncated | |
| prism | dodecahedron | dodecahedron | octahedron | ||
| Delaunay | |||||
| graph | |||||
| chromatic | |||||
| number |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Coloring the Voronoi tessellation of lattices
Mathieu Dutour Sikirić
M. Dutour Sikirić, Rudjer Bosković Institute, Bijenička 54, 10000 Zagreb, Croatia
,
David A. Madore
D. Madore, Télécom Paris / LTCI, Département INFRES, 19 place Marguerite Perey, 91120 Palaiseau, France
,
Philippe Moustrou
P. Moustrou, Department of Mathematics and Statistics, UiT The Arctic University of Norway, N-9037 Tromsø, Norway
and
Frank Vallentin
F. Vallentin, Department Mathematik/Informatik, Abteilung Mathematik, Universität zu Köln, Weyertal 86–90, 50931 Köln, Germany
(Date: March 22, 2021)
Abstract.
In this paper we define the chromatic number of a lattice: It is the least number of colors one needs to color the interiors of the cells of the Voronoi tessellation of a lattice so that no two cells sharing a facet are of the same color.
We compute the chromatic number of the root lattices, their duals, and of the Leech lattice, we consider the chromatic number of lattices of Voronoi’s first kind, and we investigate the asymptotic behaviour of the chromatic number of lattices when the dimension tends to infinity.
We introduce a spectral lower bound for the chromatic number of lattices in spirit of Hoffman’s bound for finite graphs. We compute this bound for the root lattices and relate it to the character theory of the corresponding Lie groups.
Key words and phrases:
lattice, Voronoi cell, chromatic number
1991 Mathematics Subject Classification:
05C15, 52C07
Contents
-
3 On the chromatic number of lattices of Voronoi’s first kind
-
3.2 Interpretation in terms of graphs and more general results
-
3.3 Application: the chromatic number of -dimensional lattices
-
4.1 First application: Chromatic number of and of the Leech lattice
-
4.2 Second application: Exponential growth of the chromatic number
-
5.2 Computing the spectral bound for irreducible root lattices
-
6 The chromatic number of irreducible root lattices and their duals
1. Introduction
Let be an -dimensional lattice in -dimensional Euclidean space. One can tessellate space by lattice translates of the lattice’ Voronoi cell, which is defined as
[TABLE]
By we denote the topological interior of . Now we consider translates , with , as colored tiles of an -dimensional mosaic in which one has infinitesimal small interstices between the mosaic tiles. How many colors does one need at least to get a colorful mosaic? In a colorful mosaic two neighboring tiles receive different colors. This defines the chromatic number of the lattice.
More formally, we can also define the chromatic number of a lattice in graph theoretical terms: Two distinct lattice translates of Voronoi cells and , with , are defining neighboring tiles whenever they share a facet, i.e. their intersection is a polytope of maximal dimension . The differences are called strict Voronoi vectors and the set of these vectors is denoted by . Now the chromatic number of equals the chromatic number of the Cayley graph on the additive group with generating set :
[TABLE]
Here, the set of vertices of the Cayley graph are all elements of and two vertices are adjacent whenever the difference lies in the set of strict Voronoi vectors . Note that the Cayley graph is an -regular infinite graph with .
In Section 2.1 we recall all the definitions and properties of lattices, their Voronoi cells, and the Voronoi vectors, which we need later.
The chromatic number of a lattice seems to be a natural parameter. However, to the best of the authors’ knowledge, see [40] and [25], has not been considered before. The aim of this paper is to start a systematic investigation of it. Then, the following questions immediately come to mind:
1.1. Determination of the chromatic number
What is the chromatic number of some interesting lattices? How to find lower and upper bounds? Is there an algorithm to determine for a given lattice ?
For instance, it is obvious that the chromatic number of the integer lattice is two, an optimal coloring is given by the black/white checkerboard pattern; see also Theorem 2.5.
We discuss simple lower and upper bounds for the chromatic number of a general lattice in Section 2.2 and in Section 2.3. For instance, we show that is at most .
All two- and three-dimensional lattices are of Voronoi’s first kind. We consider the chromatic number of this class of lattices in Section 3 where we compute the chromatic number of all three-dimensional lattices. It would be interesting to have a better understanding of the chromatic number of this class of lattices.
One of the most important classes of lattices are the root lattices. We recall the definitions and classification in Section 2.4. One main result of our paper is the determination of the chromatic number of all root lattices and their duals. Table 1 summarizes our results.
Note that we currently do not know the numerical value of . We only know it is equal to the chromatic number of the (finite) vertex-edge-graph of the half-cube polytope
[TABLE]
which at the moment is only known up to dimension : For , , , , and we have , , , , and (see [36]).
For the proof we use a generalization of a lower bound for the chromatic number of finite graphs originally due to Hoffman [28]. Hoffman’s bound is based on spectral considerations: Let be the adjacency matrix of a finite graph . Let be the smallest eigenvalue of and respectively let be the largest eigenvalue of , then
[TABLE]
Bachoc, DeCorte, Oliveira, and Vallentin [2] showed how to generalize the spectral bound (and its weighted variant due to Lovász [38]) from finite to infinite graphs. In Section 5.1 we review this generalization and specialize it to . Here, classical Fourier analysis is used. We show in Corollary 5.2 that
[TABLE]
holds.
In Section 5 we compute this bound for all irreducible root lattices. Surprisingly, the result of this computation can already be found in an Oberwolfach report by Serre [50] albeit in a different language and with a different motivation. In his report Serre computed all critical values of the characters of the adjoint representation of compact Lie groups. However, the report does not contain proofs. In Section 5.2 we provide proofs for the easy cases and . The cases , , and are much harder and we give Serre’s proof in Appendix B after recalling relevant facts about compact Lie groups in Appendix A. We sketch an alternative, computational proof, which is based on optimization, in particular using sum of squares for the cases and at the end of Section 5.2. The case is easier and does not require computer assistance.
Then, in Section 6, we construct several efficient colorings of irreducible root lattices.
It would be nice to know the chromatic number of more important lattices. Following the book [13] by Conway and Sloane the next candidates one should consider are the -dimensional Coxeter-Todd lattice and the -dimensional Barnes-Wall lattice . We expect that the spectral lower bound gives a close approximation to the chromatic number.
We show in Section 4.1 that the chromatic number of the Leech lattice in dimensions is . This is a consequence of the sphere packing optimality of . It would be nice to have an independent (spectral) proof of this fact.
Going back to general lattices: At the moment we do not know whether there is a finite algorithm to compute the chromatic number of a lattice which is given for example by a basis. Determining the strict Voronoi vectors—and thus the Cayley graph and the Voronoi cell —is possible by a finite algorithm, see for example [20], although it can occupy exponential space (and therefore needs exponential time). The theorem of de Bruijn and Erdős [9] implies that the chromatic number of is equal to the largest chromatic number of all finite subgraphs of . This shows that the decision problem: “Is ?” is at least semidecidable.
1.2. Generic and extremal behaviour of the chromatic number
What is of a random -dimensional lattice? How fast can grow depending on the dimension ?
In Section 4.2 we prove that the chromatic number of a generic -dimensional lattice grows exponentially with the dimension. There we show that there are -dimensional lattices with
[TABLE]
It would be very interesting to understand the extremal behaviour.
2. Background and first observations
2.1. Lattices, Voronoi cells, Voronoi vectors
A lattice is a discrete free -module in an -dimensional Euclidean space. If its rank is strictly lower than , then also defines a lattice in its linear span over . We implicitly identify these two lattices, and assume for the following definitions that is a full-rank lattice in . We denote by the dual lattice of :
[TABLE]
where denotes the standard Euclidean scalar product between and . A fundamental region of is a region such that for any , the volume of is [math], and . The volume of is defined as the volume of any of its fundamental region. A fundamental region of particular interest is the Voronoi cell of :
[TABLE]
A vector is called a strict Voronoi vector, or sometimes a “relevant” vector, if the intersection is a facet, a face of dimension , of . By a well-known characterization of Voronoi (see for example [13, Chapter 21, Theorem 10] or [15]) the set of these vectors is
[TABLE]
Now the chromatic number of equals the chromatic number of the Cayley graph on the additive group with generating set :
[TABLE]
Here, the set of vertices of the Cayley graph are all elements of and two vertices are adjacent whenever the difference lies in the set of strict Voronoi vectors .
2.2. Simple upper bounds for the chromatic number
One can color a lattice periodically by using translates of one of its sublattices which does not contain Voronoi vectors. More precisely it is enough to color the vertices of the graph with
[TABLE]
This immediately gives the following upper bound on :
Lemma 2.1**.**
Let be a sublattice of with . Then, is at most .
Sometimes, we can improve this bound by coloring the vertices of the graph greedily. This shows, see for example [6, Chapter V.1], that
[TABLE]
where is the largest degree of a vertex in .
Now we take . Lemma 2.1 implies that . If the number of Voronoi vectors is not maximal, if , then we can improve this bound by using (2):
Lemma 2.2**.**
The chromatic number of is at most .
For generic lattices, the number of Voronoi vectors is , so that Lemma 2.2 also gives an upper bound of for the chromatic number of .
2.3. Simple lower bounds for the chromatic number
For a general graph one has for every induced subgraph of . In particular, when choosing to be a largest complete subgraph of , we have , where is the clique number of .
Canonical finite induced subgraphs of are the vertex-edge graphs of Delaunay polytopes of . A Delaunay polytope of the lattice is defined as follows: Let be a vertex of the Voronoi cell . Consider all vectors so that is contained in all the translates . Then, the convex hull of is a Delaunay polytope of . Clearly, all edges of lie in .
Lemma 2.3**.**
The chromatic number of a lattice is at least the chromatic number of the vertex-edge graph of any Delaunay polytope of .
2.4. Root lattices and their duals
One of the most important classes of lattices are the root lattices. Assume that is an even lattice, i.e. we have for all . Lattice vectors with are called root vectors, or simply roots. A root lattice is an even lattice which is spanned by roots. Root lattices have been classified by Witt in 1941, see for example [22, Section 1.4], and they are orthogonal direct sums of the irreducible root lattices , , , , and . The strict Voronoi vectors of root lattices are precisely the root vectors [13, Chapter 21, §3.A] (and in fact, this condition that only the shortest nonzero vectors are relevant characterizes root lattices, see [47]). Also, the combinatorial description of the Voronoi cells of root lattices is well known: it is described in more detail in [41]. Here we recall the definitions of the irreducible root lattices and their duals.
Living inside the hyperplane (the coordinates being here numbered [math] through ), the irreducible root lattice is defined as
[TABLE]
The dual lattice naturally lives in the vector space dual to , which can be identified with the quotient of by the diagonal line . But we can identify with itself by choosing the representatives of modulo the diagonal which belong to .
For every , the irreducible root lattice is defined as
[TABLE]
and its dual lattice equals
[TABLE]
The root lattice can be constructed as the union of two translates of the lattice :
[TABLE]
The lattice is unimodular, i.e. . The lattice (resp. ) can be defined as a -dimensional (resp. -dimensional) sublattice of :
[TABLE]
and
[TABLE]
Then, if we define
[TABLE]
the dual lattices of and are
[TABLE]
2.5. The chromatic number of an orthogonal sum of lattices
Eichler [23] showed that one can decompose every lattice as a pairwise orthogonal sum of indecomposable lattices and that this decomposition is unique up to permutation of the summand; Kneser [34] gave a constructive and much simpler proof of Eichler’s result.
We prove that the chromatic number of a lattice is the maximum of the chromatic numbers of its orthogonal summands.
This reduces in particular the study of the chromatic number of root lattices to the irreducible root lattices , , , , and .
Lemma 2.4**.**
Let be a lattice which can be written as the orthogonal direct sum of lattices :
[TABLE]
so that every lattice vector can be uniquely decomposed as with and is orthogonal to whenever . Then,
[TABLE]
Proof.
By induction, we may assume that .
Every we can write as with . If both and are non zero, then is different from . It lies in and satisfies , yielding a contradiction. So for . In particular, must be the only minimal vectors in , so that .
Conversely, let for instance , and let . Let us write for . Since , we have , with equality if and only if . Thus,
[TABLE]
with equality if and only if and , namely . So . ∎
Theorem 2.5**.**
Let be a lattice such that
[TABLE]
Then,
[TABLE]
Proof.
We again assume .
By Lemma 2.4, , and so is a subgraph of . Hence, .
Conversely, let . By definition, for , there is a proper coloring such that if , then . We shall show that
[TABLE]
is a proper coloring of . For this let such that with . Following Lemma 2.4, . Assume for instance that . Then, we write with , and
[TABLE]
So is a proper coloring of and . ∎
3. On the chromatic number of lattices of Voronoi’s first kind
In this section we give lower and upper bounds for the chromatic number of lattices of Voronoi’s first kind. Lattices of Voronoi’s first kind form a nice class of lattices: All lattices in dimensions and belong to this class as well as and . Our lower and upper bounds coincide for all these cases. For dimension and greater the bounds can differ. We like to pose the question of computing the chromatic number of a lattice of Voronoi’s first kind as an open problem.
3.1. Definitions and first examples
Lattices of Voronoi’s first kind are treated in detail for example in [14]. Here we start by collecting some facts about them.
Definition 3.1**.**
A lattice is called a lattice of Voronoi’s first kind if it admits an obtuse superbasis: There exist lattice vectors such that:
- (1)
The set forms a basis of , 2. (2)
We have , 3. (3)
For every , the vectors and enclose an obtuse angle, .
The lattices and are of Voronoi’s first kind. The lattice possesses an obtuse superbasis. Let be the canonical basis of . For , let , and let . Then is an obtuse superbasis of . The dual lattice possesses a strictly obtuse superbasis. Let
[TABLE]
Then is an obtuse superbasis:
[TABLE]
If is a lattice of Voronoi’s first kind, then it is known (see [14]) that
[TABLE]
This immediately gives an upper bound for the chromatic number.
Lemma 3.2**.**
Let be a lattice of Voronoi’s first kind, with an obtuse superbasis . Then, is at most .
Proof.
By definition is a basis of . Let us show that the linear map
[TABLE]
is a proper coloring of . Because of linearity it is enough to check that it does not vanish on the strict Voronoi vectors. Let be such a vector. If we replace by (by Definition 3.1 (2)) to make sure that . Since is nontrivial, . In other words, . ∎
With this lemma it is easy to see that the chromatic numbers of and of its dual are both equal to . For and for every , the vector is a minimal vector, and thus is a strict Voronoi vector. So,
[TABLE]
is a clique in and so . For we know (see [14]) that every is a strict Voronoi vector. Again, (3) is a clique, and .
3.2. Interpretation in terms of graphs and more general results
In order to get a better understanding of the chromatic number of lattices of Voronoi’s first kind, we need to know which vectors are strict Voronoi vectors.
Let be a lattice of Voronoi’s first kind with superbasis .
Definition 3.3**.**
The Delaunay graph is an undirected graph with vertex set and where and are connected by an edge whenever .
The combinatorics of the Delaunay graph determines the Cayley graph . Recall some standard terminology in graph theory. Let be a graph, a subset of the vertex set defines a cut by
[TABLE]
The strict Voronoi vectors of are the such that the cut is minimal with respect to inclusion, see for example [20] or [53]. The minimal cuts are also known to be the ones such that, when removing the edges of the cut, the number of connected components in the graph increases by one.
A connected graph is called biconnected (a block) if it remains connected when we remove any of its vertices. One can decompose every connected graph into biconnected components (block decomposition). The set of edges can be uniquely written as a disjoint union such that the subgraph of induced by is a maximal biconnected subgraph of . Let , with , be the biconnected components of the Delaunay graph . Then, there exist lattices of Voronoi’s first kind with obtuse superbasis such that
[TABLE]
holds, see for example [45, Chapter 4, Chapter 5]. Then Theorem 2.5 and Lemma 3.2 yield the following upper bound for the chromatic number of .
Corollary 3.4**.**
Let be a lattice of Voronoi’s first kind with obtuse superbasis . Let , , be the biconnected components of the Delaunay graph . Then,
[TABLE]
Now we go for lower bounds. Recall that a cycle in a graph is a collection of vertices such that and such that is connected to , where indices are computed modulo . Its length is equal to .
Theorem 3.5**.**
Let be a lattice of Voronoi’s first kind with obtuse superbasis . Then, the chromatic number of is at least the maximal length of a cycle in the Delaunay graph .
Proof.
Up to a permutation of the indices, we may assume that is a cycle in . We shall construct a clique of of size .
For any in , we say that is a connector between and if there exists a path in such that is the only vertex on the path that belongs to .
Let . We define the set as the subset of all vertices in such that every connector from to is in . In particular, for , . An example is depicted in Figure 2. Then, if we define for all , then for every ,
[TABLE]
In order to show that the set is a clique in , we need to prove that for every , the vector is in . Equivalently, since is connected, we need to check that both and its complementary in induce connected subgraphs of .
Take in . Since is not in , there is a connector between and which is not in ; but since is in , this connector must be in . So , as well as all the vertices in this path, are in and are connected to , which is obviously connected and included in . Regarding the complementary set, take not in . Then either is not in , and there is a connector between and in , or is in , and every connector between and is in . Thus for every such , one can find a path made of vertices not in , going to .
∎
Theorem 3.6**.**
.
Proof.
For the Delaunay graph is the complete graph . For the Delaunay graph is a cycle of length . In both cases, our upper bound and lower bound coincide. ∎
Example 3.7**.**
In general, our upper bound differs from the lower bound. Both bounds can be attained: On the left of Figure 3, the longest cycle has size and one can find a coloring of the corresponding -dimensional lattice with colors, whereas the lattice associated with the graph depicted on the right of Figure 3, whose longest has size , has chromatic number . This can be seen by computing the chromatic number of a small subgraph of .
3.3. Application: the chromatic number of -dimensional lattices
Every lattice in dimensions and is of Voronoi’s first kind, see [14]. We can compute the chromatic number of these lattices by applying our bounds which coincide in these cases, see Table 2.
4. Sphere packing lower bounds
In this section we prove lower bounds for the chromatic number of a lattice by considering connections to the classical sphere packing problem.
A subset of defines a packing of unit spheres if the distance between all pairs of distinct points in is at least . Define the center density of as the number of points in per unit volume, more precisely the (upper) center density of is defined as
[TABLE]
where is the regular -dimensional cube with side length . The largest center density of a packing of unit spheres in is
[TABLE]
Theorem 4.1**.**
Let be an -dimensional lattice which defines a packing of unit spheres. Let be a positive real number so that all strict Voronoi vectors of have length strictly less than . Then,
[TABLE]
Proof.
Suppose that the chromatic number of equals . Then one can decompose into color classes . We may assume that the first color class has the largest density among these color classes. In particular, inequality
[TABLE]
holds. Then for all with we have . Hence, defines a packing of unit spheres. So,
[TABLE]
and the claim of the theorem follows by combining the two inequalities above. ∎
The following lemma gives a lower bound for .
Lemma 4.2**.**
Let be an -dimensional lattice which defines a packing of unit spheres. If is not a strict Voronoi vector, then .
Proof.
By Voronoi’s characterization of strict Voronoi vectors (1) there is a lattice vector with and . We may assume that , otherwise we replace by its negative . Define . Then,
[TABLE]
So, , since . ∎
4.1. First application: Chromatic number of and of the Leech lattice
Theorem 4.3**.**
The chromatic number of the Leech lattice equals .
Proof.
It is known that the strict Voronoi vectors of are all vectors with , see [13]. Dong, Li, Mason, and Norton showed [18, Theorem 4.1] that one can find an isometric copy of as a sublattice of ; in [43] Nebe and Parker classified all orbits of such sublattices under the action of the automorphism group of . Clearly, such a sublattice has index and any nonzero vector in this sublattice satisfies . Hence, we can color by using the cosets as color classes, with . Thus, .
For the lower bound, we apply Theorem 4.1. The Leech lattice defines the densest sphere packing in dimension , see [12], . We can apply Theorem 4.1 with and get
[TABLE]
∎
Similarly, one can show by using the fact that is the densest sphere packing in dimension , see [55].
4.2. Second application: Exponential growth of the chromatic
number
In this section we investigate the asymptotic behaviour of the chromatic number of lattices when the dimension tends to infinity.
For a lattice , we denote by the norm of its smallest nonzero vector. Then defines a packing of unit spheres, and we extend the notation for the center density of this packing by
[TABLE]
The best upper bound known for is the Kabatiansky-Levenshtein bound, [30], which states
[TABLE]
where is the volume of the -dimensional unit ball. So by using Theorem 4.1 and Lemma 4.2, we get, for any lattice ,
[TABLE]
Let us now recall Siegel’s mean value theorem (see [49]): For any lattice and any , we denote by the number of nonzero lattice vectors of in the open ball having radius . The Siegel mean value theorem states that the expected value of in a random lattice of volume is
[TABLE]
To compute the expectation one uses the Haar measure on .
This equality implies two remarkable consequences: First, by choosing such that , it proves the existence of a lattice with strictly less than two nonzero vectors in . Since such a vector would come with its opposite, the minimum of has to be at least , and therefore the density of satisfies
[TABLE]
which essentially is the Minkowski-Hlawka lower bound for lattice sphere packings (see [27]). The second consequence concerns the density of a random lattice: Let us fix , and such that . Following the previous idea, whenever , the density of satisfies
[TABLE]
By using Siegel’s mean value theorem and Markov’s inequality, we prove that this happens with high probability when grows:
[TABLE]
if tends to infinity.
Combined with (4), these observations provide:
Theorem 4.4**.**
With high probability, the chromatic number of a random -dimensional lattice grows exponentially in . Moreover, there are -dimensional lattices with
[TABLE]
Note that the Minkowski-Hlawka lower bound has been improved, in such a way that the constant in the numerator could be replaced with some quasi-linear function in , see [48], [4], [52], [54]. Even though any random lattice should be dense, and consequently should have an exponential chromatic number, to date there is no efficient way to construct such a lattice: The only algorithms for this purpose run in exponential time with respect to the dimension (see [42]).
However, explicit examples of subexponential growth are known. For the cut polytope (see [16]) is an -dimensional polytope which is a Delaunay polytope of lattice (see [17]) . The vertex-edge graph of is the complete graph on vertices. Thus we get an infinite family of Delaunay polytope with chromatic number lower bounded by .
We think it is an interesting question to construct explicit families of lattices whose chromatic number grows exponentially with the dimension.
5. Spectral lower bounds
In this section we derive a lower bound for the chromatic number of a lattice where we apply the generalization of Hoffman’s bound as developed by Bachoc, DeCorte, Oliveira, and Vallentin [2]. In Section 5.1 we recall some terminology and facts from [2]. Then we compute the spectral bound for the irreducible root lattices case by case in Section 5.2. Table 3 summarizes the results obtained.
5.1. Setup
For an -dimensional lattice define the (complex) Hilbert space
[TABLE]
which has inner product
[TABLE]
The convolution of two elements is defined by
[TABLE]
Assume that is real-valued, that its support is contained in , and that it satisfies for all . Consider the convolution operator
[TABLE]
defined by
[TABLE]
In a certain sense, is a weighted adjacency operator of . It is easy to verify that is a bounded and self-adjoint operator. Its numerical range is
[TABLE]
The numerical range is known to be an interval in . By
[TABLE]
we denote the endpoints of the interval .
We say that a subset is an independent set of the operator if for each which vanishes outside of . The chromatic number of is the smallest number so that one can partition into independent sets. By [2, Theorem 2.3] one has the following lower bound for :
[TABLE]
Since every independent set of is also an independent set of the operator we see that
[TABLE]
Therefore, we are interested in determining the parameters and and in choosing so that the bound becomes as large as possible.
For determining and for a given convolution operator we apply standard facts from Fourier analysis, in particular the Parseval identity, the theorem of Riesz-Fischer, and the fact that the Fourier transform of a convolution is a product.
The only non-standard item here is that in standard texts on Fourier analysis, see for example [19], the role of primal and dual spaces are interchanged. In order not to confuse the reader (and to some extend not to confuse the authors) we consider a new -dimensional lattice . In our context will play the role of the dual lattice . When , then .
Consider the Hilbert space of square-integrable -periodic functions
[TABLE]
with inner product
[TABLE]
where we normalize the Lebesgue measure so that the -dimensional volume of a fundamental domain equals one. The exponential functions
[TABLE]
form a complete orthonormal system for . We define the Fourier coefficient of at by
[TABLE]
By Parseval’s identity and by the Riesz-Fischer theorem the map
[TABLE]
is an isometry:
[TABLE]
We consider two functions . By the isometry of there are functions with
[TABLE]
Furthermore,
[TABLE]
Back to the lattice and the convolution operator . For , , we have
[TABLE]
and
[TABLE]
Hence, by choosing two appropriate sequences (see [2, Section 3.1])) approximating the corresponding Dirac measures, we see
[TABLE]
We summarize our considerations in the following theorem.
Theorem 5.1**.**
Let be an -dimensional lattice. Suppose that is real-valued, for all , and the support of is contained in . Then,
[TABLE]
If we uniformly choose
[TABLE]
then the bound in the previous theorem simplifies to the following generalization of the Hoffman bound for infinite graphs:
Corollary 5.2**.**
Let be an -dimensional lattice. Then,
[TABLE]
5.2. Computing the spectral bound for irreducible root lattices
In this section we compute the lower bound given by Corollary 5.2 for each of the irreducible root lattices. By the classification of Witt these are the families of lattices , and the three sporadic lattices , , and .
As recalled in Section 2.4, the set of strict Voronoi vectors for a root lattice is simply its set of roots. Now the latter constitutes a root system (sometimes known as a “crystallographic” root system, but these are the only ones which we will consider): see A.1; the root systems are themselves classified, [29, §11], and the ones which arise from root lattices are known as “simply laced” or “A-D-E” root systems. (We will review the simply laced irreducible root systems below along with the corresponding lattices.)
Computing the lower bound of Corollary 5.2 for an irreducible root lattice is therefore equivalent to finding the smallest value attained by the Fourier transform of a simply laced irreducible root system. Here, a finite set defines the function on which is the Fourier transform of the sum of delta measures concentrated on the elements of . Let us emphasize that since is a (crystallographic!) root system, this function is, in fact, a trigonometric polynomial; and since is symmetric (), it is real.
Now this reformulation affords a link with representation theory (the required facts of which are recalled in Appendix A). Namely, if is a (reduced, but not necessarily simply laced) root system of rank then the function is “essentially” the character of the adjoint representation of the — say, simply connected — compact real Lie group associated with (namely in the case of , or in the case of , or one of the exceptional Lie groups in the case of the correspondingly named ); the precise statement and explanation of why the two problems are equivalent is given in Proposition A.13.
The problem of computing our spectral lower bound is therefore essentially that of computing the least value attained by the adjoint character of a simple compact real Lie group of type A-D-E. The values in question have been considered and computed by Serre in [50, Theorem 3’]: In Table 3 we state Serre’s result in the form in which it is useful for the main part of the paper.
Again, the equivalence of the result as stated here with that as stated in Serre’s note is provided by Proposition A.13.
Serre’s note does not contain a proof of the stated result111Serre writes: “The classical groups are easy enough, but , , and are not (especially , which I owe to Alain Connes). I hope there is a better proof.”. In this paper we provide one for the A-D-E case. We treat and in Theorem 5.3 and Theorem 5.4 below, and we defer the case to Appendix B. For and we suggest an alternative proof technique based on sums of squares and semidefinite optimization in Section 5.2.3. Using the link to the chromatic number of lattices, the case turns out to be the easiest of the cases. We treat it in Section 5.2.4.
5.2.1.
The irreducible root lattice has roots
[TABLE]
where denotes the -th standard unit vector in .
Theorem 5.3**.**
The critical values of are and . In particular, .
Proof.
Given such that , we define by , so that . The sum
[TABLE]
is equal to
[TABLE]
Clearly critical values are [math] (when ) and (when all are identical). The critical values can be obtained when , for example by the -th roots of unity222 For example we have for all , by letting and . More explicitly, we have a bijection between the set of all with , modulo the lattice dual to and the set of complex vectors which all lie on the unit circle, and whose product equals 1, modulo where is some -th root of unity obtained by taking to . or by . Then, the lower bound on follows from Corollary 5.2. ∎
5.2.2.
The irreducible root lattice has roots
[TABLE]
Theorem 5.4**.**
The critical values of are:
[TABLE]
In particular,
[TABLE]
and for even , and for odd .
Proof.
By the symmetries of the function we can restrict ourselves to . We consider the sum
[TABLE]
where . To find the critical values we compute the gradient of (as a function of the ), which is
[TABLE]
Let be a critical point. Define , and the set of such that , and so that or or .
For we have . Thus, if .
We define , and and we have , and
[TABLE]
If we have
[TABLE]
If then the equation for (5) gives
[TABLE]
The value of the function is then
[TABLE]
The lower bound on then follows from Corollary 5.2. ∎
5.2.3. , and sums of squares
We start by giving an alternative construction of the lattice which is based on lifting the (extended) Hamming code , which is the vector space over the finite field (consisting of the elements [math] and ) spanned by the rows of the matrix
[TABLE]
It consists of code words:
[TABLE]
We can define the lattice by the following lifting construction (which is usually called Construction A):
[TABLE]
Now it is immediate to see that has shortest (nonzero) vectors:
[TABLE]
where are the standard basis vectors of and where denotes the Hamming weight of a code word .
Observe that the lower bound is implied trough the spectral bound by the following inequality (Theorem B.3 gives a stronger result by providing a complete list of all critical values)
[TABLE]
for all .
To simplify the formula we apply a change of variables by setting . Then,
[TABLE]
Applying the cosine addition formula multiple times we get
[TABLE]
Function is globally nonnegative if and only if the polynomial
[TABLE]
is nonnegative on the cube . This nonnegativity can be verified algorithmically. By
[TABLE]
denote the cone of real polynomials in indeterminates of degree at most . One can verify membership in this cone by using semidefinite optimization, see for example [37]. We checked numerically (up to machine precision) that there are polynomials
[TABLE]
so that the following identity holds true:
[TABLE]
It is interesting to observe that using smaller degree did not work.
One can easily modify this proof technique for the case to show . To define we apply Construction A on the code , which one obtains from by deleting its first coordinate. Here one shows that the polynomial
[TABLE]
is nonnegative on the cube again using sum of squares.
5.2.4.
The lattice we can handle without computer as follows: In the proof of Theorem 6.3 we shall color with nine colors. On the other hand, the Schläfli polytope is a Delaunay polytope of whose vertex-edge graph—the Schläfli graph on 27 vertices having 216 edges—is a finite subgraph of . It is known, see for example [5, Chapter 8, page 55] that the Hoffman bound of the Schläfli graph equals nine. It is also known, see [3, Section 10.1], that the Hoffman bound of an infinite edge transitive graph is at least the Hoffman bound of any of its finite subgraphs. Hence, the spectral bound of equals nine.
6. The chromatic number of irreducible root lattices and their duals
In this section, we complete our study of the chromatic number of irreducible root lattices and their duals. The knowledge that we use about Delaunay polytopes of root lattices can be found in [16, Section 14.3]. Our claims regarding the sublattices that we consider and the colorings of certain small graphs can be conveniently checked with computer assistance, for example by using Magma [8] or Polyhedral [21].
When we cannot directly compute the chromatic number of a graph, we apply other methods, computationally easier, in order to get lower and upper bounds. A lower bound for the chromatic number of a graph is given by its fractional chromatic number: Denote by the set of all independent sets of . The fractional chromatic number of is the solution of the following linear program:
[TABLE]
If affords symmetries, one can use them to reduce the number of variables of this linear program. Regarding upper bounds, given a number of colors and a graph , proving the existence of a -coloring of can be turned into a satisfiability problem, that can be solved for instance by using Minisat [24].
6.1. and its dual
The half cube , sometimes also called the parity polytope, is defined as
[TABLE]
It is one of the two Delaunay polytopes of the root lattice .
Theorem 6.1**.**
For all we have , where we consider the vertex-edge graph of the half cube.
Proof.
The inequality comes from the fact that is a Delaunay polytope of .
Let be a proper coloring of . We extend it to by giving to any the color . Assume that two vectors and are adjacent in . Since the relevant vectors of are of the form , the difference is also such a vector, so that and are adjacent in , and . Hence, . ∎
Theorem 6.2**.**
For every , the chromatic number of is .
Proof.
The relevant vectors of the lattice are the vectors and the vectors of the form . The four vectors [math], , and define a clique in ; and the unique way to color with four colors is by coloring each copy of with two different colors. ∎
6.2. and their duals
Theorem 6.3**.**
We have , and .
Proof.
By Theorem 6.1 we know that . So one can color the root lattice with colors. The lower bound from Theorem B.3 concludes the case of .
For there are two orbits of Delaunay polytopes. One of them is the Gosset polytope with vertices, whose vertex-edge graph has chromatic number , which shows that . Moreover, we have a lamination of over the lattice :
[TABLE]
If and belong to two layers which differ by an even index, then is not a relevant vector. Following Section 3, we know that . Thus we can color the even layers by colors in and the odd layers by colors in . This proves that .
The unique Delaunay polytope of is the Schläfli polytope whose vertex-edge graph is the Schläfli graph with vertices. It is well known that its chromatic number is , so that . There is just one orbit of independent triples of vertices. Moreover, there are just two orbits of colorings with colors of the Schäfli graph: One orbit of size and another of size . Let us take a coloring from the orbit of size . It is composed of different triples of elements. For each such triple we consider the set of vectors . Since we have triples, this gives in total vectors. Those vectors span a sublattice of of index . It turns out that none of the relevant vectors of belongs to . Thus following Lemma 2.1, . ∎
Theorem 6.4**.**
The chromatic number of is for .
Proof.
Upon rescaling to an integral lattice the norms of the vectors of are , , and so on. The norms of the relevant vectors are and . We consider the vectors of norm and enumerate the sublattices of of dimension spanned by those vectors that do not contain any relevant vector. We found orbits of such lattices. Exactly one of them is of index which proves that .
The lower bound is obtained in the following way. We consider the graph formed by the origin [math] and the relevant vectors with two vectors adjacent if their difference is a relevant vector. The fractional chromatic number of this graph is . Thus .
The lower bound on the chromatic number of is obtained by the same technique as for . The upper bound is obtained in the following way: Consider the quotient with elements. One coloring with colors can be obtained by solving the satisfiability problem using Minisat. ∎
Acknowledgements
We thank Jean-Pierre Serre for his patient explanation of how to compute the critical points of the function . We are indebted to Christine Bachoc and Gabriele Nebe for their important contributions to Section 4. We thank the anonymous referee for helpful comments and suggestions.
This material is based upon work supported by the National Science Foundation under Grant No. DMS-1439786 while M.D.S. and P.M. were in residence at the Institute for Computational and Experimental Research in Mathematics in Providence, RI, during the “Point Configurations in Geometry, Physics and Computer Science” semester program. P.M. received support from the Tromsø Research Foundation grant 17_matte_CR. F.V. was partially supported by the SFB/TRR 191 “Symplectic Structures in Geometry, Algebra and Dynamics”. F.V. also gratefully acknowledges support by DFG grant VA 359/1-1. This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie agreement number 764759.
Appendix A Recollections on compact Lie groups
In this section, we collect, for the benefit of the unfamiliar reader, without proof but with references, a few facts about semisimple compact Lie groups and representation theory. All of the following results are standard, although it is not easy to find them conveniently stated in a single place, so we hope this compendium will be helpful.
The main reason for this appendix insofar as the present paper is concerned is to explain the reason behind the reformulation which we give in theorem B.1 of Serre’s [50, Theorem 3’], namely the connection between the Fourier transform of a root system and the character of the adjoint representation of the Lie groups associated with : this is provided by A.13. We have, however, stated a few additional results which are not strictly necessary towards that goal but which, we hope, help give a clearer overall picture. The secondary reason for this appendix is to provide the necessary framework for appendix B (although the latter could, in principle, be reworded so as to eliminate all mentions of Lie groups just like we did for theorem B.1, we believe this would be unnecessarily contrived).
Remark A.1**.**
We have chosen to focus these recollections on semisimple compact real Lie groups, but the classification and representation theory of semisimple complex Lie groups is identical (Weyl’s “unitarian trick”, cf. [32, §6.2] and [26, §26]): we simply mention that the role of the tangent Lie algebra to a maximal torus in what follows is played, in the complex setting, by Cartan subalgebras of ([32, definition 6.32] and [26, §14.1 and appendix D]).
A.2. A compact (real) Lie group is a compact connected real smooth manifold , together with a group structure on such that the multiplication and inverse maps are smooth (). The tangent space at the identity of is then endowed with a linear action of , called the adjoint representation of , defined by letting (for ) be the differential at of ; this in turns defines a linear map for by letting (where stands for the vector space of linear maps between two vector spaces and ) be the differential at of itself (see [26, §8.1]): writing for , this gives the structure of a (real) Lie algebra (simply known as the Lie algebra of ).
We also recall the exponential map , which takes to the value at of the unique smooth group homomorphism (a.k.a. “-parameter subgroup”) whose differential at the origin is . While is not a group homomorphism in general, it is one whenever is abelian (whenever the Lie bracket of vanishes; otherwise, the so-called Baker-Campbell-Hausdorff formula expresses the relation of to an exponential). Rather than using the exponential, we will find it more convenient to use the function ; just as itself, the function in question is surjective (cf. A.1 below).
The Killing form is the bilinear form , which is -invariant; in the real compact case in which we placed ourselves, this form is negative semidefinite ([32, theorem 6.10] or [1, theorem 2.13]), and we say that or is semisimple when is nondegenerate ([26, prop. C10], [32, theorem 5.53]), i.e., negative definite in the real compact case.
As an example, the group of real orthogonal matrices with determinant is a compact real Lie group, whose Lie algebra consists of antisymmetric matrices, the Lie bracket being the usual , and the Killing form on is given by , so that is semisimple if and only if .
A.3. If is a compact Lie group with Lie algebra , the map taking a connected closed subgroup of to its Lie algebra seen as a subalgebra of (i.e., a vector subspace closed under the Lie bracket) is a bijection ([32, theorem 3.40]).
Two simply connected compact Lie groups are isomorphic if and only if their Lie algebras are isomorphic ([32, theorem 3.43]); thus, two compact Lie groups with isomorphic Lie algebras have isomorphic universal coverings: they are then said to be isogenous. (We note that, for the purposes of this paper, isogenous Lie groups are an irrelevant complication.) Beware, however, that the universal covering of a compact Lie group need not be compact as the case of tori shows.
Conversely, any real Lie algebra with a negative semidefinite Killing form is the Lie algebra of some compact Lie group (unique up to isogeny, by the previous paragraph). We return in A.13 to the question of which Lie groups are possible in the semisimple case.
A.4. If is a compact Lie group, a torus in is an abelian connected closed subgroup of , or equivalently, one whose Lie algebra is abelian (meaning that its Lie bracket is trivial). A maximal torus, of course, is a torus that is maximal for inclusion; by A.1, maximal tori of are in bijection with maximal abelian Lie subalgebras of the Lie algebra of .
As an example, a maximal torus in is given by the block diagonal matrices whose diagonal blocks are rotation matrices.
Crucial results by Cartan concerning maximal tori of compact Lie groups are ([10, theorems 16.4 and 16.5] or [1, theorem 2.15] or [33, corollaries 4.35 and 4.46]): (a) every element of belongs to some maximal torus, and (b) all maximal tori of are conjugate; in particular, each element of is conjugate to some element of any fixed maximal torus of .
The dimension of some (any) maximal torus in (or equivalently, of its Lie algebra) is known as the rank of . The quotient of the normalizer of (in ) by itself is known as the Weyl group of : so a -orbit in is precisely a full set of -conjugate elements of , and the set of conjugacy classes in can be identified (as a set) with . We note that acts as a group of automorphisms of , so it also acts (linearly) on the Lie algebra of and (by inverse transpose) on the dual of .
A.5. If is an abstract torus, i.e., an abelian compact Lie group, and is its Lie algebra, the map (in other words the differentiable group homomorphism whose differential at [math] is times the identity) defines a surjective homomorphism , whose kernel is a discrete subgroup of . Thus, we can identify with (as a differentiable group), and functions on with -periodic functions on . We will call the period lattice of the torus .
A.6. If is a compact Lie group, a (finite-dimensional) representation of on a finite-dimensional complex vector space is a differentiable linear action of on , i.e., a differentiable group homomorphism . The character of said representation is the map (a differentiable function on , invariant under conjugation). The representation is said to be irreducible when the only -invariant subspaces of are [math] and . It turns out that every representation of is a direct sum of irreducible representations ([32, theorem 4.40]); and a representation is characterized (up to isomorphism) by its character ([10, theorem 2.5] or [32, theorem 4.46]).
Furthermore, although we will not use this, it might be worth pointing out the Peter-Weyl theorem: the characters of the irreducible representations of form a Hilbert orthonormal basis for the closed subspace consisting of conjugation-invariant functions in the Hilbert space of square-integrable functions on ([32, theorem 4.50]; incidentally, these functions are also eigenvalues of the Laplace-Beltrami operator on seen as a Riemannian manifold).
Among the representations of , the adjoint representation (defined in A.1 above as a map , which we see as an action on the complexified vector space ) is of particular importance; its character is called the character of the adjoint representation, or simply the adjoint character, of ; the adjoint representation is irreducible if and only if is simple (this can be taken as a definition333A Lie group is called “simple” when it does not have nontrivial connected normal subgroups: this allows for a finite center (the term “quasisimple” might be more appropriate).).
A.7. Representation theory on a torus is well known: writing through as in A.1, the irreducible characters of are of the form with ranging over the lattice dual to (in the vector space dual to ), which we call the character lattice of . The corresponding representations are all one-dimensional (acting by multiplication by the character just defined). In other words, the irreducible representations of are indexed by , and the orthonormal basis of characters of predicted by the Peter-Weyl theorem is the usual Fourier basis on . (As mentioned in A.1, we write both for the function defined earlier, and for the -periodic function given by .)
We note that the ring of linear combinations (over , resp. ) of the , or character ring (resp. character -algebra) of (cf. A.15 below), is the group ring (resp. group -algebra) of the lattice . (Choosing a basis for shows that this is a ring of Laurent polynomials.)
A.8. If now is a compact Lie group and is a maximal torus in with corresponding Lie algebras , given a representation of having character , we can restrict them to and consider the Fourier decomposition of , i.e., its decomposition in terms of the characters (for ) defined in the previous paragraph: clearly is the dimension of the subspace of consisting of those such that for each . In particular, . The such that are known as the weights of the representation (or of the character ), and we emphasize that they belong to ; the value is know as the multiplicity of the weight (in or in ), and the subspace on which acts through is known as the weight (eigen)space; we note that the weight space can be defined at the Lie algebra level as the set of such that for all (where is the differential of at ; compare [32, definition 8.1]: we have added a factor here for convenience in the compact case, but it is a matter of convention).
Since, as explained in A.1 above, all maximal tori of are conjugate, the weights and multiplicities do not depend on the choice of ; furthermore, they are invariant under the action of the Weyl group . Seen as a function on , the character values are both -periodic and -invariant, so they are -invariant. (Let us mention here the paper [46], which can serve as link between the “Fourier” and “Lie group characters” points of view.)
A.9. We temporarily leave aside Lie groups to recall the following definitions and facts in relation with abstract root systems (see [29, §9.2], [32, §7.1] and [7, chap. VI]). A (reduced, crystallographic) root system is a set of vectors in a finite dimensional real vector space such that (1) is finite, does not contain [math], and spans , (2) for every , there exists in the dual space of such that and such that the (symmetry) map leaves stable (it is easy to see that is uniquely defined, cf. [7, chap. VI, §1, no1, lemme 1], so that the notation is legitimate), (3) for every we have , and (4) if and then . The elements of are called roots, and the are the coroots. The set of coroots is itself a root system (with ), known as the dual root system to . The group generated by the is known as the Weyl group of , and it is finite.
Two root systems and are said to be isomorphic when there is a linear isomorphism between and taking to . In this case, they have isomorphic dual systems and isomorphic Weyl groups.
The root system is said to be reducible when it is the union (“sum”) of root systems in with respectively, irreducible otherwise. Every root system can be written in a unique way as the sum of irreducible root systems, and any sum of root systems is a root systems.
Given a root system in , there exists a Euclidean structure on such that every (and consequently, every element of the Weyl group) is orthogonal; equivalently, a Euclidean structure which identifies with its dual so that each coroot is proportional to the corresponding root . Such a Euclidean structure is said to be compatible with . (The definition of root systems is often written in a manner that preassumes the Euclidean structure: in this case, the coroot associated to is defined as .) For irreducible, this Euclidean structure is unique up to a multiplicative constant, i.e., up to the definition of the lengths of the roots; in the case of the simply laced root system (those in which every root has the same length), which concerns us in the present paper, the constant is generally chosen such that the squared root length is , so that and can be identified. Nevertheless, it might be useful for expositional clarity to keep the distinction between and and the greater generality afforded by the not necessarily simply laced root system, so we do not perform this identification (but the reader may choose to do so).
Associated with a root system as above are four lattices: the lattice generated by is known as the root lattice, the lattice generated by is known as the coroot lattice; the lattice (in ) dual to the coroot lattice is known as the weight lattice and contains the root lattice; and the lattice (in ) dual to the root lattice is known as the coweight lattice and contains the coroot lattice. The quotient of the weight lattice by the root lattice, or equivalently of the coweight lattice by the coroot lattice, is sometimes known as the fundamental group of for reasons that will be clarified in A.13.
A.10. Continuing the exposition of root systems started in A.1, if is a linear form on such that for each , the roots such that are then known as the positive roots, and those such that as the negative roots relative to : a subset which can be obtained in this manner is known as a choice of positive roots for . The positive roots which cannot be written as sums of other positive roots are known as simple roots (for this choice of positive roots): it is then a fact that the simple roots form a basis of , and that every positive root is a linear combination of the simple roots with nonnegative integer coefficients (not all zero); so the choice of positive roots can be defined equivalently by the set of simple roots. The Weyl group acts simply transitively on the set of all choices of positive roots (or the set of all choices of simple roots).
The choice of positive roots also gives a choice of positive coroots, (defined from as above by identifying with , or by saying that the positive coroots are the coroots associated with positive roots). The dual basis to the set of simple coroots is known as the set of fundamental weights (for the choice of positive roots); symmetrically, the dual basis to the set of simple roots is known as the set of fundamental coweights.
The convex cone in generated by the fundamental weights, is known as the closed Weyl chamber in corresponding to the choice of positive roots: it is the dual cone to the positive coroot cone, in other words, it is defined by the inequalities for all positive coroots (or equivalently, for all simple coroots) ; the open Weyl chamber, defined by the inequalities , is the interior of the closed Weyl chamber (and the closed Weyl chamber is its closure). Dually, the cone in generated by the fundamental coweights, which is the dual cone to the positive root cone, is also known as the closed Weyl chamber (in ). The choice of a Weyl chamber is equivalent to a choice of positive roots: the Weyl group acts simply transitively on the set of Weyl chambers.
Given a choice of positive roots , there exists a unique such that for all : this is known as the highest root of (relative to this choice of positive roots), and it belongs to the open Weyl chamber. The (integer) coefficients of on the basis of simple roots, i.e., the such that where are the simple roots, often come up in formulae involving or its root system. It is often more convenient to define (the lowest root) and so that .
A.11. Given a root system and a choice of positive roots, we define the Dynkin diagram of as the graph whose vertices (“nodes”) are the simple roots, two nodes being connected by a single, double or triple edge, or by no edge at all, according as the angle between them is , or , or for no edge at all, these being the only possible values; in the case of a double or triple edge, it is oriented by pointing from the simple root with the larger norm to that with the smaller norm. (These constructions rely on a Euclidean structure compatible with , but are independent of the choice of such a structure.)
The Dynkin diagram of a root system determines the latter up to isomorphism. Furthermore, all possible irreducible root systems can be classified, with the help of Dynkin diagrams. The simply laced Dynkin diagrams are shown on Figure 4.
A.12. We now return to the setup of a compact Lie group as in A.1, and we furthermore assume to be semisimple.
The nonzero weights of the adjoint representation of are known as the roots of (or of ), and each one occurs with multiplicity ; as for the zero weight space, it is the complexification of itself (in other words, its multiplicity is the rank of ). So, writing for the weight space of acting on , we have the weight space decomposition (compare [33, formula (2.16)] and [32, theorem 6.38]).
The set of these roots is an abstract (reduced, crystallographic) root system ([10, theorem 19.2] or [32, theorem 6.44]), whose Weyl group is that which we have already associated to (cf. A.1); it is irreducible if and only if is simple. Furthermore, this induces a bijection between the isomorphism classes of semisimple compact Lie algebras and root systems ([32, corollary 7.55] or [33, corollary 7.3]), or equivalently, isogeny classes of semisimple compact Lie groups or isomorphism classes of semisimple simply connected compact Lie groups ([26, §7.3]); we clarify in A.13 below the classification of groups inside an isogeny class.
Note that, as a set, depends not only on but on the choice of the maximal torus used to define the weights (cf. A.1), or equivalently, not only on but also on : in fact, is a subset of the dual of ; however, as an abstract root system, it does not depend on this choice.
The following proposition follows immediately from what has already been said:
Proposition A.13**.**
If is a semisimple compact Lie group with rank , then the range of values taken by the adjoint character of is precisely the range of the function where is the Fourier transform of .
More precisely, if is a maximal torus in with Lie algebra , and is written for , then (and we have pointed out in A.1 that each element of is conjugate to an element of , which then obviously has ).
Proof.
As explained in 4, the weights of the adjoint representation are the elements of each with multiplicity , and [math] with multiplicity , i.e., , which is precisely the statement of the second paragraph. ∎
A.14. We briefly clarify the relation between isogenous (cf. A.1) compact Lie groups in the semisimple case (this subsection is required for completeness, but for the purposes of this paper we care only about the simply connected groups):
If is a semisimple compact Lie group, we have noted its root system can be defined directly from its Lie algebra and that of a maximal torus of (which is the same as a maximal abelian subalgebra of , cf. A.1), namely as the set of nonzero (where is the dual vector space to ) such that is nontrivial (cf. A.1). So the root lattice and weight lattice defined in A.1, inside , are defined at the Lie algebra level: they depend only on the isogeny class of . We have or equivalently where is the coroot and the coweight lattice (the inclusion follows from the fact that the weights of any representation of , as defined in A.1, belong to , and in particular the roots belong to ; the inclusion follows from the fact that the coroots can also be defined at the Lie algebra level).
The classification of compact Lie groups having Lie algebra is then as follows: is uniquely defined by giving the lattice satisfying , or equivalently ; and conversely, for any such , there exists a unique corresponding Lie group ; furthermore, the fundamental group of is Abelian, finite and canonically isomorphic to , and the center of is finite and canonically isomorphic to . ([44, chap. 4, §3, 6o, theorems 9 and 10]; see also [10, theorem 23.1] and [33, corollary 5.109].)
(In particular, the universal covering of a semisimple compact Lie group is still compact, and corresponds to taking equal to the coroot lattice . At the other extreme, the centerless group corresponds to taking equal to the coweight lattice ; this is also known as the “adjoint” form, because it is the image of the adjoint representation . The fundamental group of the adjoint form, or equivalently the center of the universal covering, is defined at the Lie algebra level, and is .)
Remark A.15**.**
If is a semisimple compact Lie group with maximal torus , we have already pointed out in A.1 that the set of conjugacy classes of can be identified (as a set) with , where is the Weyl group. Lifting to the Lie algebra of , it can be identified with . This point of view is particularly important when is simply connected ( is the coroot lattice) because then it can be shown that the “affine Weyl group” is an affine Coxeter group, having a fundamental domain, known as the Weyl alcove, which is the simplex whose vertices are [math] and the , where the are the fundamental coweights (cf. A.1) and are the coefficients of the highest root (cf. A.1). (See [31, chapter 11].)
A.16. We now briefly review the classification of irreducible representations of a semisimple compact Lie group as provided by “highest weight theory”.
As explained in A.1, the weights of a (finite-dimensional) representation of a semisimple compact Lie group (relative to the choice of a maximal torus ) are the such that is nonzero, the multiplicity being . Now fix a choice of positive roots of (cf. A.1): a highest weight of (or of its character, ) is a weight such that is not a weight for any positive root ; a dominant integral weight (for ) is a belonging to the closed Weyl chamber, i.e., such that for each simple (or equivalently, positive) coroot .
Highest weight theory tells us that ([33, theorem 5.5], [32, §8.3] or [44, chap. 4, §3, 7o, theorem 11] or [11, VI.1.7]):
- •
every irreducible representation of has a unique highest weight, which is a dominant integral weight, and its multiplicity is ;
- •
if is an irreducible representation of with highest weight , then the set of weights of is the intersection of , where is the root lattice, and of the convex hull of the orbit of under the Weyl group;
- •
two irreducible representations of are isomorphic if and only if they have the same highest weight (thus, we can speak of “the” irreducible representation with highest weight );
- •
every dominant integral weight is the highest weight of an irreducible representation of (it is unique by the previous point);
- •
if and are irreducible representations of with highest weights and respectively, then has highest weight and has a unique irreducible factor with that weight ([11, VI.2.8]).
The highest weight of the adjoint representation is the highest root ().
A.17. If is a semisimple compact Lie group and a maximal torus of , then the Weyl character formula ([32, §8.5], [33, theorems 5.75–5.77] or [11, VI.1.7]) expresses the value of the irreducible character with highest weight (i.e., the character of the irreducible representation with highest weight ) as the ratio of two skew--invariant polynomials on , namely
[TABLE]
where is the Weyl group and the group homomorphism taking the value on each reflection (i.e., is the determinant of acting on the Lie algebra of ); and the Weyl vector is half the sum of the positive roots, which is also the sum of the fundamental weights. The denominator of the above expression can be factored using the Weyl denominator formula:
[TABLE]
A.18. Assuming that (still a semisimple compact Lie group) is simply connected (so that is the weight lattice, cf. A.13), the irreducible representations having the fundamental weights (cf. A.1) as highest weights are known as fundamental representations, and their characters as the fundamental characters of .
A.19. The character ring of a compact Lie group is the ring generated by the characters of (that is, the set of differences between two characters of ) for pointwise sum and product. Equivalently, if we define a virtual representation of to be the formal difference of two (finite dimensional) representations, identifying with whenever and are isomorphic (“Grothendieck ring” construction), and if we define the (virtual) character of to be where is the character of , the character ring can be defined as the set of virtual representations of with addition and multiplication being defined as the direct sum and tensor product (extended in the obvious fashion to virtual representations).
To put it differently, the character ring of consists of -linear combinations of the irreducible characters (or representations) of , the product being defined by the decomposition into irreducibles of a product of characters (tensor product of representations). One can similarly define the character -algebra as the set of -linear combinations of the irreducible characters (or representations) of .
Highest weight theory implies that: (1) for a semisimple compact Lie group with maximal torus , the character ring of is simply the invariant part under the Weyl group of the character ring of (the latter being the group ring of , cf. A.1), and (2) when is, additionally, simply connected, the character ring is isomorphic to the polynomial algebra, with coefficients in , over indeterminates corresponding to the fundamental representations. ([11, VI.2.1 and VI.2.11].) The corresponding statements also hold with complex coefficients instead of integers.
Remark A.20**.**
If is a semisimple simply connected compact Lie group with maximal torus , then the character -algebra of can be identified (via restriction to ) with the set of -invariant trigonometric polynomials on (with complex coefficients), where is the Weyl group, or, lifting to , of -invariant (hence -invariant) combinations of the for . Also note that such functions are entirely defined by their values on the Weyl alcove (cf. A.15).
If we are mostly interested in the character values on (they determine those on by A.1), and in this paper we are, the irreducible representations of are something of a needless complication: the character ring of a semisimple simply connected compact Lie group has a -basis consisting of the sums for ranging over an orbit of the Weyl group acting on .
A.21. We have recalled in A.1 that every element of a compact Lie group belongs to a maximal torus ; when the torus in question is unique, the element is said to be regular. Assuming that is semisimple, this is equivalent ([10, theorem 22.3(ii)]) to saying that is not in the kernel of any for root (cf. A.1). Correspondingly, we say that an element of the Lie algebra of is regular when is regular, i.e., when for all (this means that is represented by an element in the interior of the Weyl alcove, cf. A.15; also compare [11, V.7.8]).
The following fact is crucial to the proof given in appendix B:
Proposition A.22**.**
Let be a semisimple simply connected compact Lie group. If and is a maximal torus containing , then the following are equivalent:
- •
the element is regular,
- •
the differentials of the fundamental characters of (cf. A.15) are independent at ,
- •
the differentials of the fundamental characters of restricted to are independent at .
In a more general context, the equivalence of the first two statements is due to Kostant ([35, theorem 0.1]) and Steinberg ([51, theorem 8.1]); however, since we are only considering compact Lie group, every element belongs to a maximal torus (i.e., is “semisimple”), making the proof of the equivalence considerably easier and giving the third statement as a byproduct (as detailed in [51, §8.2–8.6]).
A.23. We now briefly discuss how a subset of the nodes of the Dynkin diagram of a semisimple compact Lie group defines a Lie subgroup with the Dynkin diagram defined by the subset in question (i.e., the induced subgraph).
So let be a semisimple compact Lie group, fix a maximal torus in , and let be the corresponding Lie algebras and the root system of (cf. 4); choose a system of simple roots (where is the rank of ). Now if , this defines a root system , sometimes known as the parabolic subsystem associated to , namely the set “generated by” the for (see [31, §5.1] for a discussion, or [39, §12.1]), so that its Dynkin diagram consists of the nodes of that of labeled by elements of .
We now fix such an and explain how to define a Lie subgroup of with root system . See also [1, §7.3–7.4] for a more detailed and pedagogical account of this construction.
For , let inside be the corresponding weight space. Then (see [33, corollary 5.94], or [39, proposition 12.6] in a different context) is a (complex) Lie subalgebra of (sometimes known as a “parabolic Levi factor”), which can be further factored as a Lie algebra direct sum (i.e., with trivial bracket between the summands) of its center and its semisimple subalgebra where is the complex subspace spanned by the coroots for (seen as elements of ). Since and are stable under complex conjugation, they define (real!) Lie subalgebras and of , hence compact Lie subgroups , of having these Lie algebras (see [33, theorem 5.114]). Then is a semisimple Lie subgroup of having root system and maximal torus with Lie algebra (as for , it is isogenous to the product of with a torus of rank , so that it has the same rank as ).
(In fact, the only properties of used above are that it is a “closed” subsystem of : see [39, definition 13.2 and theorem 13.6].)
In appendix B, we will use the above construction in the case where is the complement of a single node in the Dynkin diagram.
Appendix B Serre’s result on characters of compact Lie groups
In this section, we prove the cases of [50, Theorem 3’]. The proof in the and cases has been kindly communicated to us by J.-P. Serre and only slightly adapted for symbolic computation with Sage (J.-P. Serre was able to perform the entire computation by hand and we have not attempted to reproduce this feat; any errors in the following expositions are, of course, entirely our own) and straightforwardly extended to compute all critical values of the adjoint character; in the case, J.-P. Serre referred us to a proof devised by A. Connes, which we do not follow here, preferring instead a straightforward analogue of the and cases, at the cost of considerably more computing power (the case does not seem doable by hand with the technique presented below).
Theorem B.1**.**
If is a semisimple compact Lie group of type , or respectively, and its adjoint character (cf. A.1); then is equal to , or respectively.
(The proof for and has been given in 5.3 and 5.4 respectively.)
In each case, we divide the proof in two steps: the reduction step and the computation step, the second being itself subdivided into an elimination substep and a ruling-out substep.
We will call the simply connected semisimple compact Lie group of type , or as the case may be, its maximal torus (cf. A.1) and the Lie algebra of the latter.
The reduction step uses the trick (†) explained below (and based essentially on A.22) to reduce the number of variables by observing that any critical point of must lie on certain linear subspaces of . The computation step then finds the critical values of some polynomial function of several variables by elimination theory: there are slight variations in each of the cases below, but broadly speaking, consider the ideal of generated by and (defining the — often [math]-dimensional — algebraic variety of critical points of ) and use a Gröbner basis for some elimination order (that is, a monomial order such that for all not all zero) to find the projection of this variety on the coordinate (represented by a polynomial in which one factors to find the actual values, which for some currently mysterious reason happen to be always rational); unfortunately, elimination theory considers all complex values of , so there are many spurious values, and one must then consider each computed value, or at least those that are smaller than the actual minimum, and rule them out by showing that, for some reason, they cannot be realized for real values (generally by noticing that some other element in the Gröbner basis does not have roots in the domain considered).
Let us now explain the idea of the reduction step in more detail. We generally follow appendix A for notation: for example, we call the root system of .
The reduction trick (†) is as follows. Suppose is a critical value of the adjoint character (which is a fundamental character in each of , and ), or more generally that it is a critical value of any polynomial in the fundamental characters in which no fundamental character appears more than once; then the result of Kostant and Steinberg A.22 implies that is on a root hyperplane (where and ); now the affine Weyl group (cf. A.15) acts transitively on the set of such root hyperplanes and preserves all character values, so we can assume that the lies on the root hyperplane , where is the highest root, which (for , , , ) is also one of the fundamental weight, say . (This can be read by taking the extended Dynkin diagram for : it is the node to which the extender node attaches.) In other words, the coordinates of on the basis of coroots have a zero at coordinate (the one which is measured by ): so lives, in fact, in the hyperplane generated by for . Now, as explained in A.22, the linear subspace of generated by for (here ) is, in a natural way, the Lie algebra of the maximal torus of a semisimple Lie subgroup of (denoted in A.22), whose root system has a Dynkin diagram obtained from that of by keeping only the roots labeled by an element of , so, in our case, by deleting node . So by restricting the character (initially the adjoint character of ) to , we are left with a character on a Lie group having a rank smaller by . Of course, the character restricted to the subgroup in question is no longer the adjoint character, but it can be expressed in terms of the fundamental characters of using standard tables of “branching rules” or a computer program like Sage444http://doc.sagemath.org/html/en/thematic_tutorials/lie/branching_rules.html (in fact, branching are typically given first for the restriction from to an intermediate subgroup , maximal in , and described through the removal of the node from the extended Dynkin diagram of by means of so-called “Borel-de Siebenthal theory: see [56, §8.10] or [31, chapter 12]; the restriction from to is then straightforward as is a factor of , and we will give both in what follows).
In what follows, we number the nodes of the Dynkin diagrams as in Bourbaki (cf. figure 4). We write for the -th fundamental representation of the simple group , and for the adjoint representation: thus, and and . More generally, we write for the character with highest weight (written as a combination of fundamental weights ), so that is an abbreviation for .
Theorem B.2**.**
The set of critical values of is , , , and . We therefore have and .
Proof.
If is a critical point of , i.e., if the differential vanishes there, then by (†) we can assume that belongs to (the Lie algebra of the maximal torus of) the Lie subgroup defined by removing node from the Dynkin diagram of .
Now . In more detail, the branching rule for the maximal subgroup of gives: as witnessed by Sage:
sage: E6 = WeylCharacterRing("E6", style="coroots") sage: br = branching_rule(E6, "A1xA5","extended") sage: br.branch(E6(E6.fundamental_weights()[2])) A1xA5(1,0,0,1,0,0) + A1xA5(2,0,0,0,0,0) + A1xA5(0,1,0,0,0,1)
We then observe that
[TABLE]
and that
[TABLE]
so that
[TABLE]
Evaluating at the identity of (where ), we get
[TABLE]
as announced.
Now the fundamental characters of are the elementary symmetric functions of six variables ranging over the unit circle and constrained by (the eigenvalues of the element of ). This means that we are to compute the critical values of over (which is the maximal torus of ).
This concludes the reduction step (which is simpler in the case of than for ), and we now proceed to the computation step (which, compared to , has fewer cases to consider, but is computationally more challenging).
By elimination theory, we can compute the critical values for ranging over subject to : consider the ideal of generated by for (because saying that is proportional to means ) and also ; and perform elimination of the variables (by computing a Gröbner basis for a monomial order for which for any not all zero) in this ideal to obtain the projection on the coordinate of the critical points.
By computing the Gröbner basis of the corresponding ideal we obtain that the resulting set of possible critical values is , , , , and .
Now the critical value cannot be attained on
[TABLE]
We add the inequality to the ideal and recompute a Gröbner basis. In this basis we have .
And this is impossible because is the sum of the , which can only take the value in absolute value provided all the are equal to one and the same -th root of unity , in which case and and and by checking the possible one notices that does not, in fact, take the value .
The value , on the other hand, is attained, namely when three of the are equal to one primitive cube root of unity and the other three are equal to the other. So it is its minimum and so a critical value.
By further Gröbner basis computation we obtain the is attained only at . We also obtained that the critical value is attained only with four set at and two set at . The critical value is attained only by fixing two at , two at and two at . The critical value corresponds to a manifold of dimension . In this manifold is the point with four set at and two set at .
From the formula we get and then using Corollary 5.2 . ∎
Theorem B.3**.**
The set of critical values of is , , , , , , [math], , , . We therefore have and .
Proof.
If is a critical point of , i.e., if the differential vanishes there, then by (†) we can assume that belongs to (the Lie algebra of the maximal torus of) the Lie subgroup defined by removing node from the Dynkin diagram of .
Now
[TABLE]
Now since neither nor appear more than once (or with any exponent) in , we can apply (†) again: at a point where the differential of this expression vanishes, the differentials of the fundamental characters are not independent, so belongs to (the Lie algebra of the maximal torus of) the Lie subgroup defined by removing node from the Dynkin diagram of .
We have
[TABLE]
and
[TABLE]
giving:
[TABLE]
Again, there are no multiple occurrences of the various , so we can apply (†) one more time: at a point where the differential of this expression vanishes, the differentials of the fundamental characters are not independent, so belongs to (the Lie algebra of the maximal torus of) the Lie subgroup defined by removing node from the Dynkin diagram of .
Here the branching gets more complicated: We use
sage: WeylCharacterRing("D6").maximal_subgroups()
within Sage to find the correct rule for branching to , and in principle the two factors are not symmetric (although in the end it turns out that they are, up to a symmetry of ) so one must use br.describe() to chop off the correct factor (call it in what follows). We find:
- •
(from ).
- •
(from ).
- •
(from ).
- •
(from ).
Finally, we get:
[TABLE]
We can now apply the reduction trick (†) one last time, for the factor: since is obviously independent from the , at a point where the differential of the expression above vanishes, the differentials of the fundamental characters are not independent, so belongs to (the Lie algebra of the maximal torus of) the Lie subgroup defined by removing node from the Dynkin diagram of .
To compute this restriction, first examine the restriction of to the maximal subgroup of (seen by extending the Dynkin diagram of and removing the node connected to the four others): if we call the (single) fundamental characters of the various factors, numbered in the same way as the nodes of the extended diagram of from which they come (except that comes from the extending node), then and and and finally
[TABLE]
Finally, restricting to (the first factor being the factor in earlier and the other three coming from nodes of as described in the previous paragraphs), we have
[TABLE]
where is the fundamental character from the first factor and are the elementary symmetric functions in the fundamental characters of the three other factors.
We now need to find the critical values of this function
[TABLE]
There is one subtlety, however: “critical” means that for each , as well as for , we either have (resp. ) or (resp. ). Indeed, each (as well as ) is a character of , so it is for the two eigenvalues of the element of in question, so the critical values of itself are .
This concludes the reduction step for . The computation step is then to use elimination theory, in each possible case depending on how many of the satisfy and how many satisfy and , and similarly for , to find the corresponding values of .
For this, we must consider cases according to constraints placed on the (which can be set equal to or to or to satisfy , which we denote as “” for short) and on (similarly or or ). In each case, we consider the ideal of generated by the or or as the case may be, and similarly for , and also ; and perform elimination of the variables (by computing a Gröbner basis for a monomial order for which for any not all zero) in this ideal to obtain the projection on the coordinate of the critical points.
By considering all cases we find that the set of possible critical values is , , , , , , , , , , [math], , and . For each such value and each possible critical value we compute the manifold which turns out to be always [math]-dimensional. The points of those manifolds can be enumerated and we obtain the list of critical values by keeping only the values for which at least one of the point has , , , .
From the formula we get and then using Corollary 5.2 . ∎
Theorem B.4**.**
The set of critical values of is , , , , , , , . We therefore have and .
Proof.
We have and all the reduction step has already been explained above in the case: we get
[TABLE]
so that
[TABLE]
The computation step is then similar to , except there is now less symmetry between the (one can only exchange and ): one must therefore distinguish cases.
By computing a Gröbner basis we obtain that the set of possible critical values is , , , , , , , , , and . For each case and critical value we compute the corresponding manifold and its complex points. In each case except one the manifold is [math]-dimensional and for two [math]-dimensional cases the computation of the points does not finish. Those problematic cases occur for the value .
For the other cases we compute the points and keep only the values for which one of the points has , , , . It turns out that the value is attained by one of those points and so the problematic cases do not prevent us from concluding that is a critical value.
From the formula we get and then using Corollary 5.2 . ∎
One possible extension of this work could be to consider the non-simply laced diagrams, that is , , and .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Arvanitoyeorgos, An Introduction to Lie Groups and the Geometry of Homogeneous Spaces , AMS, 2003.
- 2[2] C. Bachoc, P.E.B. De Corte, F.M. de Oliveira Filho, and F. Vallentin, Spectral bounds for the independence ratio and the chromatic number of an operator , Israel J. Math. 202 (2014), 227–254. ( https://arxiv.org/abs/1301.1054 )
- 3[3] C. Bachoc, G. Nebe, F.M. de Oliveira Filho, and F. Vallentin, Lower bounds for measurable chromatic numbers , Geom. Funct. Anal. 19 (2009), 645–661. ( https://arxiv.org/abs/0801.1059 )
- 4[4] K. Ball, A lower bound for the optimal density of lattice packings , Internat. Math. Res. Notices 10 (1992), 217–221.
- 5[5] N. Biggs, Algebraic Graph Theory , Cambridge University Press, 1974.
- 6[6] B. Bollobás, Modern Graph Theory , Springer, 1998.
- 7[7] N. Bourbaki, Groupes et algèbres de Lie , Chap. 4–6, Hermann, 1968.
- 8[8] W. Bosma, J. Cannon and C. Playoust, The Magma algebra system. I. The user language , J. Symbolic Comput. 24 (1997), 235–265.
