A new local invariant and simpler proof of Kepler's conjecture and the least action principle on the crystalformation of dense type
Wu-Yi Hsiang

TL;DR
This paper introduces a new local invariant for sphere packing in three dimensions, providing a simplified proof of Kepler's conjecture and related principles by characterizing optimal packings as f.c.c. or h.c.p.
Contribution
It defines a novel local density invariant using only a single layer of spheres, leading to simplified proofs of Kepler's conjecture and crystal formation principles.
Findings
Optimal local packings are f.c.c. or h.c.p.
New local invariant characterizes dense packings uniquely.
Simplified proofs of Kepler's conjecture and crystal formation principles.
Abstract
A new locally averaged density for sphere packing in R^3 is defined by a proper combination of the local cell (Voronoi cell) and Delaunay decompositions (\S 1.2.2), using only a single layer of surrounding spheres. Local packings attaining the optimal estimate of such a local invariant must be either the f.c.c. or h.c.p. local packings (Theorem I). The main purpose of this paper is to provide a clean-cut proof of this strong uniqueness result via geometric invariant theory. This result also leads to simple proofs of Kepler's conjecture on sphere packing, least action principle of crystal formation of dense type, and optimal packings with containers (Theorems II-IV). This work provides a much simplified alternative to the author's previous work on Kepler's conjecture and least action principle of crystal formation of dense type which involved a local invariant defined by double layer of…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19
Figure 20
Figure 21
Figure 22
Figure 23
Figure 24
Figure 25
Figure 26
Figure 27
Figure 28
Figure 29
Figure 30
Figure 31
Figure 32
Figure 33
Figure 34
Figure 35
Figure 36
Figure 37
Figure 38
Figure 39
Figure 40Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPoint processes and geometric inequalities · Phase Equilibria and Thermodynamics · Mathematics and Applications
A new local invariant and simpler proof of Kepler’s conjecture
and the least action principle on the crystal formation of dense type
Wu-Yi Hsiang
Department of Mathematics, University of California, Berkeley, CA 94720
Department of Mathematics, HK University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
1 Introduction
1.1 Three kinds of sphere packings, various kinds of densities and problems of their optimalities
Basically, there are three different kinds of packings of spheres of identical size, namely
- (i)
packings with containers; 2. (ii)
finite packings without container, e.g. crystals; 3. (iii)
infinite packings with the whole space as the container.
For example, putting marbles into a jar, oranges into a box or soybeans into a silo are daily-life examples of the first kind; while pieces of crystals of gold, silver, lead etc. are Nature-created examples of the second kind. On the other hand, those infinite packings such as the f.c.c. packing, hexagonal close packings [Bar] and lattice packings are, in fact, just some mathematical models serving as the limiting situations of the first and the second kinds as their sizes tend to infinity.
In the study of sphere packings, the central problems are the problems of optimalities on various kinds of densities. Of course, it is necessary to first give a precise definition of the kind of density before studying the problem of its optimality.
Let be a packing into a given container. It is quite obvious that the density of such a packing should be defined as follows, namely
[TABLE]
while the optimal density of packing -spheres into is given by
[TABLE]
where runs through all packings of -spheres into . Note that will certainly depend on the shape of and the relative size between and -sphere. Anyhow, this motivates us to study
[TABLE]
where denotes the -times magnification of , and the following formulation of Kepler’s conjecture on sphere packings:
Kepler’s Conjecture** (2nd version).**
For a large class of , e.g. those with piecewise smooth , should always be equal to .
Problem of least action principle on the crystal formation of dense type (cf. [Hsi])
The physical shape of atoms can be regarded as microscopic spheres while a small piece of crystal of a monatomic element, such as gold and silver etc., often consists of billions of trillions of such microscopic spheres aggregated into a specific type of regular arrangement, exhibiting fascinating geometric regularity and remarkable precision. For example, the crystal structures of forty-eight chemical elements are of hexagonal close packing type which have the highest known density of . Thus, it is quite natural to pose the following type of “uniqueness” problem, namely
“How to properly define the density of packings of the second kind so that the above geometric regularity is actually the consequence of density optimality, which will be referred to as the least action principle of crystal formation of the dense type.”
The new local invariant of §1.2 will play the key-role of providing such a proper definition as well as a far-reaching localization for the proof of such a theorem (cf. Theorem III, §2.1).
1.2 A simple local invariant and the definition of global densities of packings of the second and third kinds
In his booklet of 1611 [Kep], Kepler had already introduced the concept of local cell, nowadays often referred to as Voronoi cell, which assigns a surrounding convex polyhedron to each given sphere in , consisting of those points that are as close to the center of , say , as to that of others, say , namely
[TABLE]
where is the -side of the perpendicular bisector of . We shall always assume that are bounded for every in , thus the above intersection can always be reduced to that of a finite, irreducible intersection, namely
[TABLE]
Note that an index belongs to the above minimal set when and only when and share a common face and such a pair are defined to be neighbors of each other. Anyhow, local cell and neighbor are the most basic concepts on the geometry of sphere packings that all the other important ones are based upon.
1.2.1 Local cell decomposition and its dual decomposition
Let be a given infinite packing with all of its local cells of bounded type, each of them containing a single sphere of . We shall denote them simply by and call it the local cell decomposition of . Moreover, the above decomposition has a natural dual decomposition into convex polyhedra with centers of spheres of as their vertices, namely, the Delaunay decomposition. The duality between such a pair of fundamental decompositions associated to a given can be summarized as follows:
- (i)
The set of vertices of the D-decomposition are . 2. (ii)
The set of edges of the D-decomposition are those linking the centers of neighboring pairs those common faces of situated on the perpendicular bisector of . 3. (iii)
The set of faces of the D-decomposition are those polygons spanned by the centers of those local cells with a common edge { their common edge, perpendicular to the face at its circumcenter }. 4. (iv)
The set of convex polyhedra spanned by the centers of those local cells with a common vertex { their common vertex which is exactly the circumcenter of its corresponding D-cell } .
1.2.2 A new kind of locally averaged density
(cf. §1.4 of [Hsi] for another kind.)
Set (resp. ) to be the indices sets of the L-cells (resp. D-cells) of the above dual pair of decompositions associated to a given , and set
[TABLE]
Definition**.**
To each D-cell , set
[TABLE]
and call it the density of the D-cell . To each , the locally averaged density of at is defined to be
[TABLE]
Note that, for each given , there are only a rather small number of with non-zero and .
Remarks**.**
- (i)
In retrospect, the cluster of spheres centered at the vertices of a given can be regarded as the subcluster of of the most localized kind. Thus, can be regarded as a kind of ultimate localization of the concept of densities associated to a given . 2. (ii)
Note that is, itself, a weighted average of that makes use of the dual pair of decompositions. Of course, its usefulness will only be determined by the ultimate test of whether it can provide a better result in the study of global optimalities of sphere packings, (cf. §2).
1.2.3 Relative density and global densities of the second and third kind
Let be an infinite packing and be one of its finite subpackings.
Definition (relative density).
The relative density of in , denoted by , is defined to be
[TABLE]
Definition (intrinsic density).
Let be a given finite packing without container. The intrinsic density of , denoted by , is defined to be
[TABLE]
where runs through all possible extensions of .
Definition (global density of infinite packings).
[TABLE]
where runs through all possible exhaustion sequences of .
Example 1.1**.**
Let be a hexagonal close packing. Then
[TABLE]
and hence
[TABLE]
for all finite subpackings in , and is also equal to .
Proof.
The local cluster of D-cells that occur in (1.8) consists of octuple regular 2-tetrahedra and sextuple regular 2-octahedra. Therefore their volumes (resp. total solid angles) are given by
[TABLE]
and hence
[TABLE]
where . Thus
[TABLE]
∎
1.3 Fundamental problem and fundamental theorem of sphere packings
Note that a pair of spheres with their center distance less than are automatically neighbors of each other (i.e. their local cells must share a common face whatever the arrangements of the others). Thus, it is natural to introduce the following definition of clusters of spheres:
Definition**.**
A finite packing of -spheres is called a cluster if any pair of them can always be linked by a chain with consecutive center distances less than .
Remark**.**
A single sphere is, of course, regarded as a special case of cluster.
Fundamental Problem of Sphere Packings**.**
Set to be the optimal intrinsic density of all possible -clusters, namely
[TABLE]
where runs through all possible clusters of spheres. What is equal to? and what are the geometric structures of those -clusters together with their tightest surroundings with ?
In the beginning case of , is just the optimal locally averaged density. The above fundamental problem naturally leads to the proof of the following fundamental theorem, namely
Theorem I**.**
The optimal locally averaged density is equal to and when and only when the local packing is isometric to either that of the f.c.c. or the h.c.p. packing, which surrounds with twelve touching neighbors with their touching points as indicated in Figure 1.
2 Major theorems on global optimalities of sphere packings and their proofs via Theorem I
In this section, we shall state and deduce the major results on the optimalities of global densities of sphere packings as consequences of Theorem I.
2.1 Statements on the optimalities of the global densities of sphere packings (i.e. of the third, second, and first kinds)
Theorem II** (Kepler’s conjecture, 1st version).**
The optimal global density of infinite sphere packings is equal to , namely
[TABLE]
Theorem III** (Least action principle of crystal formation of dense type).**
[TABLE]
and when and only when the -cluster together with its tightest surrounding is a piecewise hexagonal close packing, namely, an assemblage of pieces of subclusters of hexagonal close packings.
Theorem IV** (Kepler’s conjecture, second version).**
For all kinds of containers with piecewise smooth boundaries ,
[TABLE]
2.2 Deductions of Theorems II, III, and IV via Theorem I: A far-reaching localization on global optimalities of sphere packings
Proposition 2.1**.**
Theorem I implies Theorem II.
Proof.
Let be an infinite packing and be one of its exhaustion sequences of finite subpackings. Then, by Theorem I
[TABLE]
Therefore
[TABLE]
and hence
[TABLE]
∎
Proposition 2.2**.**
Theorem I implies Theorem III.
Proof.
Let be a given -cluster and be one of those extensions of . Then, by Theorem I,
[TABLE]
and equality holds when and only when (resp. ) is the same as that of the f.c.c. or the h.c.p. Therefore
[TABLE]
and the above equality holds when and only when all of are either that of the f.c.c. or that of the h.c.p. Hence and when and only when all of are either that of the f.c.c. or that of the h.c.p.; it follows from the cluster condition that such a collection of local cells are glued together along their common faces, and such a gluing is possible only when all the local pieces of
[TABLE]
constitute subclusters of certain hexagonal close packings. ∎
Proposition 2.3**.**
Theorem I implies Theorem IV.
Proof.
Let be a given container with piecewise smooth boundary . Therefore, for sufficiently large , is locally almost flat everywhere except those corner or edge points. Let be a packing of unit spheres into . The same kind of local cell decomposition can be generalized to such a setting of . We shall call a sphere an interior (resp. boundary) sphere if has no face belonging to (resp. otherwise). Set (resp. ) to be the subset of interior (resp. boundary) spheres of . However, the corresponding dual decomposition of into D-cells certainly needs some kind of modification. Note that the same kind of D-cells construction still applies up until those D-cells containing some faces solely spanned by centers of spheres in . Set to be the union of such D-cells and to be the complementary region of in , namely
[TABLE]
which constitute a collar region lying between and . For the sake of simplicity, we shall regard the whole as a single D-cell and setting
[TABLE]
thus completing the D-decomposition of with respect to the given . Using such a pair of L-decomposition and D-decomposition, we shall again define , , and by the same kind of weighted average as that of (1.7) and (1.8).
Now, let us proceed to analyze and then to estimate which is geometrically a “total measurement” of the “boundary effect” for . It is quite natural to make use of the almost local flatness of to provide the following upper bound estimate on via the method of localization.
For the purpose of such a localized upper bound estimate, one may assume without loss of generality that is, actually, locally flat instead of just locally almost flat. Thus, the local geometry of arrangement of along can be represented by that of arranging spheres on top of a “table” (i.e. a piece of plane), and moreover, such local arrangements can also be regarded as the half of their corresponding reflectionally symmetric local arrangements, thus enabling us to analyze the localized densities of those D-cells of the latter.
Example 2.1**.**
The average density of a star cluster of such D-cells is at most equal to , and it is equal to when and only when and its surrounding sextuple neighbors forms a close hexagonal cluster of spheres touching the table.
Proof.
Each D-cell of such a star cluster is an upright triangular prism as indicated in Figure 2.
It is easy to show that the density of such a D-cell is at most equal to and it is equal to when and only when are all equal to 1 and is a regular triangle of side length 2.
Therefore, it follows from the above localized density estimate that is at most equal to and hence, by Theorem I
[TABLE]
On the other hand, to any , there exists sufficiently large such that the volume of the following subregion of , namely
[TABLE]
already exceeds -times of . Let be the subpacking of hexagonal close packing consisting of those spheres with their centers lying inside of . Then, it is easy to show that
[TABLE]
This proves that . ∎
∎
3 A concise summary on basics of solid geometry - the geometric invariant theory of the physical space
Solid geometry studies the properties of the physical space, the space that we and everything else of the universe are situated inside. In its modern setting and the most effective and advantageous formulation, it is the geometric invariant theory of the physical space. In this section, we shall provide a concise summary on the basics of geometric invariant theory of the physical space (often referred to as the Euclidean 3-space in mathematical terminology), which will supply the fundamental geometric ideas as well as basic useful techniques along our journey of proving Theorem I.
3.1 Vector algebra, the basic part of linearizable geometric invariant theory of the space
The most basic symmetric property of the space is that it is reflectionally symmetric with respect to any given plane . The totality of all such reflectional symmetries generates a fundamental transformation group on the space , say denoted by , which is the group of isometries of (often referred to as the Euclidean group), while the solid geometry studies the invariant theory of this fundamental transformation group.
- (1)
The translation subgroup of : Let be the reflectional symmetry with respect to a given plane in . If , then the composition leaves every common perpendicular line invariant (i.e. mapping onto itself) and pushes its points along by a distance twice of the distance between and . Thus, it is called a translation. It is well-known that the subset of all translations form a commutative subgroup of , say denoted by , which is algebraically isomorphic to , and moreover, it is an invariant (i.e. normal) subgroup of . 2. (2)
The orthogonal subgroups of : Let be a given point in (or rather, a chosen base point in ). Then, the subgroup of generated by the collection of reflectional symmetries with respect to those planes containing , namely, , is exactly the isotropy (i.e. stability) subgroup of fixing , i.e.,
[TABLE]
which will be, henceforth, referred to as the orthogonal subgroup fixing . 3. (3)
From the viewpoint of the geometric transformation group of acting on as isometries, one has the following generalities, namely:
- (i)
The translation group acts simple transitively on , while the group , itself, acts simple transitively on the set of all orthonormal frames, say denoted by , in particular, the subgroup acts simple transitively on the subset of orthonormal frames based at . Therefore, is geometrically a flat homogeneous Riemannian space with as the isometry group,
[TABLE]
Algebraically, one has the following diagram of homomorphisms,
G(V)$$\left.G(V)\middle/T\right.$$G_{p_{0}}$$T$$\subset$$\cong$$\cong$$O(3)
and moreover, in terms of modern concept of principal bundles, is an important example of principal bundles, namely
G_{p_{0}}$$G(V)$$V$$\mathcal{F}_{p_{0}}$$\cong$$\cong$$\mathcal{F}(V) 4. (4)
The vector algebra: Geometrically, a translation is a motion of the whole space in a given direction (i.e. the common perpendicular direction of ) by a given distance (i.e. ), which can be visualized by the equivalence class of directional intervals , thus will be called a vector and denoted by a bold-face lower case latin letter such as etc. or by exhibiting just one of its equivalence classes of directional intervals such as . Such quantities with both directions and distances will be the most basic kind of geometric quantity which will be, henceforth, referred to as vectors, and furthermore, we shall change the notations to denote the translation group by , instead of , and the commutative group operation as addition “+”. Algebraically, is naturally endowed with the following three kinds of multiplications, namely
- (i)
the scalar multiplication, , which is the algebraic representation of homothety magnification; 2. (ii)
the inner product, , which is a remarkable synthesis of length-angle and the generalized Pythagoras Theorem; 3. (iii)
the outer product, , which encodes concisely the multi-linearity of the oriented area and the oriented volume.
In summary, together with the above three kinds of multiplication constitutes a systematic, complete algebraization of the basic geometric structure of the space, in which the basic theorems and formulas of quantitative solid geometry have been transformed into powerful distributive laws of their respective multiplications (i.e. multi-linearity). In short, the vector algebra provides a simple, wholesome algebraic system which accomplishes the algebraization as well as linearization of the basic foundation of geometric invariant theory in excellence.
Remarks**.**
- (i)
Of course, the geometric invariants of the space can not be algebraized entirely. For example, the totality of solid angle (i.e. the area of the unit sphere) is equal to , the monumental contribution of Archimedes, is a transcendental invariant which can only be understood via integration. 2. (ii)
is a commutative invariant subgroup of , thus having the adjoint -action reduces to an orthogonal action of , while the above three kinds of products are -invariant and bilinear.
3.2 Basic formulas of spherical trigonometry
The study of spherical trigonometry has a long history, at least dating back to antiquity, due to its importance in quantitative astronomy and in solid geometry. The proof of the fundamental area formula of spheres (cf. §3.2.1) by Archimedes is a monumental achievement of Greek geometry, a glorious milestone in the civilization of rational mind.
3.2.1 Basic properties and basic theorems of spherical geometry
A given spherical surface, , is reflectionally symmetric with respect to those planes, , containing its center . Thus, intrinsically speaking, it is reflectionally symmetric with respect to those great circles . Therefore, the geometry of such a spherical surface has the same kind of reflection symmetries as that of a Euclidean plane, and hence it also has the same kind of congruence conditions of triangles such as SAS, ASA, SSS etc. and also the same kind of “isosceles triangle theorem” together with many of its consequences.
Note that two spheres of equal radius are translationally congruent to each other, while two concentric spheres are just homothetically different by a magnification. Thus, in the study of spherical geometry, it suffices to study the normalized model of the unit sphere . First of all, the most important theorem of spherical geometry is the fundamental theorem of Archimedes, namely
Archimedes Theorem:
The total area of the unit sphere is equal to .
Corollary 3.2.1**.**
The area of a spherical triangle is equal to the excess of angle, namely
[TABLE]
which is a kind of localization of the above theorem.
As usual in spherical trigonometry, the angles (resp. side-lengths) of will simply be denoted by (resp. ). Let , , . Then , etc. Set
[TABLE]
Then, one has
[TABLE]
Therefore, one has very simple proofs of both the spherical sine law and the spherical cosine law as a straightforward application of vector algebra, namely:
Spherical sine law:
[TABLE]
Spherical cosine law:
[TABLE]
Note that Corollary 3.2.1 is a rather beautiful area formula of AAA-type. It would be useful to derive area formulas of SSS-type and SAS-type, similar to the Heron’s formula and in the case of Euclidean triangles, making use of the above two laws to express in terms of . This idea naturally leads to the following:
Area formulas of SSS-type and SAS-type:
[TABLE]
where .
Proof.
Direct substitution of
[TABLE]
into the well-known formula of
[TABLE]
and algebraic simplification will show that
[TABLE]
Therefore,
[TABLE]
∎
Remark**.**
Area is the most fundamental geometric invariant of spherical triangles. Thus, the three types of area formulas will play their central roles in the entire geometric invariant theory of spherical trigonometry.
3.2.2 Basic geometric invariants of spherical triangles and basic formulas of spherical trigonometry
First of all, spherical triangles are an important class of basic geometric objects which have quite a few basic geometric invariants, for example side-lengths, angles, area, circumradius, inradius, determinant, etc. Moreover, in the study of various kinds of problems of solid geometry, such as the sphere packing problem that we treat in this paper, the key geometric invariants that will naturally emerge are often expressible in terms of those basic geometric invariants of spherical triangles, such as the locally averaged density of sphere packings that we discussed in §2. In fact, the important and useful part of the geometric invariant theory of spherical triangles lies in the intricate system of relations among their rich family of invariants.
Notations and basic geometric invariants of spherical triangles:
To a given spherical triangle , the Euclidean triangle , the 1-isosceles tetrahedron with as the vertices and the portion of the solid angle cone bounded by the tangent planes at , namely
[TABLE]
are a triple of geometric objects in the space canonically associated with it. Therefore, their geometric invariants should also be considered as that of itself.
As indicated in Figure 3, and
[TABLE]
and it passes through the circumcenters (resp. ) of (resp. ). Thus the circumcentric subdivisions of (resp. ) corresponds to each other under radial projection.
Notations: We shall use the following system of notations:
- (i)
The circumradius of (resp. ) will be denoted by (resp. ), while their inradius will be denoted by (resp. ); the half-sidelengths of will be denoted by and the central angles at (resp. ) will be denoted by which is in fact equal to the inner angles of . 2. (ii)
Set to be the oriented distances between and its three sides, and to be the oriented side-angle of the circumcentric subdivision. 3. (iii)
Set to be the spherical triangles of (i.e. solid angles at ), (resp. ) to be the areas of (resp. ) and
[TABLE]
Spherical trigonometric formulas:
The intricate system of formulas relating various kinds of geometric invariants of spherical triangles constitutes a set of important, useful techniques of solid geometry. The following is just a concise summary of those often useful ones. All of them can be deduced by means of the basic formulas of §(3.2.1) or sometimes by a direct application of vector algebra. Mostly, it is their clean-cut statements and usefulness that’s important, interesting and sometimes quite novel.
- (1)
Let us begin with the special case of right-angle spherical triangles (i.e. ). Their trigonometric formulas become particularly simple and hence much easier to use. However, via the canonical circumcentric subdivision of a general one, such extremely simple formulas can be used to study that of the general spherical trigonometry.
- (i)
[TABLE]
and moreover, 2. (ii)
[TABLE]
and the following special form of area formulas 3. (iii)
[TABLE] 2. (2)
Trigonometric formulas of geometric invariants of the circumcentric subdivision of (resp. ).
- (i)
[TABLE] 2. (ii)
The SSS (resp. AAA and SAS) data of , etc. are given as follows, namely
[TABLE]
and moreover,
[TABLE] 3. (iii)
[TABLE]
in the case that contains its circumcenter.
Considering the sum of distances between and the three sides in the first equation of (3.22), one has:
[TABLE]
Substituting (ii), (iii), (iv) into the last equation of (i), one gets
[TABLE]
The second equation in (3.22) can be proved by the following vector algebra computations, namely: As indicated in Figure 3, is the non-overlapping union of a triple of cones 111In our terminology, a cone can have any flat base; it doesn’t need to be circular. with as their common vertex and with , , as their respective bases whose area together with direction can be represented in terms of vector algebra by:
[TABLE]
respectively, while . Therefore
[TABLE]
while the last step is a matter of vector algebraic computations and simplifications. 3. (3)
Half angle formulas and incentric subdivision:
Set . Then, by cosine law,
[TABLE]
Therefore, one has
[TABLE]
and via the geometry of incentric subdivision,
[TABLE]
Remark: Roughly speaking, there are two types of invariants of spherical triangles, namely, those partial individual ones such as lengths, angles, , and etc. and those wholesome and symmetric ones such as area , , , , etc. The simplest and also the most basic invariants of individual type are or or , while , , , and naturally emerge as those most important wholesome invariants.
Example 3.1**.**
Let be the -isosceles triangle with as its top angle, , (cf. Figure 4). Then
[TABLE]
Example 3.2**.**
Let , , be half of the spherical rectangle as indicated in Figure 4 (i.e. ). Then
[TABLE]
3.2.3 Geometric invariants of spherical quadrilaterals
Geometrically, a spherical quadrilateral can be subdivided into a pair of spherical triangles by one of its diagonals. Therefore its geometric invariants can always be expressed in terms of that of its pair of triangles, thus expressible in terms of the basic invariants of such a pair of triangles with a common edge, in particular, the cosine of the other diagonal. Algebraically, let be the quadruple of unit vectors of the vertices of a given spherical quadrilateral . The sextuple of cross inner products (i.e. the cosines of side-lengths and diagonal lengths) of course consists of a complete set of congruence invariants but with one functional relation, thus making any quintuple subset already consisting of a complete set of congruence invariants. In fact, this is exactly the fundamental result on the relations among unit vectors for . Anyhow, it will be a useful tool in the analysis of spherical configurations to have a kind of simple algebraic formula to express any one of the above sextuple of cross inner products in terms of the other quintuple.
A simple method and an advantageous relation for basic invariants of spherical quadrilaterals
Suppose is the one that we would like to compute in terms of the other quintuple of cross inner product of . Then and are a pair of spherical triangles with as their common edge, while their orientations may be the same or opposite to each other. Set and to be their determinants. Then, obviously
[TABLE]
while
[TABLE]
[TABLE]
[TABLE]
Therefore, it is quite simple to use the above equations to solve in terms of the others.
3.3 Area estimates and area preserving deformations
3.3.1 Some corollaries of the area formulas
In the study of spherical geometry, the area is the most important invariant, while the Archimedes Theorem and the area formulas (i.e. A.A.A., S.S.S. and S.A.S.) of triangles are the fundamental theorems and powerful tools. In this subsection, we shall derive some useful corollaries of the area formulas.
Corollary 3.3.1**.**
For with given and fixed ,
[TABLE]
and equality holds when and only when .
Proof.
Set and . Then,
[TABLE]
and the above inequality follows directly from the realness of . Moreover, the pair of roots of the above quadratic equation actually correspond to the pair of opposite angles of the centrosymmetric quadrilateral with side lengths , namely, the pair of roots . Thus, the equality holds when and only when and the intersection point of the pair of diagonals is actually the circumcenter. ∎
Remark**.**
It follows from the S.A.S. formula of that
[TABLE]
Corollary 3.3.2**.**
Consider as a function of (i.e. the S.A.S. data of ). (resp. ). Then
[TABLE]
where .
Proof.
Set and fixed (i.e. regarding and as constants). Then
[TABLE]
Therefore, by differentiation w.r.t , one has
[TABLE]
while the differentiation of the cosine law gives that
[TABLE]
thus having
[TABLE]
∎
Corollary 3.3.3**.**
The areas of spherical quadrilaterals with as given side-lengths have a unique maximum at the cocircular one. Set and being the cutting diagonal. Then that of the cocircular one is given by
[TABLE]
Proof.
Set (resp. ) to be the triangles with (resp. ) as their side-lengths and (resp. ) to be that of .
Then the area, , of such quadrilaterals are given by:
[TABLE]
and by Corollary 3.3.2
[TABLE]
On the other hand, set and to be the tangent planes at vertices , and , . Then, it is not difficult to show that
[TABLE]
thus having
[TABLE]
Therefore when and only when , cocircular. Set to be the unique solution of . Then
[TABLE]
∎
Recall that, in the case of plane geometry, the well known S.S.S. area formula of triangles has a beautiful generalization to that of cocircular quadrilaterals, namely
[TABLE]
Therefore, it is interesting to seek a version of the above formula in the realm of absolute geometry. Thus
Corollary 3.3.4**.**
Let be the cocircular spherical quadrilateral with as its side-lengths. Set
[TABLE]
Then
[TABLE]
Proof.
By Corollary 3.3.3,
[TABLE]
in which is given by the specific formula of Corollary 3.3.3. It is an interesting computation of trigonometric algebra that
[TABLE]
∎
Corollary 3.3.5**.**
Let (resp. ) be a spherical triangle (resp. cocircular polygon) containing its circumcenter with side-lengths at least equal to . Then its area (resp. ) is at least equal to that of an equilateral one, namely
[TABLE]
3.3.2 Area-preserving deformation and ()-representation of the -level surface
For fixed (or equivalently ) the family of such triangles with side-lengths ordered as and parametrized by constitutes a basic type of area-preserving deformations, characterized by the property of also fixing its largest angle . Geometrically, the congruence classes of spherical triangles with a given area constitutes a 2-dimensional subset of the moduli-space of congruence classes, which will be referred to as a -level surface. Note that the area is naturally the most important, basic geometric invariant, while area-wise estimates of various kinds of geometric invariants such as , etc., and the geometry of various kinds of area-preserving deformations naturally constitutes a useful system of basic techniques of solid geometry. Moreover, it will be advantageous to provide a suitable organization of simple kinds of area-preserving deformations such as the above one fixing an angle and the Lexell’s deformations fixing a side length (cf. Example 2.1.3, p 59 [Hsi]).
The -representation of -level surface
Note that already constitutes a complete set of congruence invariants for the family of spherical triangles containing their circumcenters and with edge-lengths of at least . For the purpose of this paper, it suffices to consider the range of up to 0.97.
For a given value of , it is convenient to parametrize the -level surface by , thus representing it as a domain in the -plane. It is natural to subdivide into two cases, namely:
- •
Case 1: ,
- •
Case 2: and at most equal to 0.97
Case 1:
Note that the special case of only consists of a single point (i.e. ). Thus, we shall assume that and at most equal to . For such a given , there exists a unique -isosceles (resp. equilateral) triangle with their areas equal to , and moreover, a continuous family of isosceles triangles of area linking them in between, namely, with as their -coordinates where
[TABLE]
Now, beginning with such an isosceles triangle, say denoted by , with and given by (3.36), one has the area-preserving deformation keeping the fixed, while increasing up until either its shortest side-length already reaches the lower-bound of , or it becomes an isosceles triangle with as its base angles. Therefore, the domain of representation of such a -level surface is as indicated in Figure 6-(i), where represents the unique -isosceles triangle with the given area and -base, namely
[TABLE]
Case 2:
Set to be the side-length of the spherical square with its half area , namely
[TABLE]
Therefore, the only difference between Case 2 and Case 1 is that is bounded above by . Thus, the domain of representation for Case 2 is as indicated in Figure 6-(ii).
Note that the boundary of the above -domain consists of the following segments, namely
- (i)
The horizontal segment: representing those -isosceles with and given by (3.36), 2. (ii)
The slant segment: representing those with as their shortest side-length, the deformation along it is exactly the Lexell’s deformation fixing the side (cf. §3.3.5) 3. (iii)
The curved segment: Representing those isosceles with their base angles larger than their top angles. 4. (iv)
The vertical segment: In the case of and , one has an additional vertical segment: , representing those with , each of them is the half of a spherical rectangle with area and side-lengths of at least , while the corner point is exactly the with . (cf. Example 3.2)
3.3.3 Basic geometric invariants of isosceles spherical triangles
Let be an -isosceles spherical triangle with given area . Set , to be the length of its base, and to be its height. Then
[TABLE]
thus enabling us to solve as a root of the above quadratic equation, a simple function of and .
Next let us compute those wholesome basic geometric invariants of such a , namely, as follows:
[TABLE]
where , and .
3.3.4 Geometric analysis of the -deformation
For a given and fixed pair of (resp. ), set to be the unique isosceles triangle, namely
[TABLE]
Let be the triangle with as its S.A.S. data, and . We shall call it the -deformation of . Set
[TABLE]
We shall proceed to compute those basic geometric invariants of in terms of and that of which we derived in the previous part.
[TABLE]
Remark**.**
In our shorthand notation , these relations are summarized as:
[TABLE]
[TABLE]
The expression for requires expressions for . We first derive an expression for :
[TABLE]
Finally, let us proceed to compute as follows:
[TABLE]
Therefore,
[TABLE]
where
[TABLE]
Thus, straightforward substitutions will show that
[TABLE]
Taking a power series expansion in , one obtains to first order:
[TABLE]
3.3.5 Lexell’s deformation and some specific kinds of area-preserving deformations
Example 3.3**.**
(Lexell’s deformations (cf. Example 2.13, p. 59 of [Hsi])). As indicated in Figure 7, is an -“rectangle” with 2 as its area, is the center of the Lexell’s circle, i.e. passing , and , (the antipode of , ). Set its radius to be and
[TABLE]
Then
[TABLE]
Example 3.4**.**
(The computation of ). Note that where the S.A.S. data of such a is given by , and given by (3.36).
Therefore,
[TABLE]
Example 3.5**.**
(The computation of .
Note that where are those isosceles triangles with given and the given by (3.36) as their base angles. Set (resp. ) to be the equal side-length (resp. base-length and top-angle) of such a . Then
[TABLE]
Therefore,
[TABLE]
and hence
[TABLE]
Thus, for
[TABLE]
where
[TABLE]
3.3.6 Remarks on the behavior of
- (i)
For each given pair of , it follows from the analysis of §3.3.4 that is an increasing function of with as its minimum, while the above computation provides an estimate of its maximum (i.e. with only a rather small increment of -order. 2. (ii)
For each given area , is the unique minimum of . Thus the values of with small only have very small second order increments above the minimum value of . 3. (iii)
See Figure 9 for the graphs of as functions of with given values of .
- (iv)
In order to present a simplified, over-all picture of the result of this section on the geometric analysis of , we plot the graph of the following four critical functions of in Figure 10, namely
[TABLE]
4 A concise review of some highlights on the geometry of Type-I spherical configurations
A spherical configuration with twelve vertices and edge lengths of at least will be, henceforth, referred to as a Type-I (spherical) configuration. The moduli space of congruence classes of Type-I configurations constitute a real semi-algebraic set of twenty-one dimension which has quite a few interesting properties such as those theorems of §7.1 in [Hsi]. In this section, we shall review some highlights of such special results on the geometry of Type-I configurations which will play a useful role in the proof of Theorem I.
Geometrically, to each given Type-I configuration , there exists a unique Type-I local packing containing twelve touching neighbors with as the touching points together with the tightest extension of additional neighbors, if any, thus achieving the highest locally averaged density which shall be defined to be the associated locally averaged density of such a Type-I configuration. Anyhow, one has a function of locally averaged density defined on the moduli space of Type-I configurations, namely
[TABLE]
while the proof of Theorem I for the the major and most critical case of Type-I local packings amounts to prove that the above function has the f.c.c. and the h.c.p. configurations as the unique two maximal points with . Thus, it is quite natural that our review should begin with the following geometric characterization of these two outstanding Type-I configurations, namely, the f.c.c. and the h.c.p.
4.1 On some special features and the geometric characterizations of the f.c.c. and the h.c.p.
Let be a given Type-I configuration. Then is already -saturated, or equivalently, the circumradii of those faces of are all less than . Therefore the areas of a triangular (resp. quadrilateral) faces of a Type-I configuration are at least equal to
[TABLE]
while
[TABLE]
Hence, a type-I configuration can have at most six quadrilaterals and the f.c.c. and the h.c.p as indicated in Figure 1 are the only two such configurations with six quadrilaterals, say Type-I configurations of 6-type.
4.1.1
Note that the local star configurations of the f.c.c. are all of the type of at every point, while that of the h.c.p. has six local stars of the -type and another sextuple of the type. It is a remarkable fact that the mere occurrence of a local star of -type in a Type-I configuration, in fact, already characterizes these two outstanding 6-type ones, namely
Lemma 1**.**
Suppose that is a Type-I configuration with a star of -type. Then is either the f.c.c. or the h.c.p.
Proof.
Let be such a point of with the above kind of star structure. Set to be its boundary vertices in cyclic order, while are antipodal. Then, the complementary region of , namely
[TABLE]
can be represented via the stereographic projection with as the north pole, as indicated in Figure 11,
where the symmetric center of is exactly the south pole and are respectively the antipodal points of .
We shall use spherical trigonometry to analyze the possibilities of placing a quintuple of -separated points inside of . First of all, it is easy to check that
- (i)
, 2. (ii)
or
are such possibilities, while the addition of (i) (resp. (ii)) extends the star configuration to the f.c.c. (resp. h.c.p.) configuration. Thus, the proof of Lemma 1 amounts to showing that they are the only possibilities.
Let be a point on the -circular arc centered at between and . As indicated in Figure 12, are a quadruple of points along such that
[TABLE]
are equal to , namely, the spherical triangles of
[TABLE]
are all -isosceles. Set to be their base angles and to be their base lengths, respectively.
Then, one has
[TABLE]
and moreover,
[TABLE]
and hence
[TABLE]
Set to be the top angle of the isosceles , namely
[TABLE]
It is quite straightforward to use the above set of equations to compute as a function of and to show that when and only when , while it is strictly smaller otherwise. With the above critical analysis at hand, it is quite simple to prove that there are no other possibilities of placing a quintuple of -separated points inside of . ∎
Corollary 4.1.1**.**
Suppose that a Type-I configuration contains a 6-star whose pair of small triangles are separated by a pair of buckled quadrilaterals. Then must be just such a triangulation of the f.c.c. or the h.c.p.
Proof.
If the quadruple radial edges of the pair of small triangles are all equal to the minimal length of , then such a 6-star must be a triangulation of of Lemma 1 and must be either the f.c.c. or the h.c.p. On the other hand, any small amounts of edge-excesses in the four radial edges will make the complementary region of such a 6-star carve away critical strips from that of Lemma 1; even just a single tiny such edge-excess makes its complementary region incapable of accommodating a quintuple of -separated points, contradicting the assumption of being contained in the Type-I configuration. ∎
4.1.2 On the deformations of 6-Type-I configurations (i.e. 6-Type-I configurations)
Let be a small deformation of either the f.c.c. or the h.c.p. within , and be the small deformation of the sextuple -squares of the f.c.c. (or h.c.p.). Set to be the difference between and the smallest angle of and , which will be regarded as a kind of measurement of the size of such a deformation. Note that the area of is at least equal to
[TABLE]
Then, it follows from the area estimate that the edge-lengths of its octuple small triangles must be all equal to modulo . Therefore, up to congruence, the deformation of each small triangle is just a small rotation modulo , and moreover, it is easy to see that the sizes of those rotations of the octuple small triangles are, in fact, equal to each other modulo . Note that such a deformation would be impossible for the h.c.p. because it has adjacent pairs of small triangles. On the other hand, all small deformations of the f.c.c. are of the combinatorial type of icosahedra (cf. Lemma 7.1 of [Hsi] for more precise results on such a ).
4.2 Geometry of non-icosahedron Type-I configurations and -Type-I configurations
Let be a triangulation of a Type-I configuration , namely, by adding one of the pair of diagonals of its quadrilateral faces, if any. Then, by the Euler’s formula, always has twenty triangles, thirty edges and twelve vertices of degrees 4, 5, or 6. We shall, henceforth, call those with uniform degree 5 at each vertex Type-I icosahedra, while the other will be referred to as non-icosahedral Type-I configurations. For example, the f.c.c., the h.c.p. and the 5-Type-I configurations of Example 4.2 all have both icosahedral and non-icosahedral triangulations, depending on the way of choosing diagonal cuttings of the -faces. In fact, if has at least one quadrilateral face, then one (or both) of its diagonal cuttings will cause to become non-icosahedral. Anyhow, we would like to study the geometric structure of those genuine non-icosahedral Type-I configurations.
Suppose that is a given non-icosahedral Type-I configuration which is not such a triangulation of the f.c.c. or the h.c.p. Then it must contain a 6-star not of the kind of Corollary of Lemma 1, say of the other kind, namely with the pair of small triangles adjacent to each other.
4.2.1 On the geometry of complementary regions of those 6-stars of the other kind that are capable of accommodating a quintuple of -separated points
Let us first take a look at some examples of such 6-stars and their complementary regions:
Example 4.1**.**
6-star of :
First of all, it is the only 6-star of such kind with a quadruple of -edges (or with a pair of -equilateral triangles). Secondly, it occurs as the local stars at a sextuple of points in the h.c.p. configuration, as well as that of some vertices of those 5-Type-I configurations with at least a pair of adjacent in their angular distributions at the poles, such as . Again, set the central point of such a -star to be the north pole and using the stereographic projection to represent the complementary region of , as indicated in Figure 13.
Example 4.2**.**
Let be a 5-Type-I with as the angular distribution at the poles which is the assemblage of quintuple lune clusters as indicated in Figure 4. Then the star at is a 6-star as indicated in Figure 14, which will be, henceforth, denoted by
In the starting case of , , (i.e. the same as that of Example 4.1 whose complementary region is just the same as (cf. Figure 13). As (resp. ) gets slightly larger than , it is not difficult to check that the complementary region of will be a smaller subset of that has carved away a corresponding narrow strip along the left (resp. right) side of , as indicated by the dotted lines in Figure 13. Therefore, the geometric possibilities of arranging a quintuple of -separated points in their complementary regions are just subsets of that of , while such arrangements exactly correspond to the possibilities of extending to Type-I configurations. For example, in the case of , the arrangements of (resp. ) correspond to the extension of the h.c.p. (resp. the 5-Type-I with as the angular distributions at the poles). On the other hand, in the case of smallest complementary region of with , it is not difficult to check that such a complementary region only accommodates a unique such a quintuple which corresponds to the extension of 5-Type-I with the angular distribution of . This kind of geometric analysis naturally leads to the proof of the following lemma, namely
4.2.2 Structure of non-icosahedra Type-I configurations
Lemma 2**.**
A non-icosahedral Type-I configuration is either just such a triangulation of the f.c.c. (resp. the h.c.p.), or it is a small deformation of the 5-Type-I configuration, namely, of the kind of 5-Type-I.
Proof.
Let be a non-icosahedral Type-I which is not a triangulation of either the f.c.c. or the h.c.p. By the corollary of Lemma 1, it contains a 6-star which is a small deformation of such as and their small deformations.
(1) Among all possibilities of such 6-stars, is the unique one with quadruple radial edges of the minimal length of which has a larger complementary region (i.e. ) than that of the others. Therefore, one naturally expects that will have more room of variations of arrangements of such quintuples. Thus, let us first analyze the range of variations of such accommodations inside of . Note that is a sharp corner point of . It is quite simple to show that any such arrangements must include a point in a very small vicinity of . Anyhow, it is advantageous to use the polar coordinates with as the pole to analyze the geometry of . Then the triples (resp. ) lie on two longitudes with angular separation of , while the triples (resp. are situated on the latitudes of distance (resp. ) to the pole. First of all, it is easy to see that the arrangement of corresponds to the extension to a 5-Type-I with angular distribution of . Moreover, can also accommodate similar kinds of such quintuples which will produce the extensions to 5-Type-I with angular distribution of . In particular, we shall denote the the quintuple corresponding to that of , , by . Just for the sake of simplicity of presentation, we shall regard the others as deformations of such a specific 5-Type-I configuration, and then proceed to estimate the limitations on the sizes of such deformations. Set , and to be the base length of the -isosceles with as its base angle, i.e. . Let be any quintuple of -separated points inside of . Then the geometric confinement of implies that the angles of (resp. ) are at most equal to 0.1286, and hence the distances between (resp, ) are at least equal to . Therefore, one has the following limitations on the sizes of variations at , namely, the latitudinal differences at each point are at most equal to 0.0073 and the longitudinal differences at are at most equal to 0.065.
(2) Now, let us proceed to study the geometric structures of those 6-stars whose complementary regions can still accommodate a quintuple of -separated points. It is not difficult to see that such a 6-star will be a small deformation of Example 4.2, namely, having a pair of small adjacent triangles and a pair of sharing a short edge, such as those small deformations of Example 4.2. As indicated in Figure 14, we shall denote the boundary vertices of such as 6-star by , in cyclic order, and their corresponding edge lengths by , while are those short edge-lengths (i.e. at most only slightly longer than ). Let us first study the crucial case that the sextuple of boundary edges are all of the minimal length of . For example, in case that , such 6-stars are actually small deformations of of Example 4.2. Set
[TABLE]
to be the small edge-excesses. Then, the same kind of geometric analysis of the complementary region as the above will show that
- (i)
in case , 2. (ii)
in case , 3. (iii)
in general,
Therefore, it follows readily that is necessarily a rather small deformation of a 5-Type-I configuration. ∎
4.2.3 A remarkable characterization of 5-Type-I
Just for the sake of simplicity of terminology in this paper, those Type-I configurations having quintuple of ’s and a pair of small 5-stars will be, henceforth, simply referred to as 5-Type-I’s. Geometrically, they are those small deformations or close deformations of the 5-Type-I, meaning that those somewhat further deformations but still quite close to those 5-Type-I. It is easy to check that such 5-Type-I always have some 5-stars which are small or close deformations of .
The following lemma provides a remarkable characterization of 5-Type-I, namely
Lemma 3**.**
Let be a Type-I configuration which contains a small deformation of . Then must be a 5-Type-I.
Proof.
We may assume without loss of generality of the proof that is an icosahedron (cf. Lemma 2). Conceptually, this can also be regarded as the other case of Lemma 2; and technically, the basic method of the proof here is also quite similar to that of Lemma 2, i.e. the analysis of the geometry of accommodating sextuple of -separated points inside the complementary region of such a small deformation of .
(1) Therefore, one naturally begins the proof by analyzing the simplest but also most crucial case of . Again, set the center of such a star to be the north pole, and use the stereographic projection to represent the complementary region of such a star. It is, as indicated in Figure 13, that of with an additional -sector . Now, in the geometric setting of Type-I icosahedra extensions, we are studying the problem of analyzing the possibilities of its opposite 5-star (i.e. those with the sextuple vertices lying inside of . In this particular case of , the tightest extension, which makes the opposite star the largest, turns out to be , namely, with as its center and as its boundary vertices, while the others are just its small deformations. The key-step to prove the second assertion is to show that any opposite star must also include a pair of vertices in the vicinity of and respectively, using the following simple technique of replacement subset of R in , namely set it to be
[TABLE]
where and are the open -disc centered at and respectively. Then we may assume that the -separated sextuple in includes itself or includes a point outside of it, say as indicated in Figure 13 by or .
Next, it is straightforward to check that, in the case of , , the opposite star of one of its tight extensions is , while the others are just its small deformations.
(2) In summary, the tight extensions of is a 5-Type-I with the angular distribution of at the poles, in particular, it has an opposite pair of 5-stars with no radial edge-excesses and with antipodal centers, and furthermore, with perfect longitudinal alignment. Therefore, in the case of a small deformation of , its tight extension is a small deformation of the 5-Type-I with as the angular distribution, and hence, it has a pair of opposite 5-star with very small amount of radial edge-excesses, almost antipodal centers and small amount of angular non-alignments, in short it is a 5-Type-I. This proves that is also a -Type-I. ∎
4.3 On the geometry of Type-I icosahedra
Recall that a Type-I configuration whose twelve stars are all of 5-type will be called a Type-I icosahedron. Let us first review some generalities on the geometry of Type-I icosahedra.
4.3.1 Some generalities
(i) Lemma 2 shows that Type-I non-icosahedra are, necessarily, rather small deformations of 5-Type-I. Therefore, the great majority of Type-I configurations are icosahedra, which also include the f.c.c., the h.c.p. and those 5-Type-I (i.e. with icosahedral triangulations.)
(ii) Let be a given Type-I icosahedron and be its twelve 5-stars, be the area of . Then
[TABLE]
because every triangle of belongs to the triple of at its vertices, thus having
[TABLE]
namely, the averaged value of the twelve areas of the stars of a Type-I icosahedron is equal to . However, the area distribution of Type-I icosahedra can be quite different for different kinds of icosahedra. For example, both the f.c.c. and the h.c.p. icosahedra have uniform area distributions of for their twelve stars; those Type-I icosahedra in the close vicinity of the f.c.c. also have almost uniform area distributions with areas close to for their twelve stars. However, those 5-Type-I icosahedra always have a pair of stars with close to the minimal area of 5-star (i.e. at the poles) and ten stars of areas at least equal to .
(iii) Complementary regions of 5-stars and the opposite 5-star of a given 5-star in : Let be a given Type-I icosahedron and be one of its 5-stars. Then, the complementary region of can be defined just the same way as in the case of 6-stars and will again be denoted by . For a given Type-I icosahedron , to every 5-star, say , there is a unique opposite star, say , whose vertices are situated inside of which will be, henceforth, referred to as the opposite star of . For example, the pair of small 5-stars of a 5-Type-I (resp. 5-Type-I) icosahedra are opposite stars of each other.
Example 4.3**.**
Set the label of vertices of a given 5-Type-I icosahedron to be with at the poles and having the same longitudes. Then and are opposite stars of each other. Set to be the angular distribution of . Then consists of a pair of small triangles and a triple of half rectangles, namely -isosceles with top angles of and , 2 and another . Therefore, their areas are at least equal to and equal to when and only when . We shall denote such a 5-star by . Then, the complementary region of is given by that of with an additional circular sector. For example, the complementary region of is as indicated in Figure 13.
(iv) Small 5-stars and their complementary regions: Let be the radial edge lengths in excess of of a given 5-star, the total amount of such excesses will be, henceforth, referred to as the total radial edge-excess of the given 5-star and will be denoted by . Just for the sake of definitiveness in presentation, we introduce the following quantitative definition of small 5-stars, namely
Definition**.**
A 5-star is defined to be a small 5-star if its total radial edge-excess is at most equal to 0.104.
Example 4.4**.**
Among all 5-stars of a given total radial edge-excess, say , the following specific one will have the minimal area, as well as the maximum averaged density, namely, its radial edge-excess is concentrated in one and its boundary edge-excess is also concentrated in one of its -isosceles.
4.4 Geometry of Type-I icosahedra with rather lopsided area distributions of stars
Let be the area distribution in a given Type-I icosahedron . Note that the averaged value of such an area distribution is always equal to , while its individual values can be as small as () and as large as ( + 0.822). Geometrically, 5-stars with rather small total amount of radial edge-excesses, say denoted by , are those stars with areas quite close to (). In this subsection, we shall call those 5-stars with at most equal to 0.104 small stars, and on the other hand, call those with their areas exceeding ( + 0.6) stars with rather large areas; while Type-I icosahedra with both small stars and stars with rather large areas will be referred to as Type-I icosahedra with rather lopsided area distributions. Let us begin with some specific examples of such icosahedra.
4.4.1 Tight Extensions of small stars
Let be a small star centered at the north pole . Set (resp. , and ) to be its boundary vertices (resp. radial edge-lengths, central angles and boundary edge-lengths) and to be the vertices of its complementary region . Set to be the radial edge-excesses. Note that
[TABLE]
will ensure that are always -separated, and hence, one may extend such a to Type-I icosahedra simply by adding an opposite star with as the boundary vertices, while its center can vary in the vicinity of the south pole. Anyhow, we shall call such extensions tight extensions of . It is easy to see that such extensions achieve the maximal area of the opposite star of which will be henceforth referred as the tight extensions of the small star whose geometric structures are uniquely determined, modulo the positions of their centers, and in particular, the geometric structure of the subconfiguration of the union of the sextuplet of stars centered at .
Geometrically, the set of (resp. ) already constitutes a complete set of geometric invariants of , namely, that of the SAS (resp. SSS) type, while following quintuples of lengths, namely
[TABLE]
are the collection of lengths besides those edges of such subconfigurations of its tight extensions. For example, the additional ten spherical triangles of such a subconfiguration consists of quintuples of -isosceles with (resp. ) as their base-lengths, while the weighted average of is also equal to the further weighted average of that of the above quintuples of -isosceles (i.e. with and as their base-lengths) and with weights . The first step of analyzing geometric invariants of such a tight extension is, of course, to express in terms of the SAS data of , which has the additional advantage of . Anyhow, it is quite straightforward to first compute by the cosine law and then use the determinant formula for quadrilateral relations (cf section 3.2.3) to compute and . We only include here the following rather specific simple examples.
Example 4.5**.**
In the special “starting” case of 5-stars with uniform -radial edges and with as the center of its opposite star, the above computations become much simpler than otherwise, namely
[TABLE]
Based upon the above basic invariants of SSS type for triangles of of the tight extension of such a special 5-star, it is straightforward to compute other geometric invariants of such an , in particular the () as an explicit function of the angular distribution . Furthermore, it is not difficult to show that such a function of will be minimal in the case of uniform angular distribution and maximal in the case of most lopsided angular distributions.
Example 4.6**.**
Let be a small star whose radial (resp. boundary) edge-excesses are concentrated in a single radial edge(resp. in the base of an -isosceles) such as the St() indicated in Figure 15-(ii) with or the other assemblage of the same quintuple of triangles.
Note that there are only small differences between the tight extension of , and that of the other, while their are almost the same. Therefore, we shall only exhibit that of the former and compute its as a function of in the following. Set to be the only radial edge with excess. Then, one has of given as follows
[TABLE]
and moreover,
[TABLE]
Set to be the top angle of the -isosceles with as the base-length. Then,
[TABLE]
while
[TABLE]
Now, with the above set of basic geometric invariants of the tight extension of , , at hand, and assuming to be the center of its opposite star, it is straightforward to compute its as a function of , thus having computer graphic to such a function as indicated in Figure 16.
Example 4.7**.**
Another specific type of Type-I icosahedra with an opposite pair of small stars and accommodating a large star with higher
Let be an icosahedron containing a star with minimal area, say , (i.e. with uniform -radial edges and the central angular distribution of quadruple and ), while its opposite star, say , is a small star consisting a pair of -equilateral quadrilaterals and another -isosceles and moreover, the triple of its vertices “opposite” to the base edge of its -isosceles are situated on the boundary of the complementary region of . As indicated in Figure 17, the geometry of such a Type-I icosahedron can be advantageously analyzed in the setting of the kind of spherical cartesian representation with the equator corresponding to as the -axis, better exhibiting the geometry of the collar region connecting the pair of small stars, i.e. the complementary of :
As indicated in Figure 17, all those unmarked segments of the collar region of are of length , which consists of quintuple of with as their cutting diagonals respectively. In particular and are -equilaterals, whose geometry is determined by just one of their angles. Note that the outer angles of the collar region at are all equal to while that of and are given by
[TABLE]
Set . Then the angles of (resp. ) at are given by (resp. ), . Therefore the angles of (resp. ) at are given by
[TABLE]
Therefore, the outer angle at is given by
[TABLE]
for the range . It is easy to check that is a monotonically increasing function with , which is just the of the pair of angles of at .
Note that the geometry of such an is completely determined by the chosen of , while , and hence, by the chosen parameter of and . Therefore, from now on, we shall specify such an Type-I icosahedron as and proceed to compute those edge lengths of the collar region other than the -ones.
- (i)
First of all, one has
[TABLE]
And the outer angles of the collar region at , say denoted by (), are simply given as follows:
[TABLE]
where
[TABLE]
and
[TABLE]
- (ii)
Next let us compute (resp. ) which are respectively the base lengths of the -isosceles and whose top angles are given by
[TABLE]
- (iii)
Finally, it is straightforward to compute the remaining edge lengths of , namely, by applying cosine law to , and .
With such explicit formulas of the geometric structure of at hand, it is straightforward to use computer to provide numerical computations of the following two averaged densities, namely
[TABLE]
as well as , thus obtaining their graphs and that of the weighted ratio between and with weights given by , exhibited in Figure 18.
5 Techniques of collective area-wise estimates for clusters of triangles
In this section, we shall develop some pertinent techniques of collective areawise estimates for the weighted averages of over suitable clusters of triangles, such as -pairs, lune clusters or star clusters, which will provide the kind of powerful analytical tools to achieve a clean cut proof of Theorem I for the major case of Type-I local packings jointly with the geometric insights of §4.
Let be a Type-I configuration and be the tightest Type-I local packing with as the touching points of the twelve touching neighbors. Then, the locally averaged density is defined to be the specific weighted average of the triangular invariants of , while the proof of Theorem I for the most important case of Type-I local packings amount to proof that has the f.c.c. and the h.c.p. as the unique two maximum points with .
Geometrically, the f.c.c. and the h.c.p. are the two outstanding singular points and those 5-Type-I’s constitutes an outstanding singular subvariety in the moduli space , while the h.c.p. is an isolated point, the f.c.c. is a cusp point with 1-dimensional tangent cone and with a substantial neighborhood consisting of 6-Type-I’s; and moreover the 5-Type-I’s have a quite extensive neighborhood consisting of 5-Type-I’s. From the point of view of optimal estimation of , the 6-Type-I’s and the 5-Type-I’s are outstanding because, as it turns out, their are in fact higher than that of the others.
5.1 Area-wise estimates for the proof of Theorem I in the subsets of 6-Type-I’s and 5-Type-I’s
(1) The Estimate of () for 6-Type-I’s
A 6-Type-I configuration consists of octuple of small triangles and sextuple of , it follows readily from the ()-analysis of §3.3.2 that the weighted average of () for the two halves of a is always substantially less than that of () (by a linear factor of the angular decrement of ), while that of the small triangles are, of course, less than (). Therefore, () of a 6-Type-I’s is always less than . in fact, by a decrement of linear proportion to the deformation. Thus, the f.c.c. is a local maximum of cusp-type of the ()-function.
(2) The Estimate of () for 5-Type-I’s
First of all, let us provide an optimal estimate of the restriction of () to the subvariety of 5-Type-I’s , which is a function of the angular distributions at the poles. A 5-Type-I is the assemblage of quintuple of lune clusters , . Set
[TABLE]
Then it is straightforward to check that , is a convex function of Area(). Therefore is minimal in the case of uniform angular distribution (i.e. ) on the one hand and on the other hand it is maximal in the case of the most lopsided angular distribution, namely, with quadruple of and , where is equal to .
Next, let us consider the upper bound estimate of for those small or even just close deformations of 5-Type-I’s, namely, those 5-Type-I’s, which are, again, assemblages of lune clusters, each of them consisting of a and a pair of small triangles, say denoted by . Set
[TABLE]
Then, a direct application of the ()-estimates to the pairs of small triangles and will again provide the following upper bound estimate , namely
[TABLE]
and equality holds only when (resp. ). Therefore, the upper bound estimate of for 5-Type-I’s will also be the upper bound estimate of for 5-Type-I’s.
5.2 Area-wise estimates for 5-star clusters of Type-I icosahedra
Note that, except some of those small deformations of 5-Type-I, all the others are Type-I icosahedra. Therefore, the remaining cases of the proof of Theorem I for Type-I local packings are that of Type-I icosahedra, whose can be expressed as the following weighted average of .
[TABLE]
where
[TABLE]
Therefore, one naturally expects that collective area-wise estimates of for 5-stars of Type-I icosahedra will be a powerful technique for such a proof. Anyhow, this naturally leads to the discovery of Lemma 4 and Lemma 4*′*. This will be the major topic of this section.
5.2.1 Extremal Triangles and extremal stars
In the study of the problem of collective areawise estimates for stars in a Type-I icosahedra, it is natural to embark our journey by the exploration of possible candidates of “extremal stars”, achieving the optimal for such stars with a given total area . Recall that the clean-cut organization of area preserving deformations of spherical triangles and the ()-analysis of §3.3.2 not only show that the area-wise maximalities of (resp. ) for those extremal shapes (i.e. for areas and for areas up to 0.92), but also provide effective estimates for others in terms of ()-shape invariants. Anyhow, this individual estimates naturally leads to the construction (or rather, discovery) of the following examples, namely
Example 5.1**.**
Both and , as indicated in Figure 15, are given by the following explicit formulas as functions of , namely
[TABLE]
where (resp. ) are given by (3.30) with explicit functions of (resp. ).
Example 5.2**.**
Both and are given by the following explicit formulas as functions of , namely
[TABLE]
where (resp. ) are given by (3.30) (resp. (3.31)).
Lemma 4**.**
Let be a 5-star with edge lengths of at least , containing their circumcenters and (resp. and at most equal to ). Set (resp. ) to be the one as indicated in Figure 15-(ii) (resp. (i)) with the same area of . Then
[TABLE]
and equality holds when and only when consists of the same collection of quintuple triangles as that of (resp. ).
Remarks**.**
- (i)
It is not difficult to check that are realizable as stars of Type-I icosahedra only for . 2. (ii)
Lemma 4 proves that those realizable ones of Examples 5.1 and 5.2 are indeed extremal 5-stars. 3. (iii)
Thus, straightforward parametric graphing based upon the above set of formulas with as the parameter produces the graphs of (resp. as functions of (resp. ), as indicated by the two branches of graphs in Figure 19, matching at the distinguish peak point of (). One of the main topic of this section will be the proof of this important lemma.
5.2.2 Generalities on the geometry of such 5-stars and some basic geometric ideas
Let be such a 5-star, (resp. ), , be its quintuples of radial edges (resp. central angles, boundary edges) in cyclic orders. Thus already constitutes the S.A.S. congruence invariants of , and hence together with already constitute a complete set of congruence invariants for such a 5-star, while others such as and etc. can all be computed in terms of (resp. and ) in excess of (resp. and ), the set of radial edge-excesses (resp. boundary edge-excesses and area excesses) of such a , while the correlations between the above triple sets of excesses as well as their manifestations both on densities, weights and their weighted average will be the crucial geometric understandings for the proof of Lemma 4.
Roughly and intuitively speaking, one naturally expects that the collective area-wise optimality of should be the result of suitable combination of individual area-wise optima (i.e. and ). Set to be the function of area which records the maximal density for each given . As indicated in Figure 5, its graph has a prominent peak at with the value of , which subdivides the function into two branches of convex functions, namely, that of and .
Next let us analyze some of the outstanding geometric features of (resp. ), henceforth referred as extremal stars. First of all, each of them is an assemblage of those extremal triangles which makes the specific weighted average of with respect to its area distribution as high as possible. For example, in the special but also the most critical case of (i.e. equal to the average value of twelve stars), the area distribution of consists of a pair of and a triple of with as their (cf. §5.2.3). Anyhow, this makes the proof of Lemma 4 for the special case of rather simple.
Moreover, it is not difficult to see that (resp. ) have the most lopsided distributions both in their radial edge-excesses and their boundary edge-excesses. For example, both of them are concentrate in one for .
Some simple area preserving deformations of 5-stars
We mention here two kinds of simple area preserving deformations of star configurations, which will be helpful for area-wise estimation of geometric invariants of stars in a way quite similar to that of single triangle in area-wise estimates of individual spherical triangle.
- (i)
For certain star configurations, the center-vertex still has some kind of degree of freedom to shift. Then, the center shifting will, of course, leave the total area of such a star unchanged, while it is quite straightforward to compute the gradient vector of given geometric invariants such as and of this kind of area preserving deformations. 2. (ii)
Lexell’s deformation and a simple kind of area preserving deformation of star configurations
As indicated in Figure 20, let be such a 5-star with and longer than . Then one may use the Lexell’s deformation of fixing to deform it to another 5-star preserving the total area, up until one of becomes .
Using the parameter representation of §3.3.5 for Lexell deformations, it is quite straightforward to apply Taylor’s approximation up to second order for or other kind geometric invariants of star configurations to determine their minima (resp. maxima) for the intended range. Anyhow, such simple area preserving deformations often provide useful reduction for area-wise estimates of geometric invariants of stars such as the proof of Lemma 4.
5.2.3 A kind of measurement on the lopsidedness of area-distributions of a 5-star
Just for the purpose of facilitating the proof of Lemma 4, we shall introduce the following measurement and terminology:
Set to be the function of area which records the weight of those extremal triangles as a function of their areas, namely
[TABLE]
Let , be the area distribution of a given . Set
[TABLE]
which will be regarded as the quantitative measurement of the lopsidedness of .
Remarks**.**
- (i)
in case that the quintuple of of are all of or type, . 2. (ii)
Note that (resp. ) not only have the maximal density (i.e. ) among such triangles with the same area, but they also have the maximal . 3. (iii)
The proof of Lemma 4 will be divided in two cases according to is at most equal to that of (resp. ) or otherwise, namely
Case A: (resp. )
Case B: (resp. )
On the Correlation between and
Let be such a given 5 -star with as its quintuple of triangles, be its area distribution and (resp. ) be the extremal 5-star with the same area of , namely (resp. ) equal to (resp. ). Set and regard it as a kind of measurement of the lopsidedness of the area distribution of . Note that (resp. ) is exactly equal to (resp. ), and
[TABLE]
while equalities hold when and only when is extremal, i.e. either or for a certain depending on whether is at most equal to or exceeding .
One has the following remarkable correlation between and , namely
[TABLE]
Proof.
[TABLE]
[TABLE]
[TABLE]
[TABLE]
and hence, (5.12) follows from (5.14). ∎
Sublemma: Each summand of the RHS of (5.12) is always non-negative and it is equal to zero when and only when is extremal.
Proof.
By (5.9), if is extremal, then the -th summand of the RHS of (5.12) is equal to zero because both of its two terms are equal to zero. On the other hand, if is non-extremal, then both () and () are positive. Thus, the -th summand will be obviously positive in the case that () is non-negative. On the other hand, in the case that () is negative, is always many times larger than . Hence, it is not difficult to show that the -th summand is, again, positive. ∎
Corollary 5.2.1**.**
If is at most equal to that of the extremal star with the same area, then is at most equal to that of the extremal star, and it is equal to when and only when consists of the same collection of triangles as that the extremal star (cf. Examples 5.1 and 5.2).
Remarks**.**
- (i)
Suppose that is larger than that of the extremal star with the same area. Then must contain at least one non-extremal triangle, because it is quite simple to check that the assemblages of extremal triangles of those extremal stars are the only such assemblages of extremal triangles with the highest for the same area. 2. (ii)
The above corollary already reduces the proof of Lemma 4 to the case that exceeds that of the extremal star with the same area, which amounts to provide a lower bound estimate of the RHS of (5.12) exceeding
[TABLE]
5.2.4 The proof of Lemma 4
Proof.
In this subsection, we shall proceed to provide a proof of Lemma 4, based upon the -analysis and the optimal estimates of (resp. ) of §3.3, together with the geometric insights of §5.2.2 and §5.2.3. For example, the sublemma of §5.2.3 shows that provides a simple and advantageous upper bound of , which will be optimal when and only when the quintuple of are all extremal ones. In particular, such a simple upper bound solely in terms of area distribution already proves Lemma 4 for the case are at most equal to that of the extremal star with the same total area . Just for this subsection, we shall denote an extremal star by and set
[TABLE]
Note that, for a given value of the total area , the area distribution of the extremal star of total area is already rather lopsided (i.e. with rather high among that of stars with the same ). Here, the very particular shapes of the graphs of (resp. ) will, of course, play an important role in the proof of Lemma 4, especially, the peak of in the vicinity of . Mainly for this reason we shall subdivide the proof of Lemma 4 into the following cases, namely
Case 1: and Case 2:
The Proof of Case 1: ()
First of all, in the range of lying between () and (), the area distribution of have a pair of very close to the minimum together with a triple of rather close to . Due to the special features of and , it is not difficult to see that is almost as large as possible for with the same , and moreover, any other of total area with at least equal to (or even slightly smaller than ) must be consisting of such a pair and a triple of , such as . Furthermore, the -analysis of shows that those with only slightly smaller than can only be a quite small deformation of the extremal triangle with the same area.
Now, suppose that is such a 5-star of Type-I icosahedron with . Then, it follows from (5.12) that the triple of with in the vicinity of must be all just small deformations of extremal triangles of the same area. Therefore must be a small deformation of , or the other way of assemblage in the case of , and hence must be, in fact, equal to (i.e. of total area ).
This proves Case 1.
The Proof of Case 2: ()
For 5-stars with less than , the extremal stars are or its other way of assemblage; their radial (resp. boundary) edge-excesses are concentrated in a single radial (resp. boundary) edge, while its quintuple of triangles consists of a pair of , a pair of and another ,
[TABLE]
Note that as and are always larger than , while their difference () is steadily decreasing from () and becoming equal to zero at . Anyhow, let us start with the proof of the seemingly rather special but actually more critical subcase that the radial edge excesses are concentrated in a single radial edge.
(1) The proof of the subcase 1 of only one radial edge excesses, i.e. with quadruple -radial edges.
Set to be the radial edge of such a which may be longer than . First of all, the very special beginning case of follows easily from the convexity of , , as a function of . Therefore, we shall from now on assume that and . Let be the extremal star with total area . Then it is easy to show that
[TABLE]
and equality holds when and only when contains two , namely, is itself extremal. Again, by the convexity of as a function of , it is easy to see that , for a given value of less than , the more critical geometrical situations are those with containing a pair of ; and moreover, by the application of the Lexell’s deformations of such stars, starting at the one with up until that one of become , it is straightforward to check that such a Lexell’s deformation is density increasing, thus reducing to the study of the following specific family of stars, namely, with quadruple of extremal triangles including a pair of and one -isosceles with the only larger radial edge as its base, say as indicated in Figure 21.
Geometrically, such a is a specific kind of area preserving deformation of with to be the only non-extremal which makes longer and smaller than . These kind of stars are completely determined by and whose area and can be easily computed by explicit formula in terms of and , say denoted by and . Therefore, straightforward computation by the methods of implicit differentiation will show that the initial value of , i.e. , is the maximal value of along the above kind of area preserving deformations. This proves Lemma 4 for such a subcase.
(2) The proof of the subcase 2 of two non-zero radial edge-excesses:
Let be such a star with exceeding . Then, one of the non-zero radial excesses must be very small. For otherwise, it is easy to show that is at most equal to . Therefore, such a star can be regarded as a very small deformation of that of subcase 1, and moreover, it is straightforward to check that such small deformations are density decreasing, thus proving less than .
(3) The proof of remaining cases:
We may assume without loss of generality of the proof that such a has a triple of non-zero radial edge-excesses. For otherwise, we may simply replace it by a density increasing deformation of shifting its center. And moreover, the smallest non-zero radial edge-excesses must be very small. For otherwise, will be at most . Therefore, such a star is, again, just a very small density decreasing deformation of a star of subcase 2, and hence .
This completes the proof of Lemma 4. ∎
5.2.5 Extension of Lemmas 3 and 4 to -stars of areas exceeding : Lemma 4′
- (1)
Note that both Lemma 3 and Lemma 4 essentially only cover the case of -stars in Type-I icosahedra with areas at most equal to . In this subsection, we shall proceed to study their extensions beyond that upper limit. The main result here will be stated as Lemma 4′. Recall that, for the range of areas lying between and , Lemma 4 proves that the optimal stars (i.e. the ones with optimal for such a given area ) are uniquely realized by , ; while Lemma 3 proves that those Type-I configurations containing small deformations of such extremal stars are necessarily of -type, namely, small deformations of -Type-I. In particular, the -Type-I with angular distribution of quadruple and is, in fact, the unique one containing . Actually, if one ignores the realizability condition, it is not difficult to show that , , will also be the unique extremal star for areas considerably larger than . However, the problem here is that they are no longer realizable as stars of Type-I configurations. Therefore, the geometric problem here is a kind of tight combination of optimality and realizability and what we are seeking should be a kind of joint generalization of Lemmas 3 and 4 for the case of areas larger than .
- (2)
Let us begin our study in the case of areas only slightly larger than that of , say up to . Intuitively, one expects that such stars (i.e. realizable and with almost optimal for such a given area) should have quite similar geometric structures as that of , namely, with a pair of small almost extremal triangles and a triple of small deformations of half rectangles. Moreover, we may assume without loss of generality that such a star has at least a pair of -radial edges, for otherwise, one may simply reduce to such a star by a density increasing deformation of center-shifting. Therefore, such a star should have an adjacent pair of small triangles, each with at least one -radial edge, together with a triple of small deformations of half rectangles, or more precisely, one small deformation of half rectangle and another , in order to minimizing their -decrements to achieve higher among those realizable ones. Note that the realizability condition of such a star can be conveniently reformulated as follows, namely, there exists a star including the pair of small triangles, such as in Example 4.7, whose complementary region accommodates another star adjacent to the one we constructed. Anyhow, this is the basic geometric idea that naturally leads to the construction of Example 4.7 as well as the formulation of the following lemma, namely
Lemma 4′: Let be a Type-I icosahedron containing a star with at least almost as high as the optimal one of the family of Example 4.7 with the same area. Then must be a small deformation of some of Example 4.7 containing a star of the same area with optimal .
Proof.
(1) First of all, we may assume without loss of generality for the purpose of proof, that such a star in contains at least a pair of -radial edges, simply by a reduction of density increasing center shifting. Thus, we shall first divide the proof of this lemma into two major cases, namely
Case I: Such a star contains an adjacent pair of -radial edges.
Case II: Such a star only contains a separate pair of -radial edges.
We shall denote the pair of small triangles of such a star by and , and in the case of I, is the one with both of its radial edges equal to (or the smaller one if such a star containing a triple of -radial edges), while the boundary edges of will be denoted by .
Moreover, we shall further subdivide Case I into the following subcases, namely
Case I0: are the same as that of Example 4.7, namely and , ,
Case I1: but is not ,
Case I2: At least one of is longer than .
(2) The proof of Case I0:
As it turns out, the proof of this very special subcase is the most critical and revealing on the geometric insight of how the areawise optimality and realizability interplay with each other. Suppose that is such a Type-I icosahedron and is the larger star with at least almost as high as that of the family of Example 4.7 with the same area.
- (i)
Let us first analyze the geometric situation that the star containing , say again denoted by , is the same as that of Example 4.7, namely, with uniform -radial edges and the most lopsided angular distribution. Then the opposite star of in , say again denoted by , will, of course, be a star whose vertices, say again denoted by , are inside of its complementary region, as indicated in Figure 17 by the region below those -circular arcs tangent to at , which can be represented via the stereographic projection as indicated in Figure 22.
Note that the geometry of in is already determined by the positions of and , while that of inside of mainly providing an example of realization, as far as and is concerned. Therefore, one may choose some special kind of just for the purpose of proving Case I0. In particular, it is not difficult to show that there exists such a with situated on and . Set to be the parameter determined by as in Figure 22 (cf. Example 4.7). For the purpose of proving this lemma, we shall study the following special kind of extension problem, namely, for a given position of , inside of in the vicinity of , what are those extensions of an with rather large and of areawise higher .
Set to be the top angle of the -isosceles and to be its base angle, namely
[TABLE]
If is at most equal to , then there exist one of Example 4.7 with , , as its vertices, while such an must have its outside of that of the one of Example 4.7, if it is not coincide with its vertex. Moreover, we may assume without loss of generality for the proof that . For otherwise, we may perform an density increasing Lexell’s deformation to to reduce to such a , namely, and one of Example 4.7 have identical , while the only difference is that its is slightly larger than that of the comparing one of Example 4.7. Therefore, it is quite straightforward to check that its is, in fact, smaller than that of in some of Example 4.7 with the same area.
- (ii)
Next, let us consider the other possibility that exceeds . Then the kind of tight extensions of Example 4.7 with as their vertices are no longer available. Set to be the -circle passing through and to be the point on in the vicinity of . It is easy to check that and we may assume that is actually the center of . Therefore it is again straightforward to check that of is again smaller than that one of Example 4.7 with the same area. This proves the subcase of Case I0 that in has uniform -radial edges.
- (iii)
Now, let us consider the remaining subcase of Case I0 that contains some radial edges longer than . First of all, such radial edge-excesses, even just a very small total amount will already make the complementary region considerably smaller than that of the previous subcase, thus making some of those critical extensions of with the same no longer available. Therefore, it is not difficult to follow the same kind of geometric analysis to provide the kind of areawise estimates of as that of the previous subcases.
(3) The proof of Case I1
This case is naturally consisting of two subcases according to the angle of at is less than or larger than .
- (i)
Subcase 1: , :
Let us first consider the geometric analysis for the situation of that again has no radial edge-excesses. Then, the same kind geometric construction similar to that of Example 4.7, in particular to the specific case that the central angles of consisting of , say denoted by Example .
Note that one of the critical fact for such an is that in is shorter than that of Case I0 (i.e. ). Therefore, of in such of Example will always be lower than that of Example 4.7 with the same area. Anyhow, the same kind of comparison areawise estimate of will always provide a smaller upper bound on .
- (ii)
Subcase 2: , :
In this subcase, the in such an is necessarily to have some radial edge-excesses depending on the size of . Therefore, on the one hand, the radial edge in is now longer than , which is advantageous for producing larger , but on the other hand the minimal amount of total radial edge-excesses of makes the size of its complementary region substantially smaller than that of the minimal star in Case I0, which is rather disadvantageous for producing larger with higher density. Anyhow, the same kind of geometric analysis as that of Case I0 will show that the disadvantage far outweighing the advantage. Therefore, essentially the same kind of comparison upper bound areawise estimate will again show that of in this subcase will always be lower than that of the Case I0.
(4) The proofs of Case I2 as well as Case II
Let us first discuss the geometric situation of Case I2. Note that are the build in radial edges of . Therefore in this case, always has some radial edge-excesses. On the other case of Case II, has an adjacent pair of -boundary edges. Thus, unless in the very special geometric situation of which has already been included in Case I1, the of such an of Case II always has both the disadvantage of having radial edge-excesses and . Anyhow, it is quite clear that the same kind of proof for the previous cases will also provide simple proofs of Case I2 and Case II.
This completes the proof of Lemma 4′.
∎
Remarks**.**
- (i)
For the purpose of proving Theorem I for Type-I local packings, Lemma 4′ already provides an easy to use upper bound areawise estimate for of -stars of areas up to , namely, that of those of of Examples 4.7. 2. (ii)
The proof of Lemma 4′ also shows that, for those realizable -stars with areas exceeding , they can not be the assemblages of a pair of almost extremal small triangles and a triple of small deformations of half rectangles, because such -stars with areas exceeding are no longer realizable. Therefore, it follows from the -estimates that of those realizable -stars with areas larger than must be considerably lower than that of of Examples 4.7. In particular, it is not difficult to show that the following areawise slope of , namely
[TABLE]
must be smaller than that of in Examples 4.7 with maximal area.
6 The proof of Theorem I for the case of Type-I local packings
Geometrically speaking, the moduli space of congruence classes of Type-I configurations, say denoted by , constitutes a 21-dimensional semi-algebraic set on which the specific geometric invariant is defined to be the following weighted average
[TABLE]
First of all, the pair of 6-ones (i.e. the f.c.c. and the h.c.p.) and the family of 5-ones constitute two particular kinds of outstanding examples, namely, a pair of singular points and a 4-dimensional singular subvariety with of the f.c.c. and the h.c.p. equal to while that of the latter are slightly lower than . Of course, one of the critical, important steps in the proof of Theorem I for the Type-I case should, naturally, be the establishment of suitable neighborhoods of the 6-ones (resp. the 5-ones) in that they are indeed the unique maxima (resp. a kind of ridge subvariety) in such neighborhoods, which consists of those -Type-I’s (resp. -Type-I’s).
Thus, we shall divide the proof of Theorem I for the case of Type-I local packings into the following three subcases, namely {labeling}cases
-Type-I’s.
-Type-I’s.
Others
while the proofs of Case I1 (resp. Case I2) have already been given in §5.1 (resp. §5.2) as a kind of demonstrations of the advantages of collective areawise estimates for -clusters and lune-clusters. Therefore it suffices to prove the remaining case of Case I3.
The proof of Case I3:
Note that all non-icosahedra Type-I’s are belonging to Cases I1 or I2 and hence Case I3 only consists of Type-I icosahedra. Therefore, one has
[TABLE]
and hence, it suffices to prove that
[TABLE]
by a direct application of Lemmas 4 and 4′.
For a given Type-I -star with ), set
[TABLE]
which will be referred to as the weighted slope of . It follows from Lemma 4, Lemma 4′ and the remarks following Lemma 4′, one has the following upper bound of , namely
(i) for ,
[TABLE]
(ii) for ,
[TABLE]
and moreover, .
Applying the above simple estimates to the LHS of (6.3), one has the following estimates of its positive terms (resp. its negative terms) as follows, namely
- (i)
The sum of positive terms is bounded above by
[TABLE]
- (ii)
The sum of negative terms is bounded above by
[TABLE]
On the other hand, it is quite obvious that the total area defects is equal to the total area excesses because the average of areas of the twelve stars is always equal to . Therefore, the proof of Case I3 follows simply from the above straightforward estimate, thanks to Lemma 4 and Lemma 4′.
This completes the proof of Theorem I for the case of Type-I local packings.
7 The proof of Theorem I for Case II: non-Type-I local packings
In this section, we shall proceed to complete the proof of Theorem I for those local packings other than those Type-I local packings, which will be simply referred to as Case II.
7.1 Some pertinent generalities and a brief overview
First of all, non-Type-I local packings consist of all those local packings which have at most eleven touching neighbors, thus encompassing great amount of probabilities and varieties. However, what we’re are going to prove will be that their locally averaged density, , are always less than ! Of course, such a proof should be quite simple for those such as local packings with neighbors all of them having certain amount of buckling heights. Therefore, the real challenge in the proof of Case II is not just to have a proof, but rather, to achieve a clean-cut simple proof. Anyhow, let us begin here with some pertinent generalities.
7.1.1 Some generalities on non-Type-I local packings
(1) A concept of close neighbors and reduced local packings
Technically, it is quite clear that almost touching neighbors and touching neighbors will play just the same role in their effect towards the or . Intuitively, the subset of a given local packing consisting those neighbors with small buckling heights, , will play a major role in the estimation of . This motivates the following definition of close neighbors and reduced local packings.
Definition**.**
A neighbor with at most equal to 0.07 will be called a close neighbor; the sub-local packing of consisting of its close neighbors will be henceforth referred to as the reduced sub-packing of and denoted by
Remarks**.**
- (i)
Of course, there are local packings with no close neighbor at all; such local packings will have no reduced sub-packing, but it should be very easy to estimate their to be much less than . 2. (ii)
For the purpose of proving Case II of Theorem I, we may assume without loss of generalities of the proof that , actually, have quite a few close neighbors, and we shall regard as an extension of its reduced sub-packing, i.e. . 3. (iii)
A given reduced local packing will be called a saturated reduced local packing if it can not be extended to another local packing with additional close neighbors. Let be a saturated reduced local packing, an extension will be referred to as a tightest extension of if already achieves the maximality among all possible extensions of and is defined to be the of its tightest extensions.
(2) Some pertinent generalities
For a given saturated reduced local packing , set
[TABLE]
to be the direction profile of and the set of buckling heights of , namely
[TABLE]
Then, will be referred as the associated spherical configuration of .
Let be a given saturated reduced local packing and be one of its tightest extension and (resp. ) be the directional profiles of (resp. ), (resp. ) be their associated spherical configurations. Set to be that of , if any, and to be the connected components of the sub-graph in spanned by . To each , if any, the union of those stars of will be referred to as the big hole in associated to , namely
[TABLE]
while will be referred to as the base of the mountain range with those peaks in , which is, of course, union of triangles in . Just for the convenience of areawise estimation, we shall assign the densities of those triangles, say , , all equal to the weighted average of that of faces of inside of . Then, it is not difficult to apply the -analysis to provide areawise estimates for such clusters of triangles of , thus proving a kind of averaged areawise optimal estimate, namely, bounded above by that of the special case in which the buckling heights at those vertices of are all equal to [math]. For example, in the simplest case of quadrilateral big holes, their averaged densities are bounded above by that of the extremal triangles with half of their area.
7.1.2 The basic ideas and crucial steps of the proof of Theorem I for the remaining case of non-Type-I local packings (i.e. Case II)
In fact, the major reason that makes a simple clean-cut proof of Theorem I for Case II, at all, possible is the simple idea of using strategy of proof by contradiction, namely, proving the non-existence of a non-Type-I local packing with . Moreover, the following are two crucial steps along the journey of such a proof by contradiction.
Proposition 7.1**.**
Let be a non-Type-I local packing with . Then contains at least twelve close neighbors.
Proposition 7.2**.**
Let be a non-Type-I local packing with . Then is an icosahedron.
Therefore, we shall begin with the proofs of Proposition 7.1 and Proposition 7.2 again by the method of proof by contradiction.
(1) The proof of Proposition 7.1
Suppose that is a (non-Type-I) local packing with at most eleven close neighbors. Let us first consider the most critical case that it actually has eleven close neighbors. Then consists of eighteen triangles, say . Note that the average of is always equal to which is already slightly larger than . Therefore, it is quite straightforward to apply the triangular areawise estimate to show that of such an must be considerably smaller than , contradicting to the assumption that . Moreover, it is quite clear that the same kind of triangular areawise estimate will show that must be smaller than by a bigger margin if . This completes the proof of Proposition 7.1.
(2) The proof of Proposition 7.2
Suppose the contrary that there exists a non-Type-I local packing with at least twelve close neighbors, with and having at least one -star, say denoted by . Then, the condition of implies that the total amount of buckling heights of those vertices of must be quite small, say at most . For otherwise, there must be some -stars with considerably higher than to counter balance , and moreover, such -stars can only be either adjacent or overlapping to . Therefore, must be of the -type and must be a small deformation of -type, and hence, it is quite easy to show that is smaller than with substantial margin, contradicting to the assumption that .
On the other hand, the complementary region of the -star with at most total buckling heights is a small deformation of that of a Type-I -star we have already analyzed in the proof of Lemma 2. Therefore, the condition of again implies that can not have more than twelve close neighbors and moreover, must be of -type or -type. Thus, straightforward application of the collective areawise estimates of -pairs and lune clusters, generalized to the geometric situation with small amount of buckling heights, will produce an upper bound of less than , again contradicting to the assumption that . This completes the proof of Proposition 7.2.
7.2 The proof of Theorem I for Case II
Thanks to Proposition 7.1 and Proposition 7.2, the proof of Theorem I for Case II can now be reduced to the proof of the following upper bound, namely
[TABLE]
for the remaining case of non-Type-I local packings with icosahedra . Similar to the proof for the case of Type-I icosahedra (cf. §6), we shall also subdivide the proof of such an estimate into the following subcases, namely {labeling}cases
with of -type.
with of -type.
Others
Note that the same upper bound for areawise estimates of clearly still hold for the more general geometric situations of -pairs (resp. lune clusters) in . Therefore, Case II1 (resp. Case II2) can again be readily proved by the same kind of areawise estimates for their -pairs (resp. lune clusters).
Finally, let us proceed to prove Case II3 by means of areawise estimates of for -star-clusters. It is easy to see that the upper bound estimate of Lemma 4 still holds without modification for the more general situation of -star-clusters in . However, for the range of exceeding , Lemma 4′ certainly needs some kind of proper modification for the more general geometric situation of -star-clusters of , because the realizability condition as stars in Type-I configuration is considerably stronger than that of , or in other words, for a given total area exceeding , certain kind of -star of that total area and with their slightly or even considerably higher than that of the optimal ones in Example 4.7 can not be extended to Type-I configurations, but still can be extended to . The pertinent remark here is that the of such will always be lower than the of those in Example 4.7 containing with the same area and optimal , whose are already lower than with a comfortable margin. Therefore, just for the purpose of proving “” for Case II3, one may assume without loss of generality that the of stars with such large total areas, if any, are bounded above by that we used in the proof of Case I3, simply because the of those configurations with stars of large total areas and with higher than such upper bound are already substantially smaller than . Hence, Case II3 can again be proved by the same estimates as that of Case I3.
This completes the proof of Case II3.
8 Concluding remarks, retrospects and prospects
8.1 Retrospects
First of all, the sphere packing problem should be, more naturally, regarded as a problem of geometric understanding on crystal formations of dense type (cf. Theorem III, §2), a problem suggested by the nature, on the remarkable geometric feature of such crystals for a majority of monatomic elements, rather than whatever is commonly attributed to be the seeking of a proof of Kepler’s conjecture.
In retrospect, both conceptually as well as technically, it is a kind of problem on in depth understanding the underlying profound interplays between the wonderful symmetry of the physical space and the least action principle (i.e. optimality) of crystal formation of dense type as formulated in [Hsi], roughly speaking, a kind of philosophical belief that the remarkable geometric precision and regularity of such crystals should be the consequence of optimality in packing density. Anyhow, this naturally leads to the proof of Theorem III (cf. §2).
Note that one needs firstly to give a proper definition of the concept of global density for packing of the second kind and then proceed to study its optimality such as the one we give in (§1.2.3). Such a concept of global density, no matter how one chooses to define it will be inevitably rather complicated and very difficult to compute in general. Thus, it is extremely remarkable that the global density in (§1.2.3) actually has a clean-cut geometric structure for its optimality (cf. Theorem III). Here, one will naturally wonder about: What is the underlying geometric reason for such an almost magical reality? And how to prove it?
In retrospect, the local geometric structure of those crystals of dense type actually already reveals the clue of Nature’s secret, namely, the whole structure of such immense aggregates are always the repetition of only two kinds of local packings, i.e. the f.c.c. and the h.c.p., and such a reality of nature strongly suggests that global optimality of the second kind of packings should be the consequence of a kind of local optimization everywhere! Thus, the entire journey starting with the proper definition of global density and ending up with the proof of such a least action principle of crystal formation of dense type amounts to find the proper local invariant (i.e. the locally averaged density ) and the proof of its optimal estimate with the f.c.c. and the h.c.p. as the unique two of optimality, namely, Theorem I. This is exactly what we are presenting in this paper in the setting of geometric invariant theory of space.
In summary and from the overall point of view of solid geometry (the geometry of our physical space), the sphere packing problem is an outstanding natural problem which, on the one hand, tests and challenges our depth of geometric understanding and techniques, and on the other hand, it also inspires and leads to further improvements of geometric understanding. Indeed, it is a wonderful geometric problem of the nature that certainly deserve a clean-cut proof in the classical tradition of solid geometry. I am glad that I have finally succeeded in paying my due respect to this problem in the proper setting of geometric invariant theory of space.
8.2 Prospects
Note that the techniques we developed in §3 and §4 on geometric invariant theory and spherical geometry are naturally applicable to the study of many other problems in solid geometry, while we are only focusing on their applications towards the proof of Theorem I. Of course, the broader prospect would naturally be further developing solid geometry in the setting of geometric invariant theory of the space by studying many other natural geometric problems, especially those problems naturally arise in the study of physics, such as the problem of sphere packing and crystal formations in the realm of solid state physics.
However, due to the limited scope of this paper, we shall mention here two specific such problems still in the realm of sphere packings.
Problem 1:
What should be the optimal shape for large scale sphere packings of the first kind, namely, with container?
Problem 2:
Problem of sphere packings with two sizes, say of radii and respectively.
Acknowledgements: This is the final version of a manuscript that has been refined over time with many helpful discussions. In particular I would like to acknowledge the great help and many inspiring discussions with Lee-Ping Wang, which led to important improvement and substantial simplification of the proof of Theorem I. I also would like to thank Eldar Straume, Lars Sydney and Wing-Lung Lee for their generous help and in-depth discussions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Bar] Barlow, Nature , 1883.
- 2[Hsi] Wu-Yi Hsiang, Least action principle of crystal formation of dense packing type and Kepler’s conjecture , Nankai Tracts in Mathematics vol. 3, World Scientific, 2001.
- 3[Kep] Johannes Kepler, On the Six-Cornered Snowflakes , Oxford Univ. Press, 1966 (English translation of latin booklet of 1611).
