Polarization and Greedy Energy on the Sphere
Dmitriy Bilyk, Michelle Mastrianni, Ryan W. Matzke, Stefan, Steinerberger

TL;DR
This paper studies a greedy algorithm for placing points on a sphere to minimize Riesz energy, showing it achieves near-optimal energy behavior and providing bounds on discrepancy, with numerical evidence of high uniformity.
Contribution
It proves that greedy sequences on the sphere attain optimal second-order Riesz energy behavior for 0<s<d and relates polarization to energy and discrepancy.
Findings
Greedy sequences achieve optimal second-order Riesz energy for 0<s<d.
Second-order polarization term is of order N^{s/d} for 0<s<d.
Upper bounds on L^2 spherical cap discrepancy are established, with numerical evidence of low discrepancy.
Abstract
We investigate the behavior of a greedy sequence on the sphere defined so that at each step the point that minimizes the Riesz -energy is added to the existing set of points. We show that for , the greedy sequence achieves optimal second-order behavior for the Riesz -energy (up to constants). In order to obtain this result, we prove that the second-order term of the maximal polarization with Riesz -kernels is of order in the same range . Furthermore, using the Stolarsky principle relating the -discrepancy of a point set with the pairwise sum of distances (Riesz energy with ), we also obtain a simple upper bound on the -spherical cap discrepancy of the greedy sequence and give numerical examples that indicate that the true discrepancy is much lower.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Approximation and Integration
Polarization and greedy energy on the sphere
Dmitriy Bilyk and Michelle Mastrianni and
Ryan W. Matzke and Stefan Steinerberger
School of Mathematics, University of Minnesota, Minneapolis, MN 55455, USA
School of Mathematics, University of Minnesota, Minneapolis, MN 55455, USA
Department of Mathematics, Vanderbilt University, Nashville, TN 37212, USA
Department of Mathematics, University of Washington, Seattle, WA 98195, USA
Abstract.
We investigate the behavior of a greedy sequence on the sphere defined so that at each step the point that minimizes the Riesz -energy is added to the existing set of points. We show that for , the greedy sequence achieves optimal second-order behavior for the Riesz -energy (up to constants). In order to obtain this result, we prove that the second-order term of the maximal polarization with Riesz -kernels is of order in the same range . Furthermore, using the Stolarsky principle relating the -discrepancy of a point set with the pairwise sum of distances (Riesz energy with ), we also obtain a simple upper bound on the -spherical cap discrepancy of the greedy sequence and give numerical examples that indicate that the true discrepancy is much lower.
Key words and phrases:
Spherical cap discrepancy, greedy sequences, Stolarsky principle.
2010 Mathematics Subject Classification:
52A40, 52C99
1. Introduction and background
We consider the classical problem of distributing points on the sphere as regularly as possible. Any such discussion requires a notion of regularity. We recall several such notions below.
1.1. Energy and polarization
Let denote a set of points in the sphere. For a symmetric, lower semi-continuous kernel and a point set , we define the discrete energy as
[TABLE]
Of particular importance are the Riesz -kernels:
[TABLE]
Another quantity closely related to energy is polarization, defined as
[TABLE]
For a fixed , we define the minimal energy and maximal (constrained) polarization, respectively, by
[TABLE]
Both energy (see e.g. [10, 11, 25, 49, 53]) and polarization (see e.g. [22, 42, 43, 70, 80, 79, 85]), especially in the case of Riesz kernels, have been actively studied over the past few decades: we refer the reader to the excellent recent book [20] for a comprehensive exposition. The second-order asymptotic behavior for the optimal Riesz -energy is well understood (see the discussion in Section 4.2) for . However, results for second-order asymptotics of maximal polarization with Riesz -kernels have not been obtained before (except when ). We prove these optimal bounds in Theorem 2.1 for the singular potential-theoretic range and then use this result to demonstrate that, in the same range, the greedy sequence obtains optimal second-order asymptotic behavior (up to constants) for Riesz -energy (Theorem 2.2) and, for infinitely many , polarization (Corollary 2.3).
In this paper we only consider the range . This is natural for the Riesz -energy on the -dimensional manifold , since for minimizing energy no longer leads to uniform distribution (indeed, optimal configurations are achieved by placing half of the points at each of two opposite poles [16]), while for the kernels are hypersingular.
1.2. Spherical cap discrepancy
For any and any , we define
[TABLE]
to be a spherical cap. The spherical cap discrepancy of a set is then defined as
[TABLE]
where is the surface measure on normalized so that . A fundamental result of Beck [6, 7] states that
[TABLE]
where the implicit constants depend on the dimension . The upper bound in (1.3) follows from a probabilistic construction (jittered sampling). There are many probabilistic [1, 3, 4, 8, 9, 17, 28, 45] and deterministic [1, 41, 44, 45, 47, 56, 66, 67, 69, 86] constructions that have been investigated. Most of these explicit point sets have only been proven to have discrepancy of the order no better than , although there is numerical evidence that some of them might be almost optimal. Instead of the spherical cap discrepancy, we will in this paper work with the spherical cap discrepancy
[TABLE]
The -spherical cap discrepancy is smaller than the spherical cap discrepancy though the difference is, for many examples, at most logarithmic. The -discrepancy is connected to energy, particularly to the Riesz energy with (sum of distances) via the Stolarsky invariance principle [91], which we state in §5.
2. Greedy sequences and main results
One of the main purposes of this paper is to discuss a simple greedy construction of sequences that turns out to simultaneously
- (i)
achieve optimal Riesz -energy, up to the second-order behavior, when , 2. (ii)
satisfy good distribution properties in the sense of .
Greedy sequences have been studied in the context of energy [29, 52, 61, 62, 63, 64, 65, 73, 74, 82, 84, 95] and discrepancy [29, 57, 87, 88, 89, 90]. It is well understood that greedy sequences on tend to be fairly structured [71], but this cannot be expected in higher dimensions. However, we show that greedy sequences on exhibit an unusual amount of regularity: they produce nearly optimal Riesz energy for and their -spherical cap discrepancy satisfies favorable bounds (while numerics in §6 suggests they might even be optimal).
Generally, constructing a sequence with good discrepancy or energy properties is significantly harder than constructing point sets for arbitrary since one is forced to keep previously chosen points. In some situations (e.g. discrepancy with respect to axis-parallel boxes on the torus [59, 72]) even the optimal discrepancy bounds are slightly different for sequences and point sets. Most known constructions of good point distributions on the sphere are not sequences. This makes the greedy sequence even more valuable.
2.1. Construction of the greedy sequence
We will work with an infinite sequence instead of a single set. This is more difficult since points, once placed, remain fixed. We start with a completely arbitrary initial set of points . The construction is then greedy: at each step we add the point that minimizes the Riesz -energy with respect to the existing set of points. More formally, given a set , we pick
[TABLE]
In other words, is just the point where the polarization is achieved for the set , i.e.
[TABLE]
If the minimum is not assumed at a unique point, then any such point may be chosen. Note that if , this construction is equivalent to maximizing the sum of (Euclidean) distances to the points in the existing set. When , i.e. , and (or more generally, on any compact ), such sequences are called Leja sequences in recognition of the work of Leja [60] although they were introduced by Edrei [39] earlier. Simultaneously, Górski studied the case of (the Newtonian kernel) for compact sets [46]. For (i.e. the Green kernel) and compact , the corresponding greedy sequences are known as Leja–Górski sequences, which have been studied in [48, 74]. Leja points have applications in Stochastic Analysis [68] and Approximation/Interpolation Theory [12, 30, 31, 33, 38, 54, 78, 92], and various numerical methods have been developed to approximate such points (see, e.g. [5, 21, 34]).
As a general remark on the behavior of greedy sequences in any dimension, we note that greedy sequences defined above (when initialized with a single point) have the property that every other point in the sequence is the antipodal point to the last one placed. This is a generalization of [63, Theorem 2.1], the proof is identical.
Proposition 2.1**.**
Let . Let be a greedy sequence on for some initial point . Then for all , .
More generally, this result holds if we replace with any kernel with strictly decreasing.
2.2. Optimal second-order polarization estimates
For , we note that the kernel is integrable on in each variable and depends only on the distance between and . This suggests defining the constant
[TABLE]
with denoting, as before, the normalized uniform measure on . Observe that the constant is negative for .
One would expect that polarization is maximized when the points are distributed as uniformly as possible. In that case, one could replace summation by integration in (1.2) and expect the maximal Riesz polarization to behave roughly as , and this is indeed correct, see e.g. [64]. We prove more delicate polarization estimates with optimal (up to constants) second-order terms for .
Theorem 2.1**.**
For , there exist positive constants such that the following hold for all .
For ,
[TABLE]
For ,
[TABLE]
For ,
[TABLE]
The lower bounds in the Theorem above (proved in §4.2) follow from known optimal energy estimates (see Theorem 4.1), but the upper bounds are novel and rely on the estimates of the generalized discrepancy [13, 14, 15], see §4.1.
Note that even in the previously studied case of the circle (), our result is new when . This in fact leads us to the following characterization of asymptotics for polarization on the circle:
Corollary 2.1**.**
For , ,
[TABLE]
For ,
[TABLE]
When , there are so that for sufficiently large,
[TABLE]
An extensive discussion of the one-dimensional case is presented in §3.
As suggested by these sharp bounds in the one-dimensional case, as well as the asymptotics for the hypersingular case , which were studied in [2, 18, 19, 40, 51, 50], we conjecture that the upper bounds for polarization given in Theorem 2.1 are sharp for in all dimensions , i.e. the second term is of the order .
2.3. Riesz energy of greedy sequences
While Theorem 2.1 is interesting in its own right, we also use it as a tool to obtain second-order energy estimates for the greedy sequences (see §4.3), which are of optimal order in the range .
Theorem 2.2**.**
Let , , and let be a greedy sequence for as defined in (2.1) with fixed initial point . For , denote the first elements of the sequence by . Then there exist positive constants such that the following holds for .
For ,
[TABLE]
For ,
[TABLE]
For ,
[TABLE]
The results of Theorem 2.2 are new for all . The case of was entirely handled in [63, Theorems 4.1 and 5.2] and the case of has been studied in [64, 65]. In the case of the circle , i.e. , more precise information about the Riesz energy of greedy sequences is known, see Theorem 3.2.
The lower bounds in Theorem 2.2 are well-known optimal estimates for minimal Riesz -energies discussed in §4.2, while the upper bounds for greedy sequences are the case of the following more general result about greedy sequences with an arbitrary number of initial points, which we prove in §4.3.
Theorem 2.3**.**
Let , , and let be the greedy sequence of points on with respect to as defined in (2.1) with fixed arbitrary . Let denote the set of the first points in this sequence. Then there exists a positive constant such that for , the following hold.
For ,
[TABLE]
For if and if ,
[TABLE]
For and ,
[TABLE]
For and ,
[TABLE]
These are the first second-order upper bounds for the energy in the literature for the case when the greedy sequence is initialized with arbitrary points, rather than a single point. The Riesz energy of greedy sequences initialized by one point has been studied before in a general setting, see e.g. [61, 64, 84], where the first-order term was obtained, along with the second-order estimates in the case , see [63, 65]. In fact, for these results give much stronger bounds on the Riesz energy of greedy sequences on the circle , but the proofs rely strongly on the structural properties of the greedy sequences (see Theorem 3.2 and the discussion in §3.2), which do not generalize to . Hence, the general results of Theorem 2.3 are interesting (although probably not sharp for ) even in the one-dimensional case).
2.4. Polarization of greedy sequences
Theorem 2.3 demonstrates, in particular, that for any greedy sequence with initial data points
[TABLE]
Combining this with Theorem 4.2.2 and Corollaries 14.6.5 and 14.6.7 from [20], we have that any such greedy sequence is uniformly distributed and therefore yields optimal first-order asymptotics for polarization (which was shown for in [64, Thm 2.1] and [84, Lemma 3.1]).
Corollary 2.2**.**
Let , , and be the greedy sequence of points on with respect to as defined in (2.1) with fixed arbitrary . Let denote the set of the first points in this sequence. Then the sequence of point sets is uniformly distributed on , and
[TABLE]
Moreover, Theorem 2.2 states that greedy sequences on achieve Riesz energy that is asymptotically optimal to the second-order (at least up to constants) in the potential-theoretic case . This may be interpreted as a way of measuring regularity of a set of points, we would expect the greedy sequence to behave fairly regularly also with respect to other notions of uniformity. We prove that greedy points also achieve almost maximal polarization (up to the second-order term) most of the time in the sense of asymptotic density (in particular, for infinitely many ).
Corollary 2.3**.**
Let , , and be a greedy sequence for . Let denote the set of the first points in this sequence. For every , there exists such that
[TABLE]
This is optimal up to the value of since Theorem 2.1 implies the unconditional bound The proof gives a quantitative description of how depends on and in terms of the constant arising in (2.9), we refer to the proof in §4.4 for details.
2.5. A uniform -discrepancy bound
We now concentrate on Riesz energy with or, equivalently, on the problem of selecting in such a way that the sum of distances is maximized. The problem has particular geometric significance in light of the Stolarsky invariance principle (formally stated in §5) which states that maximizing the sum of distances is the same as minimizing the spherical cap discrepancy. On , there are a large number of deterministic constructions [1, 41, 44, 45, 47, 56, 66, 67, 69, 86] and some of them are known to achieve a spherical cap discrepancy as small as . The seminal results of Beck imply that optimal constructions should be as small as (and there is numerical evidence that some of these sequences of point sets achieve it). Our contribution to this question is two-fold:
- (1)
we show that a large class of recursively defined sequences, containing all greedy sequences, achieve an spherical cap discrepancy of order with an explicit small constant, and 2. (2)
we provide some numerical evidence that greedy sequences achieve a rate that is either or very close to it, see §6.
The result will apply to a broader class of sequences: we note the trivial inequality
[TABLE]
where we note that . The subsequent theorem applies to all sequences where the next element is always chosen in such a way that the inequality above is satisfied, i.e. rather than maximizing the sum, we choose any point where the sum exceeds its mean value (and, in particular, the greedy constructions always satisfy the inequality).
Theorem 2.4**.**
Let be an arbitrary initial set and suppose that for all , the set is extended to a sequence satisfying
[TABLE]
Let denote the set of the first points in this sequence. Then, for any ,
[TABLE]
On , this expression simplifies to
[TABLE]
Observe that, due to the Stolarsky principle (Theorem 5.1), when , the right-hand side this inequality exactly matches the first-order term in the upper bound in (2.10) in Theorem 2.2. In fact, inequality (2.10) is even stronger, since it includes a negative second-order term (similar conclusions for all follow from (2.13)). However, this theorem still provides new information since the bounds here apply to a much wider class of sequences than just purely greedy sequences.
We first note that it applies to a fairly large family of sequences and gives a uniform bound for all of them. This bound is generically tight. One might be inclined to believe that the greedy sequence is better behaved, which is suggested by the case of Theorem 2.2.
Theorem 2.1, in the case , can be rewritten as saying that there is some such that for any -point set
[TABLE]
While this is sharp for and we believe it to be sharp for higher dimensions, it cannot be sharp for a greedy sequence “on average”, which becomes evident from the following result proved in §5.1 (recall that ).
Proposition 2.2**.**
Let be a greedy sequence on with initial points. For sufficiently large, depending on ,
[TABLE]
If greedy sequences indeed have good discrepancy properties (as suggested by numerics) and if , then the upper bound in Proposition 2.2 has to be asymptotically tight. The gap between and is a quantitative way of measuring our lack of understanding of the underlying dynamics.
The proof of Proposition 2.2 can be easily adapted, with different constants, to other -Riesz kernels and to all dimensions (as well as for a wide range of more general kernels). We choose to state it just for and for the sake of simplicity of exposition.
Remark 2.5*.*
The spherical cap discrepancy can also be interpreted as the worst-case error for numeric integration (with equal weights) for the Sobolev space , see [24], and our results can be translated to this setting. Greedy sequences in the context of worst-case error for numeric integration for reproducing kernel Hilbert spaces have been studied, for example, in [83], where a similar bound (although with an additional optimization over weights) was obtained.
2.6. Outline
The outline of the paper is as follows. In §3, we collect a number of results about both polarization and greedy sequences in the special case of the circle , presenting the proof of Corollary 2.1 as well as the analogues of Theorem 2.2 and Corollary 2.3 for . In §4 we present the proofs of some of the main results for : Theorem 2.1 in §4.1–4.2, Theorems 2.2 and 2.3 in §4.3, and Corollary 2.3 in §4.4. In §5 we prove the -discrepancy bound (Theorem 2.4) and Proposition 2.2 and discuss some similar known results. We conclude with discussion of numerical properties of greedy sequences in §6, and provide some auxiliary results in §7.
3. Polarization and Greedy Sequences on
The case of in Theorem 2.1 and Corollary 2.2 has mostly been settled in the literature previously, as the unique properties of the circle yield convenient examples of point sets that maximize polarization and greedy energy. In this section we collect known results on the circle for polarization and greedy energy and note where our Theorem 2.1 and Corollary 2.2 fill in some gaps or differ.
3.1. Polarization on the circle and proof of Corollary 2.1
In [51, Theorem 1] it is shown that equally spaced points on , denoted , are optimal for polarization for any kernel such that is decreasing and convex on . This includes the Riesz kernels for , but not . However, for , still has optimal asymptotic behavior, and likely maximizes polarization as well.
For the range , the points that minimize the discrete potential with respect to are the midpoints of an arch between two th roots of unity. Because the midpoints of an arch between two th roots of unity are themselves th roots of unity, there is a natural expression for the polarization on in terms of the corresponding energies for and equally spaced points:
[TABLE]
There is an explicit formula for the energy of equally spaced points on .
Theorem 3.1** (Theorem 1.1 in [26]).**
Let , and let be any nonnegative integer such that . If is a configuration of equally spaced points on , then
[TABLE]
where is the classical Riemann zeta function and are the coefficients in the expansion
[TABLE]
Combining Theorem 3.1 with (3.1), we have that on for , ,
[TABLE]
This second order term also explicitly computed in [62, Proposition 4.1] and [63, Lemma 3.10]. In [26, page 623], the authors also show that
[TABLE]
which, when combined with (3.1), gives us that
[TABLE]
Both (3.2) and (3.4) yield a lower bound on optimal polarization , which matches the upper bounds given in Theorem 2.1 for . In the case , optimality of roots of unity, [51, Theorem 1] shows that , completing the proof of (2.6) and (2.7).
The proof of the upper bounds in Theorem 2.1 (which is presented in §4.1 and holds for all ) covers the range in which the roots of unity are not known to be optimal, proving (2.8). Thus this case of Theorem 2.1 is new even in the one-dimensional case. This completes the proof of polarization estimates for , i.e. Corollary 2.1.
3.2. Energy for greedy sequences
The behavior of greedy sequences on is also well-studied. For , it is known that any greedy sequence is in fact a classical van der Corput sequence [35] (see [12, Theorem 5], [63, Lemma 3.7], [65, Sec 1.2], [64, Lemmas 4.1 and 4.2], and also [95, Example 2], which is perhaps the earliest observation of this kind, but just for ). A similar result was shown for a large class of kernels (whenever , and is a bounded, continuous, decreasing, convex function of the geodesic distance) in [71, Thm 2.1]. This has made explicit computations for bounds of the asymptotic behavior of the greedy Riesz energies possible (see Theorems 1.1, 1.2, and 1.5 in [65] and Theorems 3.16, 3.17, 3.18 in [63]). Here we collect the these results, in less detail.
Theorem 3.2**.**
On the circle , for and , if is the first elements of a greedy sequence, then
[TABLE]
The order (of the second-order term) in each case cannot be improved.
We note that for , this is an improvement on the upper bounds achieved in Theorem 2.2, suggesting that there is likely room for improvement on the upper bounds in higher dimensions in this range.
Moreover, according to Theorem 4.1, which states that the optimal second-order term for the Riesz -energy on is , we see that for , point sets produced via the greedy algorithm have optimal asymptotic behavior on , whereas for , the greedy algorithm does not produce point sets with optimal Riesz energy, which leads us to the open question of whether these results also hold true for higher dimensions.
At the same time, at least when , these bounds are actually sharp for sequences. Indeed, the case of in Theorem 3.2 and the Stolarsky invariance principle (Theorem 5.1) imply that the -spherical caps discrepancy satisfies . But spherical caps on the circle are just intervals, hence, this is just the classical periodic -discrepancy of one-dimensional sequences, for which is known to be the optimal order, as shown in [75] (see also [55, 72]) with the ideas going back to the seminal results of Roth [81]. In fact, as mentioned earlier, in this case, the greedy sequence with one initial point is just the van der Corput sequence, whose discrepancy is well studied [32, 57, 76].
On the other hand, Theorem 4.1 shows that the optimal second-order term for the energy in the case and is , and therefore, the optimal -discrepancy is (by the Stolarsky principle), which is easily seen to be achieved by equally spaced points on the circle. This highlights an important difference between the behavior of -point sets and infinite sequences.
3.3. Polarization for greedy sequences
As discussed earlier, the fact that any greedy sequence is uniformly distributed suggests that they may behave well for different measures of uniformity. For polarization, the asymptotic behavior of a greedy sequence for Riesz kernels on the circle was shown in [63, Theorem 3.11] and [62, Theorems 1.1 and 1.4] (the case of hypersingular Riesz kernels was also handled in [62] and the case of in [63, Theorem 4.1 and 5.2]).
Theorem 3.3**.**
On the circle , for and , if is the first elements of a greedy sequence, then
[TABLE]
The order (of the second-order term) in each case cannot be improved.
Comparing this to Corollary 2.1, we see that the greedy sequences on the circle have good second-order asymptotic behavior for , but not for .
4. The Proof of Theorems 2.1 and 2.2
We proceed with the proof of Theorem 2.1 for . We associate the Riesz -kernel with a function of the inner product between the points and by observing that and setting , where:
[TABLE]
Further, to facilitate our proof of the upper bound for polarization, we also introduce non-singular approximations to the functions . For , take
[TABLE]
4.1. Proof of Theorem 2.1: upper bound
For , let . For any , we have the Gegenbauer expansion
[TABLE]
where are the standard Gegenbauer polynomials (see e.g. [36] for details) and
[TABLE]
Note that for all and , the function is continuous, and therefore in . On the other hand, only when , which would lead to certain technical complications in the proofs when . At the same time for all , in other words, is integrable on the sphere, i.e. , for (this is the potential-theoretic case).
We shall need the information about the behavior of the Gegenbauer coefficients of the Riesz kernels as well as their approximations . We postpone a detailed analysis to the Appendix and only mention the most relevant information here: for , the coefficients and are non-negative and decreasing, and is of the order (Corollary 7.2).
We start with the case and will then explain the adjustments needed in the range . Starting with an arbitrary and trivially estimating the integral by the average, we observe that
[TABLE]
where we have used that due to rotational invariance the integral is independent of . Therefore,
[TABLE]
and we can thus estimate this expression from below by the -average:
[TABLE]
where is the generalized -discrepancy of with respect to the function which is studied in [13, 14, 15] and well-defined for . Observe the similarity of this notion to the classical -spherical cap discrepancy (1.4). Indeed, taking , one obtains the inner integral in the definition (1.4) of . It has been shown [13, Theorem 4.2] that
[TABLE]
for some constants , . Since for and Gegenbauer coefficients , , are positive and decreasing (Corollary 7.2), we find that
[TABLE]
where we have used the asymptotics of the Gegenbauer coefficients . Taking the supremum over proves the upper bound in Theorem 2.1 for .
Turning to the case , we can repeat the argument above verbatim for the kernel , using the fact that the Gegenbauer coefficients of are positive and decreasing (Corollary 7.1) and the fact that ), to find that
[TABLE]
Taking the limit as which is justified by Lemma 7.3 and by the Lebesgue dominated convergence theorem, since . Using the asymptotics of from Corollary 7.2, one obtains
[TABLE]
which proves the required bound for the range .
4.2. Proof of Theorem 2.1: lower bound
We collect some known results from which the lower bound of Theorem 2.1 follows immediately for . The first relates the maximal polarization to the minimal energy.
Proposition 4.1** (Proposition 14.1.1, [20]).**
For all and kernels , we have
[TABLE]
Optimal second-order asymptotics for the discrete Riesz -energy in the range have been computed by various authors [23, 27, 58, 77, 93, 94], see also Theorems 6.4.5, 6.4.6, and 6.4.7 in [20]. These bounds are as follows.
Theorem 4.1**.**
For , , , there exist positive constants , such that for ,
[TABLE]
If , then
[TABLE]
Combining Theorem 4.1 and Proposition 4.1, we deduce the following bounds.
Corollary 4.1**.**
For , , , and ,
[TABLE]
where is as in Theorem 4.1.
If , then
[TABLE]
Since the constant term is essential when , this gives the lower bound of Theorem 2.1 for the case .
4.3. Greedy energy: proof of Theorems 2.2 and 2.3
The lower bounds in Theorem 2.2 are just the general lower bounds for minimal Riesz energies presented in Theorem 4.1 above, so we concentrate on the upper bound of Theorem 2.3.
Proof of Theorem 2.3.
Let denote the set of the first points of the greedy sequence with respect to with initial points. Observe that, by construction, for
[TABLE]
Therefore the discrete energy of satisfies
[TABLE]
Using the upper bounds from Theorem 2.1 one obtains
[TABLE]
We have that for
[TABLE]
and since , our claim now follows. Note that the case is only relevant when given that . ∎
4.4. Proof of Corollary 2.3
Let be the greedy sequence of points on with respect to as defined in (2.1). Arguing as in the proof of Theorem 2.2,
[TABLE]
We know from Theorem 2.2 that
[TABLE]
We also recall that
[TABLE]
We can now introduce, for and , the set
[TABLE]
Collecting all these estimates, we see, for some ,
[TABLE]
From this we deduce that
[TABLE]
which is less than for large enough. This proves Corollary 2.3.
5. -discrepancy: Proof of Theorem 2.4
We turn to showing our main result on the -spherical cap discrepancy of the greedy sequence. The relevance of the greedy construction to discrepancy is based on the following classical result [91] that relates the sum of pairwise Euclidean distances (in other words, the Riesz energy with ) to the -discrepancy.
Theorem 5.1** (Stolarsky Invariance Principle).**
For ,
[TABLE]
where the constant is given by
[TABLE]
Note that the construction of the greedy sequence in (2.1) when aims to maximize the pairwise sum of Euclidean distances at every step and thus minimize the -discrepancy at every step by the Stolarsky invariance principle. As will become clear in our proof of the upper bound below, it is not of tremendous importance that one picks the maximum to show such an upper bound – one really only cares about having a large value in the sum. Obviously, the larger the value the better (and thus the maximum is optimal at that step) but taking values close to the maximum should suffice (and does in practice).
Proof of Theorem 2.4.
Let us assume is given and, for all ,
[TABLE]
We have the trivial bound
[TABLE]
and, for , that
[TABLE]
Iterating this inequality, we infer, for all , that
[TABLE]
This now implies that for the first elements
[TABLE]
from which we deduce that
[TABLE]
In the case of , we have
[TABLE]
from which we deduce
[TABLE]
∎
Note that the result is sharp (up to lower order terms) for sequences satisfying
[TABLE]
It is clear that many such sequences exist: all involved functions are continuous so there are always points where they attain their average value.
Remark 5.2*.*
The above result, in a different form, has more or less appeared in the literature previously: for example, the result follows directly from Theorem 3.1 (and Remark 3.2) in [63]. The purpose of reproving it here is to state the result in the form of an -discrepancy bound and to give more explicit constants.
5.1. Proof of Proposition 2.2
We conclude with a proof of Proposition 2.2. To establish a nontrivial lower bound, we show that the average growth of any consecutive pairs of points is bounded from below. Suppose . Recall that . Fix such that
[TABLE]
Assume that . In this case it is easy to see that
[TABLE]
Indeed, the average value of the function is and its maximum is , therefore, denoting the set above by , we have
[TABLE]
which implies that . Thus every hemisphere contains points satisfying the inequality .
By restricting to the hemisphere centered at we conclude that
[TABLE]
Combining this with the definition of , we see that
[TABLE]
If , one can see immediately that (5.1) holds with the right-hand side of . Therefore, the sum in (2.18) grows at least linearly in , which leads to the lower bound in Proposition 2.2. The upper bound follows from the Stolarsky principle which implies
[TABLE]
and thus, for any set of points ,
[TABLE]
For a greedy sequence, we can write
[TABLE]
and thus
[TABLE]
Note that this upper bound (with a worse constant) could be obtained directly from the lower bound in (2.4) of Theorem 2.1. We also remark that the same arguments would also work in higher dimensions (with different constants).
6. Numerical Examples and Comments
This section contains some basic numerical examples and general comments. First we note that, in the greedy construction, finding the exact point
[TABLE]
is computationally nontrivial.
If the success of the greedy construction were to depend very strongly on finding exact maxima in each step, it would not be a very useful construction to begin with (indeed, as also indicated by the proof of Theorem 2.4, luckily this does not seem to be the case). Numerical experiments suggest the exact opposite: these types of sequences tend to be incredibly robust and seem to lead to good results even if sometimes adversarial points are added manually. Throughout this section, we consider approximate sequences obtained as follows: given , we consider 100 random points and add the one maximizing the sum of distances among those. Figure 1 shows an example of a set of points on obtained from a single individual point. We observe that the sequence looks somewhat random but avoids clusters of points more so than an actually random sequence would.
When plotting the spherical cap discrepancy of this sequence one observes, numerically, that the discrepancy seems to be relatively close to . Note that our construction of the sequence was based on the approximation by random points and it stands to reason that Figure 2 serves as an upper bound of the true behavior of a greedy sequence. The greedy method is agnostic to what happened in the past and works well with any arbitrary initial set. We illustrate this with a simple example where we first take 250 points uniformly at random and then compute another 250 points greedily.
The result is striking: for the first 250 elements, we see that is approximately constant (as expected for random points). After that there is a pronounced decay. Perhaps even more striking is that is first increasing (roughly at rate as we expect) and then quickly decreases and returns to a constant slightly above (see also Fig. 2).
One could wonder about higher dimensions: one would expect optimal sequences to behave as on as well as on . The same basic numerical experiment leads to results compatible with this interpretation. We emphasize that, due to increasing computational cost, these experiments were carried out only for rather small values of . At this scale, small powers of or logarithms are not always easy to detect. Moreover, the effectiveness our numerical procedure of picking 100 points at random and then adding the one with the largest distance depends on the profile of the function: if large deviations occur on sets of small measure (an effective that could conceivably become more pronounced in higher dimensions), then this method will lose effectiveness. Regardless, we believe these preliminary results to be rather interesting and hope they will inspire subsequent work on the regularity of greedy sequences on .
We conclude with a conjecture which collects our observations in this section and speculates that the greedy sequence has nearly optimal spherical cap discrepancy.
Conjecture**.**
Let be the first elements of the greedy sequence with respect to as defined in (2.1). Then, for some ,
[TABLE]
7. Appendix
7.1. Gegenbauer coefficients
In this section we compute the Gegenbauer coefficients of the functions (the Riesz kernel) and (its approximation) which were defined in §4.1. Various computations for such coefficients for some ranges of can be found scattered in the literature (e.g. [23, 37]). However, since we need finer properties of these coefficients (asymptotics, monotonicity, positivity) in the full range, we present detailed computations here. We proceed first with finding the Gegenbauer coefficients for the functions .
Lemma 7.1**.**
For ,
[TABLE]
where is the ordinary hypergeometric function.
Proof.
On multiple occasions, we will use the identity
[TABLE]
We shall also use the Rodrigues formula
[TABLE]
Using (7.2) with (4.3), we have, for and
[TABLE]
giving us the first expression in the claim. For the second one,
[TABLE]
∎
Corollary 7.1**.**
For , , the Gegenbauer coefficients are positive and decreasing in .
Proof.
For and , we see that all the summands of
[TABLE]
are positive, so . We also see that for ,
[TABLE]
so the summands in (7.3) are decreasing as a function of . Thus, is indeed a decreasing function in , for and any . ∎
We know consider the Gegenbauer coefficients of the Riesz kernels themselves.
Lemma 7.2**.**
For ,
[TABLE]
Proof.
We again use the Rodrigues formula along with integration by parts to find, for , and ,
[TABLE]
For and , we have a similar result:
[TABLE]
∎
The coefficients for are thus clearly positive, and a comparison of consecutive coefficients in all cases also shows that is also decreasing as a function of . A quick asymptotic analysis give us the following:
Corollary 7.2**.**
For , , is positive and decreasing as a function of . Moreover
[TABLE]
We note that in the case that , we actually have the Chebyshev polynomials of the first type instead of the Gegenbauer polynomials. These are given by and, for ,
[TABLE]
Through this limit, one can quickly find that Lemmas 7.1 and 7.2 as well as Corollaries 7.2 and 7.1 still hold in this case.
7.2. Limits for polarization
We define, for any symmetric, lower semi-continuous kernel and finite Borel measure , the potential
[TABLE]
We define the polarization of to be
[TABLE]
Lemma 7.3**.**
Let be a symmetric, lower semi-continuous kernel. Let be a finite Borel measure on and be a sequence of symmetric, continuous kernels, increasing pointwise in to . Then
[TABLE]
Proof.
By the Monotone Convergence Theorem, we have that for all ,
[TABLE]
For , let and let . Thus
[TABLE]
For all , we see that
[TABLE]
so is an increasing sequence, bounded from above by .
Since is compact, there is a convergent subsequence of , with limit point . Now, for each such that we know
[TABLE]
so for all
[TABLE]
Thus, by continuity, for ,
[TABLE]
Thus, by the Monotone Convergence Theorem, we have
[TABLE]
Our claim now follows from (7.10), (7.11), and the fact that . ∎
Taking gives a discrete version of the result. We are unaware of a proof like this for any other polarization problems, and would like to point out that it should hold on arbitrary compact metric measure spaces as well.
8. Acknowledgements
D. Bilyk and M. Mastrianni have been supported by the NSF grant DMS-2054606. R.W. Matzke was supported by the Austrian Science Fund FWF project F5503 part of the Special Research Program (SFB) “Quasi-Monte Carlo Methods: Theory and Applications” and NSF Postdoctoral Fellowship Grant 2202877. S. Steinerberger is supported by the NSF (DMS-2123224) and the Alfred P. Sloan Foundation.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. Aistleitner, J.S. Brauchart, J. Dick. Point Sets on the Sphere 𝕊 2 superscript 𝕊 2 \mathbb{S}^{2} with Small Spherical Cap Discrepancy . Discrete Comput. Geom. 48 , 990–1024 (2012).
- 2[2] G. Ambrus, K.M. Ball, T. Erdelyi. Chebyshev constants for the unit circle . Bull. Lond. Math. Soc. 45 (2), 236–248 (2013).
- 3[3] K. Alishahi, M. Zamani. The spherical ensemble and uniform distribution of points on the sphere . Electron. J. Probab. 20 (23), 1–22 (2015).
- 4[4] D. Armentano, C. Beltrán, M. Shub. Minimizing the discrete logarithmic energy on the sphere: the role of random polynomials . Trans. Amer. Math. Soc. 363 (6), 2955–2965 (2011).
- 5[5] J. Baglama, D. Calvetti, L. Reichel. Fast Leja points . Electron. Trans. Numer. Anal. 7 , 124–140 (1998).
- 6[6] J. Beck. Sums of distances between points on a sphere– an application of the theory of irregularities of distribution to discrete geometry . Mathematika 31 , 33–41 (1984).
- 7[7] J. Beck. Some upper bounds in the theory of irregularities of distribution . Acta Arith. 43 , 115–130 (1984).
- 8[8] D. Bellhouse. Area estimation by point-counting techniques . Biometrics 37 (2), 303–312 (1981).
