The convexification effect of Minkowski summation
Matthieu Fradelizi, Mokshay Madiman, Arnaud Marsiglietti, Artem, Zvavitch

TL;DR
This paper investigates how Minkowski sums of compact sets approach convexity, analyzing various non-convexity measures, disproving some conjectures, and establishing new inequalities and properties related to convexification.
Contribution
It clarifies relationships between different non-convexity indices and reveals non-monotonic behavior of volume deficit in high dimensions, providing new inequalities and counterexamples.
Findings
Volume deficit of Minkowski sums does not always decrease monotonically in high dimensions.
Schneider's non-convexity index is strongly monotone along Minkowski sum sequences.
Hausdorff distance and effective standard deviation become monotone after a certain number of summands.
Abstract
Let us define for a compact set the sequence It was independently proved by Shapley, Folkman and Starr (1969) and by Emerson and Greenleaf (1969) that approaches the convex hull of in the Hausdorff distance induced by the Euclidean norm as goes to . We explore in this survey how exactly approaches the convex hull of , and more generally, how a Minkowski sum of possibly different compact sets approaches convexity, as measured by various indices of non-convexity. The non-convexity indices considered include the Hausdorff distance induced by any norm on , the volume deficit (the difference of volumes), a non-convexity index introduced by Schneider (1975), and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The convexification effect of Minkowski summation
Matthieu Fradelizi, Mokshay Madiman, Arnaud Marsiglietti and Artem Zvavitch supported in part by the Agence Nationale de la Recherche, project GeMeCoD (ANR 2011 BS01 007 01).supported in part by the U.S. National Science Foundation through grants DMS-1409504 (CAREER) and CCF-1346564.supported in part by the Institute for Mathematics and its Applications with funds provided by the National Science Foundation. supported in part by the U.S. National Science Foundation Grant DMS-1101636, Simons Foundation and the Bézout Labex.
Abstract
Let us define for a compact set the sequence
[TABLE]
It was independently proved by Shapley, Folkman and Starr (1969) and by Emerson and Greenleaf (1969) that approaches the convex hull of in the Hausdorff distance induced by the Euclidean norm as goes to . We explore in this survey how exactly approaches the convex hull of , and more generally, how a Minkowski sum of possibly different compact sets approaches convexity, as measured by various indices of non-convexity. The non-convexity indices considered include the Hausdorff distance induced by any norm on , the volume deficit (the difference of volumes), a non-convexity index introduced by Schneider (1975), and the effective standard deviation or inner radius. After first clarifying the interrelationships between these various indices of non-convexity, which were previously either unknown or scattered in the literature, we show that the volume deficit of does not monotonically decrease to 0 in dimension 12 or above, thus falsifying a conjecture of Bobkov et al. (2011), even though their conjecture is proved to be true in dimension 1 and for certain sets with special structure. On the other hand, Schneider’s index possesses a strong monotonicity property along the sequence , and both the Hausdorff distance and effective standard deviation are eventually monotone (once exceeds ). Along the way, we obtain new inequalities for the volume of the Minkowski sum of compact sets (showing that this is fractionally superadditive but not supermodular in general, but is indeed supermodular when the sets are convex), falsify a conjecture of Dyn and Farkhi (2004), demonstrate applications of our results to combinatorial discrepancy theory, and suggest some questions worthy of further investigation.
2010 Mathematics Subject Classification. Primary 60E15 11B13; Secondary 94A17 60F15.
Keywords. Sumsets, Brunn-Minkowski, convex hull, inner radius, Hausdorff distance, discrepancy.
Contents
-
3.1 Monotonicity of volume deficit in dimension one and for Cartesian products
-
4.1 A refined superadditivity of the volume for compact sets
-
5.1 The refined monotonicity of Schneider’s non-convexity index
-
7 The behavior of the Hausdorff distance from the convex hull
1 Introduction
Minkowski summation is a basic and ubiquitous operation on sets. Indeed, the Minkowski sum of sets and makes sense as long as and are subsets of an ambient set in which the operation + is defined. In particular, this notion makes sense in any group, and there are multiple fields of mathematics that are preoccupied with studying what exactly this operation does. For example, much of classical additive combinatorics studies the cardinality of Minkowski sums (called sumsets in this context) of finite subsets of a group and their interaction with additive structure of the concerned sets, while the study of the Lebesgue measure of Minkowski sums in is central to much of convex geometry and geometric functional analysis. In this survey paper, which also contains a number of original results, our goal is to understand better the qualitative effect of Minkowski summation in – specifically, the “convexifying” effect that it has. Somewhat surprisingly, while the existence of such an effect has long been known, several rather basic questions about its nature do not seem to have been addressed, and we undertake to fill the gap.
The fact that Minkowski summation produces sets that look “more convex” is easy to visualize by drawing a non-convex set111The simplest nontrivial example is three non-collinear points in the plane, so that is the original set of vertices of a triangle together with those convex combinations of the vertices formed by rational coefficients with denominator . in the plane and its self-averages defined by
[TABLE]
This intuition was first made precise in the late 1960’s independently222Both the papers of Starr [79] and Emerson and Greenleaf [33] were submitted in 1967 and published in 1969, but in very different communities (economics and algebra); so it is not surprising that the authors of these papers were unaware of each other. Perhaps more surprising is that the relationship between these papers does not seem to have ever been noticed in the almost 5 decades since. The fact that converges to the convex hull of , at an rate in the Hausdorff metric when dimension is fixed, should perhaps properly be called the Emerson-Folkman-Greenleaf-Shapley-Starr theorem, but in keeping with the old mathematical tradition of not worrying too much about names of theorems (cf., Arnold’s principle), we will simply use the nomenclature that has become standard. by Starr [79] (see also [80]), who credited Shapley and Folkman for the main result, and by Emerson and Greenleaf [33]. Denoting by the convex hull of , by the -dimensional Euclidean ball of radius , and by the Hausdorff distance between a set and its convex hull, it follows from the Shapley-Folkman-Starr theorem that if are compact sets in contained inside some ball, then
[TABLE]
By considering , one concludes that d(A(k))=O\big{(}\frac{\sqrt{n}}{k}\big{)}. In other words, when is a compact subset of for fixed dimension , converges in Hausdorff distance to as , at rate at least .
Our geometric intuition would suggest that in some sense, as increases, the set is getting progressively more convex, or in other words, that the convergence of to is, in some sense, monotone. The main goal of this paper is to examine this intuition, and explore whether it can be made rigorous.
One motivation for our goal of exploring monotonicity in the Shapley-Folkman-Starr theorem is that it was the key tool allowing Starr [79] to prove that in an economy with a sufficiently large number of traders, there are (under some natural conditions) configurations arbitrarily close to equilibrium even without making any convexity assumptions on preferences of the traders; thus investigations of monotonicity in this theorem speak to the question of whether these quasi-equilibrium configurations in fact get “closer” to a true equilibrium as the number of traders increases. A related result is the core convergence result of Anderson [3], which states under very general conditions that the discrepancy between a core allocation and the corresponding competitive equilibrium price vector in a pure exchange economy becomes arbitrarily small as the number of agents gets large. These results are central results in mathematical economics, and continue to attract attention (see, e.g., [70]).
Our original motivation, however, came from a conjecture made by Bobkov, Madiman and Wang [21]. To state it, let us introduce the volume deficit of a compact set in : , where denotes the Lebesgue measure in .
Conjecture 1.1** (Bobkov-Madiman-Wang [21]).**
Let be a compact set in for some , and let be defined as in (1). Then the sequence is non-increasing in , or equivalently, is non-decreasing.
In fact, the authors of [21] proposed a number of related conjectures, of which Conjecture 1.1 is the weakest. Indeed, they conjectured a monotonicity property in a probabilistic limit theorem, namely the law of large numbers for random sets due to Z. Artstein and Vitale [6]; when this conjectured monotonicity property of [21] is restricted to deterministic (i.e., non-random) sets, one obtains Conjecture 1.1. They showed in turn that this conjectured monotonicity property in the law of large numbers for random sets is implied by the following volume inequality for Minkowski sums. For being an integer, we set .
Conjecture 1.2** (Bobkov-Madiman-Wang [21]).**
Let , be integers and let be compact sets in . Then
[TABLE]
Apart from the fact that Conjecture 1.2 implies Conjecture 1.1 (which can be seen simply by applying the former to , where is a fixed compact set), Conjecture 1.2 is particularly interesting because of its close connections to an important inequality in Geometry, namely the Brunn-Minkowski inequality, and a fundamental inequality in Information Theory, namely the entropy power inequality. Since the conjectures in [21] were largely motivated by these connections, we now briefly explain them.
The Brunn-Minkowski inequality (or strictly speaking, the Brunn-Minkowski-Lyusternik inequality) states that for all compact sets in ,
[TABLE]
It is, of course, a cornerstone of Convex Geometry, and has beautiful relations to many areas of Mathematics (see, e.g., [38, 72]). The case of Conjecture 1.2 is exactly the Brunn-Minkowski inequality (3). Whereas Conjecture 1.2 yields the monotonicity described in Conjecture 1.1, the Brunn-Minkowski inequality only allows one to deduce that the subsequence is non-decreasing (one may also deduce this fact from the trivial inclusion ).
The entropy power inequality states that for all independent random vectors in ,
[TABLE]
where
[TABLE]
denotes the entropy power of . Let us recall that the entropy of a random vector with density function (with respect to Lebesgue measure ) is if the integral exists and otherwise (see, e.g., [29]). As a consequence, one may deduce that for independent and identically distributed random vectors , , the sequence
[TABLE]
is non-decreasing. S. Artstein, Ball, Barthe and Naor [4] generalized the entropy power inequality (4) by proving that for any independent random vectors ,
[TABLE]
In particular, if all in the above inequality are identically distributed, then one may deduce that the sequence
[TABLE]
is non-decreasing. This fact is usually referred to as “the monotonicity of entropy in the Central Limit Theorem”, since the sequence of entropies of these normalized sums converges to that of a Gaussian distribution as shown earlier by Barron [13]. Later, simpler proofs of the inequality (5) were given by [49, 86]; more general inequalities were developed in [50, 75, 51].
There is a formal resemblance between inequalities (4) and (3) that was noticed in a pioneering work of Costa and Cover [28] and later explained by Dembo, Cover and Thomas [30] (see also [82, 87] for other aspects of this connection). In the last decade, several further developments have been made that link Information Theory to the Brunn-Minkowski theory, including entropy analogues of the Blaschke-Santaló inequality [48], the reverse Brunn-Minkowski inequality [19, 20], the Rogers-Shephard inequality [22, 53] and the Busemann inequality [10]. Indeed, volume inequalities and entropy inequalities (and also certain small ball inequalities [56]) can be unified using the framework of Rényi entropies; this framework and the relevant literature is surveyed in [55]. On the other hand, natural analogues in the Brunn-Minkowski theory of Fisher information inequalities hold sometimes but not always [35, 7, 37]. In particular, it is now well understood that the functional in the geometry of compact subsets of , and the functional in probability are analogous to each other in many (but not all) ways. Thus, for example, the monotonicity property desired in Conjecture 1.1 is in a sense analogous to the monotonicity property in the Central Limit Theorem implied by inequality (5), and Conjecture 1.2 from [21] generalizes the Brunn-Minkowski inequality (3) exactly as inequality (5) generalizes the entropy power inequality (4).
The starting point of this work was the observation that although Conjecture 1.2 holds for certain special classes of sets (namely, one dimensional compact sets, convex sets and their Cartesian product, as shown in subsection 3.1), both Conjecture 1.1 and Conjecture 1.2 fail to hold in general even for moderately high dimension (Theorem 3.4 constructs a counterexample in dimension 12). These results, which consider the question of the monotonicity of are stated and proved in Section 3. We also discuss there the question of when one has convergence of to 0, and at what rate, drawing on the work of the [33] (which seems not to be well known in the contemporary literature on convexity).
Section 4 is devoted to developing some new volume inequalities for Minkowski sums. In particular, we observe in Theorem 4.1 that if the exponents of in Conjecture 1.2 are removed, then the modified inequality is true for general compact sets (though unfortunately one can no longer directly relate this to a law of large numbers for sets). Furthermore, in the case of convex sets, Theorem 4.5 proves an even stronger fact, namely that the volume of the Minkowski sum of convex sets is supermodular. Various other facts surrounding these observations are also discussed in Section 4.
Even though the conjecture about becoming progressively more convex in the sense of is false thanks to Theorem 3.4, one can ask the same question when we measure the extent of non-convexity using functionals other than . In Section 2, we survey the existing literature on measures of non-convexity of sets, also making some possibly new observations about these various measures and the relations between them. The functionals we consider include a non-convexity index introduced by Schneider [71], the notion of inner radius introduced by Starr [79] (and studied in an equivalent form as the effective standard deviation by Cassels [25], though the equivalence was only understood later by Wegmann [89]), and the Hausdorff distance to the convex hull, which we already introduced when describing the Shapley-Folkman-Starr theorem. We also consider the generalized Hausdorff distance corresponding to using a non-Euclidean norm whose unit ball is the convex body . The rest of the paper is devoted to the examination of whether becomes progressively more convex as increases, when measured through these other functionals.
In Section 5, we develop the main positive result of this paper, Theorem 5.3, which shows that is monotonically (strictly) decreasing in , unless is already convex. Various other properties of Schneider’s non-convexity index and its behavior for Minkowski sums are also established here, including the optimal convergence rate for . We remark that even the question of convergence of to 0 does not seem to have been explored in the literature.
Section 6 considers the behavior of (or equivalently ). For this sequence, we show that monotonicity holds in dimensions 1 and 2, and in general dimension, monotonicity holds eventually (in particular, once exceeds ). The convergence rate of to 0 was already established in Starr’s original paper [79]; we review the classical proof of Cassels [25] of this result.
Section 7 considers the question of monotonicity of , as well as its generalizations when we consider equipped with norms other than the Euclidean norm (indeed, following [12], we even consider so-called “nonsymmetric norms”). Again here, we show that monotonicity holds in dimensions 1 and 2, and in general dimension, monotonicity holds eventually (in particular, once exceeds ). In fact, more general inequalities are proved that hold for Minkowski sums of different sets. The convergence rate of to 0 was already established in Starr’s original paper [79]; we review both a classical proof, and also provide a new very simple proof of a rate result that is suboptimal in dimension for the Euclidean norm but sharp in both dimension and number of summands given that it holds for arbitrary norms. In 2004 Dyn and Farkhi [32] conjectured that We show that this conjecture is false in , .
In Section 8, we show that a number of results from combinatorial discrepancy theory can be seen as consequences of the convexifying effect of Minkowski summation. In particular, we obtain a new bound on the discrepancy for finite-dimensional Banach spaces in terms of the Banach-Mazur distance of the space from a Euclidean one.
Finally, in Section 9, we make various additional remarks, including on notions of non-convexity not considered in this paper.
Acknowledgments. Franck Barthe had independently observed that Conjecture 1.2 holds in dimension 1, using the same proof, by 2011. We are indebted to Fedor Nazarov for valuable discussions, in particular for the help in the construction of the counterexamples in Theorem 3.4 and Theorem 7.3. We would like to thank Victor Grinberg for many enlightening discussions on the connections with discrepancy theory, which were an enormous help with putting Section 8 together. We also thank Franck Barthe, Dario Cordero-Erausquin, Uri Grupel, Bo’az Klartag, Joseph Lehec, Paul-Marie Samson, Sreekar Vadlamani, and Murali Vemuri for interesting discussions. Some of the original results developed in this work were announced in [36]; we are grateful to Gilles Pisier for curating that announcement. Finally we are grateful to the anonymous referee for a careful reading of the paper and constructive comments.
2 Measures of non-convexity
2.1 Preliminaries and Definitions
Throughout this paper, we only deal with compact sets, since several of the measures of non-convexity we consider can have rather unpleasant behavior if we do not make this assumption.
The convex hull operation interacts nicely with Minkowski summation.
Lemma 2.1**.**
Let be nonempty subsets of . Then,
[TABLE]
Proof.
Let . Then , where , , , and , . Thus, . Hence . The other inclusion is clear.
Lemma 2.1 will be used throughout the paper without necessarily referring to it. A useful consequence of Lemma 2.1 is the following remark.
Remark 2.2**.**
If is convex then
[TABLE]
The Shapley-Folkman lemma, which is closely related to the classical Carathéodory theorem, is key to our development.
Lemma 2.3** (Shapley-Folkman).**
Let be nonempty subsets of , with . Let . Then there exists a set of cardinality at most such that
[TABLE]
Proof.
We present below a proof taken from Proposition 5.7.1 of [18]. Let . Then
[TABLE]
where , , and . Let us consider the following vectors of ,
[TABLE]
Notice that . Using Carathéodory’s theorem in the positive cone generated by in , one has
[TABLE]
for some nonnegative scalars where at most of them are non zero. This implies that and that , for all . Thus for each , there exists such that . But at most scalars are positive. Hence there are at most additional that are positive. One deduces that there are at least indices such that for some , and thus for . For these indices, one has . The other inclusion is clear.
The Shapley-Folkman lemma may alternatively be written as the statement that, for ,
[TABLE]
where denotes the cardinality of . When all the sets involved are identical, and , this reduces to the identity
[TABLE]
It should be noted that the Shapley-Folkman lemma is in the center of a rich vein of investigation in convex analysis and its applications. As explained by Z. Artstein [5], It may be seen as a discrete manifestation of a key lemma about extreme points that is related to a number of “bang-bang” type results. It also plays an important role in the theory of vector-valued measures; for example, it can be used as an ingredient in the proof of Lyapunov’s theorem on the range of vector measures (see [46], [31] and references therein).
For a compact set in , denote by
[TABLE]
the radius of the smallest ball containing . By Jung’s theorem [45], this parameter is close to the diameter, namely one has
[TABLE]
where is the Euclidean diameter of . We also denote by
[TABLE]
the inradius of , i.e. the radius of a largest Euclidean ball included in . There are several ways of measuring non-convexity of a set:
The Hausdorff distance from the convex hull is perhaps the most obvious measure to consider:
[TABLE]
A variant of this is to consider the Hausdorff distance when the ambient metric space is equipped with a norm different from the Euclidean norm. If is the closed unit ball of this norm (i.e., any symmetric333We always use “symmetric” to mean centrally symmetric, i.e., if and only if ., compact, convex set with nonempty interior), we define
[TABLE]
In fact, the quantity (8) makes sense for any compact convex set containing 0 in its interior – then it is sometimes called the Hausdorff distance with respect to a “nonsymmetric norm”. 2. 2.
Another natural measure of non-convexity is the “volume deficit”:
[TABLE]
Of course, this notion is interesting only when . There are many variants of this that one could consider, such as , or relative versions such as that are automatically bounded. 3. 3.
The “inner radius” of a compact set was defined by Starr [79] as follows:
[TABLE] 4. 4.
The “effective standard deviation” was defined by Cassels [25]. For a random vector in , let be the trace of its covariance matrix. Then the effective standard deviation of a compact set of is
[TABLE]
Let us notice the equivalent geometric definition of :
[TABLE] 5. 5.
In analogy with the effective standard deviation, we define the “effective absolute deviation” by
[TABLE] 6. 6.
Another non-convexity measure was defined by Cassels [25] as follows:
[TABLE]
where . 7. 7.
The “non-convexity index” was defined by Schneider [71] as follows:
[TABLE]
2.2 Basic properties of non-convexity measures
All of these functionals are 0 when is a convex set; this justifies calling them “measures of non-convexity”. In fact, we have the following stronger statement since we restrict our attention to compact sets.
Lemma 2.4**.**
Let be a compact set in . Then:
* if and only if is convex.* 2. 2.
* if and only if is convex.* 3. 3.
* if and only if is convex.* 4. 4.
* if and only if is convex.* 5. 5.
* if and only if is convex.* 6. 6.
* if and only if is convex.* 7. 7.
Under the additional assumption that has nonempty interior, if and only if is convex.
Proof.
Directly from the definition of we get that if is convex (just select ). Now assume that , then is a sequence of compact convex sets, converging in Hausdorff metric to , thus must be convex. Notice that this observation is due to Schneider [71].
The assertion about follows immediately from the definition and the limiting argument similar to the above one.
If is convex then, clearly , indeed we can always take with . Next, if , then using Theorem 2.15 below we have thus and therefore is convex.
The statements about , and can be deduced from the definitions, but they will also follow immediately from the Theorem 2.15 below.
Assume that is convex, then and . Next, assume that . Assume, towards a contradiction, that . Then there exists and such that . Since is convex and has nonempty interior, there exists a ball and one has
[TABLE]
which contradicts .
The following lemmata capture some basic properties of all these measures of non-convexity (note that we need not separately discuss , and henceforth owing to Theorem 2.15). The first lemma concerns the behavior of these functionals on scaling of the argument set.
Lemma 2.5**.**
Let be a compact subset of , , and .
. In fact, is affine-invariant. 2. 2.
. 3. 3.
. 4. 4.
. In fact, if , where is an invertible linear transformation and , then .
Proof.
To see that is affine-invariant, we first notice that . Moreover writing , where is an invertible linear transformation and , we get that
[TABLE]
which is convex if and only if is convex.
It is easy to see from the definitions that , and are translation-invariant, and that and are 1-homogeneous and is -homogeneous with respect to dilation.
The next lemma concerns the monotonicity of non-convexity measures with respect to the inclusion relation.
Lemma 2.6**.**
Let be compact sets in such that and . Then:
. 2. 2.
. 3. 3.
. 4. 4.
.
Proof.
For the first part, observe that if ,
[TABLE]
where in the last equation we used that is convex and Remark 2.2. Hence all relations in the above display must be equalities, and must be convex, which means .
For the second part, observe that
[TABLE]
For the third part, observe that
[TABLE]
Hence .
For the fourth part, observe that
[TABLE]
As a consequence of Lemma 2.6, we deduce that is monotone along the subsequence of powers of 2, when measured through all these measures of non-convexity.
Finally we discuss topological aspects of these non-convexity functionals, specifically, whether they have continuity properties with respect to the topology on the class of compact sets induced by Hausdorff distance.
Lemma 2.7**.**
Suppose , where all the sets involved are compact subsets of . Then:
, i.e., is continuous. 2. 2.
, i.e., is lower semicontinuous. 3. 3.
, i.e., is lower semicontinuous. 4. 4.
, i.e., is lower semicontinuous.
Proof.
Let us first observe that for any compact sets
[TABLE]
by applying the convex hull operation to the inclusions and , and invoking Lemma 2.1. Thus implies .
- Observe that by the triangle inequality for the Hausdorff metric, we have the inequality
[TABLE]
Using (9) one deduces that . Changing the role of and , we get
[TABLE]
This proves the continuity of .
- Recall that, with respect to the Hausdorff distance, the volume is upper semicontinuous on the class of compact sets (see, e.g., [73, Theorem 12.3.6]) and continuous on the class of compact convex sets (see, e.g., [72, Theorem 1.8.20]). Thus
[TABLE]
and
[TABLE]
so that subtracting the former from the latter yields the desired semicontinuity of .
- Observe that by definition,
[TABLE]
where . Note that from Theorem 2.10 below due to Schneider [71] one has , thus there exists a convergent subsequence and
[TABLE]
Thus , which is the desired semicontinuity of .
- Using we get that is bounded and thus is bounded and there is a convergent subsequence . Our goal is to show that . Let . Then there exits such that . From the definition of we get that there exists such that and . We can select a convergent subsequence , where is compact (see [72, Theorem 1.8.4]), then and and therefore . Thus .
We emphasize that the semicontinuity assertions in Lemma 2.7 are not continuity assertions for a reason and even adding the assumption of nestedness of the sets would not help.
Example 2.8**.**
Schneider [71] observed that is not continuous with respect to the Hausdorff distance, even if restricted to the compact sets with nonempty interior. His example consists of taking a triangle in the plane, and replacing one of its edges by the two segments which join the endpoints of the edge to an interior point (see Figure 1). More precisely, let , , and . Then . But one has since is convex. Moreover one can notice that . Indeed on one hand , which implies that , on the other hand for every the point , thus . Notice also that . Indeed hence and the opposite inequality is not difficult to see since the supremum in the definition of is attained at the point .
Example 2.9**.**
To see that there is no continuity for , consider a sequence of discrete nested sets converging in to , more precisely: .
2.3 Special properties of Schneider’s index
All these functionals other than can be unbounded. The boundedness of follows for the following nice inequality due to Schneider [71].
Theorem 2.10**.**
[71]** For any subset of ,
[TABLE]
Proof.
Applying the Shapley-Folkman lemma (Lemma 2.3) to , where is a fixed compact set, one deduces that . Thus .
Schneider [71] showed that if and only if consists of affinely independent points. Schneider also showed that if is unbounded or connected, one has the sharp bound .
Let us note some alternative representations of Schneider’s non-convexity index. First, we would like to remind the definition of the Minkowski functional of a compact convex set containing zero:
[TABLE]
with the usual convention that if . Note that and is a norm if is symmetric with non empty interior.
For any compact set , define
[TABLE]
and observe that
[TABLE]
Hence, we can express
[TABLE]
Rewriting this yet another way, we see that if , then for each , there exists and such that
[TABLE]
or equivalently, . In other words, where , which can be written as using the Minkowski functional. Thus
[TABLE]
This representation is nice since it allows for comparison with the representation of in the same form but with replaced by the Euclidean unit ball.
Remark 2.11**.**
Schneider [71] observed that there are many closed unbounded sets that satisfy , but are not convex. Examples he gave include the set of integers in , or a parabola in the plane. This makes it very clear that if we are to use as a measure of non-convexity, we should restrict attention to compact sets.
2.4 Unconditional relationships
It is natural to ask how these various measures of non-convexity are related. First we note that and are equivalent. To prove this we would like to present an elementary but useful observation:
Lemma 2.12**.**
Let be an arbitrary convex body containing [math] in its interior. Consider a convex body such that and . Then for any compact set ,
[TABLE]
and
[TABLE]
Proof.
Notice that
[TABLE]
Hence, . In addition, one has
[TABLE]
Hence, .
The next lemma follows immediately from Lemma 2.12:
Lemma 2.13**.**
Let be an arbitrary convex body containing 0 in its interior. For any compact set , one has
[TABLE]
where are such that .
It is also interesting to note a special property of :
Lemma 2.14**.**
Let be a compact set in . If , then
[TABLE]
If , then
[TABLE]
Proof.
If , then . But,
[TABLE]
where we used the fact that by definition of , is convex. Hence, .
If , in addition to the above argument, we also have
[TABLE]
Hence, .
Note that the inequality in the above lemma cannot be reversed even with the cost of an additional multiplicative constant. Indeed, take the sets from Example 2.8, then but tends to [math].
Observe that and have some similarity in definition. Let us introduce the point-wise definitions of above notions: Consider , define
- •
More generally, if is a compact convex set in containing the origin,
- •
- •
- •
- •
w_{A}(x)=\inf\bigg{\{}\sum p_{i}|a_{i}-x|:\,x=\sum p_{i}a_{i};p_{i}>0;\sum p_{i}=1,a_{i}\in A\bigg{\}}.
- •
where
[TABLE]
Below we present a Theorem due to Wegmann [89] which proves that and are equal for compact sets and that they are equal also to under an additional assumption. For the sake of completeness we will present the proof of Wegmann [89] which is simplified here for the case of compact sets.
Theorem 2.15** (Wegmann [89]).**
Let be a compact set in , then
[TABLE]
Moreover if , for some in the relative interior of , then .
Proof.
- First observe that by easy arguments; in fact, this relation holds point-wise, i.e. .
Indeed the first inequality follows directly from the definitions, because .
To prove the second inequality consider any convex decomposition of , i.e. , with . Without loss of generality we may assume that for all . Then
[TABLE]
because (indeed, ).
The third inequality immediately follows from the Cauchy-Schwarz inequality.
To prove the fourth inequality let be such that . Let be such that and . Let be the center of the smallest Euclidean ball containing . Notice that the minimum of is reached for , thus
[TABLE]
and we take infimum over all to finish the proof of the inequality.
- Consider . To prove the theorem we will first show that . After this we will show that and finally we will prove if is in the relative interior of and maximizes , among then .
2.1) Let us prove that . Assume first that is an interior point of . Let us define the compact convex set by
[TABLE]
Next we define the function by , note that
[TABLE]
Note that is a boundary point of hence there exists a support hyperplane of at . Since is an interior point of , the hyperplane cannot be vertical because a vertical support plane would separate from boundary points of and thus separate from boundary points of . Thus there exist and such that . Since one has
[TABLE]
and
[TABLE]
By definition of , there exists and , such that and
[TABLE]
From the convexity of we get that , for any ; indeed we note that
[TABLE]
Thus for all . Let and . Note that for any we have
[TABLE]
thus . Define
[TABLE]
Notice that for any
[TABLE]
with equality if , in particular, , for every . Consider the point such that
[TABLE]
Then one has , which means is the projection of the point on the convex set . This implies that, for every , one has , thus
[TABLE]
We get and
[TABLE]
Using that and , we conclude from the definition of that
[TABLE]
If is a boundary point of , then using the boundary structure of the polytope (see [72, Theorem 2.1.2, p. 75 and Remark 3, p. 78]) belongs to the relative interior of an exposed face of . By the definition of the notion of exposed face (see [72, p. 75]) we get that if for and with , then . Thus
[TABLE]
If then and thus all proposed inequalities are trivial, otherwise we can reproduce the above argument for instead of .
2.2) Now we will prove that . Consider and defined in (11) and (12). Using that , for every and (13), we get , for all . We will need to consider two cases
If , then from the above thus
[TABLE] 2. 2.
If , then there exists , thus . So, from (13) we have
[TABLE]
so it is enough to prove , where . Let be the face of containing in its relative interior. Thus we can use the approach from (14) and reproduce the above argument for instead of , in the end of which we will again get two cases (as above), in the first case we get . In the second case, there exists such that and we again reduce the dimension of the set under consideration. Repeating this argument we will arrive to the dimension in which the proof can be completed by verifying that (indeed, in this case , and , thus ) and thus .
2.3) Finally, assume , where is in the relative interior of . We may assume that is -dimensional (otherwise we would work in the affine subspace generated by ). Then using (13) we get that and for all , thus
[TABLE]
for all . So for all , this means that the minimal distance between and is reached at . Notice that if then must belong to , which contradicts our hypothesis. Thus and , and we can use (15) to conclude that .
Remark 2.16**.**
The method used in the proof of Theorem 2.15 is reminiscent of the classical approach to Voronoi diagrams and Delaunay triangulation (see, e.g., [62, section 5.7]). Moreover the point constructed above is exactly the center of the ball circumscribed to the simplex of the Delaunay triangulation to which the point belongs.
Next we present a different proof of from Theorem 2.15, which essentially uses Remark 2.16 and is more geometric. The proof will be deduced from the following proposition that better describes the geometric properties of the function .
Proposition 2.17**.**
Let be a compact set in and .
Then there exists an integer , affinely independent points and real numbers such that , and
[TABLE] 2. 2.
Let . Then there exists and , such that , for all and
[TABLE]
Moreover , for all . 3. 3.
For every there exists such that , and
[TABLE]
Proof.
1. Recall that
[TABLE]
Following the standard proof of Carathéodory’s theorem, we will show that for any decomposition of in the form , with being affinely dependent, the quantity is not minimal. Thus the infimum in the definition of may be reduced to affinely independent decompositions of , thus with points. Hence the infimum is taken on a compact set and is reached.
So let and assume that the sequence is affinely dependent then there exists a sequence of real numbers , not all zeros, such that and . We note that (by multiplying, if needed, all by ) we may also assume that
[TABLE]
And there is some such that . Consider such that
[TABLE]
Next, using that we get
[TABLE]
where for all and , so we reduce the number of elements in sequence . Thus, the only thing left is to show that
[TABLE]
Using that , the above is equivalent to , which is exactly (16). Therefore, we may assume that infimum in the definition of is is reached on affinely independent points and is actually a minimum. Hence, there exists an integer , affinely independent points and real numbers such that , and
2. One has , with and thus is in the relative interior of . Since are affinely independent, is a -dimensional simplex and there exists and , such that . Then , for all . Thus , for all . Hence
[TABLE]
Assume now that there is such that . Notice that we can select such that . Indeed, consider and note that the orthogonal projection of on is equal to and thus
[TABLE]
Thus, there exists , with such that , where for and . Moreover, since is in the relative interior of one has . Then
[TABLE]
which contradicts the minimality of the sequence .
3. Let , then there exists such that . Consider another sequence , with , and . Using the fact that we get, as in 2. that , for some . Note that , because is in the relative interior of . Thus
[TABLE]
The minimality of the sequence with respect to implies that for any other convex combination , , we get Thus
[TABLE]
Using that and the fact that we get
[TABLE]
which is exactly what we need to finish the proof. Indeed, again
[TABLE]
and thus and is a minimizing sequence for .
Now we are ready to use the above proposition to show that . For every let be the simplex obtained from Proposition 2.17 and let and denote the center and the radius of the circumscribed ball of . Then
[TABLE]
where denotes the projection of onto the convex set , i.e. the nearest point to from . For every , one has thus , hence
[TABLE]
Thus is contained in the ball of radius . Hence which finishes the proof of .
The above relationships (summarized in Table 1) are the only unconditional relationships that exist between these notions in general dimension. To see this, we list below some examples that show why no other relationships can hold in general.
Example 2.18**.**
By Lemma 2.5, we can scale a non convex set to get examples where is fixed but and converge to 0, for example, take ; or to get examples where goes to 0 but are fixed and diverges, for example take .
Example 2.19**.**
*An example where , but is bounded away from 0 is given by a right triangle from which a piece is shaved off leaving a protruding edge, see Figure 2. *
Example 2.20**.**
An example where but both and are bounded away from 0 is given by taking a 3-point set with 2 of the points getting arbitrarily closer but staying away from the third, see Figure 3.
Example 2.21**.**
An example where and but can be found in Figure 4.
2.5 Conditional relationships
There are some more relationships between different notions of non-convexity that emerge if we impose some natural conditions on the sequence of sets (such as ruling out escape to infinity, or vanishing to almost nothing).
A first observation of this type is that Hausdorff distance to convexity is dominated by Schneider’s index of non-convexity if is contained in a ball of known radius.
Lemma 2.22**.**
For any compact set ,
[TABLE]
Proof.
By translation invariance, we may assume that . Then , and it follows that
[TABLE]
Hence .
This bound is useful only if is smaller than , because we already know that .
In dimension 1, all of the non-convexity measures are tightly connected.
Lemma 2.23**.**
Let be a compact set in . Then
[TABLE]
Proof.
We already know that . Let us prove that . From the definition of and , we have
[TABLE]
Thus we only need to show that for every , there exists such that
[TABLE]
By compactness there exists , with achieving the infimum in the left hand side. Then we only need to choose in the right hand side to conclude that . In addition, we get thus .
Now we prove that . From Lemma 2.22, we have . Let us prove that . By an affine transform, we may reduce to the case where , thus and . Notice that and denote . By the definition of , one has . Thus using that and , we get
[TABLE]
we conclude that and thus .
Notice that the inequality on of Lemma 2.23 cannot be reversed as shown by Example 2.9. The next lemma provides a connection between and in .
Lemma 2.24**.**
For any compact set ,
[TABLE]
Proof.
Consider the point in that realizes the maximum in the definition of (it exists since is closed). Then, for every , one has . By definition,
[TABLE]
Hence,
[TABLE]
for some and . Since , one deduces that . Thus,
[TABLE]
But,
[TABLE]
It follows that
[TABLE]
As shown by Wegmann (cf. Theorem 2.15), if is closed then . We conclude that
[TABLE]
Our next result says that the only reason for which we can find examples where the volume deficit goes to 0, but the Hausdorff distance from convexity does not, is because we allow the sets either to shrink to something of zero volume, or run off to infinity.
Theorem 2.25**.**
Let be a compact set in with nonempty interior. Then
[TABLE]
Proof.
From the definition of there exists such that . Thus . Let us denote . From the definition of , there exists such that . Hence
[TABLE]
Let be the intersection point of the sphere centered at and the segment and let be the radius of the -dimensional sphere . Then and . Thus
[TABLE]
Observe that the first term on the right side in inequality (20) is just a dimension-dependent constant, while the second term depends only on the ratio of the radius of the smallest Euclidean ball containing to that of the largest Euclidean ball inside it.
The next lemma enables to compare the inradius, the outer radius and the volume of convex sets. Such estimates were studied in [24], [69] where, in some cases, optimal inequalities were proved in dimension 2 and 3.
Lemma 2.26**.**
Let be a convex body in . Then
[TABLE]
Proof.
From the definition of , there exists such that . Without loss of generality, we may assume that and that , which means that is the Euclidean ball of maximal radius inside . This implies that [math] must be in the convex hull of the contact points of and , because if it is not, then there exists an hyperplane separating [math] from these contact points and one may construct a larger Euclidean ball inside . Hence from Caratheodory, there exists and contact points so that and . Since , there exists such that . Thus for every , hence there exists such that . Hence
[TABLE]
Moreover thus
[TABLE]
Passing to volumes and using that , we get
[TABLE]
An immediate corollary of the above theorem and lemma is the following.
Corollary 2.27**.**
Let be a compact set in . Then
[TABLE]
where is an absolute constant depending on only. Thus for any sequence of compact sets in such that and , the convergence implies that .
From the preceding discussion, it is clear that is a much weaker statement than either or .
Example 2.28**.**
Consider a unit square with a set of points in the neighboring unit square, where the set of points becomes more dense as (see Figure 5). This example shows that the convergence in the Hausdorff sense is weaker than convergence in the volume deficit sense even when the volume of the sequence of sets is bounded away from 0.
The following example shows that convergence in does not imply convergence in nor :
Example 2.29**.**
Consider the set in the plane.
Note that the Example 2.29 also shows that convergence in does not imply convergence in nor . The following example shows that convergence in does not imply convergence in :
Example 2.30**.**
Consider the set in the plane, the union of the Euclidean ball and two points close to it and close to each other (see Figure 6). Then we have by applying the same argument as in Example 2.8 to the point . But for , we see that because of the roundness of the ball, one has , when grows.
3 The behavior of volume deficit
In this section we study the volume deficit. Recall its definition: for compact in ,
[TABLE]
3.1 Monotonicity of volume deficit in dimension one and for Cartesian products
In this section, we observe that Conjecture 1.2 holds in dimension one and also for products of one-dimensional compact sets. In fact, more generally, we prove that Conjecture 1.2 passes to Cartesian products.
Theorem 3.1**.**
Conjecture 1.2 holds in dimension one. In other words, if is an integer and are compact sets in , then
[TABLE]
Proof.
We adapt a proof of Gyarmati, Matolcsi and Ruzsa [42, Theorem 1.4] who established the same kind of inequality for finite subsets of the integers and cardinality instead of volume. The proof is based on set inclusions. Let . Set and for , let , ,
[TABLE]
, and . For all , one has
[TABLE]
Since , the above union is a disjoint union. Thus for
[TABLE]
Notice that and , thus adding the above inequalities we obtain
[TABLE]
We have thus established Conjecture 1.2 in dimension 1.
Remark 3.2**.**
As mentioned in the proof, Gyarmati, Matolcsi and Ruzsa [42] earlier obtained a discrete version of Theorem 3.1 for cardinalities of sums of subsets of the integers. There are also interesting upper bounds on cardinalities of sumsets in the discrete setting that have similar combinatorial structure, see, e.g., [42, 8, 54] and references therein. Furthermore, as discussed in the introduction for the continuous domain, there are also discrete entropy analogues of these cardinality inequalities, explored in depth in [68, 83, 52, 8, 54, 43, 88, 59, 58] and references therein. We do not discuss discrete analogues further in this paper.
Now we prove that Conjecture 1.2 passes to Cartesian products.
Theorem 3.3**.**
Let and be integers. Let . For and , let be some compact sets in . Assume that for any the compact sets satisfy Conjecture 1.2. For , let . Then Conjecture 1.2 holds for .
Proof.
Let and let then let us prove that
[TABLE]
For all , one has
[TABLE]
For , denote , and for , denote . Then, using Minkowski’s inequality for (see, for example, Theorem 10 in [44]), we deduce that
[TABLE]
Using that for any the compact sets satisfy Conjecture 1.2, we obtain
[TABLE]
Thus
[TABLE]
From Theorems 3.1 and 3.3, and the fact that Conjecture 1.2 holds for convex sets, we deduce that Conjecture 1.2 holds for Cartesian products of one-dimensional compact sets and convex sets.
3.2 A counterexample in dimension
In contrast to the positive results for compact product sets, both the conjectures of Bobkov, Madiman and Wang [21] fail in general for even moderately high dimension.
Theorem 3.4**.**
For every , there exists such that for every there is a compact set such that . Moreover, one may take
[TABLE]
In particular, one has , whence Conjectures 1.1 and Conjecture 1.2 are false in for .
Proof.
Let be fixed and let be defined as in the statement of Theorem 3.4 so that
[TABLE]
and , for a certain . Let be linear subspaces of of dimension orthogonal to each other such that . Set , where for every , is a convex body in . Notice that for every ,
[TABLE]
where we used the convexity of each to write the Minkowski sum of copies of as . Thus
[TABLE]
and
[TABLE]
The hypothesis on enables us to conclude that . Now for , we define . For every , one has , thus . Therefore , which establishes that gives a counterexample in .
The sequence is increasing and . Hence, Conjecture 1.1 is false for .
Remark 3.5**.**
It is instructive to visualize the counterexample for , which is done in Figure 7 by representing each of the two orthogonal copies of by a line.
It was shown by Bobkov, Madiman and Wang **[21]** that Conjecture 1.2 is true for convex sets. The constructed counterexample is a union of convex sets and is symmetric and star-shaped. 3. 3.
Notice that in the above example one has . By adding to a ball with sufficiently small radius, one obtains a counterexample satisfying and . 4. 4.
The counterexample also implies that Conjecture 1.1 in **[21]**, which suggests a fractional version of Young’s inequality for convolution with sharp constant, is false. It is still possible that it may be true for a restricted class of functions (like the log-concave functions). 5. 5.
Conjectures 1.2 and 1.1 are still open in dimension .
3.3 Convergence rates for
The asymptotic behavior of has been extensively studied by Emerson and Greenleaf [33]. In analyzing , the following lemma about convergence of to 0 in Hausdorff distance is useful.
Lemma 3.6**.**
If is a compact set in ,
[TABLE]
Proof.
Using invariance of (22) under the shifts of , we may assume that ,
[TABLE]
Using (see Theorem 5.5 in Section 5), as well as (see Theorem 2.10), we deduce that
[TABLE]
To conclude, we note that since , one has for every . Hence, . Finally, we obtain
[TABLE]
Note that Lemma 3.6 is similar but weaker than the Shapley-Folkman-Starr theorem discussed in the introduction, and which we will prove in Section 7.4. Lemma 3.6 was contained in [33], but with an extra factor of 2.
One clearly needs assumption beyond compactness to have asymptotic vanishing of . Indeed, a simple counterexample would be a finite set of points, for which always remains at and fails to converge to 0. Once such an assumption is made, however, one has the following result.
Theorem 3.7**.**
[33]** Let be a compact set in with nonempty interior. Then
[TABLE]
for some constant possibly depending on .
Proof.
By translation-invariance, we may assume that for some . Then , and by taking , we have
[TABLE]
Hence using (22) we get
[TABLE]
so that by taking the volume we have
[TABLE]
and
[TABLE]
4 Volume inequalities for Minkowski sums
4.1 A refined superadditivity of the volume for compact sets
In this section, we observe that if the exponents of in Conjecture 1.2 are removed, then the modified inequality is true (though unfortunately one can no longer directly relate this to a law of large numbers for sets).
Theorem 4.1**.**
Let , be integers and let be compact sets in . Then
[TABLE]
Proof.
We use arguments similar to the proof of Theorem 3.1. Indeed, let us define the sets and in the same way as in the proof of Theorem 3.1. Let be any fixed unit vector and let us define , , , and . Then, the same inclusions hold true and thus we obtain
[TABLE]
Applying Theorem 4.1 to yields the following positive result.
Corollary 4.2**.**
Let be a compact set in and be defined as in (1). Then
[TABLE]
In the following proposition, we improve Corollary 4.2 under additional assumptions on the set , for .
Proposition 4.3**.**
Let be a compact subset of and be defined as in (1). If there exists a hyperplane such that , where denotes the orthogonal projection of onto , then
[TABLE]
Proof.
By assumption, . Thus, for every , . Indeed, one has . Thus, . Hence,
[TABLE]
It follows by the Bonnesen inequality (concave Brunn-Minkowski inequality, see [23, 67]) that for every ,
[TABLE]
Remark 4.4**.**
By considering the set and the Dirac measure at , one has
[TABLE]
Hence Conjecture 1.1 does not hold in general for log-concave measures in dimension 1. 2. 2.
If is countable, then for every , , thus the sequence is constant and equal to [math]. 3. 3.
If there exists such that , then for every , . Indeed,
[TABLE]
It follows that . We conclude by induction. Thus, in this case, the sequence is stationary to , for . 4. 4.
It is natural to ask if the refined superadditivity of volume can be strengthened to fractional superadditivity as defined in Definition 4.11 below. While this appears to be a difficult question in general, it was shown recently in **[14]** that fractional superadditivity is true in the case of compact subsets of .
4.2 Supermodularity of volume for convex sets
If we restrict to convex sets, an even stronger inequality is true from which we can deduce Theorem 4.1 for convex sets.
Theorem 4.5**.**
Let . For compact convex subsets of , one has
[TABLE]
We first observe that Theorem 4.5 is actually equivalent to a formal strengthening of it, namely Theorem 4.7 below. Let us first recall the notion of a supermodular set function.
Definition 4.6**.**
A set function is supermodular if
[TABLE]
for all subsets of .
Theorem 4.7**.**
Let be compact convex subsets of , and define
[TABLE]
for each . Then is a supermodular set function.
Theorem 4.7 implies Theorem 4.5, namely
[TABLE]
for compact convex subsets of , since the latter is a special case of Theorem 4.7 when . To see the reverse, apply the inequality (28) to
[TABLE]
Our proof of Theorem 4.5 combines a property of determinants that seems to have been first explicitly observed by Ghassemi and Madiman [51] with a use of optimal transport inspired by Alesker, Dar and Milman [1]. Let us prepare the ground by stating these results.
Lemma 4.8**.**
[51]** Let and be positive-semidefinite matrices. Then
[TABLE]
We state the deep result of [1] directly for sets instead of for two sets as in [1] (the proof is essentially the same, with obvious modifications).
Theorem 4.9** (Alesker-Dar-Milman [1]).**
Let be open, convex sets with for each . Then there exist -diffeomorphisms preserving Lebesgue measure, such that
[TABLE]
for any .
*Proof of Theorem 4.5.*By adding a small multiple of the Euclidean ball and then using the continuity of as , we may assume that each of the satisfy . Then choose such that with , so that
[TABLE]
using Theorem 4.9. Applying a change of coordinates using the diffeomorphism ,
[TABLE]
where the inequality follows from Lemma 4.8, and the last equality is obtained by making multiple appropriate coordinate changes. Using Theorem 4.9 again,
[TABLE]
For the purposes of discussion below, it is useful to collect some well known facts from the theory of supermodular set functions. Observe that if is supermodular and , then considering disjoint and in (26) implies that is superadditive. In fact, a more general structural result is true. To describe it, we need some terminology.
Definition 4.10**.**
Given a collection of subsets of , a function , is called a fractional partition, if for each , we have .
The reason for the terminology is that this notion extends the familiar notions of a partition of sets (whose indicator function can be defined precisely as in Definition 4.10 but with range restriction to ) by allowing fractional values. An important example of a fractional partition of is the collection of all subsets of size , together with the coefficients .
Definition 4.11**.**
A function is fractionally superadditive if for any fractional partition ,
[TABLE]
The following theorem has a long history and is implicit in results from cooperative game theory in the 1960’s but to our knowledge, it was first explicitly stated by Moulin Ollagnier and Pinchon [65].
Theorem 4.12**.**
[65]** If is supermodular and , then is fractionally superadditive.
A survey of the history of Theorem 4.12, along with various strengthenings of it and their proofs, and discussion of several applications, can be found in [57]. If are compact convex sets and as defined in (27), then and Theorem 4.7 says that is supermodular, whence Theorem 4.12 immediately implies that is fractionally superadditive.
Corollary 4.13**.**
Let be compact convex subsets of and let be any fractional partition using a collection of subsets of . Then
[TABLE]
Corollary 4.13 implies that for each ,
[TABLE]
Let us discuss whether these inequalities contain anything novel. On the one hand, if we consider the case of inequality (29), the resulting inequality is not new and in fact implied by the Brunn-Minkowski inequality:
[TABLE]
On the other hand, applying the inequality (29) to yields precisely Theorem 4.1 for convex sets , i.e.,
[TABLE]
Let us compare this with what is obtainable from the refined Brunn-Minkowski inequality for convex sets proved in [21], which says that
[TABLE]
Denote the right hand sides of (30) and (31) by and . Also set
[TABLE]
and write , so that and . Here, for , . In other words,
[TABLE]
Let us consider for illustration. Then we have
[TABLE]
which ranges between and , since \|c\|_{2}/\|c\|_{1}\in[k^{-{\mathchoice{\textstyle\frac{1}{2}}{\frac{1}{2}}{\hbox{\tiny\frac{1}{2}}}{\hbox{\tiny\frac{1}{2}}}}},1]. In particular, neither bound is uniformly better; so the inequality (29) and Corollary 4.13 do indeed have some potentially useful content.
Motivated by the results of this section, it is natural to ask if the volume of Minkowski sums is supermodular even without the convexity assumption on the sets involved, as this would strengthen Theorem 4.1. In fact, this is not the case.
Proposition 4.14**.**
There exist compact sets such that
[TABLE]
Proof.
Consider and . Then,
[TABLE]
On the other hand, the desired inequality is true in dimension 1 if the set is convex. More generally, in dimension 1, one has the following result.
Proposition 4.15**.**
If are compact, then
[TABLE]
Proof.
Assume, as one typically does in the proof of the one-dimensional Brunn-Minkowski inequality, that . (We can do this without loss of generality since translation does not affect volumes.) This implies that , whence
[TABLE]
Hence
[TABLE]
We will show that , which together with the preceding inequality yields the desired conclusion .
To see that , consider . One may write , with , and . Since one has and one deduces that and thus . This completes the proof.
Remark 4.16**.**
One may wonder if Proposition 4.15 extends to higher dimension. More particularly, we do not know if the supermodularity inequality
[TABLE]
holds true in the case where is convex and and are any compact sets. 2. 2.
It is also natural to ask in view of the results of this section whether the fractional superadditivity (2) of for convex sets proved in **[21]** follows from a more general supermodularity property, i.e., whether
[TABLE]
for convex sets . It follows from results of **[51]** that such a result does not hold (their counterexample to the determinant version of (32) corresponds in our context to choosing ellipsoids in ). Another simple explicit counterexample is the following: Let , , and , with . Then,
[TABLE]
[TABLE]
Hence,
[TABLE]
For small enough, this yields a counterexample to (32). 3. 3.
It is shown in **[51]** that the entropy analogue of Theorem 4.5 does not hold, i.e., there exist independent real-valued random variables with log-concave distributions such that
[TABLE]
5 The behavior of Schneider’s non-convexity index
In this section we study Schneider’s non-convexity index. Recall its definition: for compact in ,
[TABLE]
5.1 The refined monotonicity of Schneider’s non-convexity index
In this section, our main result is that Schneider’s non-convexity index satisfies a strong kind of monotonicity in any dimension.
We state the main theorem of this section, and will subsequently deduce corollaries asserting monotonicity in the Shapley-Folkman-Starr theorem from it.
Theorem 5.1**.**
Let and let be subsets of . Then
[TABLE]
Proof.
Let us denote . Then
[TABLE]
Since the opposite inclusion is clear, we deduce that is convex, which means that
Notice that the same kind of proof also shows that if and are convex then is also convex. Moreover, Theorem 5.1 has an equivalent formulation for subsets of , say : if with , then
[TABLE]
To see this, apply Theorem 5.1 to
[TABLE]
From the inequality (33), the following corollary, expressed in a more symmetric fashion, immediately follows.
Corollary 5.2**.**
Let and be integers and let be sets in . Then
[TABLE]
The case of Corollary 5.2 follows directly from the definition of and was observed by Schneider in [71]. Applying Corollary 5.2 for , where is a fixed subset of , and using the scaling invariance of , one deduces that the sequence is non-increasing. In fact, for identical sets, we prove something even stronger in the following theorem.
Theorem 5.3**.**
Let be a subset of and be an integer. Then
[TABLE]
Proof.
Denote . Since , from the definition of , one knows that . Using that , one has
[TABLE]
Since the other inclusion is trivial, we deduce that is convex which proves that
[TABLE]
Remark 5.4**.**
We do not know if is fractionally subadditive; for example, we do not know if . We know it with a better constant if , as a consequence of Theorem 5.3. We also know it if we take a large enough number of sets; this is a consequence of the Shapley-Folkman lemma (Lemma 2.3). 2. 2.
The Schneider index (as well as any other measure of non-convexity) cannot be submodular. This is because, if we consider , , then but , hence
[TABLE]
5.2 Convergence rates for Schneider’s non-convexity index
We were unable to find any examination in the literature of rates, or indeed, even of sufficient conditions for convergence as measured by .
Let us discuss convergence in the Shapley-Folkman-Starr theorem using the Schneider non-convexity index. In dimension 1, we can get an bound on by using the close relation (18) between and in this case. In general dimension, the same bound also holds: by applying Theorem 5.3 inductively, we get the following theorem.
Theorem 5.5**.**
Let be a compact set in . Then
[TABLE]
In particular, as .
Let us observe that the rate of convergence cannot be improved, either for or for . To see this simply consider the case where . Then consists of the equispaced points , where , and for every .
6 The behavior of the effective standard deviation
In this section we study the effective standard deviation . Recall its definition: for compact in ,
[TABLE]
6.1 Subadditivity of
Cassels [25] showed that is subadditive.
Theorem 6.1** ([25]).**
Let be compact sets in . Then,
[TABLE]
Proof.
Recall that , where
[TABLE]
and Thus
[TABLE]
And one has
[TABLE]
For and one has
[TABLE]
and
[TABLE]
Thus
[TABLE]
Taking the supremum in and , we conclude.
Observe that we may interpret the proof probabilistically. Indeed, a key point in the proof is the identity (34), which is just the fact that the variance of a sum of independent random variables is the sum of the individual variances (written out explicitly for readability).
6.2 Strong fractional subadditivity for large
In this section, we prove that the effective standard deviation satisfies a strong fractional subadditivity when considering sufficient large numbers of sets.
Theorem 6.2**.**
Let be compact sets in , with . Then,
[TABLE]
Proof.
Let , where . By using the Shapley-Folkman lemma (Lemma 2.3), there exists a set of at most indexes such that
[TABLE]
Let . In particular, we have
[TABLE]
Hence, by definition of the convex hull,
[TABLE]
where , , and . Thus, by denoting , we have
[TABLE]
Taking supremum over all , we deduce that
[TABLE]
Since this is true for every , we deduce that
[TABLE]
Taking the supremum over all set of cardinality at most yields
[TABLE]
We conclude by taking the supremum over all .
An immediate consequence of Theorem 6.2 is that if , then
[TABLE]
By iterating this fact as many times as possible (i.e., as long as the number of sets is at least ), we obtain the following corollary.
Corollary 6.3**.**
Let be compact sets in , with . Then,
[TABLE]
In the case where , we can repeat the above argument with to prove that in this case,
[TABLE]
where is the Schneider non-convexity index of . Since , and when is connected, we deduce the following monotonicity property for the effective standard deviation.
Corollary 6.4**.**
In dimension 1 and 2, the sequence is non-increasing for every compact set . 2. 2.
In dimension 3, the sequence is non-increasing for every compact and connected set .
Remark 6.5**.**
It follows from the above study that if a compact set satisfies , then the sequence is non-increasing. One can see that if a compact set contains the boundary of its convex hull, then ; for such set , the sequence is non-increasing.
6.3 Convergence rates for
It is classical that one has convergence in at good rates.
Theorem 6.6** ([25]).**
Let be compact sets in . Then
[TABLE]
Proof.
Firstly, by using subadditivity of (Theorem 6.1), one has
[TABLE]
Hence, .
If , we can improve this bound using Corollary 6.3, which gives us
[TABLE]
again using subadditivity of for the second inequality.
By considering , one obtains the following convergence rate.
Corollary 6.7**.**
Let be a compact set in . Then,
[TABLE]
7 The behavior of the Hausdorff distance from the convex hull
In this section we study the Hausdorff distance from the convex hull. Recall its definition: for being a compact convex set containing 0 in its interior and compact in ,
[TABLE]
7.1 Some basic properties of the Hausdorff distance
The Hausdorff distance is subadditive.
Theorem 7.1**.**
Let be compact sets in , and be an arbitrary convex body containing 0 in its interior. Then
[TABLE]
Proof.
The convexity of implies that
[TABLE]
but since and by definition, we have
[TABLE]
We can provide a slight further strengthening of Theorem 7.1 when dealing with Minkowski sums of more than 2 sets, by following an argument similar to that used for Schneider’s non-convexity index.
Theorem 7.2**.**
Let be compact sets in , and be an arbitrary convex body containing 0 in its interior. Then
[TABLE]
Proof.
Notice that
[TABLE]
In particular, Theorem 7.2 implies that
[TABLE]
and, when the sets are the same,
[TABLE]
While not proving monotonicity of , the inequality (35) does provide a bound on extent of non-monotonicity in the sequence in general dimension.
7.2 The Dyn–Farkhi conjecture
Dyn and Farkhi [32] conjectured that
[TABLE]
The next theorem shows that the above conjecture is false in for .
Theorem 7.3**.**
Let . The inequality
[TABLE]
holds for all compact sets if and only if .
Proof.
We have already seen that the inequality holds for and thus the inequality holds when . Let be such that the inequality holds for all compact sets and . Let , where and are intervals such that and , and is a large number to be selected. Let , where and are intervals such that and . Note that belongs to both and . It is easy to see, using two dimensional considerations that
[TABLE]
Next we notice that
[TABLE]
Thus, the points in can be parametrized by
[TABLE]
where . We note that and
[TABLE]
Note that if , this tends to . So assuming that the inequality
[TABLE]
holds implies that thus .
Remark 7.4**.**
Note that the above example is also valid if we consider , metric instead of the metric. Indeed and we may compute distance from to as
[TABLE]
If , then to minimize the above, we must, again, select to be close to zero, and thus the distance is at least . This shows that if the inequality
[TABLE]
holds for all , then . 2. 2.
As shown by Wegmann **[89]**, if the set is such that the supremum in the definition of is achieved at a point in the relative interior of , then . Thus Theorem 6.1 implies the following statement: If are compact sets in such that the supremum in the definition of is achieved at a point in the relative interior of , and likewise for , then
[TABLE] 3. 3.
We emphasize that the conjecture is still open in the case . In this case, the Dyn-Farkhi conjecture is equivalent to
[TABLE]
If is the best constant such that for all compact sets in dimension , then one has
[TABLE]
for . This can be seen from the example where is a set of vertices of a regular simplex in , . For this example, it is not difficult to see that , where is the center of mass of and . Then, one easily concludes that
[TABLE]
Thus we get , while the Dyn-Farkhi conjecture amounts to . 4. 4.
Notice that there is another interpretation of as the largest empty circle of , i.e., the radius of the circle of largest radius, centered at a point in and containing no point of in its interior (see **[74]**, where the relevance of this notion for planning new store locations and toxic waste dump locations is explained). Indeed this radius is equal to
[TABLE]
7.3 Strong fractional subadditivity for large
In this section, similarly as for the effective standard deviation , we prove that the Hausdorff distance from the convex hull satisfies a strong fractional subadditivity when considering sufficient large numbers of sets.
Theorem 7.5**.**
Let be an arbitrary convex body containing 0 in its interior. Let be compact sets in , with . Then,
[TABLE]
Proof.
Let . By using the Shapley-Folkman lemma (Lemma 2.3), there exists a set of cardinality at most such that
[TABLE]
Let . In particular, we have
[TABLE]
Thus,
[TABLE]
for some , , and some , where . Hence,
[TABLE]
Taking supremum over all , we deduce that
[TABLE]
Since this is true for every , we deduce that
[TABLE]
Taking the supremum over all set of cardinality at most yields
[TABLE]
We conclude by taking the supremum over all .
In the case where , we can use the above argument to prove that for ,
[TABLE]
where is the Schneider non-convexity index of . Since , and when is connected, we deduce the following monotonicity property for the Hausdorff distance to the convex hull.
Corollary 7.6**.**
Let be an arbitrary convex body containing 0 in its interior. Then,
In dimension 1 and 2, the sequence is non-increasing for every compact set . 2. 2.
In dimension 3, the sequence is non-increasing for every compact and connected set .
Remark 7.7**.**
It follows from the above study that if a compact set satisfies , then the sequence is non-increasing. One can see that if a compact set contains the boundary of its convex hull, then ; for such set , the sequence is non-increasing.
It is useful to also record a simplified version of Theorem 7.5.
Corollary 7.8**.**
Let be an arbitrary convex body containing 0 in its interior. Let be compact sets in , with . Then,
[TABLE]
Proof.
By Theorem 7.5, provided , we have in particular
[TABLE]
Iterating the same argument as long as possible, we have that
[TABLE]
which is the first desired inequality. Applying the subadditivity property of (namely, Theorem 7.1), we immediately have the second desired inequality.
While Corollary 7.8 does not seem to have been explicitly written down before, it seems to have been first discovered by V. Grinberg (personal communication).
7.4 Convergence rates for
Let us first note that having proved convergence rates for , we automatically inherit convergence rates for as a consequence of Lemma 2.13, Theorem 2.15 and Corollary 6.7.
Corollary 7.9**.**
Let be an arbitrary convex body containing 0 in its interior. For any compact set ,
[TABLE]
where is such that .
For Euclidean norm (i.e., ), this goes back to [79, 25].
Although we have a strong convergence result for as a consequence of that for , we give below another estimate of in terms of , instead of .
Theorem 7.10**.**
For any compact set ,
[TABLE]
Proof.
As a consequence of Theorem 7.1, we always have . Now consider , and notice that
[TABLE]
Hence , or equivalently, .
Using the fact that for every compact set , we deduce that
[TABLE]
8 Connections to discrepancy theory
The ideas in this section have close connections to the area known sometimes as “discrepancy theory”, which has arisen independently in the theory of Banach spaces, combinatorics, and computer science. It should be emphasized that there are two distinct but related areas that go by the name of discrepancy theory. The first, discussed in this section and sometimes called “combinatorial discrepancy theory” for clarity, was likely originally motivated by questions related to absolute versus unconditional versus conditional convergence for series in Banach spaces. The second, sometimes called “geometric discrepancy theory” for clarity, is related to how well a finite set of points can approximate a uniform distribution on (say) a cube in . Our discussion here concerns the former; the interested reader may consult [85] for more on the latter. When looked at deeper, however, combinatorial discrepancy theory is also related to the ability to discretely approximate “continuous” objects. For example, a famous result of Spencer [77] says that given any collection of subsets of , it is possible to color the elements of with two colors (say, red and blue) such that
[TABLE]
for each , where is the set of red elements. As explained for example by Srivastava [78]
In other words, it is possible to partition into two subsets so that this partition is very close to balanced on each one of the test sets . Note that a “continuous” partition which splits each element exactly in half will be exactly balanced on each ; the content of Spencer’s theorem is that we can get very close to this ideal situation with an actual, discrete partition which respects the wholeness of each element.
Indeed, Srivastava also explains how the recent celebrated results of Marcus, Spielman and Srivastava [60, 61] that resulted in the solution of the Kadison-Singer conjecture may be seen from a discrepancy point of view.
For any -dimensional Banach space with norm , define the functional
[TABLE]
In other words, answers the question: for any choice of unit vectors in , how small are we guaranteed to be able to make the signed sum of the unit vectors by appropriately choosing signs? The question of what can be said about the numbers was first asked444See [47, p. 496] where this question is stated as one in a collection of then-unsolved problems. by A. Dvoretzky in 1963. Let us note that the same definition also makes sense when is a nonsymmetric norm (i.e., satisfies for , positive-definiteness and the triangle inequality), and we will discuss it in this more general setting.
It is a central result of discrepancy theory [41, 12] that when has dimension , it always holds555The fact that appears to be folklore and the first explicit mention of it we could find is in [41]. that . To make the connection to our results, we observe that this fact actually follows from Corollary 7.8.
Theorem 8.1**.**
Suppose , where is a convex body in containing 0 in its interior (i.e., the unit ball of a non-symmetric norm ), and suppose and for each . Then there exist vectors () such that
[TABLE]
In particular, if is symmetric, then by choosing , with , one immediately has for .
Proof.
We simply observe that since , there exists a point such that
[TABLE]
where the last inequality follows from Corollary 7.8. Moreover, using that for each , and is convex, we get . Thus by Lemmata 2.12 and 2.14, , where the last inequality uses Theorem 2.10 and the assumption that .
Remark 8.2**.**
Bárány and Grinberg [12] proved Theorem 8.1 without the condition . They also proved it for symmetric bodies under the weaker condition that ; we will recover this fact for symmetric bodies, without restriction on the dimension, as a consequence of Theorem 8.6 below.
Remark 8.3**.**
As pointed out in [12], Theorem 8.1 is sharp. By taking and to be the -th standard basis vector of , we see that for any choice of signs, \big{\|}\sum_{i\in[n]}\varepsilon_{i}x_{i}\big{\|}=n, which implies that .
Remark 8.4**.**
*It is natural to think that the sequence may be monotone with respect to . Unfortunately, this is not true. Swanepoel [81] showed that for every odd and every -dimensional Banach space . Consequently, we have and , whereas we know from Remark 8.3 that . *
Not surprisingly, for special norms, better bounds can be obtained. In particular (see, e.g., [2, Theorem 2.4.1] or [15, Lemma 2.2]), . We will present a proof of this and more general facts in Theorem 8.6. But first let us discuss a quite useful observation about the quantity : it is an isometric invariant, i.e., invariant under nonsingular linear transformations of the unit ball. A way to measure the extent of isometry is using the Banach-Mazur distance : Let , be two -dimensional normed spaces. The Banach-Mazur distance between them is defined as
[TABLE]
Thus and if and only if and are isometric. We also remind that the above notion have a geometrical interpretation. Indeed if we denote by a unit ball of Banach space , then is a minimal positive number such that there exists a linear transformation with:
[TABLE]
Lemma 8.5**.**
If , then
[TABLE]
Proof.
Consider an invertible linear transformation such that and thus , then
[TABLE]
Now we would like to use the ideas of the proof of Theorem 8.1 together with Lemma 8.5 to prove the following statement that will help us to provide sharper bounds for for intermediate norms.
Theorem 8.6**.**
Suppose , where is a symmetric convex body in (i.e., the unit ball of a norm ), and suppose . Then there exist vectors () such that
[TABLE]
where . In particular, by choosing , with , one immediately has
[TABLE]
Proof.
Let , then we may assume, using Lemma 8.5, that . Next, as in the proof of Theorem 8.1 we observe that since , there exists a point such that
[TABLE]
where the last inequality follows from Corollary 7.8. Next, we apply Lemma 2.12 together with to get
[TABLE]
Now we can apply Theorems 2.15 and 6.1 to get
[TABLE]
where the last inequality follows from the fact that is bounded by since .
We note that it follows from F. John Theorem (see, e.g., [64, page 10]) that for any -dimensional Banach space . Thus we have the following corollary, which recovers a result of [12].
Corollary 8.7**.**
Suppose , where is a convex symmetric body in , and suppose . Then there exist vectors () such that
[TABLE]
In particular, by choosing , with , one immediately has
[TABLE]
where .
It is well known that for (see, e.g., [64, page 20]). Thus Theorem 8.6 gives:
Corollary 8.8**.**
For any and any ,
[TABLE]
In particular, we recover the classical fact that , which can be found, e.g., in [2, Theorem 2.4.1]. V. Grinberg (personal communication) informed us of the following elegant and sharp bound generalizing this fact that he obtained in unpublished work: if are subsets of and , then
[TABLE]
The special case of this when each has cardinality 2 is due to Beck [15]. Let us note that the inequality (37) improves upon the bound of that is obtained in the Shapley-Folkman theorem by combining Theorems 2.15 and 6.6.
Finally let us note that the fact that the quantities are for general norms and for Euclidean norm is consistent with the observations in Section 7.4 that the rate of convergence of for a compact set is for general norms and for Euclidean norm (i.e., ).
We do not comment further on the relationship of our study with discrepancy theory, which contains many interesting results and questions when one uses different norms to pick the original unit vectors, and to measure the length of the signed sum (see, e.g., [16, 39, 66]). The interested reader may consult the books [26, 63, 27] for more in this direction, including discussion of algorithmic issues and applications to theoretical computer science. There are also connections to the Steinitz lemma [11], which was originally discovered in the course of extending the Riemann series theorem (on the real line being the set of possible limits by rearrangements of a conditionally convergent sequence of real numbers) to sequences of vectors (where it is called the Lévy-Steinitz theorem, and now understood in quite general settings, see, e.g., [76]).
9 Discussion
Finally we mention some notions of non-convexity that we do not take up in this paper:
Inverse reach: The notion of reach was defined by Federer [34], and plays a role in geometric measure theory. For a set in , the reach of is defined as
[TABLE]
A key property of reach is that if and only if is convex; consequently one may think of
[TABLE]
as a measure of non-convexity. Thäle [84] presents a comprehensive survey of the study of sets with positive reach (however, one should take into account the cautionary note in the review of this article on MathSciNet). 2. 2.
Beer’s index of convexity: First defined and studied by Beer [17], this quantity is defined for a compact set in as the probability that 2 points drawn uniformly from at random “see” each other (i.e., the probability that the line segment connecting them is in ). Clearly this probability is 1 for convex sets, and 0 for finite sets consisting of more than 1 point. Since our study has been framed in terms of measures of non-convexity, it is more natural to consider
[TABLE]
where are i.i.d. from the uniform measure on , and denotes the line segment connecting and . 3. 3.
Convexity ratio: The convexity ratio of a set in is defined as the ratio of the volume of a largest convex subset of to the volume of ; it is clearly 1 for convex sets and can be arbitrarily close to 0 otherwise. For dimension 2, this has been studied, for example, by Goodman [40]. Balko et al. [9] discuss this notion in general dimension, and also give some inequalities relating the convexity ratio and Beer’s index of convexity. Once again, to get a measure of non-convexity, it is more natural to consider
[TABLE]
where denotes a largest convex subset of .
These notions of non-convexity are certainly very interesting, but they behave quite differently from the notions we have explored thus far. For example, if or , the compact set may not be convex, but differ from a convex set by a set of measure zero. For example, if is the union of a unit Euclidean ball and a point separated from it, then
[TABLE]
even though is compact but non-convex. Even restricting to compact connected sets does not help– just connect the disc with a point by a segment, and we retain (38) though remains non-convex.
It is possible that further restricting to connected open sets is the right thing to do here– this may yield a characterization of convex sets using and , but it still is not enough to ensure stability of such a characterization. For example, small would not imply that is close to its convex hull even for this restricted class of sets, because we can take the previous example of a point connected to a disc by a segment and just slightly fatten the segment.
Generalizing this example leads to a curious phenomenon. Consider , where are points in well separated from each other and the origin. Then , but we can send and arbitrarily close to 1 by making go to infinity (since isolated points are never seen for but become very important for the sumset). This is remarkably bad behavior indeed, since it indicates an extreme violation of the monotone decreasing property of or that one might wish to explore, already in dimension 2.
Based on the above discussion, it is clear that the measures of non-convexity are more sensitive to the topology of the set than the functionals we considered in most of this paper. Thus it is natural that the behavior of these additional measures for Minkowski sums should be studied with a different global assumption than in this paper (which has focused on what can be said for compact sets). We hope to investigate this question in future work.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Alesker, S. Dar, and V. Milman. A remarkable measure preserving diffeomorphism between two convex bodies in 𝐑 n superscript 𝐑 𝑛 {\bf R}^{n} . Geom. Dedicata , 74(2):201–212, 1999.
- 2[2] N. Alon and J. H. Spencer. The probabilistic method . Wiley-Interscience Series in Discrete Mathematics and Optimization. Wiley-Interscience [John Wiley & Sons], New York, second edition, 2000. With an appendix on the life and work of Paul Erd H os.
- 3[3] R. M. Anderson. An elementary core equivalence theorem. Econometrica , 46(6):1483–1487, 1978.
- 4[4] S. Artstein, K. M. Ball, F. Barthe, and A. Naor. Solution of Shannon’s problem on the monotonicity of entropy. J. Amer. Math. Soc. , 17(4):975–982 (electronic), 2004.
- 5[5] Z. Artstein. Discrete and continuous bang-bang and facial spaces or: look for the extreme points. SIAM Rev. , 22(2):172–185, 1980.
- 6[6] Z. Artstein and R. A. Vitale. A strong law of large numbers for random compact sets. Ann. Probab. , 3(5):879–882, 1975.
- 7[7] S. Artstein-Avidan, D. Florentin, and Y. Ostrover. Remarks about mixed discriminants and volumes. Commun. Contemp. Math. , 16(2):1350031, 14, 2014.
- 8[8] P. Balister and B. Bollobás. Projections, entropy, and sumsets. Combinatorica , 32(2):125–141, 2012.
