The nonlocal isoperimetric problem for polygons: Hardy-Littlewood and Riesz inequalities
Beniamin Bogosel, Dorin Bucur, Ilaria Fragal\`a

TL;DR
This paper explores nonlocal isoperimetric problems for polygons, establishing optimality of regular polygons for Hardy-Littlewood inequalities and revealing complex behaviors for Riesz inequalities depending on the number of sides and kernel choice.
Contribution
It extends classical inequalities to nonlocal polygonal domains, proving regular polygons are optimal for Hardy-Littlewood and analyzing symmetry breaking in Riesz inequalities.
Findings
Regular N-gons are optimal for Hardy-Littlewood inequalities.
Optimality of regular polygons for Riesz inequalities depends on N and kernel parameters.
Symmetry breaking can occur for N ≥ 5 in Riesz inequalities.
Abstract
Given a non-increasing and radially symmetric kernel in , we investigate counterparts of the classical Hardy-Littlewood and Riesz inequalities when the class of admissible domains is the family of polygons with given area and sides. The latter corresponds to study the polygonal isoperimetric problem in nonlocal version. We prove that, for every , the regular -gon is optimal for Hardy-Littlewood inequality. Things go differently for Riesz inequality: while for and it is known that the regular triangle and the square are optimal, for we prove that symmetry or symmetry breaking may occur (i.e. the regular -gon may be optimal or not), depending on the value of and on the choice of the kernel.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNonlinear Partial Differential Equations · Spectral Theory in Mathematical Physics · Differential Equations and Boundary Problems
\stackMath
The nonlocal isoperimetric problem for polygons:
Hardy-Littlewood and Riesz inequalities
Beniamin Bogosel, Dorin Bucur, Ilaria Fragalà
Centre de Mathématiques Appliquées, CNRS, École polytechnique, Institut Polytechnique de Paris, 91120 Palaiseau (France)
Université Savoie Mont Blanc, CNRS UMR 5127
Laboratoire de Mathématiques, Campus Scientifique
73376 Le-Bourget-Du-Lac (France)
Dipartimento di Matematica
Politecnico di Milano
Piazza Leonardo da Vinci, 32
20133 Milano (Italy)
Abstract.
Given a non-increasing and radially symmetric kernel in , we investigate counterparts of the classical Hardy-Littlewood and Riesz inequalities when the class of admissible domains is the family of polygons with given area and sides. The latter corresponds to study the polygonal isoperimetric problem in nonlocal version. We prove that, for every , the regular -gon is optimal for Hardy-Littlewood inequality. Things go differently for Riesz inequality: while for and it is known that the regular triangle and the square are optimal, for we prove that symmetry or symmetry breaking may occur (i.e. the regular -gon may be optimal or not), depending on the value of and on the choice of the kernel.
Key words and phrases:
Polygons, nonlocal functionals, isoperimetric problem, Riesz rearrangement inequality.
2010 Mathematics Subject Classification:
52B60, 28A75, 49Q10, 49K21
1. Introduction
Given a non-negative and non-increasing radially symmetric function in , called in the sequel an admissible kernel, for any measurable set let
[TABLE]
The classical Riesz rearrangement inequality [42] states in particular that, denoting by the ball with the same volume as , it holds . It is a natural question to ask whether symmetry is preserved when passing to the polygonal setting (in dimension ): denoting by the class of polygons with sides and area for definiteness equal to , and by the regular gon in , this amounts to ask whether
[TABLE]
As a general fact, transposing isoperimetric-type inequalities with balls as optimal domains into the setting of polygons with a fixed number of sides is indeed a very natural problem. It has been investigated both in the field of geometric measure theory and in the field of mathematical physics, with drastically different levels of difficulties.
For the isoperimetric inequality in geometric measure theory solved by De Giorgi in [20], i.e. the minimization of perimeter under volume constraint, the polygonal version
[TABLE]
is an elementary result, which can be found in several textbooks in convex geometry.
On the other hand, for isoperimetric-type inequalities in mathematical physics, such as Saint-Venant of Faber-Krahn inequalities (for which we refer to the classical monograph [39] and to the recent book [30]), the polygonal versions
[TABLE]
are conjectures formulated several decades ago by Pólya-Szegö, who also proved them for and . Here is the principal eigenvalue of the Dirichlet Laplacian in , while is the torsional rigidity of (namely the -norm of the unique solution to the equation in ). The inequalities analogue to (4) have been proved for every for the logarithmic capacity [44] and for the Cheeger constant [10] (for related results, see also [5, 11, 32]). At present, (4) are open for any , and they can be included among the major open problems in shape optimization. Their validity is also related to a conjecture by Caffarelli and Lin [14], about the asymptotical optimality of the hexagonal honeycomb for related optimal partition problems, see [13].
To some extent, Riesz inequality can be viewed as a kind of “bridge” between the classical isoperimetric inequality (3) and the physical inequalities (4), as it is intimately connected to each of them. It is useful to briefly explain these connections, before introducing our results.
The relation between Riesz inequality and the classical isoperimetric inequality is easily individuated. Indeed, the functional in (1) differs just by a change of sign and a translation from the nonlocal -perimeter, defined by
[TABLE]
where the quantity is interpreted as an interaction density between two points and . A suitable scaling of in (5) allows to recover the usual notion of perimeter via an asymptotic formula. The concept of nonlocal perimeter has been first introduced in [7], and in recent times it has been widely developed, especially concerning the fractional kernel , , and concerning bounded integrable kernels (see respectively the seminal papers [15, 16] and the recent monograph [36]). In particular, the optimality of balls in the nonlocal isoperimetric inequality has been proved in the fractional case [26, 25, 22, 35], for kernels which are not radially symmetric and decreasing [19], and for some Minkowski type nonlocal perimeters [18]. In this perspective, (2) is equivalent to
[TABLE]
To sketch the relation between the polygonal Riesz inequality and the physical isoperimetric inequalities (4), let us focus for simplicity on the Saint-Venant inequality. It is known that the torsional rigidity satisfies (see [17, Section 4])
[TABLE]
where denotes the heat kernel
[TABLE]
A probabilistic reformulation in the same vein can be given also for the principal frequency, see again [17, Section 4]. Then the classical Saint-Venant and Faber-Krahn inequalities, with the ball as optimal domain, can be obtained as a consequence of the general rearrangement inequality for multiple integrals due to Brascamp-Lieb-Luttinger [8, Theorem 1.2], which in -dimensions reads
[TABLE]
here are measurable non-negative functions on vanishing at infinity, are their symmetric decreasing rearrangements, are real numbers, is a set of finite Lebesgue measure in , and is the ball with the same area as .
Hence, a possible approach to Pólya-Sezgö conjectures (4) would be to prove a polygonal version of the result by Brascamp-Lieb-Luttinger, stating that the inequality (8) remains true when the integrals at the left hand side are extended to a polygon , and the integrals at the right hand side are extended to . Of course one has to start from small values of . For (and a suitable choice of and ), this corresponds exactly to study the Riesz inequality (10). But the problem turns out to be nontrival even for . Indeed in this case it amounts investigate the validity of the following polygonal version of the classical Hardy-Littlewood inequality (see [28, Chapter 10])
[TABLE]
Aim of this paper is to attack the polygonal Hardy-Littlewood inequality (9) and the polygonal Riesz inequality
[TABLE]
To the best of our knowledge, very few results are available in this respect in the literature. Concerning the inequality (9), within the restricted setting of convex polygons, and for particular kernels, is mentioned as an open question by Fejes-Tóth in [21]. Still in the restricted setting of convex polygons, and for the kernel , the inequality is proved in [37]. Concerning the inequality (10), for (triangle and quadrilaterals), though not explicitly stated it can be deduced via Steiner symmetrization from Lemma 3.2 in [8]. More recently, in [6] Bonacini, Cristoferi and Topaloglu have considered the strictly related problem of characterizing “critical” triangles and quadrilaterals. More precisely, a polygon is said to be critical for if, for some positive constant , it holds
[TABLE]
whenever are obtained from by one of the following elementary movements: either the rotation of one side with respect to its midpoint, or the parallel translation of one side with respect to itself (see the Appendix in Section 9 for the detailed definitions). Clearly, a polygon maximizing over must be a critical polygon. In [6] it is proved that, under some weak assumptions on , the regular triangle and the square are respectively the unique critical triangle and quadrilateral. It is also conjectured that the same rigidity property holds true for any and that, consequently, the inequality (10) remains true for every .
In this paper we prove that the polygonal Hardy-Littlewood inequality (9) holds true for every admissible kernel and without any convexity assumption on the admissible polygons (see Theorem 6). The situation concerning the polygonal Riesz inequality (10) is more delicate, because a key point turns out to be the choice of the kernel. In this respect, the above discussion motivates the assertion that the heat kernel is of special relevance. We point out that, for , the corresponding functional , namely the quantity
[TABLE]
is the so-called heat content of the set at time . It represents the quantity of heat kept by the set once it is warmed at constant temperature and its heat is left to diffuse in the plane. We refer to [40] for related isoperimetric properties and to [45, 46] for related asymptotic expansions in the polygonal setting. A natural heuristic way to investigate the validity of the polygonal inequality (10) for , with sufficiently small or sufficiently large, consists in looking at the limiting behaviours of as and as . In the limit as , by [46, Theorem 1] for any polygon it holds
[TABLE]
where the term can be expressed in terms of the inner angles of . Such asymptotic expansion, combined with the classical isoperimetric inequality (3), suggests that inequality (10) should hold when with sufficiently small. On the other hand, in the limit as , starting from the asymptotics of and looking at the leading term, we obtain
[TABLE]
so that the inequality (10) should hold for with sufficiently large provided it holds for the quadratic kernel .
This observation drew our attention to study the inequalities (10) for the quadratic kernel , and more generally for power type kernels
[TABLE]
Notice carefully that such kernels are not admissible in the sense specified at the beginning of the paper, since they are increasing. Thus one should write inequalities (10) for , with the constant chosen so large that on . Equivalently, this amounts to rewrite the reverse inequalities of (10) for . Another family of “simple” kernels, which are of natural interest since their linear combinations can serve to approximate any smooth admissible kernel, are those of the form
[TABLE]
that we call in the sequel characteristic kernels.
Our results about Riesz polygonal inequality are mainly focused on the two families of kernels in (14) and in (13). For characteristic kernels we prove that, when is small enough, inequality (10) holds (see Theorem 1); indeed, in this case we also have a rigidity result characterizing the regular -gon as the unique critical polygon (see Theorem 3). On the other hand, when is large enough, a big surprise is coming: for even, , the inequality turns out to be false! See Theorem 2. In particular, the above mentioned conjecture made in [6] is, in general, false. The heuristic reason is that, for large enough, our problem is equivalent to the problem of finding so-called “largest small -gons”, namely polygons with fixed diameter and maximal area. This is a challenging problem in discrete geometry, for which it is known that symmetry breaking occurs for any even, see Section 2 for more details.
For power-type kernels, by using our polygonal Hardy-Littlewood inequality, we prove that the inequality (10) holds true for and (see Theorem 5). The same strategy fails for non-integers , for odd integers , as well as for higher even integers . In fact, as a consequence of Theorem 2, the inequality (10) turns out to be false also for power-type kernels with sufficiently high exponent (still for even).
The conclusion which stems from our analysis is that optimal polygons for the nonlocal isoperimetric inequality turn out to be sensitive to the choice of the kernel , as it may produce either symmetry or symmetry breaking. This is a highly unexpected phenomenon, which makes the study of the nonlocal isoperimetric inequality quite intriguing: indeed, in the light of our results, the problem becomes to understand which are specifically the kernels yielding symmetry for every . We suspect that this is the case for the heat kernel; we don’t have a proof of this fact, but we give some affirmative numerical results for polynomial approximations of (see Section 3).
Outline of the paper. The paper is organized as follows: in Section 2 we state our main results, complemented with some comments on their proofs and a list of related open problems; in Section 3 we explore numerically some of these open problems; in the subsequent sections we give the proofs of the results stated in Section 2; finally in the Appendix we provide some first and second order shape derivatives which are used in the proofs.
2. Main results
Our main results about the nonlocal polygonal isoperimetric inequality for characteristic kernels read as follows. Below, when , we set for brevity
[TABLE]
Theorem 1** (symmetry).**
For every , there exists such that
[TABLE]
The fact that above is in general finite is related to the result of Reinhardt asserting that, when is even, the regular -gon is not a minimizer of the diameter under an area constraint [41, 43]. In fact we have the following:
Theorem 2** (symmetry breaking).**
For every even, , setting , we have
[TABLE]
Moreover, still for the minimum in (17) equals , and it is attained at a polygon with diameter , which is not the regular polygon.
Comparing Theorems 1 and 2 shows that, for characteristic kernels, the optimality of the regular -gon does depend on the value of (at least for even). This brings the study of the nonlocal isoperimetric inequality for polygons into the more complex perspective of understanding for which and for which kernels symmetry or symmetry breaking occurs.
Before stating some partial results in that direction, let us give some short comments about the proofs of Theorems 1 and 2.
The proof of Theorem 1 is performed in a first stage for convex polygons and then it is extended to the general case. For convex polygons, we argue by contradiction. The idea is that, in the regime of a small , if a convex polygon minimizes the -perimeter over and it has -perimeter strictly smaller than , then must be close to (this follows from a uniform asymptotic estimate for the -perimeter as , where the classical perimeter appears in the leading term). In particular, close to there would be a -critical polygon, that is a polygon satisfying the stationarity condition (11), for . When is sufficiently small, this is not possible thanks to a symmetry result for critical polygons that we state separately in Theorem 3 below, since it may have an independent interest. The second part of the proof dealing with arbitrary polygons requires some more refined arguments, in particular since minimizing sequences may converge to a “generalized polygon” (precisely in the sense of Definition 12), possibly containing self-intersections in its boundary. Roughly speaking, the idea is to reduce the problem to a situation similar to the convex setting: this is achieved by exploiting triangulations in order to identify local concentrations of mass, and by localizing our estimates near the sides of the limit polygon where there is no accumulation of vertices. We refer to Section 4 for the detailed proof.
Let us now state the afore mentioned symmetry result for -critical polygons. To that aim, it is convenient to reformulate more explicitly the shape derivative in the left hand side of (11). This has been done in [6], but to make the paper self-contained we enclose a proof in the Appendix, see Lemma 19. The outcome is the following: if are obtained from respectively by rotating the side with respect to its midpoint , or by a parallel movement of such side with respect to itself, the stationarity condition (11) amounts to ask that, setting , it holds
[TABLE]
The result below states that the validity of eqs (18)-(19) for enforces symmetry, provided the support of the kernel is small enough. It can be viewed as a polygonal version of the Alexandrov-type symmetry recently proved in [12, Corollary 7]. The proof is obtained by using a reflection argument which is reminiscent of [6, 24].
Theorem 3** (symmetry for -critical polygons).**
Let be an admissible kernel and let satisfy equations (18)-(19) for every . When , assume further that , with such that
[TABLE]
*Then is a regular -gon. *
The proof of Theorem 2 is obtained in a completely different way; indeed, it follows as a rather straightforward consequence of a result by Reinhardt asserting that, when is even, the regular -gon is not a minimizer of the diameter under an area constraint (while, for odd, the regular -gon is a minimizer), see [41, 43] and the expository paper [38]. In the particular case , the optimal hexagon was found by Graham [27, 4], see Figure 1. Its construction can be done as follows. First fix two points and at distance one, and then, denoting we determine the other four vertices , , , and : taking as a parameter, and are found from the relations and . For and , a numerical value for is given by . 111The construction is taken from the MathWorld page https://mathworld.wolfram.com/GrahamsBiggestLittleHexagon.html
Denoting by and suitable scalings of Graham and regular hexagons, we have:
- •
At fixed diameter, the area of the Graham hexagon is greater than the area of the regular one, the ratio of the areas being .
- •
At fixed area, the diameter of the Graham hexagon is smaller than the diameter of the regular one and the ratio of their diameters is .
For even, the determination of a polygon minimizing the diameter under an area constraint is a challenging problem in discrete geometry, which remains, to the best of our knowledge, open. Let us just mention that, among equilateral polygons, it has been recently proved that the regular polygon is optimal [1]. Among arbitrary polygons, it has been proved in [23] that for every even the optimal polygon enjoys the following property conjectured by Graham: its skeleton (namely the collection of diameters connecting any two vertices) is not an asterisk but consists of a -cycle and one additional edge. Moreover, for , very accurate numerical solutions have been proposed in [2] and [29]. We also refer to the recent paper [3] for more recent symbolic calculations and for further bibliography.
We now come back to the problem of minimizing the -perimeter over . We are going to focus on some specific kernels which are not characteristic functions. In this respect, let us point out that the phenomenon of symmetry breaking is not a prerogative of characteristic kernels. Indeed from Theorem 2 and keeping the same notation as in its statement, we easily obtain the following
Corollary 4**.**
Let even, .
- (i)
If is a smooth admissible kernel close enough to , with , symmetry breaking occurs, i.e. .
- (ii)
If , there exists such that symmetry breaking occurs for every , i.e. . In particular, for we have .
Regarding statement (ii) above recall that, since power-type kernels with are increasing, the corresponding inequalities (10) must be reversed. Dealing with such kernels, in view of Corollary 4, the question becomes whether the regular polygon is a maximizer of the nonlocal perimeter at least for small . We show that the answer is affirmative in the particular cases and :
Theorem 5** (symmetry for power-type kernels).**
Let or . For every , the regular poygon maximizes over . In equivalent terms, we have the following polygonal Riesz inequalities
[TABLE]
The proof of Theorem 5 relies on the idea to reduce the study of inequalities 21-(22) to the study of Hardy-Littlewood type polygonal inequalities. More precisely, by writing explicitly the polynomials and , it turns out that the minimization of their double integral over is equivalent to the minimization of the single integrals and . We are thus led to the following question: is it true that minimizes over an integral functional of the type ? More in general, for any admissible kernel , one is led to investigate the following Hardy-Littlewood type maximization problem
[TABLE]
As mentioned in the Introduction, in the restricted setting of convex polygons, problem (23) is mentioned as an open question by Fejes Tóth in [21] when is a characteristic kernel. In this case, it amounts to solve the following purely geometric problem: find the polygon in which maximizes the overlap with the ball . Despite its elementary formulation, the solution to such geometric problem is far from being immediate, and it is also the heart of the matter in order to solve problem (23) for arbitrary kernels. As in the proof of Theorem 1, the difficulty comes mainly from the fact that maximizing sequences of polygons may converge to “generalized polygons” with self-intersections in their boundary. We overcome this difficulty via an ad hoc geometric construction, allowing to reduce ourselves to deal with star-shaped polygons; once made this restriction, we can take advantage of first order optimality conditions, which enable us to arrive at the regular -gon.
Theorem 6** (polygonal Hardy-Littlewood inequality).**
Let be an admissible kernel. For every , we have
[TABLE]
Applying inequality (24) allows us to prove (21)-(22), but the same strategy is not successful to obtain the analogous inequality
[TABLE]
for non-integers, or odd integers, or higher exponents . We are just able to prove that (25) continues to hold in some very specific situations, that we gather in the statement below:
Lemma 7**.**
Inequality (25) holds in the following cases:
- (i)
, , under the restriction that is convex and axisymmetric;
- (ii)
, , under the restriction that is a linear image of .
Remark 8*.*
Theorem 6 allows to extend the result in [37] by Morgan and Bolton about the optimality of the hexagonal economic regions for the location problem to other kernels than the average distance, for instance power-type kernels.
Clearly our results raise many new questions, some of which may be very challenging. A short list is given below.
Open problems
- (A)
Characteristic kernels: Determine or estimate the radius in Theorem 1.
- (B)
Power-type kernels: Determine for which values of the inequality (25) holds.
- (C)
Gaussian kernel : Determine whether minimizes the -perimeter over for any . (Alternatively, in terms of the heat content defined in (12), does the inequality hold for any and every ?)
- (D)
Arbitrary admissible kernels : Determine whether minimizes the -perimeter over for every odd () and whether, under some suitable assumptions on , the same holds for every even ().
- (E)
More general kernels: Explore what happens also for kernels which are not locally integrable, but induce a finite perimeter on the class of polygons, as it is for instance the case for the fractional kernel (see e.g. [34, Corollary 1.2]).
3. Numerical results about problems (B) and (C)
In this section we bring some numerical evidence related to open problems (B) and (C). The numerical results are summarized below. Some of them (mainly on the local minimality) could be turned into analytical ones provided the approximations would be controlled and the numerical computations certified. Let us point out that, as soon as the kernel is of polynomial type with sufficiently small degree, the computations we perform are accurate up to rounding errors in double precision. The computations use quadrature rules which are exact for low degree polynomials. This is explained in Section 3.3.
3.1. About problem (B)
We made multiple numerical optimizations with randomized initialization, in order to minimize over the functional
[TABLE]
We used the constrained optimization algorithm interior-point from the Matlab fmincon routine. The computations were performed for , and for . All simulations led to the regular polygon.
Next, in order to extract information about the local minimality of the regular -gon, we looked at the sign of the eigenvalues of the Hessian matrix of the scale-invariant functional defined for all polygons with sides by
[TABLE]
Needless to say, since the above functional is invariant under rigid motions, several zero eigenvalues must be expected, so that local minimality is gained as soon as the other eigenvalues are strictly positive.
We computed the Hessian matrix of our functional under vertices displacement, by using formula (86) in the Appendix and the classical Hessian formula for the area functional which can be found e.g. in [5, Section 2]. We obtained eigenvalues equal to zero (corresponding to translations, rotations, and homotheties) and eigenvalues which are strictly positive.
The computations were performed for , and for even, .
3.2. About problem (C)
Differently from power-type kernels, the Gaussian kernel is no longer homogeneous under homotheties. Hence, after rescaling, it not restrictive to consider the heat kernel at different times, and work with polygons with fixed diameter equal to instead of polygons with fixed area . Actually, dealing with polygons with fixed diameter turns out to be convenient in order to control the approximation made when the power series expansion of the heat kernel
[TABLE]
is replaced by its partial sums
[TABLE]
Notice that, in the case , problem (C) becomes trivial since , and also in the case the regular -gon maximizes for every thanks to Theorem 5.
The idea is then to look at what happens for higher values of such that the approximation of by is sufficiently good. If belong to a polygon with unit diameter and taking , by the inequality holding for any with , we have
[TABLE]
In particular, for (which in our computational strategy described in Section 3.3 below corresponds to a quadrature rule of order ) the inverse of is bounded from above by . Consequently, for polygons with unit diameter (having area at most ), the global numerical error done in evaluating in place of is bounded from above by . Similar estimates yields global errors smaller than when replacing the gradient and the Hessian of by their analogues for , according to the integral formulas in Appendix.
Then, we fix our attention on the functional for . Clearly, working with such functional brings us back to a polynomial setting as in case of problem (B) discussed above, with the difference that now is no longer homogeneous with respect to scalings. Hence we consider the Hessian matrix associated with the functional
[TABLE]
being a Lagrange multiplier chosen so that the regular -gon under consideration is a critical point. The Lagrange multiplier is the ratio of the norms of the gradient of and the gradient of the area; notice indeed that for the regular -gon these gradients are collinear for symmetry reasons. The Hessian matrix is obtained again by using formula (86). We investigate the sign of its eigenvalues corresponding to eigenvectors orthogonal to the gradient of the area (a space of dimension ). We obtained eigenvalues equal to zero (corresponding to translations and rotations) and eigenvalues which are strictly negative.
The computations where performed for and for a few choices of , including the endpoints. Numerically, we observe that the Hessian eigenvalues vary monotonically with : they are negative and have a decreasing absolute value as increases. Therefore, we conjecture that their sign remains negative for all values of in the considered range.
Furthermore, the smallest absolute value of non-zero eigenvalues, which is obtained for and , is larger than . The above discussion about the error estimates done when replacing the heat kernel by its polynomial approximation indicates that, also for the heat kernel, the regular -gon is a local maximizer under area constraint.
These simulations motivate us to conjecture that the regular -gon is a local maximizer of the heat content for every .
Let us also mention that the same computations above were made also for : for , and various choices of an oscillatory behavior can be observed, namely the Hessian at the regular -gon may have positive or negative eigenvalues; however, for the behavior stabilizes and the non-zero eigenvalues become strictly negative.
3.3. Computational strategy
Let us now briefly explain the strategy adopted for the computations in Sections 3.1 and 3.2. When is a polygon, and is a positive even integer, functionals of the type can be computed explicitly in terms of the coordinates of the vertices, and the same assertion holds for the integrals involved in the shape derivatives of such functionals. However, the resulting expressions are difficult to interpret and implement. Thus we choose to adopt a different approach, based on quadrature rules. Given a -gon , we split it into triangles (using an inner node) and we decompose the energy as
[TABLE]
so that we can focus on the computation of integrals made over a product of triangles. A quadrature rule for an integral over a triangle is an approximation of the form
[TABLE]
where are points in (expressed, for instance, using barycentric coordinates in the triangle ), and are the associated weights. A quadrature rule is said to be of order if the approximation (29) is exact when is a polynomial of total degree at most equal to . For any degree , there exist quadrature rules of such degree, the number of quadrature points being increasing with respect to .
An example of a triangulation and choice of quadrature points of degrees and for a regular hexagon is shown in Figure 2.
Handling quadratures rules involving double integrals is more complex, but relies on the same principles. In this case, given two triangles with corresponding quadrature points , (having the same barycentric coordinates) and weights , we have
[TABLE]
as above, the quadrature rule is of order if the approximation (30) is exact when is a polynomial of total degree at most .
In order to generate quadrature rules required in our computations, we used the Matlab toolbox Quadtriangle (accessed in November 2022). We used non product rules, included in the referenced toolbox up to degree .
4. Proof of Theorem 1
We proceed to prove the result first in the simplified setting of convex polygons and then in the general case.
4.1. Proof of Theorem 1 in the convex setting.
To prepare the proof, it is useful to introduce the set defined by
[TABLE]
where
[TABLE]
Clearly is empty set for , while for it is given a circular segment of radius and apothem , see Figure 3.
For , the Lebesgue measure of is immediately determined as
[TABLE]
Below we state a simple geometric lemma which plays a key role in the proof; it provides a lower bound for the -perimeter of a convex -gon and an upper bound for the -perimeter of a regular -gon. They will be exploited in the limit of a vanishing radius. We focus on the case , since for the equality in (16) is valid for every .
Lemma 9**.**
(i) Let be a convex polygon in . Assume that, for every side of , denoting by its length and by its adjacent inner angles, it holds
[TABLE]
Then
[TABLE]
Moreover, if (32) fails for a family of sides (of cardinality at most ), the inequality (33) holds with replaced by , where is the convex polygon obtained by eliminating any such side and prolonguing its two consecutive sides.
(ii) Let denote a regular -gon, with . Assume that, denoting by the length of its sides, it holds
[TABLE]
*Then *
[TABLE]
Proof.
(i) Setting , we have
[TABLE]
Thanks to assumption (32), for every and every , contains a segment of positive length, made by points such that intersects only along the side and is congruent to . For the length of this segment we have the following lower bound:
[TABLE]
and, for points , it holds
[TABLE]
where the last equality follows from (31) and an elementary integration. Hence,
[TABLE]
In case assumption (32) fails for some index, we repeat the proof above with the following only modification: in correspondence of any index for which (32) is false, we remove that side from and we consider the polygon defined as in the statement. By construction, for every side of and every , contains a segment of positive length (parallel to ), made by points such that is congruent to . For the length of this segment we have now the following lower bound:
[TABLE]
where , , and denote the length of , and it adjacent angles. Summing over all the sides of , we find the lower bound .
(ii) We write the equality (36) for . For every , the set of points at distance from contains segments of positive length, bounded from above by the positive quantity , made by points such that is congruent to . For points in such segments, the equality (37) holds. For points such that meets more than one side of , we simply estimate from above by . The measure of these points is bounded from above by . We end up with
[TABLE]
∎
We are now ready to prove Theorem 1 for convex polygons. We argue by contradiction. Assume the statement is false. Then, there exists an infinitesimal sequence of radii and a sequence of convex polygons such that
[TABLE]
Here and in the remaining of the proof, denotes a regular -gon of area . By possibly passing to a subsequence and up to translations, the sequence of convex polygons admits a limit in the Hausdorff complementary topology. There are two possibilities: either , or . Let us show that both cases lead to a contradiction.
Case 1): . Let us consider the sequence of (possibly empty) convex polygons contained into defined by
[TABLE]
Up to a subsequence, we may distinguish to subcases: either , or .
Case 1a): . Up to a further subsequence, either the convex polygons are empty, or they converge to a segment. Anyhow, we have that . For every , the intersection contains a set congruent to . Hence
[TABLE]
where in the last equality we have used (31). On the other hand, for sufficiently large assumption (34) is fulfilled and hence the inequality (35) in Lemma 9 holds with . In view of this fact, and since , the inequality (39) contradicts (38) in the limit as .
Case 1b): . We have
[TABLE]
Since (35) holds with , and since , the inequality (40) contradicts (38) in the limit as .
Case 2): . We apply Lemma 9 (i) to the sequence of convex polygons . We observe that, since , for large enough assumption (32) is certainly satisfied, except possibly for certain indices corresponding to sides of infinitesimal length. Thus we have
[TABLE]
We observe that the coefficients of the polynomial function only depend on the perimeter and on the inner angles of the polygon (see (33)); moreover, the same holds for the polynomial function , because the perimeter and the inner angles of can be easily expressed in terms of the perimeter and the inner angles of . Now, since converge to , the perimeter and the inner angles of converge respectively to the perimeter and to the inner angles of . We conclude that, for sufficiently large, the following lower bound holds:
[TABLE]
where is a fixed constant independent of . By combining the above lower bound with Lemma 9 (ii) (which applies since its assumption (34) is satisfied for sufficiently large), we obtain that
[TABLE]
and hence
[TABLE]
By the classical isoperimetric inequality for convex polygons, this implies that . To conclude, we observe that it is not restrictive to assume that is a minimizer of the -perimeter over the class of convex polygons in with area . (Notice that such a minimizer exists for any sufficiently small, because otherwise a maximizing sequence of polygons would degenerate, yielding a contradiction by the same arguments used to deal with Case 1) above.) Then we have found a sequence of critical polygons for the -perimeter, converging to , and satisfying (38). Since , this contradicts Theorem 3. ∎
4.2. Proof of Theorem 1 in the general case.
Also in this case we prepare the proof with a geometric lemma. For every and every , we set
[TABLE]
Given a family of straight lines , for which do not intersect each other in , with \theta_{i}\in\big{(}\!-\!\frac{\pi}{2},\frac{\pi}{2}\big{)} and , consider the union of strips
[TABLE]
see Figure 4. Recalling that that is the set defined at the beginning of Section 4.1, we prove the following estimate for the measure of :
Lemma 10**.**
Let be defined by (41). Setting \overline{s}:=\mathcal{H}^{1}\big{(}\Sigma\cap(\{0\}\times[0,+\infty))\big{)}, it holds
[TABLE]
Proof.
In order to prove (42), it is not restrictive to assume that , since otherwise it is trivially satisfied. We notice first that, for every \theta\in\big{(}\!-\!\frac{\pi}{2},\frac{\pi}{2}\big{)} and for every , it holds
[TABLE]
Namely, for , (43) holds with equality signs, by the definition of . For , it holds since the symmetric difference between and is contained into the union of the two sets
[TABLE]
and each of these sets has measure bounded from above by . Now, the inequality (42) is a consequence of (43) and of the monotonicity of the map
[TABLE]
which implies that, for , it holds . ∎
Remark 11*.*
In Lemma 10, inequality (42) remains trivially valid replacing by any subset . We shall use this argument for subsets of the type
[TABLE]
where and does not intersect any with , in .
As a further preliminary, let us give the following definition, that was already used by the second and third authors in [10].
Definition 12**.**
A generalized polygon with at most -sides is the limit in the topology of a sequence of classical polygons with at most sides (meant as open polygons) such that .
Recall that the convergence of to in the topology means that, for every ball , we have , being the Hausdorff complementary distance, namely
[TABLE]
( stands for the Euclidean distance from a closed set).
As a consequence of well-known properties of such topology (see for instance [9, 31]), any generalized polygon is an open set of finite Lebesgue measure, which is simply connected (as its complement is connected), but possibly disconnected. Any connected component is delimited by a finite number of line segments, which are pairwise joined at their endpoints to form a closed path, possibly containing self-intersections, given by points or line segments.
We now ready to prove Theorem 1 for arbitrary polygons. As in the convex case, we argue by contradiction, and we denote by a regular -gon or area . If the statement is false there exists an infinitesimal sequence of positive radii and a sequence of polygons such that
[TABLE]
To achieve the proof it is enough to show that
[TABLE]
Indeed, the convergence (46) implies in particular that is convex for large enough, which is a contradiction since we have already proved the statement for convex polygons.
In order to prove (46), we consider for every a triangulation of made by disjoint (open) triangles , with vertices and sides belonging to the family of vertices and diagonals of , such that
[TABLE]
Up to subsequences (which here and in the sequel are not relabeled), there exist sequences of vectors and sets (which may be either a triangle or the empty set), such that
[TABLE]
For every , one of the following three situations occurs:
- (a)
and
- (b)
and
- (c)
For convenience, we divide the remaining of the proof in three steps.
Step1: Situation (a) cannot occur.
Assume by contradiction that we are in situation (a) for some sequence . Hereafter, we omit for simplicity the index . Then, up to a subsequence and to a rigid motion, the vertices of are given by
[TABLE]
where the horizontal side of length is the longest one and, as ,
[TABLE]
We divide the segment into equal segments, of length . At least one of them, say up to a translation the segment , is such that the half strip
[TABLE]
does not contain any other vertex of . On the other hand, is crossed by a certain number of sides of , including two sides of . Thus, with the notation introduced at the beginning of Section 4.2, there exist angles \theta^{k}_{i}\in\big{(}-\frac{\pi}{2},\frac{\pi}{2}\big{)} and positive numbers , for such that , and
[TABLE]
Up to subsequences, for every i=0,\dots,M_{k}-1\color[rgb]{0,0,0}, we have
[TABLE]
Moreover, by construction it holds
[TABLE]
Hence we may define as the largest index in such that . Notice that, as a consequence, we also have
[TABLE]
We set
[TABLE]
Let us observe that, if , then
[TABLE]
Then, the idea is to adopt similar arguments as in the convex case, just by using Lemma 10 in place of Lemma 9. More precisely, we consider the set
[TABLE]
where
[TABLE]
In view of Lemma 10 and of the strict inequality (49), for large enough and for every x\in\Sigma_{k}\cap\big{(}\big{[}-\frac{\ell_{k}}{4N},\frac{\ell_{k}}{4N}\big{]}\times[0,+\infty)\big{)}, it holds
[TABLE]
Then we follow the same proof as in Case 1) of Section 4.1 to get a contradiction. More precisely, denoting by the projection of on the horizontal axis, we distinguish the two cases , and .
Assume that . This implies that . Thus, setting
[TABLE]
we have
[TABLE]
(where the first inequality holds by a proportion argument, which works since the side of length was assumed to be the longest one of ). Then we estimate as follows:
[TABLE]
In view of (50), recalling that |\Delta_{r_{k},\frac{r_{k}}{2}}|=\Big{(}\frac{\pi}{3}-\frac{\sqrt{3}}{4}\Big{)}r_{k}^{2}, and taking into account that, by (48), for every , we conclude that
[TABLE]
in contradiction with (35) and (45).
If , then we estimate as follows:
[TABLE]
so that (51) is again valid, in contradiction with (35) and (45).
Step 2: Identification of local concentrations. Finally, only situations (b) and (c) can occur. Since any sequence of triangles in situation (b) does not affect the limit of the sequence in , in order to describe the geometry of local concentrations, we focus only on the sequences of triangles in situation (c). For any pair of such sequences , we consider the corresponding sequences of vectors such that (47) holds, and we look at whether the distances remain bounded or diverge as . This way we define an equivalence relation on the family of sequences of triangles in situation (c), which splits them into a finite number of equivalence classes. By construction, for , there exist sequences of vectors , with for , such that
[TABLE]
where are generalized polygons, with a total number of sides not larger than and total area equal , i.e.,
[TABLE]
Then we consider the open sets with polygonal boundary obtained as ; we observe that
[TABLE]
with equality if and only if and . Indeed, any open connected component of set has a boundary which is union of closed polygonal lines, each one with at most edges. Then, by removing every bounded connected component of and rescaling the set thus obtained by a factor less than , it is possible to decrease the perimeter by preserving the area. Then the classical polygonal isoperimetric inequality ensures that is not smaller than the sum of the perimeters of regular -gons with total area , and (53) follows from the sub-additivity of the map .
*Step 3: We prove that *
[TABLE]
The above lower bound, combined with the upper bound inequality (35) and with the assumption (45), will imply that (53) holds with equality sign. This implies in particular that, for large enough, the sets must be convex. As we have seen in Section 4.1, this contradicts (45).
Let us prove (54). We fix an index and we localize our estimates around the set ; we may also assume without loss of generality that the corresponding vectors in (52) are equal to zero. Choosing such that , from (52) we have
[TABLE]
Dropping the index for simplicity of notation, we have to show that
[TABLE]
We focus our analysis around a fixed side of . Its endpoints are limit of vertices of , but its interior as well may contain some accumulation points of vertices of . These accumulation points divide our side into several segments (at most ). We pick one of them, say , with lying below the segment. From the -convergence, and since the open segment does not contain any accumulation point of vertices of , if and are sufficiently small so that , then inside the rectangle the structure of is similar to the one of the set in Lemma 10 (cf. (41)).
Precisely, we consider the sides of which intersect the rectangle , and whose supporting lines satisfy as . Assume that, as the index goes from to , those lines are labelled from the bottom to the top. Choosing such that does not meet any other side of , we can locally represent in as the following union of strips:
[TABLE]
Then, using Lemma 10 (and Remark 11), we get
[TABLE]
Inequality (55) follows by repeating the above argument around each side of and letting .
∎
5. Proofs of Theorem 2 and of Corollary 4
Proof of Theorem 2. For brevity, let us denote by the functional
[TABLE]
so that
[TABLE]
Clearly, for , we have
[TABLE]
Setting , by Reinhardt’s Theorem [41], for even there exists a polygon , which is not a regular -gon, such that
[TABLE]
Therefore, for every , we have
[TABLE]
(in particular, the above strict inequality holds for if ). It follows that, for every , the maximum of over equals , or equivalently the minimum of over equals , and they are attained at . ∎
Proof of Corollary 4 (i) Let and as in the statement of Theorem 2. For every , there exists a polygon such that
[TABLE]
Then, it is enough to consider a sequence of non-negative and non-increasing radially symmetric kernels in which converge increasingly to . By the monotone convergence theorem, for large enough we have
[TABLE]
(ii) Again, let and be as in the statement of Theorem 2. Set . If is a diameter of , for and we have
[TABLE]
Hence,
[TABLE]
On the other hand, we have
[TABLE]
By comparing (56) and (57) we infer that, for large enough,
[TABLE]
We now examine in particular the case . Denoting by the regular hexagon with unit diameter and by the Graham hexagon with the same area, it follows that . Then, a direct estimate using gives
[TABLE]
On the other hand, by arguing as done above to obtain (56), we get
[TABLE]
Observing that , , and using the numerical value of shows that, for any , has a lower energy than . ∎
6. Proof of Theorem 3
We argue in two steps.
Step 1. We claim that, if denotes inner angle of at the vertex , it holds . We prove this claim by contradiction, via a reflection argument. Assume . Let be the symmetry axis of the side , let be their intersection point and, for , let be its symmetric about . Given in the plane we denote its reflection about . We have
[TABLE]
Indeed, the strict inclusion for close to readily follows from
[TABLE]
We assert that the inclusion (58) remains valid for all , namely that
[TABLE]
Once proved (59), the contradiction required to achieve the proof of Step 1 readily follows. Indeed, recalling that the inclusion becomes strict for close to , we have
[TABLE]
in contradiction with (18).
It remains to show (59). To that purpose, we distinguish the cases and .
For triangles, (59) is straightforward, since the inequality implies that , see Figure 5, left.
For , the inequality does not imply, in general, that , see Figure 5, right. Nevertheless, thanks to assumption (20), in order to check that (59) still holds, it is enough to show that, for any , if denotes its symmetric about , the trapezoid with vertices is contained into . Again thanks to assumption (20), the inclusion holds true as soon as . This latter property is true because , and .
Step 2. We claim that, if denotes the length of the side , it holds . To prove this claim we are going to assume without loss of generality that . Indeed, if is a triangle, Step 1 proved above already gives that is equilateral.
We point out that, by assumption (20), it holds
[TABLE]
Indeed, in case for some index , , with mid-point of would not be contained into two consecutive sides of , against (20).
Next we recall that, by Step 1, all the inner angles of are equal to a fixed angle . Since , we have . Therefore, points such that intersects another side of (which by (20) is necessarily a consecutive side) are points whose distance from or from does not exceed . For such points, is a function depending on (or ), , and , but not on .
Taking (60) into account, we conclude that for a suitable function , it holds
[TABLE]
where is the norm of in . We now enforce condition (19) to deduce
[TABLE]
Clearly this system can be satisfied only if either all the ’s are equal, or . But the latter equality cannot hold: indeed, by (19), is the integral mean over a side of of the function , and such function is always less than or equal to , with strict inequality near the vertices. ∎
7. Proof of Theorem 5 and Lemma 7
Proof of Theorem 5. Taking , for every we have
[TABLE]
Since the energy is invariant by translations, it is not restrictive to assume that has its baricenter at the origin, and hence the result follows from Theorem 6.
Taking , for every we have
[TABLE]
where the last equality holds up to assuming as done above that has its baricenter at the origin.
Now we use the inequality , with equality if and only if . Applying it for , we get
[TABLE]
where the second inequality follows from Theorem 6, and last equality holds since
[TABLE]
To check these equalities, we decompose a regular -gon centred at the origin into triangles , with .
Concerning the first equality in (61), on each triangle we have
[TABLE]
inserting the expressions of and summing over we get
[TABLE]
Concerning the second equality in (61), on each triangle we have
[TABLE]
[TABLE]
Inserting the expressions of , we see that for every , so that
[TABLE]
[TABLE]
Now,
[TABLE]
[TABLE]
Using the formulas
[TABLE]
[TABLE]
we conclude that
[TABLE]
[TABLE]
∎
Proof of Lemma 7.
(i) If is axially symmetric, some elementary computations give
[TABLE]
where the inequality is obtained by invoking as usual Theorem 6. Since
[TABLE]
to conclude the proof it is enough to show that
[TABLE]
To that aim, we exploit the assumption that is an axisymmetric convex octagon. Assuming without loss of generality that has four vertices at the points , , and one in the region , we have that contains an axisymmetric convex octagon , which has still four vertices at the points , and one on the straight line . Then
[TABLE]
(ii) We can assume without loss of generality that . Indeed, once the inequality is proved for such , one can pass to the limit as . The proof is inspired from [32]. In the remaining of this proof, the functional with will be denoted for brevity by , i.e. we set
[TABLE]
Our target is to show that, for every real volume preserving real matrix , it holds
[TABLE]
We have , with diagonal and . Since for any and any , it is not restrictive to assume that . Moreover, by considering the regular polygon in place of , we may assume that also . We are thus reduced to show that
[TABLE]
We consider the family of polygons , with , so that , and . We claim that for every , or equivalently that the map satisfies for every . Indeed, let us show that
[TABLE]
By arguing as in [32, Lemma 4.3 and Lemma 4.4], we are allowed to differentiate under the sign of integral. Setting , and writing for brevity in place of , by direct computations we have
[TABLE]
where
[TABLE]
and
[TABLE]
Hence conditions (62) are satisfied (we exploit here the assumption ).
∎
8. Proof of Theorem 6
We consider the maximization problem
[TABLE]
and we proceed as follows:
– In Section 8.1 we prove that it is not restrictive to take (cf. Proposition 13).
– In Section 8.2 we prove that problem (63) admits a solution, which is a classical star-shaped polygon (cf. Proposition 14).
– In Section 8.3 we prove that, among classical star-shaped polygons, the regular -gon is optimal (cf. Proposition 17).
The validity of Theorem 6 follows at once by combining Propositions 13, 14, and 17.
In the sequel, we write for brevity in place of .
8.1. Reduction to the characteristic kernel
Proposition 13**.**
*If a polygon solves problem (63) when for all , then it solves problem (63) for every admissible kernel . *
Proof.
Assume solves (63) for every . Consider a sequence of radii and some positive reals . Then by optimality of for every radius , we have
[TABLE]
Since every radially decreasing step function can be written in the form it follows that solves (63) for radially decreasing step functions.
Every radially decreasing function can be written as the limit of an increasing sequence of radial step functions . Passing to the limit in the inequalities
[TABLE]
shows that solves (63) for arbitrary admissible kernels.
∎
8.2. Existence and reduction to star-shaped polygons
In view of Proposition 13, we are going to focus our attention on the maximization problem
[TABLE]
Proposition 14**.**
Problem (64) admits a solution. Moreover, every solution is star-shaped.
As a preliminary remark, let us observe that there are some ranges for the value of for which problem (64) can be elementarily solved. More specifically, let and denote respectively the regular -gons circumscribed and inscribed to . Then:
- –
If , the maximum in (64) is equal to , and it is attained either at infinitely many admissible polygons among which the regular -gon of area (if the inequality is strict), or uniquely at (if the inequality holds with equality sign).
- –
If , the maximum in (64) is equal to , and it is attained either at infinitely many admissible polygons among which the regular -gon of area (if the inequality is strict), or uniquely at (if the inequality holds with equality sign).
Thus, in the remaining of this section we always tacitly assume that is chosen so that
[TABLE]
(this is just for definiteness, as our proof below works also in the ‘trivial’ cases left).
The proof of Proposition 14 requires as a key ingredient a geometric construction that we state separately in the next lemma, along with its application in our problem in the subsequent remark.
Lemma 15**.**
Let be a polygon with sides such that and . Then there exists another polygon with sides, which is star-shaped and satisfies the inclusions
[TABLE]
Remark 16*.*
Let and be polygons as in Lemma 15. We claim that, starting from , it is easy to construct another star-shaped polygon , still having sides, such that
[TABLE]
Indeed, from the first inclusion in (66) we have , with . Compare then the areas of and . In case , we simply define . In the case left, namely when , we define as the polygon homothetic to which as the same area as , namely we take . Clearly, such is a still a star-shaped polygon with sides, and it is easy to check that that its energy is not less than the energy of . Actually we have:
[TABLE]
where the last inequality follows after an immediate computation as a consequence of the two inequalities and , the latter holding by the second inclusion in (66).
Let us assume for a moment that Lemma 15 holds true, and let us show how Proposition 14 follows. Let be a maximizing sequence for problem (64). Clearly, up to a subsequence each polygon has a non-negligible intersection both with and with its complement (recall we are assuming (65)). Then, for every , we denote by the star-shaped polygon given by Lemma 15. Proceeding as in Remark 16, we obtain a new polygon with its star-shapedness centre inside , which satisfies (67). Thus, is still a maximizing sequence.
Now, by the compactness and lower semicontinuity properties of the Hausdorff complementary topology [31, Section 2], up to a subsequence, we may assume that
[TABLE]
for some which is a generalized polygon with sides, according to Definition 12, having area at most . We observe that the perimeter of the sets is uniformly bounded from above (since all the polygons have a fixed number of sides). Hence, by the compact embedding of into , the energy converges to .
Notice carefully that the limit generalized polygon is still star-shaped (this is precisely the scope reached through the modification of the sequence into the sequence ).
It may still occur that contains some self-intersections, but they can only be contact segments between two consecutive sides. Hence, it is enough to remove any such contact segment, in order to transform into a classical polygon, which will be a solution to problem (64). ∎
We now turn to the most delicate part of the proof, namely the geometric construction in Lemma 15.
Proof of Lemma 15. Let be a polygon as in the assumptions of the Lemma. To prove the statement, we can further assume with no loss of generality that has no side tangent to and no vertex in . Indeed, if this is not the case, once the Lemma is proved for polygons with no side tangent to and no vertex in , we can approximate (in the Hausdorff complementary topology) by a sequence of polygons satisfying such additional conditions, and apply the Lemma to each : we find a sequence of polygons , whose limit polygon (which exists up to passing to a subsequence and is still star-shaped) does the job for .
Thus, let be a polygon as in the assumptions of the Lemma, which in addition has no side tangent to , and no vertex in . For the sake of clearness, we give first the construction of the polygon in a simplified situation, namely when the intersection between and consists precisely of arcs of circle, and then we proceed in the general case.
Case when each side of has both its endpoints outside , and intersects . (Equivalently, the intersection between and consists precisely of arcs of circle.) Starting from a fixed endpoint of such an arc, say , and following a counter-clockwise oriented parametrization of , name these arcs , for ; none of these arcs is degenerated into a point, since by assumption no side of is tangent to , see Figure 6, left.
For , let be the straight line through and , with the convention , and let be the (open) half-plane determined by which contains all the points for and .
[TABLE]
By construction, is a classical convex polygon in , see Figure 6, right.
The first inclusion in (66) is satisfied because any circular segment delimited by the arc and the segment cannot intersect (otherwise the arc would be crossed by a side of ).
The second inclusion in (66) is satisfied because, denoting by the common vertex of the two consecutive sides of containing and , and by the intersection of the straight lines (through and ) and (through and ), it holds
[TABLE]
This is due to the fact that the point belongs to both the half-plane determined by the straight line trhough and and containing , and the half-plane determined by and and containing .
General case. Consider the intersection between and . Such intersection consists now of arcs of circle, with . Starting from a fixed endpoint of such an arc, say , and following a counter-clockwise parametrization of , name these arcs , for . Notice that, if we equip with an oriented parametrization such that lies on the left of each side, then, at every point , the side of passing through is entering into .
For , let be the half-planes defined as above. Let also denote the (possibly empty) family of vertices of lying inside (recall that by assumption no vertex of lies on .
We point out that, if one would define as in (69), none of the two inclusions in (69) would be in general satisfied: the former due to the possible presence of vertices inside , the latter due to the possible presence of sides exterior to (in both cases, with possible self-intersections occuring in ).
For this reason, the definition of is more involved: we are going to construct it as the union of two sets, denoted by and , which lie respectively inside and outside . Such sets are “curvilinear” polygons, whose boundaries do not contain self-intersections, and consist in a finite number of “sides”, meant as arcs of circle lying on or line segments (in case of , the segments lie inside , while in case of they lie outside). The closures of the two sets and intersect precisely at the arcs , so that the set
[TABLE]
turns out to be a classical polygon, which by construction will be a star-shaped one.
Let us specify how and are defined. We set
[TABLE]
where denotes the convex envelope. By construction, specifically thanks to the presence of the vertices in the above definition, we have
[TABLE]
In order to define , let us choose a point in the interior of , say , which does not belong to any of the straight lines supporting the edges of .
Let us denote by the family of all the straight lines supporting the non-circular edges of . For every , we introduce the set
[TABLE]
where
[TABLE]
We define
[TABLE]
By construction we have
[TABLE]
In view of (71), and (72), the set defined in (70) is a classical star-shaped polygon which satisifes both inclusions in (66) as required.
To conclude, it remains to show that , namely that our procedure respects the constraint on the number of sides. We set
[TABLE]
so that . Denoting by the family of sides of , we have
[TABLE]
where and are the families of non-circular sides respectively of and of . To prove that (namely that ), we are going to show that
[TABLE]
*(i) Counting inside. * For every , let us denote by the vertices of lying in the interior of the circular segment delimited by the arc and the line segment . Then the number of (non-circular) sides of which join to is exactly . To any such side, we can associate a side which intersects , in the following way:
- –
to the side starting at , we associate the only side of passing through , which is entering into ;
- –
to the side starting at , we associate the side of which starts at in the positive orientation of .
Since this association is injective, the first inequality in (73) holds true.
(ii) Counting outside. As above, to any side in we can associate a side of , which in this case does not intersect . Specifically, we distinguish two cases: a side in is either a side of which does not intersect or a newly created one.
- –
to a side which does not intersect , we associate the corresponding side of ; note that the association is injective, because a side of cannot be simultaneously in the boundary of two distinct sets ’s.
- –
to a newly created side, we associate a side of which is not part of the boundary of in the following way: a newly created side occurs when some point is a vertex of and, for sufficiently small, we still have . This means that is an endpoint of a side of which is not part of the boundary of : this is precisely the side we associate to the newly created one (again, with an injective association).
Thus also the second equality in (73) holds true, and our proof is achived. ∎
8.3. Optimality of the regular -gon
Proposition 17**.**
Problem (64) is solved by the regular -gon of area .
Proof.
Let be a solution to problem (64), which exists by Proposition 14. As in the proof of Proposition 14, we are going to assume with no loss of generality that the inequalities (65) are satisfied. We are going to prove the result through several claims. We stress that in each of these claims we estimate the variation of the area and of the “energy” when is perturbed by some kind of deformation, preserving the number of sides: this strategy is allowed precisely by the crucial information is a classical polygon, whose boundary does not contain self-intersections.
No side entirely outside . Indeed, assume that has a side which does not intersect . By the right inequality in (65), there exists another side which intersects . We move simultaneously and , both in a parallell way to themselves, respectively towards the interior and towards the exterior of : the area is preserved while the energy increases, contradicting optimality.
No side with one vertex in and one vertex outside . Indeed, assume that as such a side . We perform a rotation of around its mid-point, in such way that the vertex of which lies inside moves towards the exterior of : the area is preserved at first order, while the energy increases, again contradicting optimality.
No vertex in . Assume by contradiction that some side of has an endpoint in . By the previous claim, we know that the other endpoint of cannot lie outside , hence it lies either in or on . On the other hand, by the left inequality in (65), we can exclude that all vertices of lie in . We deduce that necessarily contains a chain of consecutive sides, all entirely contained into , such that the first and the last sides in the chain have one vertex in and the other one in , while all the intermediate sides in the chain are entirely contained into . Now, we can move to all the vertices of the chain lying in so to construct another polygon which satisfies the inclusions (66). Starting from this polygon and arguing as in Remark 16, we find another polygon , with , which has a strictly larger energy, contradicting optimality.
* is inscribed into a circle, concentric with , of radius .* By the previous item, we may associate with each side of a chord of , given by its intersection with (a priori possibly coinciding with the side itself). We perform a rotation of a fixed arbitrary side around its mid point: the first order optimality conditions yield that the mid-point of any side must coincide with the mid-point of the chord associated with it (apply Lemma 19, eq. (79) with and eq. (81) from the Appendix in Section 9). We infer that all the vertices of have the same distance from the center of , namely that is inscribed into a circle concentric with . By the left inequality in (65), this circle has radius strictly larger than .
* is a regular polygon.* We make a simultaneous parallel movement of two different sides in such a way to preserve the area of . Denoting by the lengths of the sides of and by the lengths of the corresponding chords (obtained by intersecting the sides with ), the first order optimality conditions yield
[TABLE]
(apply Lemma 19, eq. (82) with and eq. (84)). Combined with the previous item, this yields that for every , and hence is a regular polygon.
* is the regular -gon of area .* We already know from the previous steps that is a regular polygon, with number of vertices at most and area at most .
If has a number of sides strictly less than , we can add a side just by “cutting” a corner which lies outside . In this case we obtain a new optimal polygon with an edge not intersecting , which contradicts the first step of this proof. Hence, the optimal polygon is a regular -gon centred with . Then, since regular -gons centred with are monotone by inclusions, the optimal polygon must have the maximal admissible area, i.e. area equal to . ∎
9. Appendix: first and second order shape derivatives
9.1. General formulas
When is an integrand of class , integral energies on such as
[TABLE]
are twice differentiable with respect to domain perturbations. More precisely, given a Lipschitz velocity field , let denote a one parameter family of diffeomorphisms from into itself with initial velocity , i.e. .
The first and second order Fréchet shape derivatives of , meant respectively as
[TABLE]
exist, and their computation as stated in the next lemma is classical, see [31, Theorem 5.2.2, eq. (5.11)], [33, Section 2].
Lemma 18**.**
Assume is of class . Then, for any open bounded Lipschitz domain and any Lipschitz deformation , it holds
[TABLE]
and
[TABLE]
Since the above result is valid in every space dimension, it applies in particular to energies of the type on . In that case, we need to consider “doubled” vector fields of the form , see Section 9.3 for more details.
9.2. First order shape derivative under rotation/ parallel movement of a side
We give hereafter the expressions of the first order shape derivatives for the energies
[TABLE]
when a polygon is perturbed by two distinct relevant deformations, preserving the number of sides, which have been previously considered in [10] (see also [6, 24]). We enclose their definitions to make the presentation self-contained. Let be a fixed polygon, and let be a fixed side of , with consecutive sides and .
(i) The polygons are obtained from by rotation of the side around its mid-point if they are obtained from by keeping the other sides are fixed, and replacing the three sides by the new sides described as follows
- –
lies on the straight-line obtained by rotating of an oriented angle , around the mid-point of , the straight-line containing ;
- –
and lie on the same straight-line containing respectively and ;
- –
the lengths of , and , are chosen so that the three sides are consecutive.
(ii) The polygons are obtained from by parallel movemenf of the side if they are obtained from by keeping the other sides are fixed, and replacing the three sides by the new sides described as follows
- –
lies on the straight-line parallel to having signed distance from ;
- –
and lie on the same straight-line containing respectively and ;
- –
the lengths of , and , are chosen so that the three sides are consecutive.
It follows from Lemma 18 that
[TABLE]
where , X=\frac{d}{d{\varepsilon}}\phi_{\varepsilon}\big{|}_{\varepsilon=0} is the initial velocity of the deformation, and is the unit outward normal defined -a.e. on .
Now, some elementary geometric considerations show that, in cases (i) and (ii), the normal component of the velocity field is given respectively by
[TABLE]
where are the endpoints of and is its midpoint.
We end up with the following
Lemma 19**.**
(i) If are obtained from by rotation of the side around its mid-point, it holds
[TABLE]
In particular, it follows from (79) taking that
[TABLE]
(ii) If are obtained from by parallel movement of the side with respect to itself, it holds
[TABLE]
In particular, it follows from (79) taking that
[TABLE]
∎
9.3. Gradient and Hessian under vertices displacement
Let be a -gon with vertices , and let be a triangulation of such that the edges of are edges of some triangles . An example is shown in Figure 9. Following [33, 5], let denote the piece-wise affine function on the triangulation such that .
Given vectors and which perturb the vertices , respectively, consider the double perturbation fields
[TABLE]
From the general results recalled in Section 9.1, we have
[TABLE]
Now, the different terms appearing in the above equalities can be made explicit by using the expressions of (the computations are similar to those in [33, Appendix A]). We have the following formulas:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Inserting these formulas into the expressions of and , we find that
[TABLE]
where the vector and the matrix , representing respectively the gradient and the Hessian of with respect to the vertices, are given by
[TABLE]
and
[TABLE]
*Note: On behalf of all authors, the corresponding author states that there is no conflict of interest. *
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. Audet, Maximal area of equilateral small polygons , Amer. Math. Monthly 124 (2017), no. 2, 175–178.
- 2[2] C. Audet, P. Hansen, F. Messine, and J. Xiong, The largest small octagon , J. Combin. Theory Ser. A 98 (2002), no. 1, 46–59.
- 3[3] C. Audet, P. Hansen, and D. Svrtan, Using symbolic calculations to determine largest small polygons , J. Global Optim. 81 (2021), no. 1, 261–268.
- 4[4] H. Bieri, Ungelöste Probleme: Zweiter Nachtrag zu Nr. 12 , Elem. Math. 16 (1961), 105–106.
- 5[5] B. Bogosel and D. Bucur, On the Polygonal Faber-Krahn Inequality , Arxiv ar Xiv:2203.16409.
- 6[6] M. Bonacini, R. Cristoferi, and I. Topaloglu, Riesz-type inequalities and overdetermined problems for triangles and quadrilaterals , J. Geom. Anal. 32 (2022), no. 2, Paper No. 48, 31.
- 7[7] J. Bourgain, H. Brezis, and P. Mironescu, Another look at Sobolev spaces , Optimal control and partial differential equations, IOS, Amsterdam, 2001, pp. 439–455.
- 8[8] H.J. Brascamp, E.H. Lieb, and J.M. Luttinger, A general rearrangement inequality for multiple integrals , J. Functional Analysis 17 (1974), 227–237.
