Lebesgue and gaussian measure of unions of basic semi-algebraic sets
Jean Lasserre (LAAS-MAC), Youssouf Emin (LAAS-MAC)

TL;DR
This paper introduces a systematic numerical method using semidefinite programming to approximate the measure of unions of semi-algebraic sets with arbitrary precision, leveraging available moments of the measure.
Contribution
It develops a hierarchy of semidefinite programs that converges to the measure of unions of semi-algebraic sets, enabling precise approximation from both above and below.
Findings
Convergent hierarchy of semidefinite programs for measure approximation.
Approximation of moments of the measure restricted to semi-algebraic sets.
Method applicable to Lebesgue measure with compact sets.
Abstract
Given a finite Borel measure on R n and basic semi-algebraic sets \_i R n , i = 1,. .. , p, we provide a systematic numerical scheme to approximate as closely as desired (\cup\_i \_i), when all moments of are available (and finite). More precisely , we provide a hierarchy of semidefinite programs whose associated sequence of optimal values is monotone and converges to the desired value from above. The same methodology applied to the complement R n \ (\cup\_i \_i) provides a monotone sequence that converges to the desired value from below. When is the Lebesgue measure we assume that := \cup\_i \_i is compact and contained in a known box B and in this case the complement is taken to be B \ . In fact, not only () but also every finite vector of moments of \_ (the restriction of …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Optimization and Variational Analysis · Advanced Control Systems Optimization
Lebesgue and Gaussian measure of unions of basic semi-algebraic sets
Jean B. Lasserre
LAAS-CNRS and Institute of Mathematics
University of Toulouse
LAAS, 7 avenue du Colonel Roche
31077 Toulouse Cédex 4, France
and
Youssouf Emin
Ecole Polytechnique
91 128 Palaiseau Cedex, France
Jean B Lasserre: 7 Avenue du Colonel Roche, BP 54200, 31031 Toulouse cedex 4, France.
Tel: +66561336415; Fax: +33561336936; Email: [email protected]
Youssouf Emin: Ecole Polytechnique, 91 128 Palaiseau, Cedex, France
Email: [email protected]
Abstract
Given a finite Borel measure on and basic semi-algebraic sets , , we provide a systematic numerical scheme to approximate as closely as desired , when all moments of are available (and finite). More precisely, we provide a hierarchy of semidefinite programs whose associated sequence of optimal values is monotone and converges to the desired value from above. The same methodology applied to the complement provides a monotone sequence that converges to the desired value from below. When is the Lebesgue measure we assume that is compact and contained in a known box and in this case the complement is taken to be . In fact, not only but also every finite vector of moments of (the restriction of on ) can be approximated as closely as desired, and so permits to approximate the integral on of any given polynomial.
Keywords: Lebesgue and Gaussian measure; semi-algebraic sets; moment problem and sums of squares; semidefinite programming; convex optimization
MSC: 44A60 28A75 90C05 90C22
1. Introduction
Given a set and a finite Borel measure on , computing is a very challenging problem. In fact even approximating the Lebesgue volume of a convex body (e.g. a polytope) is difficult; see e.g. Bollobás [2] and Dyer and Frieze [8]. However, in the latter case some efficient (non deterministic) algorithms with probabilistic guarantees are available and for more details the interested reader is referred to e.g. Dyer et al. [9], Cousins and Vempala [3, 4] and the references therein.
In the non convex case no such algorithm is available and one is left with approximating with Monte Carlo (or Quasi-Monte-Carlo) type methods as described in e.g. Niederreiter [17]. That is, one first generates a sample of points in following the distribution on and then one counts the number of points that fall into . This realization of the random variable provides an estimate of but by no means an upper bound or a lower bound on . Of course this method is quite fast, especially is small dimension.
Yet, as is indeed very difficult to compute exactly, a less ambitious but still useful goal would be to provide upper and/or lower bounds on . Even better, a converging sequence of upper (or lower) bounds would be highly desirable. This is the strategy proposed in Henrion et al. [10] when is a compact basic semi-algebraic set and is the Lebesgue measure. In [10] the authors have provided a (deterministic) numerical scheme which yields a monotone sequence of upper bounds converging to . It consists of solving a hierarchy of semidefinite programs of increasing size. By repeating the procedure but now with the complement , one also obtains a monotone sequence of lower bounds converging to . However, even on typical or -dimensional examples, the convergence was rather slow and the authors proposed a slight modification which turned out to be much more efficient; the convergence was much faster but unfortunately not monotone anymore.
Contribution
The purpose of this paper is to introduce a deterministic method to approximate (in principle as closely as desired) the measure of the union of finitely many basic semi-algebraic set. The finite Borel measure is any measure whose all moments are finite, e.g., the Lebesgue measure when is compact, the Gausssian measure for non-compact set .
The method is similar in spirit to the one in [10] for a compact basic semi-algebraic set and the one in [13] for computing Gaussian measures of basic closed semi-algebraic sets (not necessarily compact), but with two important novelties.
In contrast to [10] and [13], we consider a finite union of (non disjoint) basic semi-algebraic sets, which complicates matters significantly.
We include a technique to accelerate the convergence different from the one described in [10]. Indeed in contrast to [10], it has the highly desirable feature to maintain the monotone convergence to which is essential if one wishes to obtain upper and lower bounds. It consists of using moments constraints coming from a particular application of Stokes’ theorem.
In fact this numerical scheme allows to approximate not only but also any fixed finite sequence of moments of the measure (where is the restriction of to ).
Remark 1.1**.**
One might invoke the inclusion-exclusion principle which states that
[TABLE]
so that in principle it suffices to compute (or approximate) for all possible intersections of the ’s, e.g. by the approach of [10] or [13]. But this approach has two major drawbacks. First there are possibly such sets and secondly, to compute an upper bound one has to compute an upper bound for such intersections with an odd number of elementary sets , and a lower bound for such intersections with an even number of elementary sets. The latter lower bound in turn is obtained by computing an upper bound for the complement. This makes the whole procedure tedious and complicated. Finally, Bonferroni’s inequalities also provide a (finite) sequence of upper and lower bounds on but computing those bounds involves sums similar to the right-hand-side of (1.1), hence with the same drawbacks just mentioned. Our proposed technique is direct with no partial computation on intersections of elementary sets .
Of course, the technique described in this paper is computationally expensive. In particular, its applicability is limited by the performance of the state-of-the-art semidefinite solvers because the size of the semidefinite programs increases fast with the rank in the hierarchy. Therefore it makes its application limited to small dimensional problems (). For higher dimensions only a few steps in the hierarchy can be implemented and therefore only upper and lower bounds (possibly crude) are expected. But the reader should keep in mind that the problem is very difficult and to the best of our knowledge we are not aware of an algorithm (at least at this level of generality) which provides certified upper and lower bounds with such convergence properties (even for convex sets and in particular for non compact sets ). In our opinion this methodology should be viewed as complementary to (rather than competing with) probabilistic methods.
2. Notation, definitions and preliminary results
2.1. Notation and definitions
Let be the ring of polynomials in the variables . Denote by the vector space of polynomials of degree at most , which has dimension , with e.g., the usual canonical basis of monomials, where , is the set of natural numbers including [math] and . Also, denote by (resp. ) the cone of sums of squares (s.o.s.) polynomials (resp. s.o.s. polynomials of degree at most ). If , we write in the canonical basis and denote by its vector of coefficients. Finally, let denote the space of real symmetric matrices, with inner product . We use the notation (resp. ) to denote that is positive semidefinite (definite). With , the quadratic module generated by polynomials , is defined by:
[TABLE]
Definition 2.1** (Archimedean assumption).**
The quadratic module is Archimedean if there exists such that the quadratic polynomial belongs to . Notice that is an algebraic certificate that the set is compact.
If the set is compact then for some , and one may always include the redundant quadratic constraint in the definition of without changing . Then the quadratic module is Archimidean.
Moment and localizing matrix
With a real sequence , one may associate the (Riesz) linear functional defined by
[TABLE]
Denote by the moment matrix associated with , the real symmetric matrix with rows and columns indexed in the basis of monomials , and with entries:
[TABLE]
Next, given , denote by the localizing moment matrix associated with and , the real symmetric matrix with rows and columns indexed in the basis of monomials , and with entries:
[TABLE]
If is the sequence of moments of some Borel measure on then for all . However the converse is not true in general and it is related to the well-known fact that there are positive polynomials that are not sums of squares. Similarly, if the support of is contained in then for all . For more details the interested reader is referred to e.g. [14, Chapter 3].
Given a Borel set let be the space of finite signed Borel measures on and let be the convex cone of finite Borel measures on .
2.2. The measure of a basic semi-algebraic set
Let with and let be a finite Borel measure whose support is . (Typically is the Lebesgue measure on a box and one wishes to compute the Lebesgue volume ; alternatively , is the Gaussian measure and one wishes to compute .)
An infinite-dimensional linear program
Let be positive almost everywhere on and consider the following infinite-dimensional LP problem :
[TABLE]
Theorem 2.2** ([10]).**
The measure (the restriction of to ) is the unique optimal solution of . In particular, if for all , then .
Semidefinite relaxations
Of course problem in (2.2) is infinite-dimensional and cannot be solved directly. However, when is a basic semi-algebraic set then Theorem 2.2 can be further exploited. So given , let be the basic semi-algebraic set
[TABLE]
assumed to nonempty and compact. Let and let be a finite Borel measure whose all moments with
[TABLE]
are available in closed form or can be computed.
To approximate as closely as desired in [10] the authors propose to solve the following hierarchy of semidefinite programs111A semidefinite program (SDP) is a conic convex optimization problem with a remarkable modeling power. It can be solved efficiently (in time polynomial in its input size) up to arbitrary precision fixed in advance; see e.g. Anjos and Lasserre [1] indexed by :
[TABLE]
Observe that is a relaxation of , and so for all . In addition, the sequence is monotone non increasing. The dual of (2.3) is the semidefinite program:
[TABLE]
and by weak duality, for all .
Theorem 2.3** ([10]).**
Assume that is Archimedean. Then as . If has nonempty interior then and (2.4) has an optimal solution .
So when , provides us with a monotone sequence of upper bounds on . Unfortunately the convergence is rather slow as observed on several numerical examples. This is because in the dual (2.4) the optimal solution tries to approximate from above (in ) the discontinuous function , which implies an annoying Gibb’s phenomenon222The Gibbs’ phenomenon appears at a jump discontinuity when one approximates a piecewise function with a continuous function, e.g. by its Fourier series.. To remedy this problem the authors in [10] propose to use a polynomial , nonnegative on and which vanishes on . In this case the convergence as is still monotone and if denotes an optimal solution of (2.3) then as . However, while faster than with , the latter convergence of to is not monotone anymore, a rather annoying feature which prevents from obtaining a non increasing sequence of upper bounds.
3. Main result
The context
Let be a box, and for every , let , for some polynomials . Assume that has been chosen so as to satisfy:
[TABLE]
The goal is to provide a numerical scheme to approximate as closely as desired the Lebesgue volume . (We will see how to adapt the methodology to also approximate as closely as desired when is not necessarily compact and is a Gaussian measure.) One possible approach described below is to use the powerful inclusion-exclusion principle and/or the associated Bonferroni inequalities.
3.1. The inclusion-exclusion principle and Bonferroni Inequalities
Let :
[TABLE]
By the inclusion-exclusion principle,
[TABLE]
which allows us to work with intersections of the ’s only. In addition, the Bonferroni inequalities state that
[TABLE]
which provides sequences of (increasingly tighter) upper and lower bounds.
Therefore to compute we only have to compute the measure of the intersection , for all . Notice that there are such sets. As each is a compact basic semi-algebraic set, one may apply the methodology described in §2.2, to obtain a sequence which converges to as , and therefore
[TABLE]
Notice that the convergence is not monotone non increasing even if one solves (2.3) with because we sum up negative and positive terms. To maintain the monotone convergence (when ) it suffices to compute a lower bound on the complement when is even. However as already mentioned the convergence is expected to be rather slow.
To accelerate the convergence one may use when one solves (2.3) with as on and on . But then the convergence
[TABLE]
(where is an optimal solution of (2.3) with ) is not monotone anymore.
3.2. A direct approach
In this section we describe a direct approach with two distinguishing features:
- •
It does not use the inclusion-exclusion principle and the need to approximate for all such sets.
- •
The convergence to (and also to is monotone non increasing, that is, we can compute two sequences and such that:
[TABLE]
Recall that any finite number of moments
[TABLE]
are either available in closed-form or can be obtained numerically.
A multi infinite-dimensional linear program
As in §2.2 we first introduce an infinite-dimensional LP problem whose unique optimal solution is the restriction of on (and whose dual has a clear interpretation).
Let be positive almost everywhere on and consider the following infinite-dimensional LP problem :
[TABLE]
Theorem 3.1**.**
Problem has an optimal solution and every optimal solution satisfies , where is the restriction of to .
Proof.
We first prove that has an optimal solution. The set is weakly sequentially compact; see e.g. Dunford & Schwartz [7, Theorem 1, p. 305]. Therefore let be a maximizing sequence of feasible solutions of . There exists a subsequence such that for every , for some . The above weak convergence and implies , that is, for all . Weak convergence again implies and so is a feasible solution of . Finally weak convergence also implies
[TABLE]
which proves that is an optimal solution of .
We next prove that . Indeed, firstly observe that for every feasible solution of , . On the other hand, for every , denote by the measurable function defined on by :
[TABLE]
The (discontinuous) functions form a partition of unity subordinate to the open cover . For every , let be the finite Borel measure defined by:
[TABLE]
Hence, . Therefore is a feasible solution of such that , and so , i.e., is an optimal solution of . In fact, every optimal solution of satisfies , and therefore is an optimal solution of . By Theorem 2.2 this solution is unique, which yields the desired result. ∎
A hierarchy of semidefinite relaxations
Let be the sequence of all moments of , that is,
[TABLE]
Let be a box and be a compact semi-algebraic as in (3.1). With no loss of generality and possibly after scaling, we may and will assume that and is a probability measure. Therefore for all .
Let and let be a given polynomial positive almost everywhere on (and define ). For , consider the following hierarchy of semidefinite programs indexed by :
[TABLE]
Observe that for all . Indeed, if is the sequence of moments of an optimal solution of in (3.2) then is also a feasible solution of .
Theorem 3.2**.**
Consider the semidefinite programs , . Then :
(i) has an optimal solution and the associated sequence of optimal values is monotone non increasing and converges to , that is:
[TABLE]
(ii) Let be an optimal solution of . Then for each :
[TABLE]
and in particular .
Proof.
For a sequence , let and recall that if then for every ; see [14, Proposition 3.6]. Next, observe that from and ,
[TABLE]
Hence the diagonal elements are all nonnegative which in turn implies for all . As then by [14, Proposition 3.6 ] for every , and so the feasible set of semidefinite program is closed, bounded, hence compact, and therefore has an optimal solution.
Next, let be an optimal solution of and by completing with zeros, make an element of the unit ball of (where is the Banach space of bounded sequences, equipped with the sup-norm). As is the topological dual of , by the Banach-Alaoglu Theorem, there exists and a subsequence such that as , for the weak topology . In particular,
[TABLE]
Next let be fixed arbitrary. From the pointwise convergence (3.7) we also obtain and for every . Similary, for every and . As was arbitrary, by Putinar’s Positivistellensatz [18], has a representing measure supported on for all , and . In particular from (3.7), as ,
[TABLE]
Therefore is admissible for problem with value , and so is an optimal solution of . Finally, by Theorem 3.1, . And so for each :
[TABLE]
As the converging subsequence was arbitrary, it follows that in fact the whole sequence converges to , for all , that is, (3.6) holds. ∎
The dual of
Let for all , . The dual of the semidefinite program is the semidefinite program:
[TABLE]
Proposition 3.3**.**
Assume that for every , both and have nonempty interior. Then there is no duality gap between (3.5) and its dual (3.8), that is, for all . Moreover (3.8) has an optimal solution .
Proof.
Let be the measures defined in (3.3) the proof of the Theorem 3.1 and let be the sequence of moments up to degree of , . As every has nonempty interior, then clearly and for every and . As also has nonempty interior then . Therefore Slater’s condition holds for . In addition, the set of admissible solution of is nonempty (set and for all ), and therefore a standard result in conic convex optimization yields the desired result333In fact as the set of optimal solutions of (3.5) is compact, the absence of a duality gap between (3.5) and (3.8) also follows from [19] without the conditions and ..
∎
As in the case of a basic closed semi-algebraic set, when is the constant function the convergence is monotone non increasing, a highly desirable feature. However in typical examples this convergence is rather slow. Again one may take for a function that is nonnegative on and which vanishes on . This accelerates the convergence both and as , but if by construction the former is monotone non increasing, the latter is not monotone anymore, a rather annoying feature if the goal is to obtain a converging sequence of upper bounds. In the next section we describe a technique that allows to accelerate significantly the convergence as , while maintaining its monotone non increasing character.
3.3. Convergence improvement using Stokes’ formula
In this section we show how to improve significantly the monotone non increasing convergence of (i.e. with ) to . To do this we will use Stokes’ theorem for integration and in the sequel, to avoid technicalities we assume that is the closure of its interior, i.e., . The basic idea is simple to express in informal terms.
Since we know in advance that in (3.3) is an optimal solution of problem , every additional information in terms of linear constraints on the moments of can be included in without changing its optimal value. BUT when included in the relaxation it will provide useful additional constraints that restrict the feasible set of and so make its optimal value necessarily smaller.
Suppose for the moment that is compact with smooth boundary, and assume that the measure has a density with respect to Lebesgue measure of the form for some polynomial . Let be a given vector field and . Then Stokes’ theorem states:
[TABLE]
where is outward pointing normal at , and is the -dimensional Hausdorff measure on . In particular if vanishes on and (where ) Stokes’ formula becomes
[TABLE]
To exploit (3.9) in our particular context where is defined in (2.2), let and let with arbitrary. Then vanishes on and on for all , . Hence by (3.9):
[TABLE]
for all , , where
[TABLE]
Recalling how is defined in (3.3), it can be written as
[TABLE]
where each is supported on and has a constant density w.r.t. . Therefore, for every :
[TABLE]
Hence (3.12) provides additional useful information on the optimal solution of defined in (3.3). Namely it translates into
[TABLE]
i.e., linear constraints on the moments of , for every .
Plugging this additional linear constraints on the moments of into the relaxation , yields the following new hierarchy of SDP-relaxation :
[TABLE]
By construction holds for every , and the analogue of Theorem 3.2 (with ) reads:
Theorem 3.4**.**
Consider the semidefinite programs , , defined in (3.13). Then :
(i) has an optimal solution and the associated sequence of optimal values is monotone non increasing and converges to , that is:
[TABLE]
(ii) Let be an optimal solution of . Then for each :
[TABLE]
The proof being almost a verbatim copy of that of Theorem 3.2, is omitted.
The important feature of Theorem 3.4 is that we now have the monotone non increasing convergence (compare with (3.6) (with ) in Theorem 3.2).
3.4. Gaussian measure of non compact sets
So far Theorem 3.2 and Theorem 3.4 have been given for supported on a box , and so only for sets in (3.1) that are compact.
It turns out that for a Gaussian measure of (possibly non-compact) sets , Theorem 3.2 (resp. Theorem 3.4) is still valid with exactly the same statement and exactly the same semidefinite relaxations (3.5) (resp. (3.13)), except that now is the vector of moments of the Gaussian measure (instead of the moments of the Lebesgue measure on previously).
However in the gaussian case the proof of Theorem 3.2(i)-(ii) and Theorem 3.4(i)-(ii) uses quite different arguments (some already used in [13] for a basic semi-algebraic set). Indeed as is not necessarily compact :
-
The uniform bound is not valid any more for the relaxations and .
-
One cannot invoke Putinar’s Positivstellensatz [18] any more.
-
The standard version of Stokes’ theorem where is compact cannot be invoked anymore either.
The new arguments that we need are the following:
A crucial fact is that satisfies Carleman’s condition
[TABLE]
Then a sequence such that for all , and
[TABLE]
has a unique representing measure on which is moment determinate; see for instance [14, Proposition 3.5, p. 60].
If in addition for all (where ), and as satisfies (3.15), then for all in the support of ; see Lasserre [15]. This argument is used to show that is supported on .
To obtain a version of Stokes for non-compact set with boundary , we invoke a limiting argument that uses (the standard) Stokes’s theorem on the compact (where ). Letting and using the Monotone and Bounded Convergence theorems yields the desired result. For more details the reader is referred to [13] where such arguments have been used in the case of a basic semi-algebraic set.
Finally it is worth emphasizing that this methodology also works for any measure that satisfies (3.15) (and whose moments are known or can be computed); an important spacial case is the exponential measure on the positive orthant .
Remark 3.5**.**
As mentioned above, in [13] the first author has already used Stokes’ formula to accelerate the convergence of a hierarchy of semidefinite relaxations to approximate the Gaussian measure of a basic semi-algebraic set , not necessarily compact. The important and non trivial novelty here is that (i) is now a union of basic semi-algebraic sets, and (ii) even if this complicates matters significantly, we are still able to work with measures , each supported on (a basic semi-algebraic set). It turns out that where is each has a piecewise constant density w.r.t. (constant on each of the possible intersections ). By using a family of polynomials that all vanish on the boundary of each , we can exploit Stokes’ Theorem on each piece and sum up to obtain a family of linear constraints on the moments of .
4. Numerical experiments and discussion
For illustration purposes we have applied the methodology on a few (simple) examples. We report some numerical experiments carried out in Matlab and GloptiPoly3 [11], a software package for manipulating and solving generalized problems of moments. The SDP problems were solved with SeDuMi 1.1R3.
4.1. Lebesgue volume of a union of two ellipsoids
We first consider a simple example of two ellipsoids in where the exact value can be computed exactly so that we can compare with our upper bounds. So we want to compute the Lebesgue measure of with and . In this example we take .
The results are displayed in the Figure 1 with: in orange the approximation of the Lebesgue volume without using Stokes’ formulas, in red the approximation when using Stokes’ formulas and in blue the exact value of .
We next consider a union of two ellipsoids in dimension . Let , , and . Results are displayed in Figure 2. In both examples one can check that the convergence is much faster when using Stokes’ formula.
4.2. Lebesgue measure a union of three ellipsoids
We next consider a union of three ellipsoid in dimension , with:
[TABLE]
[TABLE]
[TABLE]
and . In Figure 3 we also compare our results with those obtained when using Bonferroni inequalities. In red the upper bounds obtained by solving , in orange the lower bounds obtained by solving for the complement, and in blue the upper bounds obtained by using Bonferroni inequalities. (For a fair comparison, for each relaxation in Bonferroni case we also use appropriate Stokes’ constraints.)
4.3. Examples for the Gaussian measure
In this section we consider the Gaussian measure with variance . For each example we have computed two upper-bounds and two lower-bounds for . The first (resp. second) upper-bound (resp. ) is obtained by solving the semidefinite relaxation (resp. ). Similary, the lower-bounds (resp. ) are obtained from upper bounds for the complement . The respective relative error-gap are denoted by and .
Example 1**.**
In this example is the union of two ellipsoids. Let with and for the values
[TABLE]
,
[TABLE]
In this case can be computed exactly and so we have displayed the values of the relative errors denoted by and respectively, depending on whether or not we have used Stokes’ formula. As one can see in Table 1 for a reasonable value the relative error (when using Stokes’ formula) is quite good. The respective behaviors are displayed in Figure 4.
Example 2**.**
Consider with and for the values
,
[TABLE]
In this case is not a compact set as it is unbounded. The results for displayed in Table 2 show that a good value is already obtained when using Stokes’ formula. The respective behaviors of and displayed in Figure 5 also show that using Stokes’ formula yields a significant improvement.
Example 3**.**
Consider with and for the values
,
[TABLE]
Again is not compact. The results in Table 3 and the respective behaviors of and displayed in Figure 6 confirm that using Stokes’ formula yields a significant improvement.
Example 4**.**
We next consider an example in dimension . Let and for the values
,
[TABLE]
Results for are displayed in Table 4 and the relative errors and are displayed in Figure 7. The quality of results is comparable to that in Examples 2 and 3 for .
Example 5**.**
Still in dimension , let with and , where and
[TABLE]
The relative errors and are displayed in Figure 8.
One can see that in all examples quite good approximations are obtained with relatively few moments (up to order for and for ) provided that we use the hierarchy (3.13) with the additional moments constraints induced by Stokes’ formula. The convergence of the hierarchy (3.5) (without those Stokes constraints) is indeed much slower.
For all the examples that we have treated, the (crucial) moment and localizing matrices involved in (3.5) and in (3.13) have been expressed in the canonical basis of monomials for simplicity and easyness of implementation of the SDP relaxations. But this choice is in fact the worst from a numerical point of view (numerical stability and robustness) which prevented us from solving (3.5) and (3.13) for when . It is very likely that the basis of orthonormal polynomials w.r.t. (Legendre for the Lebesgue measure on and Hermite for the Gaussian measure ) is a much better (and recommended) choice. Such a more sophisticated implementation was beyond the scope of this paper.
Conclusion
In this paper we have provided a numerical scheme to approximate as closely as desired the measure of a finite union of basic semi-algebraic sets (the case of a single basic semi-algebraic set was treated in [13])). Surprisingly, even though the case of a union of semi-algebraic sets complicates matters significantly we are still able to adapt the methodology developed in [13] and provide a monotone non-increasing (resp. non-decreasing) sequence of upper (resp. lower) bounds that converges to as the number of moments considered increases. In addition we are also able to use additional moment constraints induced by an appropriate application of Stokes’ Theorem which permits to improve significantly the convergence. In fact those additional moment constraints are crucial to obtain good bounds rapidly as they permit strongly attenuate a Gibbs’ phenomenon that otherwise appears.
Our current implementation could be significantly improved by using a basis for polynomials more appropriate than the usual canonical basis of monomials (the worst choice from a numerical stability point of view). For instance in doing so it should be possible to implement step of the hierarchy in dimension , and step for . As the convergence seems to be fast, each additional step of the hierarchy can yield a significant improvement.
The methodology was presented for the Lebesgue measure when is compact and the Gaussian measure for non-compact sets , but in fact and remarkably, the same methodology works for any measure that satisfies Carleman’s condition and provided that all its moments are available (or can be computed easily).
Of course the methodology proposed in this paper is computationally expensive, especially when compared with Monte-Carlo type methods. But the latter provide only an estimate of and by no means an upper or lower bound on and therefore these two types of methods should be seen as complementary rather than competing. In its present form it is also limited to small dimension problems (typically ) because since each upper (or lower) bound requires to solve a semidefinite program whose size increases fast in the hierarchy, one is limited by the current efficiency of state-of-the-art semidefinite solvers. However to the best of our knowledge this is the first method that provides a sequence of upper and lower bounds with strong asymptotic guarantees, at least at this level of generality.
Acknowledgement
Research funded by by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement ERC-ADG 666981 TAMING)”
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Anjos M. , Lasserre J.B. (Eds.). Handbook of Semidefinite, Conic and Polynomial Optimization , Springer, New York, 2012.
- 2[2] Bollobás B. Volume estimates and rapid mixing. In: Flavors of Geometry , MSRI Publications 31, 1997, pp. 151–180.
- 3[3] Cousins B., Vempala S. A cubic algorithm for computing gaussian volume. Proceedings of the 2014 ACM-SIAM Symposium on Discrete Algorithms (SODA 14), Portland, January 2014.
- 4[4] Cousins B., Vempala S. A Practical Volume Algorithm, Math. Program. Comput. 8 , pp. 133–160, 2016.
- 5[5] Curto R.E., Fialkow L.A. Flat extensions of positive moment matrices: recursively generated relations , Memoirs. Amer. Math. Soc. 136 , AMS, Providence, 1998.
- 6[6] Curto R.E., Fialkow L.A. The truncated K-moment problem in several variables, J. Operator Theory 54 , pp. 189–226, 2005.
- 7[7] Dunford N., J. Schwartz. Linear Operators. Part I: General Theory , John Wiley & Sons, Inc., New York, 1958.
- 8[8] Dyer M.E., Frieze A.M. The complexity of computing the volume of a polyhedron, SIAM J. Comput. 17 , pp. 967–974, 1988.
