Stochastic Polynomial Optimization
Jiawang Nie, Liu Yang, Suhan Zhong

TL;DR
This paper introduces a stochastic polynomial optimization framework using sample averages and Moment-SOS relaxations, analyzing its properties and demonstrating effectiveness through numerical experiments.
Contribution
It presents a novel approach combining sample average approximation with Moment-SOS relaxations for stochastic polynomial optimization.
Findings
Effective solution method demonstrated via numerical experiments
Properties of the proposed optimization and relaxations analyzed
Sample average approach integrated with Moment-SOS relaxations
Abstract
This paper studies stochastic optimization problems with polynomials. We propose an optimization model with sample averages and perturbations. The Lasserre type Moment-SOS relaxations are used to solve the sample average optimization. Properties of the optimization and its relaxations are studied. Numerical experiments are presented.
| 0 | ||||||
| \@slowromancapi@ | solvable? | yes | yes | yes | yes | yes |
| time(sec.) | 1.13 | 1.34 | 1.23 | 1.23 | 1.23 | |
| 1.29e-06 | 6.41e-07 | 2.38e-07 | 1.30e-07 | 4.75e-08 | ||
| 2.05e-02 | 2.05e-02 | 2.07e-02 | 3.41e-02 | 3.33e-01 | ||
| \@slowromancapii@ | solvable? | yes | yes | yes | yes | yes |
| time(sec.) | 1.12 | 1.29 | 1.23 | 1.21 | 1.22 | |
| 9.15e-07 | 6.41e-07 | 2.06e-07 | 1.27e-07 | 1.53e-08 | ||
| 4.84e-04 | 4.86e-04 | 6.91e-04 | 1.38e-02 | 3.13e-01 | ||
| \@slowromancapiii@ | solvable? | no | yes | yes | yes | yes |
| time(sec.) | 2.14 | 2.78 | 1.40 | 1.16 | 1.15 | |
| n.a. | 1.13e+03 | 7.50e-08 | 1.25e-07 | 8.22e-09 | ||
| n.a. | 7.14e+05 | 3.85e-02 | 7.00e-03 | 2.94e-01 | ||
| \@slowromancapiv@ | solvable? | no | yes | yes | yes | yes |
| time(sec.) | 2.19 | 2.79 | 1.40 | 1.14 | 1.16 | |
| n.a. | 1.35e+03 | 1.89e-07 | 1.26e-07 | 1.54e-08 | ||
| n.a. | 6.52e+05 | 4.73e-04 | 1.32e-02 | 3.14e-01 | ||
| 0 | |||||
| \@slowromancapi@ | solvable? | yes | yes | yes | yes |
| time(sec.) | 0.37 | 0.29 | 0.15 | 0.26 | |
| 1.81e+05 | 1.29e-08 | 5.28e-09 | 9.53e-09 | ||
| 1.81e+05 | 1.48e-02 | 1.48e-02 | 1.47e-02 | ||
| \@slowromancapii@ | solvable? | yes | yes | yes | yes |
| time(sec.) | 0.23 | 0.12 | 0.14 | 0.12 | |
| 1.87e+05 | 1.29e-08 | 5.61e-09 | 9.64e-09 | ||
| 1.87e+05 | 6.13e-03 | 6.12e-03 | 6.12e-03 | ||
| \@slowromancapiii@ | solvable? | yes | yes | yes | yes |
| time(sec.) | 0.12 | 0.13 | 0.10 | 0.09 | |
| 5.58e-02 | 1.29e-08 | 5.61e-09 | 9.70e-09 | ||
| 1.40e-02 | 8.56e-03 | 8.56e-03 | 8.55e-03 | ||
| \@slowromancapi@ | solvable? | yes | yes | yes |
| time(sec.) | 0.17 | 0.13 | 0.12 | |
| 1.86e+03 | 4.18e-06 | 3.55e-06 | ||
| 3.44e+01 | 2.09e-04 | 2.30e-02 | ||
| \@slowromancapii@ | solvable? | yes | yes | yes |
| time(sec.) | 0.13 | 0.14 | 0.16 | |
| 1.85e+03 | 4.17e-06 | 3.56e-06 | ||
| 3.44e+01 | 1.29e-02 | 9.87e-03 | ||
| \@slowromancapi@ | |||||
| solvable? | no | yes | yes | yes | |
| time(sec.) | 0.12 | 0.10 | 0.07 | 0.09 | |
| n.a. | 5.31e-03 | 3.36e-04 | 1.09e-04 | ||
| n.a. | 5.44e-03 | 4.61e-04 | 2.34e-04 | ||
| \@slowromancapii@ | |||||
| solvable? | yes | yes | yes | yes | |
| time(sec.) | 0.08 | 0.08 | 0.07 | 0.06 | |
| 5.03e-09 | 1.84e-09 | 1.61e-09 | 1.86e-09 | ||
| 2.50e-04 | 2.50e-04 | 2.50e-04 | 2.50e-04 |
| \@slowromancapi@ | \@slowromancapii@ | \@slowromancapiii@ | \@slowromancapiv@ | |
| 2.11 | 1.96 | 2.01 | 2.02 | |
| 6.43 | 5.71 | 6.13 | 6.07 | |
| 0.807543 | 3.3618e-10 | 0.073413 | 0.146826 |
| \@slowromancapiii@ | \@slowromancapiv@ | |||||
| 0 | 0.1 | 0 | 0.2 | |||
| solvable? | no | yes | yes | no | yes | yes |
| time(sec.) | 0.23 | 0.20 | 0.11 | 0.10 | 0.16 | 0.07 |
| n.a. | 5.30e+01 | 2.02e-01 | n.a. | 6.47e+01 | 2.28e-01 | |
| n.a. | 5.39e+01 | 3.90e-01 | n.a. | 6.49e+01 | 2.96e-01 | |
| n.a. | 3.09e-02 | 1.28e-01 | n.a. | 5.86e-02 | 2.73e-01 | |
| n.a. | n.a. | |||||
| \@slowromancapi@ | \@slowromancapii@ | \@slowromancapiii@ | \@slowromancapiv@ | |
| 0.98 | 1.08 | 1.01 | 0.97 | |
| 1.06 | 0.96 | 1.02 | 0.96 | |
| 0.023094 | 0.023094 | 0.017321 | 1.1076e-09 |
| \@slowromancapii@ | \@slowromancapiii@ | |||||
| 0 | 0.05 | 0 | 0.05 | |||
| solvable? | no | yes | yes | no | yes | yes |
| time(sec.) | 0.06 | 0.21 | 0.06 | 0.06 | 0.13 | 0.05 |
| n.a. | 1.13e+02 | 1.36e-02 | n.a. | 5.65e+00 | 6.25e-03 | |
| n.a. | 1.13e+02 | 4.21e-03 | n.a. | 5.65e+00 | 2.80e-03 | |
| n.a. | 1.20e-03 | 1.85e-02 | n.a. | 6.08e-03 | 2.58e-02 | |
| n.a. | n.a. | |||||
| \@slowromancapi@ | \@slowromancapii@ | \@slowromancapiii@ | |
| 0.99 | 1.03 | 1.00 | |
| 1.32 | 1.38 | 1.33 | |
| 1.97 | 2.09 | 1.99 | |
| 0.508637 | 0.518810 | 0.508637 |
| \@slowromancapi@ | \@slowromancapii@ | \@slowromancapiii@ | ||||
| 0 | 0 | 0 | ||||
| solvable? | yes | yes | yes | yes | yes | yes |
| time(sec.) | 0.07 | 0.06 | 0.07 | 0.06 | 0.08 | 0.06 |
| 7.08e+01 | 1.07e-06 | 7.08e+01 | 8.44e-07 | 7.08e+01 | 1.07e-06 | |
| 2.07e+01 | 2.00e-02 | 2.07e+01 | 1.00e-02 | 2.07e+01 | 1.00e-02 | |
| 1.67e+00 | 2.97e-08 | 1.67e+00 | 2.97e-08 | 1.67e+00 | 2.97e-08 | |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Polynomial and algebraic computation · Risk and Portfolio Optimization
Stochastic Polynomial Optimization
Jiawang Nie
Jiawang Nie, Suhan Zhong, Department of Mathematics, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, USA, 92093.
[email protected],[email protected]
,
Liu Yang
Liu Yang, school of Mathematics and Computational Sciences, Xiangtan University, Xiangtan, Hunan, China, 411105.
and
Suhan Zhong
Abstract.
This paper studies stochastic optimization problems with polynomials. We propose an optimization model with sample averages and perturbations. The Lasserre type Moment-SOS relaxations are used to solve the sample average optimization. Properties of the optimization and its relaxations are studied. Numerical experiments are presented.
Key words and phrases:
stochastic optimization, polynomial, Moment-SOS relaxation, semidefinite program
2010 Mathematics Subject Classification:
90C15,90C22,90C31,65K05
1. Introduction
Stochastic optimization is about functions that depend on random variables. A typical stochastic optimization problem is
[TABLE]
where is a function in . The variable is a random vector, and the decision variable is required to be contained in a set . In (1.1), the symbol denotes the expectation of a function in the random vector . Frequently used methods for solving stochastic optimization are often based on sample average approximation (SAA). We refer to [5, 8, 17, 24, 25, 33, 35, 36, 38, 39, 41] for related work on stochastic optimization. The SAA methods use sample averages to approximate the expectation function , transforming the stochastic optimization into deterministic optimization. Many classical SAA methods assume the objective functions are convex and are based on evaluations of gradients or subgradients. They can also be applied to nonconvex problems, however, the global optimality may not be guaranteed. There exists relatively less work on nonconvex stochastic optimization [1, 9, 10]. Generally, nonconvex stochastic optimization problems are computationally challenging, because the deterministic case is already difficult.
This paper discusses the special case of stochastic polynomial optimization, i.e., is a polynomial function in and is a semialgebraic set defined by polynomials. When does not depend on , this is the case of polynomial optimization. The Lasserre type Moment-SOS relaxations are efficient and reliable for solving polynomial optimization [20, 23]. When depends on the random vector , the objective is the expectation of with respect to . Hence, is still a polynomial. When the distribution of is known explicitly (e.g, Gaussian, Poisson, etc), the objective can be expressed by integral formula. However, when the distribution of is not known exactly, or its density function is too complicated for evaluating the expectation, it is not practical to get an explicit formula for . In most methods for stochastic optimization, the objective is approximated by sample averages. In this article, we discuss how to use the efficient Moment-SOS relaxation for solving stochastic optimization with sample average approximation.
In stochastic polynomial optimization, we assume that
[TABLE]
is a polynomial in . Here, each coefficient is a measurable function in . The feasible set is assumed to be in the form
[TABLE]
for given polynomials in . For each , is a measurable function in . So the objective is also a polynomial. The stochastic optimization can be expressed as
[TABLE]
The coefficients of the polynomial are typically not known explicitly, because the true distribution of is usually not known exactly. However, they can be estimated by sample average approximation. In applications, we can generate samples of , say, random samples . The expectation function can be approximated by the sample average
[TABLE]
If each sample obeys the same distribution of , then . Furthermore, when all are independently identically distributed (i.i.d.), the Law of Large Numbers (LLN) (see [15]) implies that
[TABLE]
with probability one and under some regularity conditions. The resulting sample average approximation for (1.3) is
[TABLE]
This is also a polynomial optimization problem. It can be solved globally by the Lasserre type Moment-SOS hierarchy of relaxations [18].
Sample average approximation methods have good statistical properties. For convenience, denote by the optimal values of (1.3) and (1.4) respectively, and denote by their optimizer sets respectively. Assume that: i) is continuous and is nonempty; ii) there is a compact set such that and converges uniformly to on , with probability one; iii) for all large enough, . Then, it can be shown (see [39]) that and 111For sets , their distance is defined as
as , with probability one. Moreover, if are independently identically distributed as is, then . That is, as the sample size increases, the sample average optimization gives better approximation for (1.3). We refer to [39] for more details about the convergence of the sample average optimization.
When is a polynomial in , the sample average approximation (1.4) is also a polynomial optimization problem. The Lasserre type Moment-SOS relaxations can be applied to solve it. However, the following concerns need to be addressed:
- •
For given samples , the optimizer set of (1.4) may (or may not) be far away from the optimizer set of (1.3). For instance, it is possible that (1.3) is bounded from below and has a global minimizer, while (1.4) is unbounded from below and has no global minimizers.
- •
The sample average is only an approximation for the objective . Usually, the optimizer sets of (1.3) and (1.4) are not exactly same. Therefore, the sample average approximation (1.4) does not need to be solved exactly. However, we still expect that the optimizer set of (1.4) (if it is nonempty) is a good approximation for the optimizer set of (1.3), though they might be very different. Generally, the optimizer set can not be determined exactly, unless the objective can be determined exactly.
To address the above concerns, we propose the following perturbation sample average approximation (PSAA) model
[TABLE]
where is the smallest integer such that
[TABLE]
and is the vector of monomials in and with degrees at most (see (2.1)). The norm is the standard Euclidean norm. We use the Lasserre type Moment-SOS relaxation of degree for solving (1.5). A small parameter is often selected, for (1.5) to approximate (1.3) well. In this article, we discuss properties of (1.5), as well as its Moment-SOS relaxations. The perturbation term plays an important role in sample average approximation. The paper is organized as follows. We review some basics for polynomial optimization in Section 2. The properties of the perturbed sample average optimization (1.5) are discussed in Section 3. The numerical experiments are given in Section 4.
2. Preliminaries
Notation
The symbols denote the set of nonnegative integers and real numbers, respectively. For given and a real scalar , denotes the closed ball in centered at with radius , under the standard Euclidean norm. For a real symmetric matrix , we write (resp., ) by meaning that is positive semidefinite (resp., positive definite). The symbol denotes the ring of polynomials with real coefficients and in . For a polynomial , refers to its total degree. For a tuple of polynomials , refers to the maximum of the degrees of . For a degree , stands for the space of all real polynomials in and of degrees no more than . For a nonnegative integer vector , denote
[TABLE]
For convenience, denote the monomial power set
[TABLE]
For a degree , denotes the vector of all monomials of degrees at most , ordered in the graded lexicographic ordering, i.e.,
[TABLE]
(The superscript T denotes the transpose of a vector or matrix.) For , denotes the smallest integer greater than or equal to . For a function in a random vector , stands for the expectation of , with respect to the distribution of .
A polynomial is said to be a sum of squares (SOS) if , for some and polynomials . Clearly, if is SOS and has degree , then each must have degree at most . We use to denote the cone of all SOS polynomials, and to denote the truncation of SOS polynomials in . Checking whether a polynomial is SOS or not can be done by solving a semidefintie program [18, 31].
For a tuple of polynomials in , its quadratic module is the set
[TABLE]
The -th truncation of is
[TABLE]
It clearly holds the nesting relation of containment
[TABLE]
Indeed, each is a convex cone of the space . The tuple determines the semialgebraic set in (1.2). Obviously, if , then on . To ensure , we often require on . The quadratic module is said to be archimedean if there exists a single polynomial such that the inequality defines a compact set in . If is archimedean, then the set must be compact. The converse is not necessarily true. However, if is compact (say, for some radius ), one can always enforce to be archimedean by adding the redundant polynomial to the tuple . When is archimedean, if on , then we must have . This conclusion is referred to Putinar’s Positivstellensatz, which was shown in [34]. Interestingly, when on , we still have , under some optimality conditions [28].
For a given dimension and degree , denote by the space of real vectors that are indexed by , i.e.,
[TABLE]
Each vector in is called a truncated multi-sequence (tms) of degree . A tms gives the linear functional acting on as
[TABLE]
The is called a Riesz functional. For and , we denote
[TABLE]
The tms is said to admit a representing measure supported in a set if there exists a Borel measure , supported in , such that for all . This is equivalent to that for all . Such is called a -representing measure for . We refer to [7, 12, 29] for recent work on truncated moment problems.
For a polynomial , the th localizing matrix of associated to a tms , is the symmetric matrix such that
[TABLE]
for all polynomials , with . In the above, the denotes the coefficient vector of the polynomial . For instance, when and , for , we have
[TABLE]
For the special case of constant one polynomial , is reduced to the so-called moment matrix
[TABLE]
The columns and rows of , as well as , are labelled by with . For instance, for and , we have
[TABLE]
Suppose is a tuple of polynomials in . Consider the cone of tms of degree
[TABLE]
It is a closed convex cone in . Consider the projection map:
[TABLE]
Let be the semialgebraic set as in (1.2), then
[TABLE]
This is because for each , the tms belongs to and . Therefore, the linear section of the cone is a lifted convex relaxation of the set . The cone and the quadratic module are dual to each other. This is because for all and . We refer to [20, 23, 30] for these basic properties. Interestingly, the containment in (2.8) is an equality when the polynomials are SOS-concave [11].
There exists much work for polynomial optimization. The Lasserre type Moment-SOS hierarchy of relaxations were introduced in [18]. The Moment-SOS hierarchy was proved to have finite convergence under some optimality conditions [16, 19, 28]. The flat extension or flat truncation condition can be used to certify its convergence [7, 13, 26]. For unconstrained optimization, the performance of the standard SOS relaxations was studied in [31, 32]. For the special case of finite feasible sets, the convergence was studied in [21, 22, 27]. We refer to [3, 20, 23, 37] for more detailed introductions to polynomial optimization.
3. The sample average optimization
Let be given samples for the random vector . Consider the sample average function
[TABLE]
When is a polynomial in , is also a polynomial in . We assume that the feasible set is given as in (1.2), for a tuple of polynomials. Let be the degree:
[TABLE]
Instead of solving (1.4) directly, we propose to solve the sample average optimization with perturbation
[TABLE]
for a small parameter . The Lasserre type moment relaxation can be applied to solve (3.2). Recall the notation , , as in Section 2. Observe that
[TABLE]
for all and all . If we replace the monomial vector by a tms , then (3.2) is relaxed to the following convex optimization
[TABLE]
It is a semidefinite program, with a norm function in the objective. The relaxation (3.3) is said to be tight if its optimal value is the same as that of (3.2). In this paper, we choose to be the standard Euclidean norm, but any other kind of vector norms can also be used. The equality constraint means that the first entry of is equal to one. The set of all satisfying linear matrix inequalities in (3.3) is just the cone , defined as in (2.6). The cone and the truncated quadratic module are dual to each other. Therefore, the Lagrange function for (3.3) is
[TABLE]
for dual variables and . The function has a finite minimum value for if and only if
[TABLE]
for which case the minimum value is . (The denotes the coefficient vector of .) Therefore, the dual optimization problem of (3.3) is
[TABLE]
Because the sample average is only an approximation for , it is possible that there is no scalar such that . The perturbation term in (3.3) motivates us to find the maximum such that , for some polynomial whose coefficient vector has a small norm. This leads to the following algorithm.
Algorithm 3.1**.**
Generate samples , according to the distribution of . Choose a small perturbation parameter .
- Step 1
Compute the sample average .
- Step 2
Solve the semidefinite relaxation problem (3.3). If (3.3) is infeasible, increase the value of (e.g., let ), until (3.3) has a minimizer, which we denote as .
- Step 3
Let , where is the projection map in (2.7), or equivalently, let
[TABLE]
Output as a candidate minimizer for the sample average optimization with perturbation (3.2), and stop.
For , the minimizer of the relaxation (3.3) is always unique (if it exists), because its objective is strictly convex. Our numerical experiments demonstrate that Algorithm 3.1 is efficient for solving (3.2).
Theorem 3.2**.**
Assume that is a minimizer of (3.2) and is a minimizer of (3.3). Then, for , the relaxation (3.3) is tight if and only if . In particular, for the case , the point is a minimizer of (3.2).
Proof.
Let be optimal values of (3.2) and (3.3) respectively.
“” It is clear that . If , then for one can show that . Hence, , , and each (see [13, 26]). So, is a feasible point of (3.2) and
[TABLE]
Therefore, , is a minimizer of (3.2), and the relaxation (3.3) is tight.
“” Let , then and . If the relaxation (3.3) is tight, then and is a minimizer of (3.3). For , the objective of (3.3) is strictly convex, so its minimizer must be unique. Hence, and
[TABLE]
Therefore, . ∎
When the sample average is unbounded from below on the feasible set , the moment relaxation (3.3) might still be unbounded from below if is small. However, if is big, then (3.3) must be feasible and has a minimizer. Indeed, we have the following theorem.
Theorem 3.3**.**
Suppose the feasible set has nonempty interior. If is big, both (3.3) and (3.4) have optimizers and their optimal values are the same.
Proof.
When has nonempty interior, the quadratic module is a closed cone (see [23, Theorem 3.49]) and the cone has nonempty interior. For instance, let be the Gaussian measure, then the tms
[TABLE]
is an interior point of the cone . In other words, and all . This is because and for all nonzero polynomials . Moreover, . The convex relaxation (3.3) is strictly feasible (i.e., there is a feasible such that each matrix in (3.3) is positive definite). When is big, the SOS relaxation (3.4) is also strictly feasible. For instance, for the choice
[TABLE]
we have that
[TABLE]
In the above, int denotes the interior of a set. Therefore, for big , both (3.3) and (3.4) have strictly feasible points. By the strong duality theorem (see [2, 4]), they have the same optimal value and they both achieve the optimal value, i.e., they have optimizers. ∎
In applications, however, we often choose a small , because we expect that (3.2) is a good approximation for (1.4). In (3.3), the value of affects the performance of (3.3). When is too small, (3.3) might be unbounded from below and has no minimizers. If is big, (3.3) might give a loose approximation for (1.4). For efficiency, we often anticipate the smallest value of such that (3.3) is bounded from below and has a minimizer. When has nonempty interior, the relaxation (3.3) is strictly feasible, i.e., there exists such that all the matrices and are positive definite. Therefore, the strong duality holds between (3.3) and (3.4). To ensure that (3.3) is solvable (i.e., it has a minimizer), the dual optimization problem (3.4) needs to be feasible. Consider the optimization problem
[TABLE]
The above is a convex optimization problem with semidefinite constraints. In computational practice, we often choose in a heuristic way, e.g., . If such is not enough, we can increase its value until (3.3) performs well.
4. Numerical Experiments
This section gives numerical experiments of applying Algorithm 3.1 to solve stochastic polynomial optimization. The computation is implemented in MATLAB R2018a, in a Laptop with CPU 8th Generation Intel® Core™ i5-8250U and RAM 16 GB. The moment relaxation (3.3) is solved by the software GloptiPoly 3 [14], which calls the semidefinite program solver SeDuMi [40]. The computational results are displayed with four decimal digits. We use “PSAA” to denote the perturbation sample average approximation model (3.2). Its relaxation (3.3) is said to be solvable if it has a minimizer . For such a case, let be the point given in Step 3 of Algorithm 3.1, i.e., . Otherwise, (3.3) is said to be not solvable and we use “n.a.” to indicate the relevant values are not available. We use and to denote the optimal value and the minimizer of (1.3) respectively. The symbol stands for the sample average of the random vector , while refers to its th entry:
[TABLE]
In our numerical examples, we use the following classical distributions for random variables (see [6]; let denote the Dirac function supported at ):
- •
denotes the Bernoullian distribution with success probability , whose density function is .
- •
denotes the geometric distribution with success probability , whose density function is .
- •
denotes the Poisson distribution with parameter , whose density function is .
- •
, with , denotes the uniform distribution on .
- •
denotes the normal distribution with the expectation and covariance matrix .
Example 4.1**.**
Consider the stochastic optimization problem
[TABLE]
where
[TABLE]
We have . The constraining polynomial tuple is
[TABLE]
The feasible set is a polyhedron and the exact objective is
[TABLE]
For the optimization (1.3), the optimal value and the minimizer are
[TABLE]
Since the approximation polynomial is uniquely determined by and , we can pick some typical sample averages to explore the performance of our optimization model (3.2) and its relaxation (3.3). For instance, we consider the samples of such that
[TABLE]
There are four cases of the signs:
[TABLE]
We solve the optimization model (3.2) for each case. The computational results are reported in Table 1. The PSAA model (3.2) has clear advantages for cases @slowromancapiii@ and @slowromancapiv@, when they are compared to the unperturbed case (i.e, ). It gives reliable optimizers, while the classical SAA model (1.4) does not return good ones.
Example 4.2**.**
Consider the stochastic polynomial optimization
[TABLE]
where
[TABLE]
If is evaluated exactly, the optimal value and the minimizer of (1.3) are
[TABLE]
As in Example 4.1, we explore the performance of (3.2)-(3.3) for the following cases of samples:
[TABLE]
The computational results are reported in Table 2. The PSAA model (3.2) performs better than the classical SAA model (1.4) (i.e. ). For all these cases, (3.2) gives more reliable optimizers. Moreover, solving the relaxation (3.3) costs less computational time for cases @slowromancapi@ and @slowromancapii@.
Example 4.3**.**
Consider the stochastic optimization
[TABLE]
where
[TABLE]
The exact objective can be evaluated as
[TABLE]
For the optimization (1.3), its optimal value and minimizer are
[TABLE]
For convenience, we use “” to denote a random sample average of size with error in the order of . Consider two cases:
[TABLE]
The numerical results are reported in Table 3. The PSAA model (3.2) gives a better optimizer than the unperturbed one, for both cases.
Example 4.4**.**
Consider the stochastic polynomial optimization
[TABLE]
where
[TABLE]
The feasible set is a simplex, which is compact and satisfies the archimedean condition. For all samples , the sample average is bounded from below on and it has a minimizer. For this example, and , so
[TABLE]
The optimal value and minimizer of (1.3) are
[TABLE]
We consider two cases of samples
[TABLE]
In the above, is the minimum value of (3.5).
The numerical results are reported in Table 4. The PSAA model (3.2) performs very well for both cases. Compared with the classical SAA model (1.4) (i.e., ), it has quite clear advantages for case @slowromancapi@. It successfully returned a good minimizer, while (1.4) is unbounded from below and does not return a minimizer.
Example 4.5**.**
Consider the unconstrained stochastic optimization
[TABLE]
where
[TABLE]
Evaluating the expectation, we get and
[TABLE]
The optimal value and minimizer of (1.3) are
[TABLE]
Approximate by their sample averages , i.e., . We make samples of different sizes and compute in (3.5) for each case.
We focus on cases @slowromancapiii@ and @slowromancapiv@.
The computational results are reported in Table 6. The perturbation term in the PSAA model (3.2) makes a big difference for computing reliable minimizers. The PSAA model returned minimizers that are close to the optimizer of (1.3), while the classical SAA model (i.e., ) is unbounded from below and fails to return a minimizer.
Example 4.6**.**
Consider the stochastic polynomial optimization
[TABLE]
where
[TABLE]
Note that and the exact objective is
[TABLE]
Its optimal value and minimizer are
[TABLE]
Denote . We make samples of different sizes and compute in (3.5) for each case. The values of are given in Table 7.
We focus on cases @slowromancapii@ and @slowromancapiii@, for which the numerical results are reported in Table 8.
While the optimization (1.4) is unbounded from below via the classical SAA model, our PSAA model (3.2) provides good lower bounds and returns reliable minimizers for all cases.
Example 4.7**.**
Consider the stochastic polynomial optimization
[TABLE]
where and
[TABLE]
One can see that , , and the exact objective
[TABLE]
Its optimal value and minimizer are
[TABLE]
As in Examples 4.5-4.6, we approximate by sample averages where s_{2}=\sum_{k=1}^{N}\big{(}\xi^{(k)}\big{)}^{2},\ s_{3}=\sum_{k=1}^{N}\big{(}\xi^{(k)}\big{)}^{3}. Several samples of different sizes are made. The values of in (3.5) are given in Table 9.
We apply the classical SAA model (i.e., ) and the PSAA model (i.e., ) to cases @slowromancapi@, @slowromancapii@ and @slowromancapiii@.
The computational results are reported in the Table 10. The PSAA model (3.2) performs much better than the classical SAA model (1.4), as (3.2) gives more reliable minimizers for all cases.
5. Conclusion
This paper proposes a sample average optimization model with a perturbation term for solving stochastic polynomial optimization. The perturbation optimization model performs better than the classical one without perturbations. The Lasserre type moment relaxations are used to solve the perturbation optimization. In particular, we show that the moment relaxation is tight if and only if the moment matrix of the minimizer is rank one. Numerical experiments demonstrated advantages of our perturbation optimization model.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Bastin, C. Cirillo and P. Toint Convergence theory for nonconvex stochastic programming with an application to mixed logit. Math. Program. , 108(2–3), pp. 207–234, 2006.
- 2[2] A. Ben-Tal and A. Nemirovski, Lectures on Modern Convex Optimization: Analysis, Algorithms, and Engineering Applications , MPS-SIAM Series on Optimization, SIAM, Philadelphia, 2001.
- 3[3] G. Blekherman, P. Parrilo and R. Thomas (eds.), Semidefinite optimization and convex algebraic geometry , MOS-SIAM Series on Optimization, SIAM, 2013.
- 4[4] S. Boyd and L. Vandenberghe, Convex Optimization , Cambridge University Press, 2004.
- 5[5] M. Branda. Sample approximation technique for mixed-integer stochastic programming problems with several chance constraints. Operations Research Letters , 40(3), pp. 207–211, 2012.
- 6[6] K. Chung, A course in probability theory , Academic press, 2001.
- 7[7] R. Curto and L. Fialkow. Truncated K-moment problems in several variables. Journal of Operator Theory , 54(2005), pp. 189-226.
- 8[8] M. Fu (eds.), Handbook of simulation optimization , volume 216, Springer, New York, 2015.
