On Risk-Averse Stochastic Semidefinite Programs with Continuous Recourse

Matthias Claus; R\"udiger Schultz; Kai Sp\"urkel; Tobias Wollenberg

arXiv:1812.09879·math.OC·December 27, 2018

On Risk-Averse Stochastic Semidefinite Programs with Continuous Recourse

Matthias Claus, R\"udiger Schultz, Kai Sp\"urkel, Tobias Wollenberg

PDF

TL;DR

This paper introduces mean-risk models for stochastic semidefinite programs with continuous recourse, analyzing their structural properties and stability under distribution perturbations, with implications for computational approaches.

Contribution

It develops a framework for risk-averse stochastic SDPs, exploring convexity, continuity, and stability, and presents extended formulations for finite discrete distributions.

Findings

01

Mean-risk models exhibit convexity and Lipschitz continuity.

02

Extended formulations lead to deterministic mixed-integer SDPs.

03

Models are stable under distribution perturbations.

Abstract

The vast majority of the literature on stochastic semidefinite programs (stochastic SDPs) with recourse is concerned with risk-neutral models. In this paper, we introduce mean-risk models for stochastic SDPs and study structural properties as convexity and (Lipschitz) continuity. Special emphasis is placed on stability with respect to changes of the underlying probability distribution. Perturbations of the true distribution may arise from incomplete information or working with (finite discrete) approximations for the sake of computational efficiency. We discuss extended formulations for stochastic SDPs under finite discrete distributions, which turn out to be deterministic (mixed-integer) SDPs that are (almost) block-structured for many popular risk measures.

Equations163

x, y min {c ∙ x + q ∙ y ∣ T ∙ x + W ∙ y = z, x \in X, y \in S_{+}^{m}},

x, y min {c ∙ x + q ∙ y ∣ T ∙ x + W ∙ y = z, x \in X, y \in S_{+}^{m}},

y min {q ∙ y ∣ W ∙ y = Z (ω) - T ∙ x, y \in S_{+}^{m}} .

y min {q ∙ y ∣ W ∙ y = Z (ω) - T ∙ x, y \in S_{+}^{m}} .

φ (t) := y min {q ∙ y ∣ W ∙ y = t, y \in S_{+}^{m}} .

φ (t) := y min {q ∙ y ∣ W ∙ y = t, y \in S_{+}^{m}} .

x min {f (x, Z (\cdot)) ∣ x \in X} .

x min {f (x, Z (\cdot)) ∣ x \in X} .

{f (x, Z (\cdot)) ∣ x \in X} \subseteq X \subseteq L^{0} (Ω, F, P)

{f (x, Z (\cdot)) ∣ x \in X} \subseteq X \subseteq L^{0} (Ω, F, P)

x min {Q_{R} (x) ∣ x \in X},

x min {Q_{R} (x) ∣ x \in X},

-W^{\top}v=\lim_{k\to\infty}-W^{\top}v_{k}=\lim_{k\to\infty}\frac{1}{\|u_{k}\|}\big{(}q-W^{\top}u_{k}\big{)}\in\mathcal{S}^{m}_{+}.

-W^{\top}v=\lim_{k\to\infty}-W^{\top}v_{k}=\lim_{k\to\infty}\frac{1}{\|u_{k}\|}\big{(}q-W^{\top}u_{k}\big{)}\in\mathcal{S}^{m}_{+}.

α \to \infty lim v^{⊤} (u_{0} + α v) = α \to \infty lim v^{⊤} u_{0} + α ∥ v ∥^{2} = \infty,

α \to \infty lim v^{⊤} (u_{0} + α v) = α \to \infty lim v^{⊤} u_{0} + α ∥ v ∥^{2} = \infty,

φ (t) = u max {t^{⊤} u ∣ u \in M_{D}} \forall t \in R^{s} .

φ (t) = u max {t^{⊤} u ∣ u \in M_{D}} \forall t \in R^{s} .

φ (λ t_{1} + (1 - λ) t_{2})

φ (λ t_{1} + (1 - λ) t_{2})

\leq λ u \in M_{D} max t_{1}^{T} u + (1 - λ) u \in M_{D} max t_{2}^{T} u

= λ φ (t_{1}) + (1 - λ) φ (t_{2}),

- ∥ u_{2} ∥ \cdot ∥ t_{1} - t_{2} ∥ \leq t_{1}^{⊤} u_{2} - t_{2}^{⊤} u_{2} \leq φ (t_{1}) - φ (t_{2}) \leq t_{1}^{⊤} u_{1} - t_{2}^{⊤} u_{1} \leq ∥ u_{1} ∥ \cdot ∥ t_{1} - t_{2} ∥

- ∥ u_{2} ∥ \cdot ∥ t_{1} - t_{2} ∥ \leq t_{1}^{⊤} u_{2} - t_{2}^{⊤} u_{2} \leq φ (t_{1}) - φ (t_{2}) \leq t_{1}^{⊤} u_{1} - t_{2}^{⊤} u_{1} \leq ∥ u_{1} ∥ \cdot ∥ t_{1} - t_{2} ∥

∣ φ (t_{1}) - φ (t_{2}) ∣ \leq L_{φ} \cdot ∥ t_{1} - t_{2} ∥

∣ φ (t_{1}) - φ (t_{2}) ∣ \leq L_{φ} \cdot ∥ t_{1} - t_{2} ∥

\partial φ (t) = Argmax {u^{⊤} t ∣ u \in M_{D}} .

\partial φ (t) = Argmax {u^{⊤} t ∣ u \in M_{D}} .

φ_{l} : R^{s} \to \overline{R}, φ_{l} (t) := min {q_{l}^{⊤} y_{l} ∣ W_{l} y_{l} = t, y_{l} \in R_{+}^{m}}

φ_{l} : R^{s} \to \overline{R}, φ_{l} (t) := min {q_{l}^{⊤} y_{l} ∣ W_{l} y_{l} = t, y_{l} \in R_{+}^{m}}

φ_{l} (t) = j = 1, ..., N max d_{j}^{⊤} t,

φ_{l} (t) = j = 1, ..., N max d_{j}^{⊤} t,

min {[1000] ∙ y ∣ [0 \frac{1}{2} \frac{1}{2} 0] ∙ y = t, y \in S_{+}^{2}} .

min {[1000] ∙ y ∣ [0 \frac{1}{2} \frac{1}{2} 0] ∙ y = t, y \in S_{+}^{2}} .

[∣ t ∣ + 1 t t ∣ t ∣ + 1] \in int S_{+}^{2} and [0 \frac{1}{2} \frac{1}{2} 0] ∙ [∣ t ∣ + 1 t t ∣ t ∣ + 1] = t .

[∣ t ∣ + 1 t t ∣ t ∣ + 1] \in int S_{+}^{2} and [0 \frac{1}{2} \frac{1}{2} 0] ∙ [∣ t ∣ + 1 t t ∣ t ∣ + 1] = t .

M_{D} = {u \in R ∣ [1000] - [0 \frac{1}{2} \frac{1}{2} 0] \cdot u \in S_{+}^{2}} = {0} .

M_{D} = {u \in R ∣ [1000] - [0 \frac{1}{2} \frac{1}{2} 0] \cdot u \in S_{+}^{2}} = {0} .

[y_{11} \frac{t}{2} \frac{t}{2} y_{22}] \in S_{+}^{2} \Leftrightarrow y_{11} > 0, y_{22} > 0, y_{11} y_{22} - (\frac{t}{2})^{2} \geq 0,

[y_{11} \frac{t}{2} \frac{t}{2} y_{22}] \in S_{+}^{2} \Leftrightarrow y_{11} > 0, y_{22} > 0, y_{11} y_{22} - (\frac{t}{2})^{2} \geq 0,

M_{s}^{p} := {μ \in P (R^{s}) ∣ \int_{R^{s}} ∥ t ∥^{p} μ (d t) < \infty}

M_{s}^{p} := {μ \in P (R^{s}) ∣ \int_{R^{s}} ∥ t ∥^{p} μ (d t) < \infty}

∥ F (x) ∥_{L^{1}}

∥ F (x) ∥_{L^{1}}

\leq ∣ c ∙ x ∣ + ∣ φ (0) ∣ + \int_{R^{s}} ∣ φ (z - T ∙ x) - φ (0) ∣ (P \circ Z^{- 1}) (d z)

\leq ∣ c ∙ x ∣ + ∣ φ (0) ∣ + L_{φ} ∥ T ∙ x ∥ + L_{φ} \int_{R^{s}} ∥ z ∥ (P \circ Z^{- 1}) (d z) < \infty

f (λ x_{1} + (1 - λ) x_{2}, z) \leq λ f (x_{1}, z) + (1 - λ) f (x_{2}, z)

f (λ x_{1} + (1 - λ) x_{2}, z) \leq λ f (x_{1}, z) + (1 - λ) f (x_{2}, z)

∥ F (x_{1}) - F (x_{2}) ∥_{L^{1}}

∥ F (x_{1}) - F (x_{2}) ∥_{L^{1}}

\leq ∥ c ∥ \cdot ∥ x_{1} - x_{2} ∥ + L_{φ} \cdot ∥ T ∥ \cdot ∥ x_{1} - x_{2} ∥

R [λ Z_{1} + (1 - λ) Z_{2}] \leq λ R [Z_{1}] + (1 - λ) R [Z_{2}] .

R [λ Z_{1} + (1 - λ) Z_{2}] \leq λ R [Z_{1}] + (1 - λ) R [Z_{2}] .

EE_{η} [Y] = \int_{Ω} max {Y (ω) - η, 0} P (d ω) .

EE_{η} [Y] = \int_{Ω} max {Y (ω) - η, 0} P (d ω) .

\text{$\mathbb{CV}$@R}_{\alpha}:L^{1}(\Omega,\mathcal{F},\mathbb{P})\to\mathbb{R},\;\text{$\mathbb{CV}$@R}_{\alpha}[Y]=\min_{\eta\in\mathbb{R}}\big{\{}\,\eta+\frac{1}{1-\alpha}\mathbb{EE}_{\eta}(Y)\,\big{\}}

\text{$\mathbb{CV}$@R}_{\alpha}:L^{1}(\Omega,\mathcal{F},\mathbb{P})\to\mathbb{R},\;\text{$\mathbb{CV}$@R}_{\alpha}[Y]=\min_{\eta\in\mathbb{R}}\big{\{}\,\eta+\frac{1}{1-\alpha}\mathbb{EE}_{\eta}(Y)\,\big{\}}

V @R_{α} : L^{0} (Ω, F, P) \to R, V @R_{α} [Y] = in f {t ∣ P (Z (ω) \leq t) \geq α}

V @R_{α} : L^{0} (Ω, F, P) \to R, V @R_{α} [Y] = in f {t ∣ P (Z (ω) \leq t) \geq α}

\displaystyle\text{$\mathbb{M}$ad}^{+}_{p}[Y]=\Big{(}\int\,\max\{0,Y(\omega)-\mathbb{E}_{\mathbb{P}}[Z]\}^{p}\;\mathbb{P}(\text{d}\omega)\Big{)}^{\frac{1}{p}}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

∎

11institutetext: M. Claus 22institutetext: R. Schultz 33institutetext: K. Spürkel 44institutetext: T. Wollenberg 55institutetext: University Duisburg-Essen

Thea-Leymann-Straße 9

D-45127 Essen

Tel.: +49 201 183 6887

55email: [email protected]

On Risk-Averse Stochastic Semidefinite Programs with Continuous Recourse

††thanks: The authors gratefully acknowledge the support of the German Research Foundation (DFG) within the collaborative research center TRR 154 “Mathematical Modeling, Simulation and Optimization Using the Example of Gas Networks”.

Matthias Claus

Rüdiger Schultz

Kai Spürkel

Tobias Wollenberg

(Received: date / Accepted: date)

Abstract

The vast majority of the literature on stochastic semidefinite programs (stochastic SDPs) with recourse is concerned with risk-neutral models. In this paper, we introduce mean-risk models for stochastic SDPs and study structural properties as convexity and (Lipschitz) continuity. Special emphasis is placed on stability with respect to changes of the underlying probability distribution. Perturbations of the true distribution may arise from incomplete information or working with (finite discrete) approximations for the sake of computational efficiency. We discuss extended formulations for stochastic SDPs under finite discrete distributions, which turn out to be deterministic (mixed-integer) SDPs that are (almost) block-structured for many popular risk measures.

Keywords:

Stochastic Semidefinite Programming Mean-Risk Models Stability Analysis Extended Formulations

1 Introduction

Stochastic semidefinite programs with recourse were first considered by Ariyawansa and Zhu in AriyawansaZhu2006 , where, for finite discrete distributions, the authors reformulate the risk-neutral stochastic SDP as a block-structured deterministic SDP and discuss an application to the stochastic version of the minimum-volume covering ellipsoid problem (cf. SunFreund2004 , VandenbergheBoyd1996 ). In ZhuAriyawansa2011 , the same authors give a multitude of other applications, including problems in geometry, location aided routing, RC circuit design and structural optimization.

Some approaches to the algorithmic treatment of risk neutral programs with linear recourse carry over to expectation based stochastic SDPs. Extending the results of Zhao (cf. Zhao2001 ), Mehrotra and Özevin derive a polynomial logarithmic barrier algorithm employing Bender’s decomposition (cf. MehrotraOezevin2007 ). Using the volumetric barrier of Vaidya (cf. Vaidya1996 ), Ariyawansa and Zhu construct algorithms of similar complexity in AriyawansaZhu2011 . Furthermore, in JinAriyawansaZhu2012 , Jin, Ariyawansa and Zhu propose homogeneous self-dual algorithms with complexities comparable to the ones of the methods mentioned before. Motivated by an application in multi-antenna wireless networks, Gaujal and Mertikopoulos establish a stochastic approximation algorithm in GaujalMertikopoulos2016 .

Chance constrained SDP models have been introduced by Ariyawansa and Zhu in (Zhu2006, , Chapter 3), where an application to the stochastic minimum-volume covering ellipsoid problem is considered. A different approach towards risk-aversion is taken by Schultz and Wollenberg, who consider stochastic mixed-integer semidefinite programs arising from unit commitment problems in AC transmission systems. Based on Lagrangian relaxation of the nonanticipativity constraint, a decomposition algorithm for minimizing a weighted sum of the expectation and the probability of exceeding a certain threshold is proposed in SchultzWollenberg2017 .

The present work extends the models of SchultzWollenberg2017 and AriyawansaZhu2011 by considering more general risk measures. Instead of focussing on a certain application, we discuss structural properties as convexity and (Lipschitz) continuity of the resulting objective functions. Consequences for quantitative stability of the stochastic SDP models under perturbations of the underlying distribution are pointed out. Such perturbations may arise from incomplete information about the distribution or the choice to work with a simpler (possibly finite discrete) approximation for reasons of computational efficiency.

Furthermore, we establish sufficient conditions for differentiabiliy in the risk neutral setting. Finally, for finite discrete distributions, we establish equivalent SDPs for various risk measures and give indications on how to exploit their special structure for numerical treatment.

2 Two-Stage Stochastic SDPs with Continuous Recourse

Let $\mathcal{S}^{k}_{+}$ denote the cone of symmetric positive semidefinite matrices in $\mathbb{R}^{k\times k}$ . The componentwise Frobenius product of $A=(a_{1},\ldots a_{s})^{\top}\in(\mathcal{S}^{k}_{+})^{l}$ and $x\in\mathcal{S}^{k}_{+}$ is defined as $A\bullet x:=\big{(}\mathrm{tr}(a_{1}x),\ldots,\mathrm{tr}(a_{s}x)\big{)}^{\top}\in\mathbb{R}^{s}$ . Furthermore, the Frobenius norm on $\mathcal{S}^{k}_{+}$ is given by $\|x\|:=\sqrt{x\bullet x}$ .

We shall consider the parametric SDP

[TABLE]

where $z\in\mathbb{R}^{s}$ enters as a parameter. The data is comprised of $c\in\mathcal{S}^{n}_{+}$ , $q\in\mathcal{S}^{m}_{+}$ , $T\in(\mathcal{S}^{n}_{+})^{s}$ , $W\in(\mathcal{S}^{m}_{+})^{s}$ and a nonempty, closed, convex set $X\subseteq\mathcal{S}^{n}_{+}$ . The set $X$ is usually given as a spectrahedron, i.e. the intersection of the solution sets of a finite number of affine matrix inequalities with the cone of positive semidefinite matrices.

Let $z=Z(\omega)$ be the realization of a random vector $Z:\Omega\to\mathbb{R}^{s}$ on some probability space $(\Omega,\mathcal{F},\mathbb{P})$ . A two-stage stochastic SDP arises from (P( $z$ )) if the decision $x$ has to be taken without knowledge of the particular realization $Z(\omega)$ , while $y$ can be chosen after observing the previously unknown parameter. In this setting, the optimal decision $y$ is governed by the recourse problem

[TABLE]

Let $\varphi:\mathbb{R}^{s}\to\overline{\mathbb{R}}$ denote the optimal value function of (1) with respect to the right-hand side of the system of matrix equations in its constraints, i.e.

[TABLE]

Introducing the function $f:\mathcal{S}^{n}_{+}\times\mathbb{R}^{s}\to\overline{\mathbb{R}}$ , $f(x,z):=c\bullet x+\varphi(z-T\bullet x)$ we may rewrite (P( $Z(\cdot)$ ) as

[TABLE]

Due to the assumed interplay between decision and observation, problem (2) is not well-defined without further modelling choices. For any $x$ , $f(x,Z(\cdot))$ belongs to the space $L^{0}(\Omega,\mathcal{F},\mathbb{P})$ of extended real-valued random variables on the underlying probability space. We thus may fix any functional $\mathcal{R}:\mathcal{X}\to\overline{\mathbb{R}}$ satisfying

[TABLE]

and consider the optimization problem

[TABLE]

where the mapping $Q_{\mathcal{R}}:\mathcal{S}^{n}_{+}\to\overline{\mathbb{R}}$ is given by $Q_{\mathcal{R}}(x)=\mathcal{R}[f(x,Z(\cdot))]$ .

We shall work with the following assumptions:

A1

(Complete recourse) $W\bullet\mathcal{S}^{m}_{+}=\mathbb{R}^{s}$ .

A2

(Strict dual feasibility) There is some $u\in\mathbb{R}^{s}$ such that $q-W^{\top}u$ is positive definite.

Similar, yet more restrictive assumptions are also made in MehrotraOezevin2007 .

Lemma 1

Assume A2, then A1 holds if and only if $M_{D}:=\{u\in\mathbb{R}^{s}\;|\;q-W^{\top}u\in\mathcal{S}^{m}_{+}\}$ is compact.

Proof

$M_{D}$ is closed due to the closedness of $S^{m}_{+}$ . Suppose that $M_{D}$ is unbounded, i.e. that there exists a sequence $\{u_{k}\}_{k\in\mathbb{N}}\subseteq M_{D}$ with $\lim_{k\to\infty}\|u_{k}\|=\infty$ . Define $v_{k}:=u_{k}/\|u_{k}\|$ , then ${\|v_{k}\|}=1$ holds for all $k\in\mathbb{N}$ . Therefore, the sequence $\{v_{k}\}_{k\in\mathbb{N}}$ can be assumed to converge to some $v\neq 0$ without loss of generality. By $u_{k}\in M_{D}$ we have $q-W^{\top}u_{k}\in\mathcal{S}^{m}_{+}$ for all $k\in\mathbb{N}$ . Thus,

[TABLE]

Now select any $u_{0}\in M_{D}$ . Then $u_{0}+\alpha v\in M_{D}$ holds for any $\alpha\geq 0$ and we have

[TABLE]

verifying $\sup\{v^{\top}u\;|\;q-W^{\top}u\in\mathcal{S}^{m}_{+}\}=\infty$ . By duality, the set $\{y\in\mathcal{S}^{m}_{+}\;|\;W\bullet y=v\}$ has to be empty, which contradicts A1.

Let $M_{D}$ be compact, then once again by duality for arbitrary $t\in\mathbb{R}^{s}$ , there exists $u\in M_{D}$ with $\min\{q\bullet y\;|\;W\bullet y=t,\;y\in\mathcal{S}^{m}_{+}\}=t^{\top}u$ , which implies $t\in W\bullet\mathcal{S}^{m}_{+}$ and thus A1. ∎

The lemma above shows that $\sup\{t^{\top}u\;|\;q-W^{\top}u\in\mathcal{S}^{m}_{+}\}$ is attained for any $t\in\mathbb{R}^{s}$ whenever A1 and A2 hold true.

Lemma 2

Assume A1 and A2, then $\varphi$ is finite, convex and Lipschitz continuous on $\mathbb{R}^{s}$ .

Proof

Due to A1 and A2, strong duality holds true for the SDP defining $\varphi$ . We thus have

[TABLE]

As $M_{D}$ is nonempty and compact by Lemma 1, $\varphi$ is finite on $\mathbb{R}^{s}$ .

Furthermore, for arbitrary $\lambda\in[0,1]$ and $t_{1},t_{2}\in\mathbb{R}^{s}$ , strong duality implies

[TABLE]

which proves the asserted convexity of $\varphi$ .

To establish Lipschitz continuity, let $t_{1},t_{2}\in\mathbb{R}^{s}$ be arbitrary and fixed. Then by strong duality and the compactness of $M_{D}$ , there exists $u_{1},u_{2}\in M_{D}$ such that $\varphi(t_{1})=t_{1}^{\top}u_{1}$ and $\varphi(t_{2})=t_{2}^{\top}u_{2}$ . By $t_{1}^{\top}u_{1}\geq t_{1}^{\top}u_{2}$ and $t_{2}^{\top}u_{2}\geq t_{2}^{\top}u_{1}$ we have

[TABLE]

and thus $|\varphi(t_{1})-\varphi(t_{2})|\leq\max\{\|u_{1}\|,\|u_{2}\|\}\|t_{1}-t_{2}\|$ . Set $L_{\varphi}:=\max\nolimits_{u\in M_{D}}\|u\|<\infty$ , then

[TABLE]

holds for all $t_{1},t_{2}\in\mathbb{R}^{s}$ , which completes the proof. ∎

Remark 1

Under assumptions A1 and A2, $\varphi$ is finite and convex, which implies directional differentiability by (Rockafellar1970, , Theorem 25.4). Furthermore, the subdifferential of $\varphi$ is convex, compact and admits the representation

[TABLE]

By (Rockafellar1970, , Theorem 25.1), $\varphi$ is differentiable at $t$ if and only if $\partial\varphi(t)$ is a singleton. In that case, we have $\partial\varphi(t)=\{\nabla\varphi(t)\}$ .

Remark 2

In two-stage stochastic linear programming, the counterpart of $\varphi$ is the optimal value function of a linear program:

[TABLE]

with $q_{l}\in\mathbb{R}^{m}$ and $W_{l}\in\mathbb{R}^{s\times m}$ . By linear programming theory, $\varphi_{l}$ is finite on $\mathbb{R}^{s}$ iff $W_{l}(\mathbb{R}^{m}_{+})=\mathbb{R}^{s}$ and $M_{D_{l}}=\{u\in\mathbb{R}^{s}\;|\;W_{l}^{\top}u\leq q\}\neq\emptyset$ . In this situation, $\varphi_{l}$ admits the representation

[TABLE]

where $d_{1},...,d_{N}$ denote the vertices of the polytope $M_{D_{l}}$ . In particular, $\varphi_{l}$ is piecewise linear, convex and Lipschitz continuous.

The following example shows that the assumptions A1 and $M_{D}\neq\emptyset$ are not sufficient to ensure that the optimal value in the problem defining $\varphi(t)$ is attained for all $t\in\mathbb{R}^{s}$ .

Example 1

For $t\in\mathbb{R}$ , consider the SDP

[TABLE]

For any $t\in\mathbb{R}$ we have

[TABLE]

Consequently, A1 is fulfilled. Moreover, we have

[TABLE]

As (4) is strictly feasible for any right-hand side $t\in\mathbb{R}^{s}$ , strong duality holds and (5) implies that the infimum of (4) is zero. Furthermore, for any $t\in\mathbb{R}\setminus\{0\}$ we have

[TABLE]

which yields the lower bound $y_{11}\geq t^{2}/(4y_{22})>0$ for any $y$ that is feasible for (4). Consequently, the optimal value in (4) is not attained if $t\neq 0$ .

3 Structure of Risk-Averse Stochastic SDPs

Let us now return to problem (3) and consider various choices of $\mathcal{R}$ . To ensure finiteness, we shall work with moment conditions on the Borel probability measure $\mathbb{P}\circ Z^{-1}$ induced by the underlying random vector $Z(\cdot)$ . Let $\mathcal{P}(\mathbb{R}^{s})$ denote the space of all Borel probability measures on $\mathbb{R}^{s}$ and

[TABLE]

be the subspace of measures having finite moments of order $p\geq 1$ .

Lemma 3

Assume A1, A2 and $\mathbb{P}\circ Z^{-1}\in\mathcal{M}^{1}_{s}$ . Then $f(x,Z(\cdot))\in L^{1}(\Omega,\mathcal{F},\mathbb{P})$ for all $x\in\mathcal{S}^{n}_{+}$ and the mapping $F:\mathcal{S}^{m}_{+}\to L^{1}(\Omega,\mathcal{F},\mathbb{P})$ , $F(x):=f(x,Z(\cdot))$ is convex and Lipschitz continuous with constant $\|c\|+L_{\varphi}\cdot\|T\|$ .

Proof

For any $x\in\mathcal{S}^{n}_{+}$ we have

[TABLE]

by Lemma 2.

For any $x_{1},x_{2}\in\mathcal{S}^{n}_{+}$ , $\lambda\in[0,1]$ and $z\in\mathbb{R}^{s}$ , the convexity of $\varphi$ yields

[TABLE]

and thus in particular $F(\lambda x_{1}+(1-\lambda)x_{2})\leq\lambda F(x_{1})+(1-\lambda)F(x_{2})$ with respect to the $\mathbb{P}$ -almost sure partial order, proving the asserted convexity of $F$ .

Finally,

[TABLE]

holds for all $x_{1},x_{2}\in\mathcal{S}^{n}_{+}$ . ∎

Definition 1

A mapping $\mathcal{R}:\mathcal{X}\to\mathbb{R}\cup\{\infty\}$ defined on some linear subspace $\mathcal{X}$ of $L^{0}(\Omega,\mathcal{F},\mathbb{P})$ containing the constants is called a convex risk measure if the following conditions are fulfilled:

(Convexity) For any $Z_{1},Z_{2}\in\mathcal{X}$ and $\lambda\in[0,1]$ we have

[TABLE] 2. 2.

(Monotonicity) $\mathcal{R}[Z_{1}]\leq\mathcal{R}[Z_{2}]$ for all $Z_{1},Z_{2}\in\mathcal{X}$ satisfying $Z_{1}\leq Z_{2}$ with respect to the $\mathbb{P}$ -almost sure partial order. 3. 3.

(Translation equivariance) $\mathcal{R}[Z_{1}+z_{2}]=\mathcal{R}[Z_{1}]+z_{2}$ for all $Z_{1}\in\mathcal{X}$ and $z_{2}\in\mathbb{R}$ .

A convex risk measure $\mathcal{R}$ is coherent if the following holds true:

(Positive homogeneity) $\mathcal{R}[z_{2}Z_{1}]=z_{2}\cdot\mathcal{R}[Z_{1}]$ for all $Z_{1}\in\mathcal{X}$ and $z_{2}\in[0,\infty)$ .

Definition 2

A mapping $\mathcal{R}:L^{0}(\Omega,\mathcal{F},\mathbb{P})\supseteq\mathcal{X}\to\mathbb{R}\cup\{\infty\}$ is called law-invariant if for all $Z_{1},Z_{2}\in L^{0}(\Omega,\mathcal{F},\mathbb{P})$ with $\mathbb{P}\circ Z_{1}^{-1}=\mathbb{P}\circ Z_{2}^{-1}$ we have $\mathcal{R}[Z_{1}]=\mathcal{R}[Z_{2}]$ .

We shall give some examples of risk-measures frequently used in stochastic programming as listed in RuszczynskiShapiro2003 , pp. 447-448, and ShapiroDentchevaRuszczynski2009 . Later we will give extensive formulations of discrete mean-risk SDPs based on these risk-measures:

(i)

The expectation $\mathbb{E}:L^{1}(\Omega,\mathcal{F},\mathbb{P})\to\mathbb{R}$ is a law-invariant coherent risk-measure.

(ii)

The expected excess over threshold $\eta\in\mathbb{R}$ (as used in SchultzTiedemann2006 ) is the mapping $\mathbb{EE}_{\eta}:L^{1}(\Omega,\mathcal{F},\mathbb{P})\rightarrow\mathbb{R}$ defined by

[TABLE]

This is a non-decreasing, convex and law-invariant risk measure, but in general not translation-equivariant.

(iii)

The conditional value-at-risk at level $\alpha\in\left(0,1\right)$

[TABLE]

is law-invariant and coherent (cf. Pflug2000 ).

(iv)

The value-at-risk at level $\alpha\in\left(0,1\right)$

[TABLE]

is nondecreasing, law-invariant, translation-equivariant and positively homogenous, but in general non-convex.

(v)

The upper semi-deviation of order $p$ is the mapping $\text{$ \mathbb{M} $ad}^{+}_{p}:L^{p}(\Omega,\mathcal{F},\mathbb{P})\rightarrow\mathbb{R}$ defined by

[TABLE]

For $\rho\in\left[0,1\right]$ this gives rise to the law-invariant and coherent risk measure $\mathbb{E}+\rho\,\mathbb{M}\text{ad}_{p}$ (cf. ShapiroDentchevaRuszczynski2009 , p. 276).

Proposition 1

Assume A1 and A2, let $\mathcal{X}$ be a convex subset of $L^{0}(\Omega,\mathcal{F},\mathbb{P})$ that contains $F(\mathcal{S}^{n}_{+})$ and fix a convex and nondecreasing mapping $\mathcal{R}:\mathcal{X}\to\mathbb{R}$ . Then $Q_{\mathcal{R}}$ is finite and convex on $\mathcal{S}^{n}_{+}$ . In particular, problem (3) is convex.

Proof

Finiteness of $Q_{\mathcal{R}}$ follows directly from the finiteness of $\mathcal{R}$ . Furthermore, for any $x_{1},x_{2}\in\mathcal{S}^{n}_{+}$ and $\lambda\in[0,1]$ we have

[TABLE]

The first inequality above holds due to the monotonicity of $\mathcal{R}$ and the convexity of $F$ (by Lemma 3), while the second one is justified by the convexity of $\mathcal{R}$ . ∎

Proposition 2

Assume A1, A2 and that the support of $\mathbb{P}\circ Z^{-1}$ is bounded. Furthermore, let $\mathcal{R}:L^{\infty}(\Omega,\mathcal{F},\mathbb{P})\to\mathbb{R}\cup\{\infty\}$ be a coherent risk measure and assume that there is some $Y\in L^{\infty}(\Omega,\mathcal{F},\mathbb{P})$ such that $\mathcal{R}[Y]<\infty$ . Then $Q_{\mathcal{R}}$ is finite and Lipschitz continuous with constant $\|c\|+L_{\varphi}\cdot\|T\|$ on $\mathcal{S}^{n}_{+}$ .

Proof

$\mathcal{R}$ is finite and Lipschitz continuous with constant $1$ with respect to the $L^{\infty}$ -norm on by $L^{\infty}(\Omega,\mathcal{F},\mathbb{P})$ by (FoellmerSchied2004, , Lemma 4.3).

For any $x\in\mathcal{S}^{n}_{+}$ , the mapping $f(x,\cdot)$ is continuous by Lemma 2, which implies

[TABLE]

Thus, $F(\mathcal{S}^{n}_{+})\subseteq L^{\infty}(\Omega,\mathcal{F},\mathbb{P})$ , which implies the asserted finiteness of $Q_{\mathcal{R}}$ .

Furthermore, for any $x_{1},x_{2}\in\mathcal{S}^{n}_{+}$ , we have

[TABLE]

by Lemma 3. ∎

If the support of $\mathbb{P}\circ Z^{-1}$ is unbounded, $F(\mathcal{S}^{n}_{+})$ may fail to be a subset of $L^{\infty}(\Omega,\mathcal{F},\mathbb{P})$ . While Lipschitz continuity with respect to any $L^{p}$ -norm with $p<\infty$ does not hold for general coherent risk measures, the Conditional Value-at-Risk $\mathrm{CVaR}_{\alpha}$ is known to be Lipschitz continuous with respect to the $L^{1}$ -norm with constant $\frac{1}{1-\alpha}$ (cf. (Pichler2017, , Corollary 3.7)). Using the Kusuoka representation (cf. Kusuoka2001 ), this allows to replace the boundedness of the support of $\mathbb{P}\circ Z^{-1}$ with a less restrictive assumption on the moments of $\mathbb{P}\circ Z^{-1}$ for special classes of risk measures.

Definition 3

Random variables $Z_{1}$ and $Z_{2}$ are called comonotonic if $(Z_{1},Z_{2})$ is distributionally equivalent to $(F^{-1}_{Z_{1}}(U),F^{-1}_{Z_{2}}(U))$ where $U$ is uniformly distributed on $[0,1]$ .

A coherent risk measure $\mathcal{R}:\mathcal{X}\to\mathbb{R}$ is said to be comonotonic if for any two comonotonic random variables $Z_{1},Z_{2}\in\mathcal{X}$ we have $\mathcal{R}(Z_{1}+Z_{2})=\mathcal{R}(Z_{1})+\mathcal{R}(Z_{2})$ .

For a discussion of comonotonicity we refer to DhaeneEtAl2002 and DhaeneEtAl2006 . A proof of the following result is given in (Shapiro2013, , Theorem 2):

Theorem 3.1

*A law-invariant coherent risk measure $\mathcal{R}:L^{p}(\Omega,\mathcal{F},\mathbb{P})\to\mathbb{R}$ with

$p\in[1,\infty)$ is comonotonic if and only if there exists probability measure $\nu$ on $[0,1)$ such that*

[TABLE]

holds for all $Y\in L^{p}(\Omega,\mathcal{F},\mathbb{P})$ . Furthermore, the measure $\nu$ in representation (7) is defined uniquely.

Example 2

Using $\delta_{\alpha_{0}}$ to denote the Dirac measure at $\alpha_{0}\in[0,1)$

[TABLE]

and, in particular,

[TABLE]

hold for all $Y\in L^{1}(\Omega,\mathcal{F},\mathbb{P})$ .

Proposition 3

Let $\mathcal{R}:L^{p}(\Omega,\mathcal{F},\mathbb{P})\to\mathbb{R}$ with $p\in[1,\infty)$ be a law-invariant, comonotonic coherent risk measure. Assume A1, A2, $\mathbb{P}\circ Z^{-1}\in\mathcal{M}^{p}_{s}$ and

[TABLE]

where $\nu$ denotes the uniquely defined probability measure form representation (7). Then $Q_{\mathcal{R}}$ is Lipschitz continuous with constant $L_{\nu}\cdot(\|c\|+L_{\varphi}\cdot\|T\|)$ on $\mathcal{S}^{n}_{+}$ .

Proof

For any $x_{1},x_{2}\in\mathcal{S}^{n}_{+}$ , we have

[TABLE]

The second inequality above holds due to (Pichler2017, , Corollary 3.7), while the third one is justified by Lemma 3. ∎

We shall now study the dependence of $Q_{\mathcal{R}}$ on the underlying probability measure $\mathbb{P}\circ Z^{1}$ . This is motivated by the fact that in applications the true probability distribution of the random parameter may be unknown. In such situations, one may work with an approximation if the optimal value function and the optimal solution set mapping of (3) are at least semicontinuous with respect to changes of the underlying distribution.

Let $(\Omega_{0},\mathcal{F}_{0},\mathbb{P}_{0})$ be an atomless probability space, i.e. assume that for any $A\in\mathcal{F}_{0}$ with $\mathbb{P}_{0}(A)>0$ there exists some $B\subsetneq A$ with $B\in\mathcal{F}_{0}$ and $\mathbb{P}_{0}(B)>0$ , and fix any $p\geq 1$ . Then for any $\nu\in\mathcal{M}^{1}_{p}$ there exists some $Z_{\nu}\in L^{p}(\Omega_{0},\mathcal{F}_{0},\mathbb{P}_{0})$ such that $\mathbb{P}_{0}\circ Z_{\nu}^{-1}$ . Thus, given any law-invariant mapping $\mathcal{R}_{0}:L^{p}(\Omega_{0},\mathcal{F}_{0},\mathbb{P}_{0})\to\mathbb{R}$ , the function

[TABLE]

is well-defined. Furthermore, we can construct a mapping $\mathcal{R}_{\mathcal{R}_{0}}:L^{p}(\Omega,\mathcal{F},\mathbb{P})\to\mathbb{R}$ by setting $\mathcal{R}_{\mathcal{R}_{0}}[Z_{1}]:=\Theta_{\mathcal{R}_{0}}[\mathbb{P}\circ Z_{1}^{-1}]$ . To ease the notation, we shall assume that $(\Omega,\mathcal{F},\mathbb{P})$ itself is atomless. Given any law-invariant mapping $\mathcal{R}:L^{p}(\Omega,\mathcal{F},\mathbb{P})\to\mathbb{R}$ , we shall consider the function

[TABLE]

For the following analysis, we equip the space $\mathcal{P}(\mathbb{R}^{s})$ with the topology of weak convergence, where a sequence $\{\mu_{k}\}_{k\in\mathbb{N}}\subseteq\mathcal{P}(\mathbb{R}^{s})$ converges to some $\mu\in\mathcal{P}(\mathbb{R}^{s})$ , written $\mu_{k}\stackrel{{\scriptstyle w}}{{\rightarrow}}\mu$ if and only if

[TABLE]

holds for any bounded and continuous function $h:\mathbb{R}^{s}\to\mathbb{R}$ . It is well known that even for linear recourse one cannot expect weak continuity of $\mathcal{Q}_{\mathcal{R}}$ on the entire space $\mathcal{S}^{n}_{+}\times\mathcal{M}^{p}_{s}$ . Along the lines of ClausKraetschmerSchultz2017 , we shall thus restrict the analysis to appropriate subspaces.

Definition 4

A set $\mathcal{M}\subseteq\mathcal{M}^{p}_{s}$ is called locally uniformly $\|\cdot\|^{p}$ -integrating if for any $\mu\in\mathcal{M}$ and any $\epsilon>0$ there exists some open neighborhood $\mathcal{N}$ of $\mu$ with respect to the topology of weak convergence such that

[TABLE]

Example 3

(a) For any $K,\epsilon>0$ and $p\geq 1$ , the set

[TABLE]

of measures having uniformly bounded moments of order $1+\epsilon$ is locally uniformly $\|\cdot|^{p}$ -integrating (cf. (Claus2016, , Lemma 2.69)).

(b) For any $p\geq 1$ and compact set $\Xi\subset\mathbb{R}^{s}$ , the set

[TABLE]

of measures with support in $\Xi$ is locally uniformly $\|\cdot\|^{p}$ -integrating by (KraetschmerSchiedZaehle2017, , Lemma 5.1).

(c) Any singleton $\{\mu\}\subseteq\mathcal{M}^{p}_{s}$ is locally uniformly $\|\cdot\|^{p}$ -integrating for any $p\geq 1$ by (KraetschmerSchiedZaehle2017, , Lemma 5.2).

Theorem 3.2

Let $\mathcal{R}:L^{p}(\Omega,\mathcal{F},\mathbb{P})\to\mathbb{R}$ with $p\geq 1$ be law-invariant, convex and nondecreasing. Assume A1 and A2 and let $\mathcal{M}\subseteq\mathcal{M}^{p}_{s}$ be locally uniformly $\|\cdot\|^{p}$ -integrating. Then the following statements hold true:

The restriction of $\mathcal{Q}_{\mathcal{R}}$ to the set $\mathcal{S}^{n}_{+}\times\mathcal{M}$ is continuous with respect to the product topology of the the standard topology on $\mathcal{S}^{n}_{+}$ and the relative topology of weak convergence on $\mathcal{M}$ . 2. 2.

The optimal value function

[TABLE]

is weakly upper semicontinuous.

Additionally assume that $X$ is compact. Then

$\phi$ * is weakly continuous.* 2. 4.

The optimal solution set mapping

[TABLE]

is weakly upper semicontinuous in the sense of Berge, i.e. for any $\mu_{0}\in\mathcal{M}$ and any open set $\mathcal{O}\subseteq\mathcal{S}^{n}_{+}$ with $\Phi(\mu_{0})\subseteq\mathcal{O}$ there exists a weakly open neighborhood $\mathcal{N}$ of $\mu_{0}$ such that $\Phi(\mu)\subseteq\mathcal{O}$ for all $\mu\in\mathcal{N}\cap\mathcal{M}$ . Furthermore, $\Phi(\mu)$ is nonempty and compact for any $\mu\in\mathcal{M}$ .

Proof

Invoking Lemma 2, the result follows from (ClausKraetschmerSchultz2017, , Corollary 2). ∎

Corollary 1

Let $\mathcal{R}:L^{p}(\Omega,\mathcal{F},\mathbb{P})\to\mathbb{R}$ with $p\geq 1$ be law-invariant, convex and nondecreasing and assume A1 and A2. Then $Q_{\mathcal{R}}$ is continuous.

Proof

By part (c) of Example 3 we may apply the first part of Theorem 3.2 to $\mathcal{M}=\{\mathbb{P}\circ Z^{-1}\}$ . The asserted continuity follows from $Q_{\mathcal{R}}(x)=\mathcal{Q}_{\mathcal{R}}(x,\mathbb{P}\circ Z^{-1})$ for any $x\in\mathcal{S}^{n}_{+}$ . ∎

We shall now turn our attention to questions of differentiability, but confine the analysis to the risk neutral model.

Lemma 4

Assume A1, A2 and $\mathbb{P}\circ Z^{-1}\in\mathcal{M}^{1}_{s}$ , then the functional $Q_{\mathbb{E}}:\mathcal{S}^{n}_{+}\to\mathbb{R}$ , $Q_{\mathbb{E}}(x):=\mathbb{E}[F(x)]$ is directionally differentiable and

[TABLE]

holds for all $x,v\in\mathcal{S}^{n}_{+}$ .

Proof

$Q_{\mathbb{E}}$ is finite valued by Lemma 3, convex by Proposition 1 and thus directionally differentiable (cf. (Rockafellar1970, , Theorem 25.4)). Furthermore, $\varphi^{\prime}(\cdot-Tx;v)$ is a pointwise limit of measurable functions and thus measurable for any $x,v\in\mathcal{S}^{n}_{+}$ . The asserted representation of the directional derivative is justified by Lemma 2 and (Bertsekas1973, , Proposition 2.1). ∎

Sufficient conditions for differentiability $Q_{\mathbb{E}}$ can be obtained using the same arguments as for linear recourse (cf. ShapiroDentchevaRuszczynski2009 ).

Lemma 5

Assume A1, A2 and $\mathbb{P}\circ Z^{-1}\in\mathcal{M}^{1}_{s}$ and let $x_{0}\in\mathcal{S}^{n}_{+}$ be such that

[TABLE]

is a singleton for $(\mathbb{P}\circ Z^{-1})$ -almost all $z\in\mathbb{R}^{s}$ . Then $Q_{\mathbb{E}}$ is differentiable at $x_{0}$ .

Proof

For $(\mathbb{P}\circ Z^{-1})$ -almost all $z\in\mathbb{R}^{s}$ , $h_{z}:\mathcal{S}^{n}_{+}\to\mathbb{R}$ , $h_{z}(x)=c\bullet x+\varphi(z-T\bullet x)$ is differentiable with measurable derivative

[TABLE]

Consider the functions $g_{z}:\mathcal{S}^{n}_{+}\to\mathbb{R}$ defined by

[TABLE]

then $\lim_{x\to x_{0}}g_{z}(x)=0$ holds for $(\mathbb{P}\circ Z^{-1})$ -almost all $z\in\mathbb{R}^{s}$ . Furthermore, Lemma 2 implies $\|g_{z}(x)\|\leq 2(L_{\varphi}\|T\|+\|c\|)$ for all $x\in\mathcal{S}^{n}_{+}$ and $z\in\mathbb{R}^{s}$ . Hence, by Lebesgue’s dominated convergence theorem, we have

[TABLE]

Consequently, $Q_{\mathbb{E}}$ is differentiable at $x_{0}$ and $Q_{\mathbb{E}}^{\prime}(x_{0})=\int_{\mathbb{R}^{s}}h_{z}^{\prime}(x_{0})~{}(\mathbb{P}\circ Z^{-1})(dz)$ . ∎

Corollary 2

Assume A1, A2 and that $\mathbb{P}\circ Z^{-1}\in\mathcal{M}^{1}_{s}$ is absolutely continuous with respect to the Lebesgue measure. Then $Q_{\mathbb{E}}$ is continuously differentiable on $\mathcal{S}^{n}_{+}$ .

Proof

Let $N_{\varphi}\subset\mathbb{R}^{s}$ denote the set of points of nondifferentiability of $\varphi$ . By (Rockafellar1970, , Theorem 25.5),

[TABLE]

is a null set with respect to the Lebesgue measure for any $x\in\mathcal{S}^{n}_{+}$ , which implies $(\mathbb{P}\circ Z^{-1})[N_{x}]=0$ . Consequently, $Q_{\mathbb{E}}$ is differentiable on $\mathcal{S}^{n}_{+}$ . Continuity of the derivative follows from (Rockafellar1970, , Theorem 25.5) and the convexity of $Q_{\mathbb{E}}$ . ∎

Remark 3

Assuming A1, A2 and $\mathbb{P}\circ Z^{-1}\in\mathcal{M}^{1}_{s}$ , the subdifferential of $Q_{\mathbb{E}}$ admits the representation

[TABLE]

Furhter details are given in Bertsekas1973 .

Corollary 3

Assume A2 and that the underlying random variable $Z$ follows a finite discrete distribution with realizations $z_{1},\ldots,z_{S}\in\mathbb{R}^{s}$ and respective probabilities $\pi_{1},\ldots,\pi_{S}>0$ . Furthermore, assume that $\{y\in\mathcal{S}^{m}_{+}\;|\;W\bullet y=z_{i}-T\bullet x\}$ is nonempty for any $i\in\{1,\ldots,S\}$ and $x\in\mathcal{S}^{n}_{+}$ . Then

[TABLE]

holds for any $x\in\mathcal{S}^{n}_{+}$ .

Proof

The result follows directly from (Rockafellar1970, , Theorem 23.8). ∎

4 Extensive Formulations for Finite Discrete Distributions

Throughout this section, we shall assume A1, A2 and that the underlying random variable $Z$ follows a finite discrete distribution with realizations $z_{1},\ldots,z_{S}\in\mathbb{R}^{s}$ and respective probabilities $\pi_{1},\ldots,\pi_{S}>0$ . Furthermore, we denote the index set $\{1,\ldots,S\}$ by $\mathcal{I}_{S}$ .

It is well known that in the risk neutral setting, the stochastic SDP admits a reformulation as a block-structured SDP (cf. AriyawansaZhu2006 , MehrotraOezevin2007 ):

Proposition 4

The risk neutral stochastic SDP

[TABLE]

is equivalent to the SDP

[TABLE]

in the sense that the infimal values of the problems coincide. Furthermore, $x$ is an optimal solution for (8) if and only if there exist $v$ and $y_{1},\ldots,y_{S}$ such that $(x,v,y_{1},\ldots,y_{S})$ is an optimal solution for (9).

Proof

By definition of $\varphi$ ,

[TABLE]

holds for any $x\in X$ , $y_{1},\ldots,y_{S}\in\mathcal{S}^{m}_{+}$ satisfying $T\bullet x+W\bullet y_{i}=z_{i}$ for all $i\in\mathcal{I}_{S}$ . Thus, the infimal value of (8) is less or equal to the infimal value of (9). Furhtermore, (10) is satisfied as equality if and only if

[TABLE]

holds for all $i\in\mathcal{I}_{S}$ . The optimal solution set above is nonempty by strong duality, which holds due to A1 and A2. ∎

We continue with extensive formulations of the SDP (3) for mean-risk models based on the risk measures immediately following Definition 2. In this context, $\rho$ shall always be a nonnegative, predefined parameter indicating risk-aversion in the optimization.

Proposition 5

[TABLE]

with $\eta\in\mathbb{R}$ as a given parameter, can be equivalently restated as

[TABLE]

Proof

As the objective function of (12) is increasing with respect to $v$ , any optimal solution $(x,v_{1},\ldots,v_{S},y_{1},\ldots,y_{S})$ satisfies $v_{i}=\max\{c\bullet x+q\bullet y_{i}-\eta,0\}$ for all $i\in\mathcal{I}_{S}$ . The asserted equivalence of (11) and (12) then follows as in the proof of Proposition 4. ∎

Proposition 6

[TABLE]

can be equivalently restated as

[TABLE]

Proof

This follows directly from the variational representation of $\mathbb{CV}@R$ in (6). The expected-excess can be pushed into the restrictions by the same trick as in Proposition 5. ∎

As in in the risk-neutral case, problems (12) and (13) exhibit a block structure, i.e. there is no coupling constraint involving variables associated with different scenarios. This allows for a direct adaptation of the decomposition algorithms established for the expectation based model.

Proposition 7

Consider the problem

[TABLE]

with compact set $X$ . This problem can be equivalently restated as the following SDP with binary variables

[TABLE]

if $M\in\mathbb{R}$ is chosen sufficiently big.

Proof

As in the preceding propositions introduce a dummy variable $\eta$ to push $\mathbb{V}@R[\varphi(z-T\bullet x)]$ into the restrictions as $\eta\geq\mathbb{V}@R[\varphi(z-T\bullet x)]$ and minimize over $\eta$ . Note that $\eta\geq\mathbb{V}@R[\varphi(z-T\bullet x)]$ is equivalent to

[TABLE]

As for given $x\in X$ feasible points to the second stage problem corresponding to realization $z_{i}$ are denoted as $y_{i}$ , (15) can be rewritten as

[TABLE]

This conditional summation can in turn be cast into inequalities with

binary variables $\delta_{i}$ , $i\in\mathcal{I}_{S}$ ,

[TABLE]

if $M$ is chosen such that $\eta-q\bullet y_{i}<M$ for all feasible $y_{i}$ and all $\eta$ close to $\mathbb{V}@R[\varphi(z_{i}-T\bullet x)]$ . Since $-q\bullet y_{i}\leq-\varphi(z_{i}-T\bullet x)$ the existence of $M$ follows from compactness of $X$ , as $\max_{x\in X}\varphi(z_{i}-T\bullet x)<\infty$ for all $i\in\mathcal{I}_{S}$ . ∎

Unlike the previous models, (14) does not decompose scenariowise due to the coupling constraint $\sum_{i=1}^{S}\delta_{i}\,\pi_{i}\geq\alpha$ , which involves variables from all scenarios. Furthermore, it has an additional binary variable for each scenario. Problems of a similar structure have been considered in the context of minimizing a weighted sum of the expectation and the probability of exceeding a fixed threshold in SchultzWollenberg2017 , where Lagrangian relaxation of the coupling constraint enables an approach based on Bender’s decomposition. This direction seems also very promising for the algorithmic treatment of (14).

Proposition 8

[TABLE]

can be equivalently restated as

[TABLE]

Proof

Analogous to Proposition 5. ∎

Unlike (14), the equivalent SDP in Proposition 8 contains an individual coupling constraint for each scenario. While Lagrangian relaxation still is possible, it remains to be examined whether this approach is sensible form a computational point of view.

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) K. A. Ariyawansa, Y. Zhu, Stochastic semidefinite programming: a new paradigm for stochastic optimization , 4OR, 4(3), pp. 239-253 (2006)
2(2) K. A. Ariyawansa, Y. Zhu, A class of polynomial volumetric barrier decomposition algorithms for stochastic semidefinite programming , Mathematics of Computation, 80, no. 275, pp.1639-1661 (2011)
3(3) D. P. Bertsekas, Stochastic optimization problems with nondifferentiable cost functionals , Journal of Optimization Theory and Applications, 12, pp. 218-231 (1973)
4(4) M. Claus, Advancing stability analysis of mean-risk stochastic programs : bilevel and two-stage models , Ph D thesis, University of Duisburg-Essen (2016)
5(5) M. Claus, V. Krätschmer and R. Schultz, Weak continuity of risk functionals with applications to stochastic programming , SIAM Journal on Optimization, 27(1), pp. 91-108 (2017)
6(6) J. Dhaene, M. Denuit, M. J. Goovaerts, R. Kaas, D. Vyncke, The concept of comonotonicity in actuarial science and finance: theory , Insurance: Math. Econom., 31, pp. 3-33 (2002)
7(7) J. Dhaene, S. Vanduffel, M. J. Goovaerts, R. Kaas, Q. Tang, D. Vyncke, Risk Measures and Comonotonicity: A Review , Stochastic Models, 22, pp. 573-606 (2006)
8(8) H. Föllmer, A. Schied, Stochastic Finance: An Introduction in Discrete Time , 2nd ed., de Gruyter Stud. Math. 27, de Gruyter, Berlin (2004)