On estimating the regular normal cone to constraint systems and   stationarity conditions

Mat\'u\v{s} Benko; Helmut Gfrerer

arXiv:1902.07512·math.OC·February 21, 2019

On estimating the regular normal cone to constraint systems and stationarity conditions

Mat\'u\v{s} Benko, Helmut Gfrerer

PDF

TL;DR

This paper introduces two new methods for estimating the regular normal cone in constraint systems and proposes a stronger stationarity concept, enhancing the derivation of necessary optimality conditions in mathematical programming.

Contribution

It presents novel approaches and a new stationarity concept that improve the analysis of constraint systems in optimization.

Findings

01

Two novel approaches for estimating the regular normal cone

02

Introduction of a stronger stationarity concept than M-stationarity

03

Application to three classes of mathematical programs

Abstract

Estimating the regular normal cone to constraint systems plays an important role for the derivation of sharp necessary optimality conditions. We present two novel approaches and introduce a new stationarity concept which is stronger than M-stationarity. We apply our theory to three classes of mathematical programs frequently arising in the literature.

Equations458

Ω := {x \in R^{n} ∣ F (x) \in D}

Ω := {x \in R^{n} ∣ F (x) \in D}

min f (x) subject to x \in Ω

min f (x) subject to x \in Ω

- \nabla f (\overset{x}{ˉ}) \in N_{Ω} (\overset{x}{ˉ}) .

- \nabla f (\overset{x}{ˉ}) \in N_{Ω} (\overset{x}{ˉ}) .

N_{Ω} (\overset{x}{ˉ}) = \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

N_{Ω} (\overset{x}{ˉ}) = \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

\nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) \subset N_{Ω} (\overset{x}{ˉ})

\nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) \subset N_{Ω} (\overset{x}{ˉ})

\nabla F (\overset{x}{ˉ})^{- 1} Q := {u ∣ \nabla F (\overset{x}{ˉ}) u \in Q}

\nabla F (\overset{x}{ˉ})^{- 1} Q := {u ∣ \nabla F (\overset{x}{ˉ}) u \in Q}

T_{\Gamma}(\bar{z}):=\Big{\{}u\in\mathbb{R}^{d}\,|\,\exists\,t_{k}\searrow 0,\;u_{k}\to u\;\mbox{ with }\;\bar{z}+t_{k}u_{k}\in\Gamma~{}\forall~{}k\Big{\}}.

T_{\Gamma}(\bar{z}):=\Big{\{}u\in\mathbb{R}^{d}\,|\,\exists\,t_{k}\searrow 0,\;u_{k}\to u\;\mbox{ with }\;\bar{z}+t_{k}u_{k}\in\Gamma~{}\forall~{}k\Big{\}}.

N_{Γ} (\overset{z}{ˉ}) := (T_{Γ} (\overset{z}{ˉ}))^{\circ} .

N_{Γ} (\overset{z}{ˉ}) := (T_{Γ} (\overset{z}{ˉ}))^{\circ} .

N_{Γ} (\overset{z}{ˉ}) := {z^{*} ∣ \exists z_{k} \to Γ \overset{z}{ˉ}, z_{k}^{*} \to z^{*} \mbox w i t h z_{k}^{*} \in N_{Γ} (z_{k}) \forall k} .

N_{Γ} (\overset{z}{ˉ}) := {z^{*} ∣ \exists z_{k} \to Γ \overset{z}{ˉ}, z_{k}^{*} \to z^{*} \mbox w i t h z_{k}^{*} \in N_{Γ} (z_{k}) \forall k} .

N_{Γ} (\overset{z}{ˉ}) \subset N_{Γ} (\overset{z}{ˉ}) .

N_{Γ} (\overset{z}{ˉ}) \subset N_{Γ} (\overset{z}{ˉ}) .

(C_{1} \cup C_{2})^{\circ} = (C_{1} + C_{2})^{\circ} = C_{1}^{\circ} \cap C_{2}^{\circ}, (C_{1} \cap C_{2})^{\circ} = cl (C_{1}^{\circ} + C_{2}^{\circ})

(C_{1} \cup C_{2})^{\circ} = (C_{1} + C_{2})^{\circ} = C_{1}^{\circ} \cap C_{2}^{\circ}, (C_{1} \cap C_{2})^{\circ} = cl (C_{1}^{\circ} + C_{2}^{\circ})

\Big{(}\prod_{i=1}^{m}P_{i}\Big{)}^{\circ}\cap\Big{(}\prod_{i=1}^{m}Q_{i}\Big{)}^{\circ}=\Big{(}\prod_{i=1}^{m}P_{i}^{\circ}\Big{)}\cap\Big{(}\prod_{i=1}^{m}Q_{i}^{\circ}\Big{)}=\prod_{i=1}^{m}\left(P_{i}^{\circ}\cap Q_{i}^{\circ}\right)=\prod_{i=1}^{m}\left(P_{i}\cup Q_{i}\right)^{\circ}.

\Big{(}\prod_{i=1}^{m}P_{i}\Big{)}^{\circ}\cap\Big{(}\prod_{i=1}^{m}Q_{i}\Big{)}^{\circ}=\Big{(}\prod_{i=1}^{m}P_{i}^{\circ}\Big{)}\cap\Big{(}\prod_{i=1}^{m}Q_{i}^{\circ}\Big{)}=\prod_{i=1}^{m}\left(P_{i}^{\circ}\cap Q_{i}^{\circ}\right)=\prod_{i=1}^{m}\left(P_{i}\cup Q_{i}\right)^{\circ}.

{u ∣ A u \in conv C}^{\circ} = A^{T} C^{\circ}

{u ∣ A u \in conv C}^{\circ} = A^{T} C^{\circ}

(A S_{1}) \cap (A S_{2}) = A (S_{1} \cap (ker A + S_{2})) .

(A S_{1}) \cap (A S_{2}) = A (S_{1} \cap (ker A + S_{2})) .

T_{Ω} (\overset{x}{ˉ}) = T_{Ω}^{lin} (\overset{x}{ˉ}),

T_{Ω} (\overset{x}{ˉ}) = T_{Ω}^{lin} (\overset{x}{ˉ}),

(T_{Ω} (\overset{x}{ˉ}))^{\circ} = (T_{Ω}^{lin} (\overset{x}{ˉ}))^{\circ} .

(T_{Ω} (\overset{x}{ˉ}))^{\circ} = (T_{Ω}^{lin} (\overset{x}{ˉ}))^{\circ} .

d (u, Ψ^{- 1} (v)) \leq κ d (v, Ψ (u)) \forall (u, v) \in U \times V .

d (u, Ψ^{- 1} (v)) \leq κ d (v, Ψ (u)) \forall (u, v) \in U \times V .

d (u, Ψ^{- 1} (\overset{v}{ˉ})) \leq κ d (\overset{v}{ˉ}, Ψ (u)) \forall u \in U .

d (u, Ψ^{- 1} (\overset{v}{ˉ})) \leq κ d (\overset{v}{ˉ}, Ψ (u)) \forall u \in U .

M (x) := F (x) - D

M (x) := F (x) - D

\nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) \subset N_{Ω} (\overset{x}{ˉ}) .

\nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) \subset N_{Ω} (\overset{x}{ˉ}) .

N_{Ω} (\overset{x}{ˉ}) \subset \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

N_{Ω} (\overset{x}{ˉ}) \subset \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

0 \in \nabla f (\overset{x}{ˉ}) + N_{Ω} (\overset{x}{ˉ}) .

0 \in \nabla f (\overset{x}{ˉ}) + N_{Ω} (\overset{x}{ˉ}) .

0 \in \nabla f (\overset{x}{ˉ}) + \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

0 \in \nabla f (\overset{x}{ˉ}) + \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

0 \in \nabla f (\overset{x}{ˉ}) + \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

0 \in \nabla f (\overset{x}{ˉ}) + \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

⟨ \nabla f (\overset{x}{ˉ}), u ⟩ \geq 0 \forall u \in T_{Ω} (\overset{x}{ˉ})

⟨ \nabla f (\overset{x}{ˉ}), u ⟩ \geq 0 \forall u \in T_{Ω} (\overset{x}{ˉ})

N_{Ω} (\overset{x}{ˉ}) \subset \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

N_{Ω} (\overset{x}{ˉ}) \subset \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

N_{Ω} (\overset{x}{ˉ}) \subset \nabla F (\overset{x}{ˉ})^{T} N_{T_{D} (F (\overset{x}{ˉ}))} (0) \subset \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

N_{Ω} (\overset{x}{ˉ}) \subset \nabla F (\overset{x}{ˉ})^{T} N_{T_{D} (F (\overset{x}{ˉ}))} (0) \subset \nabla F (\overset{x}{ˉ})^{T} N_{D} (F (\overset{x}{ˉ})) .

0 \in \nabla f (\overset{x}{ˉ}) + \nabla F (\overset{x}{ˉ})^{T} N_{T_{D} (F (\overset{x}{ˉ}))} (0)

0 \in \nabla f (\overset{x}{ˉ}) + \nabla F (\overset{x}{ˉ})^{T} N_{T_{D} (F (\overset{x}{ˉ}))} (0)

(\nabla F (\overset{x}{ˉ})^{- 1} Q_{i})^{\circ} = \nabla F (\overset{x}{ˉ})^{T} Q_{i}^{\circ}, i = 1, 2

(\nabla F (\overset{x}{ˉ})^{- 1} Q_{i})^{\circ} = \nabla F (\overset{x}{ˉ})^{T} Q_{i}^{\circ}, i = 1, 2

N_{Ω} (\overset{x}{ˉ}) \subset \nabla F (\overset{x}{ˉ})^{T} (Q_{1}^{\circ} \cap (ker \nabla F (\overset{x}{ˉ})^{T} + Q_{2}^{\circ})) = (\nabla F (\overset{x}{ˉ})^{T} Q_{1}^{\circ}) \cap (\nabla F (\overset{x}{ˉ})^{T} Q_{2}^{\circ}) .

N_{Ω} (\overset{x}{ˉ}) \subset \nabla F (\overset{x}{ˉ})^{T} (Q_{1}^{\circ} \cap (ker \nabla F (\overset{x}{ˉ})^{T} + Q_{2}^{\circ})) = (\nabla F (\overset{x}{ˉ})^{T} Q_{1}^{\circ}) \cap (\nabla F (\overset{x}{ˉ})^{T} Q_{2}^{\circ}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On estimating the regular normal cone to constraint systems and stationarity conditions††thanks: This is an Accepted Manuscript of an article published by Taylor & Francis in Optimization on 31 October 2016, available online: http://www.tandfonline.com/10.1080/02331934.2016.1252915

Matúš Benko, Helmut Gfrerer Institute of Computational Mathematics, Johannes Kepler University Linz, A-4040 Linz, Austria, [email protected], [email protected]

Abstract

Estimating the regular normal cone to constraint systems plays an important role for the derivation of sharp necessary optimality conditions. We present two novel approaches and introduce a new stationarity concept which is stronger than M-stationarity. We apply our theory to three classes of mathematical programs frequently arising in the literature.

**Key words. **Regular normal cone; B-, M-, S-stationarity; complementarity constraints; vanishing constraints; generalized equations.

**AMS subject classification. ** 49J53 90C46.

1 Introduction

This paper deals with the computation of the regular normal cone $\widehat{N}_{\Omega}(\bar{x})$ to sets of the form

[TABLE]

at some point $\bar{x}\in\Omega$ , where $F:\mathbb{R}^{n}\to\mathbb{R}^{m}$ is a mapping continuously differentiable at $\bar{x}$ and $D\subset\mathbb{R}^{m}$ is a closed set.

This task is of particular importance for the development of first order optimality conditions of the nonlinear program

[TABLE]

since the basic optimality condition, see e.g. [27, Theorem 6.12], states that the negative gradient of the objective at a local minimizer $\bar{x}$ belongs to the regular normal cone to the constraints at $\bar{x}$ , i.e.

[TABLE]

When $D$ is convex, the computation of the regular normal cone is well understood, see e.g. [2]. Under some constraint qualification condition an exact formula reads as

[TABLE]

Quite more complicated is the situation, when $D$ is not convex. This occurs for instance, when among the constraints so-called equilibrium constraints are present. Such programs are usually termed mathematical programs with equilibrium constraints (MPEC). The equilibrium can be often described by a lower-level optimization problem, by variational inequalities or by complementarity constraints. Some of these equilibrium constraints can be written as smooth equalities and inequalities, but these constraints usually do not satisfy the common constraint qualifications of nonlinear programming. Alternative formulations yield either a nonsmooth mapping or the system (1) with nonconvex $D$ , the case considered in this paper. Prominent examples are mathematical programs with complementarity constraints (MPCC) or mathematical programs with vanishing constraints (MPVC). We refer the reader to the paper [28] for some more examples on this subject.

In case when $D$ is not convex, only inclusions for the regular normal cone are known in general. The lower estimate is given by

[TABLE]

and is known to hold with equality, if the Jacobian $\nabla F(\bar{x})$ has full rank, cf. [27, Example 6.7]. When we have equality in (4), the corresponding optimality conditions are usually called S-stationarity (strong stationarity) conditions in the literature on mathematical programs with equilibrium constraints (MPECs). The main drawback of the S-stationarity conditions is the requirement of strong constraint qualification conditions.

If one weakens the used constraint qualification condition then the inclusion (4) will be strict in general. In this situation one has to consider an upper estimate to the regular normal cone $\widehat{N}_{\Omega}(\bar{x})$ . A commonly used upper estimate is provided by the so-called limiting normal cone to $\Omega$ at $\bar{x}$ . The use of the limiting normal cone has the advantage, that a lot of calculus rules are available for its calculation; we refer the readers to the textbooks [22, 23, 27]. Optimality conditions based on this upper estimate involving the limiting normal cone are usually called M-stationarity conditions. A main disadvantage of this approach is, that in general the regular normal cone is strictly included in the limiting normal cone. Therefore, in general M-stationarity does not preclude the existence of feasible descent directions.

The aim of this paper is to provide estimates to the regular normal cone $\widehat{N}_{\Omega}(\bar{x})$ which are valid under very weak constraint qualification conditions and are tighter than the one based on the limiting normal cone.

For this purpose we present two new approaches. The first one is motivated by a result due to Pang and Fukushima [24] and yields an upper bound for the regular normal cone which is exact under some suitable assumptions. This upper estimate for the regular normal cone constitutes a new stationarity concept called ${\cal Q}_{M}$ -stationarity which is shown to be stronger than M-stationarity. We apply this approach to MPCC and improve the result due to Pang and Fukushima [24]. For MPVC we derive a new qualification condition, which resembles the well known Mangasarian Fromovitz constraint qualification (MFCQ) of nonlinear programming, and allows the exact computation of the regular normal cone for MPVC. The obtained results are much stronger than the known results from literature [1, 3, 18, 19, 20, 21]. Finally we analyze MPECs where the constraints are given by a generalized equation (GE) involving the normal cone mapping to $C^{2}$ inequalities together with parameter constraints. Again we derive upper bounds for the regular normal cone which can be exact under certain conditions and can be employed to replace the commonly used conditions as in [16, Theorem 3.4].

In the second approach treated in this paper we focus on the lower inclusion (4) for the regular normal cone and state a condition which ensures equality. This new condition is an extension of the recent result [10, Theorem 4] and we apply it also to MPECs with an additional parameter constraint.

The paper is organized as follows. In section 2 we present some basic definitions and results from variational analysis together with the definitions of various stationarity concepts. In section 3 we give the theoretical background for the two approaches presented in this paper for estimating the regular normal cone as well as the new concepts of ${\cal Q}$ -stationarity and ${\cal Q}_{M}$ -stationarity, respectively. In sections 4, 5 and 6 we apply the results from section 3 to MPCC, MPVC and an MPEC, respectively.

Our notation is basically standard. $K^{\circ}$ stands for the polar to a cone $K$ and $\mathop{\rm span\,}\limits\{u_{1},\ldots,u_{N}\}$ stands for the subspace generated by the vectors $u_{1},\ldots,u_{N}$ . By $\nabla F(\bar{x})$ we normally denote the Jacobian of the mapping $F$ at $\bar{x}$ , but occasionally we use it like a linear mapping to write

[TABLE]

for a set $Q$ . To ease the notation the Minkowski sum of a singleton $\{a\}$ and a set $A$ is denoted by $a+A$ .

2 Preliminaries

Let us start with geometric objects. Given a set $\Gamma\subset\mathbb{R}^{d}$ and a point $\bar{z}\in\Gamma$ , define the (Bouligand-Severi) tangent/contingent cone to $\Gamma$ at $\bar{z}$ by

[TABLE]

Note that one has $T_{\Gamma}(\bar{z})=\mathbb{R}_{+}(\Gamma-\bar{z})$ when $\Gamma$ is a convex polyhedron.

The (Fréchet) regular normal cone to $\Gamma$ at $\bar{z}\in\Gamma$ can be defined as the polar cone to the tangent cone by

[TABLE]

Further, the (Mordukhovich) limiting/basic normal cone to $\Gamma$ at $\bar{z}\in\Gamma$ is given by

[TABLE]

Note that the tangent/contingent cone and the regular normal cone reduce to the classical tangent cone and normal cone of convex analysis, respectively, when the set $\Gamma$ is convex. We put $T_{\Gamma}(\bar{z})=\widehat{N}_{\Gamma}(\bar{z})=N_{\Gamma}(\bar{z})=\emptyset$ , if $\bar{z}\not\in\Gamma$ . Note that we always have

[TABLE]

Next we recall some rules for calculating polar cones. For two closed convex cones $C_{1}$ and $C_{2}$ we have

[TABLE]

and for closed convex cones $P_{j},Q_{j}$ , $j=1,\ldots,m$ we have

[TABLE]

Proposition 1.

Let $A$ be an $s\times d$ matrix, let $C\subset\mathbb{R}^{s}$ be a cone and assume that either there exists some $u$ such that $Au\in{\rm ri\,}{\rm conv\,}C$ or $C$ is polyhedral, i.e. $C$ is the union of finitely many convex polyhedral cones $C_{1},\ldots,C_{p}$ . Then

[TABLE]

Proof.

In case when there exists some $u$ with $Au\in{\rm ri\,}{\rm conv\,}C$ , the statement follows from [26, Corollary 16.3.2]. Now consider the case when $C$ is polyhedral. Then ${\rm conv\,}C=\sum_{i=1}^{p}C_{i}$ is a convex polyhedral set by [26, Corollary 19.3.2] and so is its polar $({\rm conv\,}C)^{\circ}=C^{\circ}=\bigcap_{i=1}^{p}C_{i}^{\circ}$ by [26, Corollary 19.2.2]. By virtue of [26, Theorem 19.3] the set $A^{T}C^{\circ}$ is again convex and polyhedral and now the statement follows from [26, Corollary 16.3.2] by taking into account that convex polyhedral sets are always closed. ∎

Lemma 1.

Let $A$ be an $s\times d$ matrix and let $S_{1},S_{2}\subset\mathbb{R}^{d}$ be two sets. Then

[TABLE]

Proof.

If $z\in(AS_{1})\cap(AS_{2})$ , then there are $s_{1}\in S_{1}$ , $s_{2}\in S_{2}$ with $z=As_{1}=As_{2}$ . Since $s_{1}=s_{2}+(s_{1}-s_{2})$ and $A(s_{1}-s_{2})=0$ , the properties $s_{1}\in S_{1}\cap(\ker A+S_{2})$ and $z\in A(S_{1}\cap(\ker A+S_{2}))$ follow. Conversely, if $z\in A(S_{1}\cap(\ker A+S_{2}))$ , then there are $s_{1}\in S_{1}$ , $s_{2}\in S_{2}$ and $r\in\ker A$ such that $s_{1}=r+s_{2}$ and $z=As_{1}\in AS_{1}$ . It follows that $z=A(r+s_{2})=As_{2}\in AS_{2}$ and thus $z\in(AS_{1})\cap(AS_{2})$ . ∎

We now introduce generalizations of the Abadie constraint qualification condition and the Guignard constraint qualification condition, respectively, as known from nonlinear programming.

Definition 1.

Let $\Omega$ be given by (1) and let $\bar{x}\in\Omega$ .

We say that the generalized Abadie constraint qualification (GACQ) holds at $\bar{x}$ if

[TABLE]

where $T^{\rm lin}_{\Omega}(\bar{x}):=\{u\in\mathbb{R}^{n}\,|\,\nabla F(\bar{x})u\in T_{D}(F(\bar{x}))\}$ denotes the linearized cone. 2. 2.

We say that the generalized Guignard constraint qualification (GGCQ) holds at $\bar{x}$ if

[TABLE]

Obviously GGCQ is weaker than GACQ, but GACQ is easier to verify because several advanced methods from variational analysis are available. To this end we need the concepts of metric regularity and metric subregularity of multifunctions.

Definition 2.

Let $\Psi:\mathbb{R}^{d}\rightrightarrows\mathbb{R}^{s}$ be a multifunction, $(\bar{u},\bar{v})\in{\rm gph\,}\Psi$ and $\kappa>0$ . Then

$\Psi$ * is called metrically regular with modulus $\kappa$ near $(\bar{u},\bar{v})$ if there are neighborhoods $U$ of $\bar{u}$ and $V$ of $\bar{v}$ such that*

[TABLE] 2. 2.

$\Psi$ * is called metrically subregular with modulus $\kappa$ at $(\bar{u},\bar{v})$ if there is a neighborhood $U$ of $\bar{u}$ such that*

[TABLE]

It is well known that metric regularity of the multifunction $\Psi$ near $(\bar{u},\bar{v})$ is equivalent to the Aubin property (also called Lipschitz-like or pseudo-Lipschitz) of the inverse multifunction $\Psi^{-1}$ and metric subregularity of $\Psi$ at $(\bar{u},\bar{v})$ is equivalent with the property of calmness of its inverse.

Obviously, metric regularity of $\Psi$ near $(\bar{u},\bar{v})$ implies metric subregularity of $\Psi$ at $(\bar{u},\bar{v})$ .

Proposition 2 (cf.[14, Proposition 1]).

Let $\bar{x}$ belong to the set $\Omega$ given by (1). If the perturbation mapping

[TABLE]

associated with the constraint system (1) is metrically subregular at $(\bar{x},0)$ , then $GACQ$ holds at $\bar{x}$ .

Metric regularity of the mapping (14) can be verified by the so-called Mordukhovich criterion, see, e.g., [27, Example 9.44]. Tools for verifying metric subregularity of constraint systems can be found e.g. in [9].

The following theorem states some fundamental relations between the regular and the limiting normal cone.

Theorem 1.

Let $\Omega$ be given by (1) and let $\bar{x}\ \in\Omega$ . Then

[TABLE]

On the other hand, if the multifunction (14) is metrically subregular at $(\bar{x},0)$ then

[TABLE]

If $\nabla F(\bar{x})$ has full rank, then both inclusions (15) and (16) hold with equality.

Proof.

The inclusion (15) can be found in [27, Theorem 6.14], whereas (16) follows from [15, Theorem 4.1]. For the statement on equality in the inclusions we refer to [27, Exercise 6.7]. ∎

At the end of this section we consider different stationarity concepts.

Definition 3.

Let $\bar{x}$ be feasible for the program (2), where $\Omega$ is given by (1) and $f$ is assumed to be smooth.

We say that $\bar{x}$ is B-stationary (Bouligand stationary) if

[TABLE] 2. 2.

We say that $\bar{x}$ is S-stationary (strongly stationary) if

[TABLE] 3. 3.

We say that $\bar{x}$ is M-stationary (Mordukhovich stationary) if

[TABLE]

By the definition of the regular normal cone we have

[TABLE]

at a B-Stationary point, which expresses that no feasible descent direction exists. Every local minimizer is known to be B-stationary. Conversely, if $\bar{x}$ is B-stationary then there exists some smooth mapping $\hat{f}:\mathbb{R}^{n}\to\mathbb{R}$ with $\nabla\hat{f}(\bar{x})=\nabla f(\bar{x})$ such that $\bar{x}$ is a global minimizer of the problem $\min_{x\in\Omega}\hat{f}(x)$ , cf. [27, Theorem 6.11].

From (15) it is easy to see that every S-stationary point is also B-stationary, but the reverse statement is not true in general, unless we have equality in (15).

On the other hand, a B-stationary point $\bar{x}$ is also M-stationary provided that the perturbation mapping $M$ is metrically subregular at $(\bar{x},0)$ . However, M-stationarity does not preclude the existence of feasible descent directions, unless we have $\widehat{N}_{\Omega}(\bar{x})=N_{\Omega}(\bar{x})=\nabla F(\bar{x})^{T}N_{D}(F(\bar{x}))$ .

Since we have $\widehat{N}_{\Omega}(\bar{x})\subset N_{\Omega}(\bar{x})$ by the definition, we derive from Theorem 1 the inclusion

[TABLE]

under the assumption of metric subregularity of (14) at $(\bar{x},0)$ . This relation can be strengthened by the following proposition.

Proposition 3.

Let $\Omega$ be given by (1), let $\bar{x}\ \in\Omega$ and assume that GGCQ is fulfilled, while the mapping $u\rightrightarrows\nabla F(\bar{x})u-T_{D}(F(\bar{x}))$ is metrically subregular at $(0,0)$ . Then

[TABLE]

Proof.

By virtue of GGCQ we have $\widehat{N}_{\Omega}(\bar{x})=(T^{\rm lin}_{\Omega}(\bar{x}))^{\circ}=\widehat{N}_{T^{\rm lin}_{\Omega}(\bar{x})}(0)$ and since $u\rightrightarrows\nabla F(\bar{x})u-T_{D}(F(\bar{x}))$ is assumed to be metrically subregular at $(0,0)$ , we can apply Theorem 1 to obtain $\widehat{N}_{T^{\rm lin}_{\Omega}(\bar{x})}(0)\subset N_{T^{\rm lin}_{\Omega}(\bar{x})}(0)\subset\nabla F(\bar{x})^{T}N_{T_{D}(F(\bar{x}))}(0)$ . By [27, Proposition 6.27] we have $N_{T_{D}(F(\bar{x}))}(0)\subset N_{D}(F(\bar{x}))$ and this finishes the proof. ∎

If $T_{D}(F(\bar{x}))$ is the union of finitely many convex polyhedral cones, then the mapping $u\rightrightarrows\nabla F(\bar{x})u-T_{D}(F(\bar{x}))$ is a polyhedral multifunction and consequently metrically subregular at $(0,0)$ by Robinson’s result [25]. Hence we arrive at the following corollary which slightly improves [6, Theorem 7].

Corollary 1.

Let $\bar{x}$ be B-stationary for the program (2), where $\Omega$ is given by (1) and $f$ is assumed to be smooth. If GGCQ is fulfilled at $\bar{x}$ and $T_{D}(F(\bar{x}))$ is the union of finitely many convex polyhedral cones, then $\bar{x}$ is M-stationary and even the stronger condition

[TABLE]

holds.

3 Estimating the regular normal cone

Throughout this section we assume that the set $\Omega$ is given by $\eqref{EqConstrSyst}$ , where $F:\mathbb{R}^{n}\to\mathbb{R}^{m}$ is continuously differentiable at the reference point $\bar{x}\in\Omega$ and $D\subset\mathbb{R}^{m}$ is closed. Further we assume that the objective $f:\mathbb{R}^{n}\to\mathbb{R}$ of the program (2) is continuously differentiable at $\bar{x}$ and GGCQ holds.

The main goal of this section is to provide a tight estimate for the regular normal cone $\widehat{N}_{\Omega}(\bar{x})$ , which, thanks to GGCQ, amounts to $(T^{\rm lin}_{\Omega}(\bar{x}))^{\circ}$ . To this end we discuss two possibilities, the first one being motivated by the paper of Pang and Fukushima [24] is based on the following observation.

Theorem 2.

Let $Q_{1}$ and $Q_{2}$ denote two closed convex cones contained in $T_{D}(F(\bar{x}))$ . If

[TABLE]

then

[TABLE]

Further, if

[TABLE]

then equality holds in (18).

Proof.

Since $\nabla F(\bar{x})^{-1}Q_{i}\subset\nabla F(\bar{x})^{-1}T_{D}(F(\bar{x}))=T^{\rm lin}_{\Omega}(\bar{x})$ , $i=1,2$ we have

[TABLE]

and (18) follows from Lemma 1. To show the sufficiency of condition (19) for equality in (18), note that condition (19) together with (18) implies $\widehat{N}_{\Omega}(\bar{x})\subset\nabla F(\bar{x})^{T}\widehat{N}_{D}(F(\bar{x}))$ . Now, equality in (18) follows from (15). ∎

The proper choice of $Q_{1}$ and $Q_{2}$ is crucial in order that (18) provides a good estimate for the regular normal cone. It is obvious that we want to choose the cones $Q_{i}$ , $i=1,2$ as large as possible in order that the inclusion (18) is tight. Further it is reasonable that a good choice of $Q_{1},Q_{2}$ fulfills

[TABLE]

because then condition (19) holds whenever $\nabla F(\bar{x})$ has full rank.

Since $Q_{i}\subset T_{D}(F(\bar{x}))$ , we have $Q_{i}^{\circ}\supset(T_{D}(F(\bar{x})))^{\circ}=\widehat{N}_{D}(F(\bar{x}))$ , $i=1,2$ and consequently, $Q_{1}^{\circ}\cap(\ker\nabla F(\bar{x})^{T}+Q_{2}^{\circ})\supset Q_{1}^{\circ}\cap Q_{2}^{\circ}\supset\widehat{N}_{D}(F(\bar{x}))$ . Hence the inclusion (19) can never be strict.

The following definition is motivated by Theorem 2.

Definition 4.

Let $\cal Q$ denote some collection of pairs $(Q_{1},Q_{2})$ of closed convex cones fulfilling

[TABLE]

(i)* Given $(Q_{1},Q_{2})\in{\cal Q}$ we say that $\bar{x}$ is ${\cal Q}$ -stationary with respect to $(Q_{1},Q_{2})$ for the program (2), if*

[TABLE]

(ii)* We say that $\bar{x}$ is ${\cal Q}$ -stationary for the program (2), if $\bar{x}$ is ${\cal Q}$ -stationary with respect to some pair $(Q_{1},Q_{2})\in{\cal Q}$ .*

(iii)* We say that $\bar{x}$ is ${\cal Q}_{M}$ -stationary, if there exists a pair $(Q_{1},Q_{2})\in{\cal Q}$ such that*

[TABLE]

The following corollary follows immediately from the definitions and Theorem 2.

Corollary 2.

Assume that $\bar{x}$ is B-stationary for the program (2). Then $\bar{x}$ is ${\cal Q}$ -stationary with respect to every pair $(Q_{1},Q_{2})\in{\cal Q}$ . Conversely, if $\bar{x}$ is ${\cal Q}$ -stationary with respect to some pair $(Q_{1},Q_{2})\in{\cal Q}$ fulfilling condition (19), then $\bar{x}$ is S-stationary and consequently, also B-stationary.

The following lemma follows immediately from (18) and the definition of ${\cal Q}$ -stationarity.

Lemma 2.

Let $(Q_{1},Q_{2})\in{\cal Q}$ . Then $\bar{x}$ is ${\cal Q}$ -stationary with respect to $(Q_{1},Q_{2})$ for the program (2) if and only if $-\nabla f(\bar{x})\in\nabla F(\bar{x})^{T}Q_{i}^{\circ}$ , $i=1,2$ .

Corollary 3.

Let $\bar{x}$ be S-stationary for the program (2). Then $\bar{x}$ is ${\cal Q}$ -stationary with respect to every $(Q_{1},Q_{2})\in{\cal Q}$ .

Proof.

Since $Q_{i}\subset T_{D}(F(\bar{x}))$ , we have $\widehat{N}_{D}(F(\bar{x}))\subset Q_{i}^{\circ}$ , $i=1,2$ . Hence S-stationarity of $\bar{x}$ implies

[TABLE]

and the assertion follows from Lemma 2.

∎

Remark 1.

Note that for $i=1,2$ the program

[TABLE]

is a convex program and therefore the first-order optimality condition

[TABLE]

is both necessary and sufficient in order that $u=0$ is a solution of $(P_{i})$ . Hence $\bar{x}$ is ${\cal Q}$ -stationary with respect to $(Q_{1},Q_{2})$ if and only if [math] is a solution for the programs $(P_{1})$ and $(P_{2})$ , respectively.

By the definition, a ${\cal Q}_{M}$ -stationary point is both M-stationary and ${\cal Q}$ -stationary. However, a B-stationary point is ${\cal Q}_{M}$ -stationary only under some additional condition. This is due to the fact that under the assumptions of Theorem 1 we have

[TABLE]

but in general

[TABLE]

Clearly, equality holds when $\nabla F(\bar{x})$ possesses full row rank, but in this case a B-stationary point is already S-stationary. In the following theorem we state three more sufficient conditions ensuring ${\cal Q}_{M}$ stationarity of a B-stationary point.

Theorem 3.

Assume that $\bar{x}$ is B-stationary for the program (2). Then $\bar{x}$ is ${\cal Q}_{M}$ -stationary if any of the following three conditions holds:

There exists a pair $(Q_{1},Q_{2})\in{\cal Q}$ such that

[TABLE] 2. 2.

$\bar{x}$ * is M-stationary and for every $\lambda\in N_{D}(F(\bar{x}))$ there is some pair $(Q_{1},Q_{2})\in{\cal Q}$ with $\lambda\in Q_{1}^{\circ}.$ * 3. 3.

$T_{D}(F(\bar{x}))$ * is the union of finitely many convex polyhedral sets and for every $t\in T_{D}(F(\bar{x}))$ there is some pair $(Q_{1},Q_{2})\in{\cal Q}$ satisfying $t\in Q_{1}$ .*

Proof.

Under the condition (22), ${\cal Q}_{M}$ -stationarity of $\bar{x}$ follows immediately from the definition and Corollary 2. Let us prove the second case. Since $\bar{x}$ is M-stationary, there exists some $\lambda\in N_{D}(F(\bar{x}))$ verifying $-\nabla f(\bar{x})=\nabla F(\bar{x})^{T}\lambda$ and by the assumption there is some $(Q_{1},Q_{2})\in{\cal Q}$ with $\lambda\in Q_{1}^{\circ}$ implying $-\nabla f(\bar{x})\in\nabla F(\bar{x})^{T}\big{(}Q_{1}^{\circ}\cap N_{D}(F(\bar{x}))\big{)}$ . By using that $\bar{x}$ is B-stationary and therefore also ${\cal Q}$ -stationary with respect to $(Q_{1},Q_{2})$ by Corollary 2, by virtue of Lemmas 2 and 1 we obtain

[TABLE]

showing ${\cal Q}_{M}$ -stationarity of $\bar{x}$ . Now let us prove the sufficiency of the third condition. By Corollary 1 there is some $\lambda\in N_{T_{D}(F(\bar{x}))}(0)$ with $-\nabla f(\bar{x})=\nabla F(\bar{x})^{T}\lambda$ and by using [7, Lemma 3.4], we can find some $t\in T_{D}(F(\bar{x}))$ with $\lambda\in\widehat{N}_{T_{D}(F(\bar{x}))}(t)$ . Our assumption guarantees that there is some pair $(Q_{1},Q_{2})\in{\cal Q}$ with $t\in Q_{1}\subset T_{D}(F(\bar{x}))$ and therefore $\lambda\in\widehat{N}_{T_{D}(F(\bar{x}))}(t)\subset\widehat{N}_{Q_{1}}(t)=Q_{1}^{\circ}\cap\{t\}^{\perp}\subset Q_{1}^{\circ}$ by convexity of $Q_{1}$ . By [27, Proposition 6.27] we obtain $\lambda\in N_{T_{D}(F(\bar{x}))}(0)\subset N_{D}(F(\bar{x}))$ and the same arguments as used just before yield (23) showing ${\cal Q}_{M}$ -stationarity of $\bar{x}$ . ∎

We summarize the relations between the various stationarity concepts in the following picture.

[TABLE]

Below we will work out the concepts of ${\cal Q}$ - and ${\cal Q}_{M}$ -stationarity for the special cases of mathematical programs with complementarity constraints, vanishing constraints and constraints involving a generalized equation, respectively, and in the first two cases we will present explicit expressions for the pair $(Q_{1},Q_{2})$ establishing ${\cal Q}_{M}$ -stationarity.

Now we consider another possibility to estimate the regular normal cone to $\Omega$ , which is an enhancement of the approach used in the recent paper [10]. For every nonempty convex cone $Q\subset\mathbb{R}^{m}$ we define

[TABLE]

i.e. $\bar{\cal T}(Q)$ is the collection of all $t\in T_{D}(F(\bar{x}))$ such that there are $u\in\mathbb{R}^{n}$ and $q\in Q$ with

[TABLE]

Further we define

[TABLE]

It is easy to see that both $\bar{\cal T}(Q)$ and $\bar{\cal C}(Q)$ are cones, that $\bar{\cal C}(Q)$ is convex and that $T_{D}(F(\bar{x}))\cap Q\subset\bar{\cal T}(Q)$ .

Theorem 4.

For every nonempty convex cone $Q\subset\mathbb{R}^{m}$ satisfying

[TABLE]

there holds

[TABLE]

Proof.

We first show the inclusion $\widehat{N}_{\Omega}(\bar{x})\subset(\bar{\cal C}(Q))^{\circ}$ . Let $x^{\ast}\in\widehat{N}_{\Omega}(\bar{x})$ be arbitrarily fixed. In order to show $x^{\ast}\in(\bar{\cal C}(Q))^{\circ}$ we have to prove $\langle x^{\ast},u\rangle\leq 0\ \forall u\in\bar{\cal C}(Q)$ . Consider any $u\in\bar{\cal C}(Q)$ . Since $\nabla F(\bar{x})u\in{\rm conv\,}\bar{\cal T}(Q)$ , $\nabla F(\bar{x})u$ can be represented as convex combination $\sum_{i=1}^{N}\alpha_{i}t_{i}$ of elements $t_{i}\in\bar{\cal T}(Q)$ , $i=1,\ldots,N$ with coefficients $\alpha_{i}\in[0,1]$ , $\sum_{i=1}^{N}\alpha_{i}=1$ . By the definition of the set $\bar{\cal T}(Q)$ we can find for each $i=1,\ldots,N$ , elements $u_{i}\in\mathbb{R}^{n}$ and $q_{i}\in Q$ such that

[TABLE]

By taking into account that $x^{\ast}\in\widehat{N}_{\Omega}(\bar{x})=\{u\,|\,\nabla F(\bar{x})u\in T_{D}(F(\bar{x}))\}^{\circ}$ by GGCQ, we obtain $\langle x^{\ast},u_{i}\rangle\leq 0$ $\forall i$ . Further we have

[TABLE]

and therefore

[TABLE]

Since $Q$ is assumed to be convex, we conclude $\nabla F(\bar{x})(u-\sum_{i=1}^{N}\alpha_{i}u_{i})\in Q$ and hence, by using (24), we can argue $\langle x^{\ast},u-\sum_{i=1}^{N}\alpha_{i}u_{i}\rangle\leq 0$ . This yields

[TABLE]

and, since $u\in\bar{\cal C}(Q)$ was arbitrary, we derive the claimed inclusion $x^{\ast}\in(\bar{\cal C}(Q))^{\circ}$ . In order to show the reverse inclusion $\widehat{N}_{\Omega}(\bar{x})\supset(\bar{\cal C}(Q))^{\circ}$ consider $x^{\ast}\in(\bar{\cal C}(Q))^{\circ}$ . Then for arbitrary $u\in T^{\rm lin}_{\Omega}(\bar{x})$ we have

[TABLE]

showing $t\in\bar{\cal T}(Q)$ and $u\in\bar{\cal C}(Q)$ . Hence, $\langle x^{\ast},u\rangle\leq 0$ and, because $u\in T^{\rm lin}_{\Omega}(\bar{x})$ was chosen arbitrarily, we conclude $x^{\ast}\in(T^{\rm lin}_{\Omega}(\bar{x}))^{\circ}=\widehat{N}_{\Omega}(\bar{x})$ by GGCQ, and $(\bar{\cal C}(Q))^{\circ}\subset\widehat{N}_{\Omega}(\bar{x})$ follows. ∎

Remark 2.

Condition (24) is in particular fulfilled, if $Q\subset T_{D}(F(\bar{x}))$ .

Of course, in practice it is a difficult task to compute $(\bar{\cal C}(Q))^{\circ}$ . In practical applications, for given $Q$ we try to find a cone $\tilde{\cal T}\subset\bar{\cal T}(Q)$ and then apply Proposition 1 to obtain

[TABLE]

provided there exists some $u$ with $\nabla F(\bar{x})u\in{\rm ri\,}{\rm conv\,}\tilde{\cal T}$ or $\tilde{\cal T}$ is polyhedral. Using (26) we obtain the following corollary from Theorem 4.

Corollary 4.

Assume that there exists some convex cone $Q\subset\mathbb{R}^{m}$ fulfilling (24) and some cone $\tilde{\cal T}\subset\bar{\cal T}(Q)$ such that $\widehat{N}_{D}(F(\bar{x}))=\tilde{\cal T}^{\circ}$ and either there is some $u\in\mathbb{R}^{n}$ with $\nabla F(\bar{x})u\in{\rm ri\,}{\rm conv\,}\tilde{\cal T}$ or $\tilde{\cal T}$ is polyhedral. Then

[TABLE]

Proof.

Using (15), Theorem 4 and (26) together with the assumptions of the corollary we obtain

[TABLE]

and the assertion follows. ∎

4 Application to MPCC

In this section we consider a mathematical program with complementarity constraints (MPCC) of the form

[TABLE]

where $f:\mathbb{R}^{n}\to\mathbb{R}$ , $h:\mathbb{R}^{n}\to\mathbb{R}^{m_{E}}$ , $g:\mathbb{R}^{n}\to\mathbb{R}^{m_{I}}$ , $G:\mathbb{R}^{n}\to\mathbb{R}^{m_{C}}$ and $H:\mathbb{R}^{n}\to\mathbb{R}^{m_{C}}$ are assumed to be continuously differentiable. There are several possibilities to write the constraints of (4) in the form (1), we use here the formulation with

[TABLE]

where

[TABLE]

In what follows we denote the feasible set of (4) by $\Omega_{C}$ . Given a feasible point $\bar{x}\in\Omega_{C}$ we introduce the following index sets of constraints active at $\bar{x}$ :

[TABLE]

Straightforward calculations yield that

[TABLE]

with $T_{\mathbb{R}^{m_{I}}_{-}}(g(\bar{x}))=\{v\in\mathbb{R}^{m_{I}}\,|\,v_{i}\leq 0,i\in I^{g}\}$ ,

[TABLE]

and consequently $T^{\rm lin}_{\Omega_{C}}(\bar{x})$ is the collection of all $u\in\mathbb{R}^{n}$ fulfilling the system

[TABLE]

Further we have

[TABLE]

$N_{D_{C}}(-G_{i}(\bar{x}),-H_{i}(\bar{x}))=\widehat{N}_{D_{C}}(-G_{i}(\bar{x}),-H_{i}(\bar{x}))$ for $i\in I^{+0}\cup I^{0+}$ and $N_{D_{C}}(-G_{i}(\bar{x}),-H_{i}(\bar{x}))=(\mathbb{R}_{+}\times\mathbb{R}_{+})\cup(\{0\}\times\mathbb{R})\cup(\mathbb{R}\times\{0\})$ for $i\in I^{00}$ ; cf. [5, 6, 29].

Note that GACQ for MPCC is equivalent to MPEC-ACQ as introduced by Flegel and Kanzow [4]. Similarly, GGCQ for MPCC ie equivalent to MPEC-GCQ [5].

In order to apply Theorem 2 and the concept of ${\cal Q}$ -stationarity we define for every partition $(\beta_{1},\beta_{2})$ of the biactive index set $I^{00}$ the convex polyhedric cone

[TABLE]

where $\tau^{\beta_{1},\beta_{2}}_{i}:=T_{D_{C}}(-G_{i}(\bar{x}),-H_{i}(\bar{x}))$ if $i\in I^{0+}\cup I^{+0}$ and

[TABLE]

Lemma 3.

For every partition $(\beta_{1},\beta_{2})\in{\cal P}(I^{00})$ the pair $(Q_{1},Q_{2})=(Q^{\beta_{1},\beta_{2}}_{CC},Q^{\beta_{2},\beta_{1}}_{CC})$ consists of two closed convex cones fulfilling (21) and (20).

Proof.

It is easy to see that both cones $Q_{j}$ , $j=1,2$ are closed convex polyhedral cones fulfilling $Q_{j}\subset T_{D}(F(\bar{x}))$ and by using Proposition 1 we conclude that $\big{(}\nabla F(\bar{x})^{-1}Q_{j}\big{)}^{\circ}=\nabla F(\bar{x})^{T}(Q_{j})^{\circ}$ . There remains to show that $(Q^{\beta_{1},\beta_{2}}_{CC})^{\circ}\cap(Q^{\beta_{2},\beta_{1}}_{CC})^{\circ}=\widehat{N}_{D}(F(\bar{x}))$ . Since for every $i\in I^{00}=\beta_{1}\cup\beta_{2}$ we have $\tau^{\beta_{1},\beta_{2}}_{i}\cup\tau^{\beta_{2},\beta_{1}}_{i}=(\{0\}\times\mathbb{R}_{-})\cup(\mathbb{R}_{-}\times\{0\})=D_{C}=T_{D_{C}}(-G_{i}(\bar{x}),-H_{i}(\bar{x}))$ and for every $i\in I^{0+}\cup I^{+0}$ we have $\tau^{\beta_{1},\beta_{2}}_{i}\cup\tau^{\beta_{2},\beta_{1}}_{i}=T_{D_{C}}(-G_{i}(\bar{x}),-H_{i}(\bar{x}))$ by the definition, we obtain from (8) that

[TABLE]

and the lemma is proved. ∎

It is easy to see that $T_{D}(F(\bar{x}))$ is the union taken over all partitions $(\beta_{1},\beta_{2})\in{\cal P}(I^{00})$ of the cones $Q^{\beta_{1},\beta_{2}}_{CC}$ and therefore $\widehat{N}_{D}(F(\bar{x}))=\bigcap_{(\beta_{1},\beta_{2})\in{\cal P}(I^{00})}\big{(}Q_{CC}^{\beta_{1},\beta_{2}}\big{)}^{\circ}$ . We have shown in Lemma 3 that this intersection of $2^{|I^{00}|}$ many polar cones can be replaced by the intersection of two polar cones $(Q^{\beta_{1},\beta_{2}}_{CC})^{\circ}\cap(Q^{\beta_{2},\beta_{1}}_{CC})^{\circ}$ . Since

[TABLE]

and under the assumption of GGCQ

[TABLE]

we expect that the replacement of the intersection of the $2^{|I^{00}|}$ many cones $\nabla F(\bar{x})^{T}\big{(}Q_{CC}^{\beta_{1},\beta_{2}}\big{)}^{\circ}$ by the intersection $(Q^{\beta_{1},\beta_{2}}_{CC})^{\circ}\cap(Q^{\beta_{2},\beta_{1}}_{CC})^{\circ}$ of two cones can result in a tight inclusion which can be even exact under some reasonable assumptions.

Note that

[TABLE]

In the sequel we will use the sets of multipliers

[TABLE]

and

[TABLE]

Note that

[TABLE]

and

[TABLE]

We now apply Theorem 2 to estimate the regular normal cone $\widehat{N}_{\Omega_{C}}(\bar{x})$ of the MPCC (4).

Proposition 4.

Let $\bar{x}$ belong to the feasible region $\Omega_{C}$ of the MPCC (4) and assume that GGCQ is fulfilled at $\bar{x}$ . Then for every partition $(\beta_{1},\beta_{2})$ of the index set $I^{00}$ we have

[TABLE]

where

[TABLE]

Proof.

We apply (18) with $(Q_{1},Q_{2})=(Q^{\beta_{1},\beta_{2}}_{CC},Q^{\beta_{2},\beta_{1}}_{CC})$ . All we have to show is the equation (38). Obviously we have $(Q^{\beta_{1},\beta_{2}}_{CC})^{\circ}=\mathbb{R}^{m_{E}}\times N_{\mathbb{R}^{m_{I}}_{-}}(g(\bar{x}))\times\prod_{i=1}^{m_{C}}(\tau^{\beta_{1},\beta_{2}}_{i})^{\circ}$ and the set $(Q^{\beta_{1},\beta_{2}}_{CC})^{\circ}\cap(\ker\nabla F(\bar{x})^{T}+(Q^{\beta_{2},\beta_{1}}_{CC})^{\circ})$ consists of all $\lambda=(\lambda^{h},\lambda^{g},\lambda^{G},\lambda^{H})$ such that there exists $\eta=(\eta^{h},\eta^{g},\eta^{G},\eta^{H})\in(Q^{\beta_{2},\beta_{1}}_{CC})^{\circ}$ and some $\mu=(\mu^{h},\mu^{g},\mu^{G},\mu^{H})\in\ker\nabla F(\bar{x})^{T}$ such that

[TABLE]

We proceed with an analysis of the different cases:

Equality constraints: We obtain $\lambda^{h}=\eta^{h}+\mu^{h}\in\mathbb{R}^{m_{E}}$ , $\mu^{h}\in\mathbb{R}^{m_{E}}$ , $\eta^{h}\in\mathbb{R}^{m_{E}}$ , i.e., $\lambda^{h},\mu^{h}\in\mathbb{R}^{m_{E}}$ . 2. 2.

Inequality constraints: For $i\in I^{g}$ we have $\lambda^{g}_{i}=\eta^{g}_{i}+\mu^{g}_{i}\geq 0$ , $\eta^{g}_{i}\geq 0$ or equivalently $\lambda^{g}_{i}\geq\max\{0,\mu^{g}_{i}\}$ , whereas for $i\in\{1,\ldots,m_{I}\}\setminus I^{g}$ we obtain $\lambda^{g}_{i}=\eta^{g}_{i}=0$ which yields $\mu^{g}_{i}=0$ . 3. 3.

$i\in I^{0+}$ : Since $(\tau^{\beta_{1},\beta_{2}}_{i})^{\circ}=(\tau^{\beta_{2},\beta_{1}}_{i})^{\circ}=\mathbb{R}\times\{0\}$ , we obtain $\lambda^{H}_{i}=\eta^{H}_{i}=0$ and consequently also $\mu^{H}_{i}=0$ . 4. 4.

$i\in I^{+0}$ : Similarly as in the previous case we obtain $\lambda^{G}_{i}=\mu^{G}_{i}=0$ . 5. 5.

$i\in\beta_{1}$ : Since $(\tau^{\beta_{1},\beta_{2}}_{i})^{\circ}=\mathbb{R}\times\mathbb{R}_{+}$ , $(\tau^{\beta_{2},\beta_{1}}_{i})^{\circ}=\mathbb{R}_{+}\times\mathbb{R}$ we have

[TABLE]

and $(\eta^{G}_{i},\eta^{H}_{i})\in\mathbb{R}_{+}\times\mathbb{R}$ . This can be written equivalently as $\lambda^{G}_{i}\geq\mu^{G}_{i}$ , $\lambda^{H}_{i}\geq 0$ . 6. 6.

$i\in\beta_{2}$ : Similarly as in the previous case we obtain $\lambda^{G}_{i}\geq 0$ , $\lambda^{H}_{i}\geq\mu^{H}_{i}$ .

We see that $\tilde{N}^{\beta_{1},\beta_{2}}_{CC}=(Q^{\beta_{1},\beta_{2}}_{CC})^{\circ}\cap(\ker\nabla F(\bar{x})^{T}+(Q^{\beta_{2},\beta_{1}}_{CC})^{\circ})$ and the claimed result follows from (18). ∎

Theorem 5.

Let $\bar{x}$ belong to the feasible region $\Omega_{C}$ of the MPCC (4) and assume that GGCQ is fulfilled at $\bar{x}$ . Further assume that there is some partition $(\beta_{1},\beta_{2})$ of the index set $I^{00}$ such that for every $\mu\in{\cal N}_{CC}$ we have

[TABLE]

Then

[TABLE]

Proof.

Due to (38), (33) and Theorem 2 we only have to show that (19), i.e.

[TABLE]

holds. Consider $x^{\ast}\in M^{\beta_{1},\beta_{2}}_{CC}$ . Then we have the representation

[TABLE]

with $(\lambda^{h},\lambda^{g},\lambda^{G},\lambda^{H})\in\tilde{N}^{\beta_{1},\beta_{2}}_{CC}$ . If $\lambda^{G}_{i}\geq 0$ for every $i\in\beta_{1}$ and $\lambda^{H}_{i}\geq 0$ for every $i\in\beta_{2}$ , then the claimed inclusion $x^{\ast}\in\nabla F(\bar{x})^{T}\widehat{N}_{D}(F(\bar{x}))$ follows from (31). Otherwise, either there is some $j\in\beta_{1}$ such that $\lambda^{G}_{j}<0$ or some $j\in\beta_{2}$ such that $\lambda^{H}_{j}<0$ . We consider first the case when $\lambda^{G}_{j}<0$ for some $j\in\beta_{1}$ . Take the element $(\mu^{h},\mu^{g},\mu^{G},\mu^{H})\in{\cal N}_{CC}$ associated with $(\lambda^{h},\lambda^{g},\lambda^{G},\lambda^{H})$ according to (38) and set $(\tilde{\lambda}^{h},\tilde{\lambda}^{g},\tilde{\lambda}^{G},\tilde{\lambda}^{H}):=(\lambda^{h}-\mu^{h},\lambda^{g}-\mu^{g},\lambda^{G}-\mu^{G},\lambda^{H}-\mu^{H})$ . Then

[TABLE]

and

[TABLE]

by virtue of (38). Further, since $0>\lambda^{G}_{j}\geq\mu^{G}_{j}$ we deduce by the assumptions of the theorem that $\mu^{G}_{i}\leq 0\ \forall i\in\beta_{2}$ , $\mu^{H}_{i}\leq 0\ \forall i\in\beta_{1}$ and consequently $\tilde{\lambda}^{G}_{i}=\lambda^{G}_{i}-\mu^{G}_{i}\geq\lambda^{G}_{i}\geq 0\ \forall i\in\beta_{2}$ , $\tilde{\lambda}^{H}_{i}=\lambda^{H}_{i}-\mu^{H}_{i}\geq\lambda^{H}_{i}\geq 0\ \forall i\in\beta_{1}$ . Therefore $\tilde{\lambda}^{G}_{i}\geq 0$ and $\tilde{\lambda}^{H}_{i}\geq 0$ holds for every $i\in\beta_{1}\cup\beta_{2}$ and $x^{\ast}=\nabla F(\bar{x})^{T}\widehat{N}_{D}(F(\bar{x}))$ follows. Similar arguments can be applied in the alternative situation when there exists some $j\in\beta_{2}$ with $\lambda^{H}_{j}<0$ . ∎

Let us compare our approach with the results of Pang and Fukushima [24]. In [24] the authors try to detect certain redundancies in the description of the linearized tangent cone and then analyze an equivalent representation of the linearized cone. In this paper we treat only so-called (non)singular inequalities, a more general approach goes beyond the scope of this work.

Given a linear system

[TABLE]

an inequality $a_{i}x\leq b_{i}$ is said to be nonsingular if there exists a feasible solution of this system which satisfies this inequality strictly. Here $a_{i}$ denotes the i-th row of the matrix $A$ . An inequality is called singular if it is not nonsingular.

Let us denote by $T^{\rm lin}_{\Omega_{C},R}(\bar{x})$ the set of all $u$ fulfilling the linear system

[TABLE]

which is obtained from (29) by relaxing the complementarity condition. Obviously we have $T^{\rm lin}_{\Omega_{C}}(\bar{x})\subset T^{\rm lin}_{\Omega_{C},R}(\bar{x})$ .

Now let $\beta^{G}$ denote the set consisting of all indices $i\in I^{00}$ such that the inequality $-\nabla G_{i}(\bar{x})u\leq 0$ is nonsingular in the system (39). Similarly, we denote by $\beta^{H}$ the nonsingular set pertaining to the inequalities $-\nabla H_{i}(\bar{x})u\leq 0$ . For notational convenience we introduce also the set $\beta^{GH}:=\beta^{G}\cap\beta^{H}$ .

Using the set $\beta^{GH}$ we arrive at the following description of the linearized cone:

[TABLE]

This can be seen from the fact that every $u$ belonging to the set on the right hand side of (40) also belongs to $T^{\rm lin}_{\Omega_{C},R}(\bar{x})$ and therefore for every $i\in I^{00}\setminus\beta^{GH}=(I^{00}\setminus\beta^{G})\cup(I^{00}\setminus\beta^{H})$ either the inequality $-\nabla G_{i}(\bar{x})u\leq 0$ or the inequality $-\nabla H_{i}(\bar{x})u\leq 0$ is singular and consequently fulfilled with equality, implying that complementarity holds. Now the representation (40) of the linearized cone has the same structure as the original representation (29) and we can apply Theorem 5 to (40) in order to obtain the following corollary.

Corollary 5.

Let $\bar{x}$ belong to the feasible region $\Omega_{C}$ of the MPCC (4) and assume that GGCQ is fulfilled at $\bar{x}$ . Further assume that there is some partition $(\beta^{GH}_{1},\beta^{GH}_{2})$ of the index set $\beta^{GH}$ such that for every $\mu\in{\cal N}_{CC}$ there holds

[TABLE]

Then

[TABLE]

Proof.

The representation (40) has the form $T^{\rm lin}_{\Omega_{C}}(\bar{x})=\{u\in\mathbb{R}^{n}\,|\,\nabla F(\bar{x})u\in T^{GH}\}$ with

[TABLE]

and from Theorem 5 we obtain $\widehat{N}_{\Omega_{C}}(\bar{x})=\nabla F(\bar{x})^{T}(T^{GH})^{\circ}$ . It is easy to see that $(T^{GH})^{\circ}=(T_{D}(F(\bar{x})))^{\circ}=\widehat{N}_{D}(F(\bar{x}))$ and thus the assertion follows. ∎

The statement of Corollary 5 was shown in [24, Theorem 2] under the assumption (A3), which reads in our notation that there exists a partition $(\beta^{GH}_{1},\beta^{GH}_{2})$ of the index set $\beta^{GH}$ such that for every $\mu\in{\cal N}_{CC}$ one has

[TABLE]

Since $\beta^{GH}_{2}=\beta^{GH}\setminus\beta^{GH}_{1}\subset\beta^{G}\setminus\beta^{GH}_{1}$ and $\beta^{GH}_{1}\subset\beta^{H}\setminus\beta^{GH}_{2}$ , our assumption (41) is not stronger than assumption (A3) used by Pang and Fukushima [24]. In case when $\beta^{G}\not=\beta^{GH}$ or $\beta^{H}\not=\beta^{GH}$ our assumption (41) is actually weaker, as the following example demonstrates.

Example 1.

Consider the system

[TABLE]

at $\bar{x}=(0,0,0,0)$ . Since all constraint functions are linear, GACQ is fulfilled, cf. also [4, Theorem 3.2], and consequently GGCQ holds as well. It is easy to see that $\beta^{G}=\{1,2\}$ and $\beta^{GH}=\beta^{H}=\{2\}$ and therefore condition (41) amounts to

[TABLE]

Since (43) is equivalent to $\mu^{H}_{1}=\mu^{g}_{2}$ , $\mu^{H}_{2}=\mu^{G}_{2}=-\mu^{G}_{1}=-\mu^{g}_{1}$ , (47) holds with any of the two partitions $\beta^{GH}_{1}=\{2\},\beta^{GH}_{2}=\emptyset$ and $\beta^{GH}_{1}=\emptyset,\beta^{GH}_{2}=\{2\}$ and therefore Corollary 5 is applicable. On the other hand, condition (42) reads as

[TABLE]

Taking $(\mu^{g}_{1},\mu^{g}_{2},\mu^{G}_{1},\mu^{G}_{2},\mu^{H}_{1},\mu^{H}_{2})=(1,1,1,-1,1-1)$ we obtain that for the partition $\beta^{GH}_{1}=\emptyset,\beta^{GH}_{2}=\{2\}$ the condition $\mu_{1}^{G}\mu_{2}^{H}\geq 0$ is violated, whereas in case when $\beta^{GH}_{1}=\{2\},\beta^{GH}_{2}=\emptyset$ the inequality $\mu_{2}^{G}\mu_{1}^{G}\geq 0$ fails to hold. Thus [24, Assumption (A3)] does not hold for this example and therefore the assumption used in our Corollary 5 is strictly weaker.

We introduce now the following stationarity concepts for MPCC which correspond to Definition 4 with ${\cal Q}={\cal Q}_{CC}$ , where

[TABLE]

Note that there is a one-to-one correspondence between the sets $(Q_{1},Q_{2})\in{\cal Q}_{CC}$ and partitions $(\beta_{1},\beta_{2})$ of the biactive index set $I^{00}$

Definition 5.

Let $\bar{x}\in\Omega_{C}$ .

We say that $\bar{x}$ is ${\cal Q}$ -stationary for the MPCC (4) with respect to the partition $(\beta_{1},\beta_{2})$ of the index set $I^{00}$ if

[TABLE]

where $M^{\beta_{1},\beta_{2}}_{CC}$ is given by (33). 2. 2.

*We say that $\bar{x}$ is ${\cal Q}$ -stationary for the MPCC (4) * if it is ${\cal Q}$ -stationary with respect to some partition $(\beta_{1},\beta_{2})$ of the index set $I^{00}$ . 3. 3.

We say that $\bar{x}$ is ${\cal Q}_{M}$ -stationary for the MPCC (4) if there is some partition $(\beta_{1},\beta_{2})$ of $I^{00}$ such that

[TABLE]

Theorem 6.

Assume that GGCQ is fulfilled at the point $\bar{x}\in\Omega_{C}$ . If $\bar{x}$ is B-stationary, then $\bar{x}$ is ${\cal Q}$ -stationary for the MPCC (4) with respect to every partition $(\beta_{1},\beta_{2})$ of $I^{00}$ and it is also ${\cal Q}_{M}$ stationary. Conversely, if $\bar{x}$ is ${\cal Q}$ -stationary with respect to a partition $(\beta_{1},\beta_{2})$ of $I^{00}$ , which fulfills also the assumptions of Theorem 5, then $\bar{x}$ is S-stationary and consequently B-stationary.

Proof.

In view of the definitions of B-stationarity and S-stationarity together with Proposition 4 and Theorem 5 there is only to show the assertion about ${\cal Q}_{M}$ -stationarity. This follows easily from Theorem 3(3.) because $T_{D}(F(\bar{x}))=\bigcup_{(\beta_{1},\beta_{2})\in{\cal P}(I^{00})}Q^{\beta_{1},\beta_{2}}_{CC}$ is the union of finitely many convex polyhedral cones generating the collection ${\cal Q}$ . ∎

Remark 3.

Given a multiplier $\lambda\in N_{D}(F(\bar{x}))$ verifying the M-stationarity condition $0\in\nabla f(\bar{x})+\nabla F(\bar{x})^{T}\lambda$ we can use the partition $(\beta_{1},\beta_{2})\in{\cal P}(I^{00})$ defined by

[TABLE]

for testing $\bar{x}$ on ${\cal Q}_{M}$ -stationarity, because this choice ensures $\lambda\in\big{(}Q^{\beta_{1},\beta_{2}}_{CC}\big{)}^{\circ}$ . The computation of such a multiplier $\lambda$ can be done by means of the algorithm presented in the proof of [8, Theorem 4.3].

We see that ${\cal Q}$ -stationarity is a first order necessary condition for $\bar{x}$ being a local minimizer, provided GGCQ is fullfilled, which is to be considered as a very weak constraint qualification. In order to verify ${\cal Q}$ -stationarity, only a system of linear equalities and linear inequalities has to be solved, but the main difference to the usual first-order optimality conditions is, that a second multiplier $\mu$ is involved.

Note that postulating GGCQ in our problem setting is equivalent to MPEC-GCQ as given in [5]. It was shown in [5] that under MPEC-GCQ any B-stationary point of MPCC is M-stationary. Theorem 6 improves this result by stating that even ${\cal Q}_{M}$ -stationarity holds.

Let us now turn our attention to the case when the gradients of the constraints active at the point $\bar{x}$ ,

[TABLE]

are linearly independent. This constraint qualification is usually named MPEC-LICQ in the literature. Then we obviously have ${\cal N}_{CC}=\{0\}$ and therefore the assumptions of Theorem 5 hold. Hence, under MPEC-LICQ ${\cal Q}$ -stationarity automatically implies S-stationarity and B-stationarity. This is remarkable because M-stationarity does not have this property: Under MPEC-LICQ an M-stationary point is neither S-stationary nor B-stationary in general. However, in case when MPEC-LICQ does not hold, there also exist examples where a ${\cal Q}$ -stationary point is not M-stationary and therefore neither M-stationarity implies ${\cal Q}$ -stationarity nor vice versa. However, the following example shows that ${\cal Q}_{M}$ -stationarity is strictly stronger than M-stationarity.

Example 2.

(cf.[8, Example 3]) Consider the MPCC

[TABLE]

Then $\bar{x}=(0,0,0)$ is not a local minimizer because for every $\alpha>0$ the point $x^{\alpha}=(0,\alpha,\alpha)$ is feasible and $f(x^{\alpha})=-\alpha<0=f(\bar{x})$ . GACQ is fulfilled because all constraints are linear and the linearized cone amounts to

[TABLE]

Straightforward calculations yield that $\bar{x}$ is M-stationary and $\lambda=(\lambda^{g}_{1},\lambda^{g}_{2},\lambda^{G}_{1},\lambda^{H}_{1})=(1,3,0,-2)$ is the unique multiplier fulfilling the M-stationarity conditions. However, we will now show that $\bar{x}$ is not ${\cal Q}_{M}$ -stationary. Assuming that $\bar{x}$ is ${\cal Q}_{M}$ -stationary, by taking $\beta_{1}=\emptyset$ , $\beta_{2}=\{1\}$ , there would exist some $\mu=(\mu^{g}_{1},\mu^{g}_{2},\mu^{G}_{1},\mu^{H}_{1})$ verifying

[TABLE]

But a solution of this system must fulfill

[TABLE]

which is obviously not possible. On the other hand, if we take $\beta_{1}=\{1\}$ , $\beta_{2}=\emptyset$ then $\lambda\not\in(Q^{\beta_{1},\beta_{2}}_{CC})^{\circ}$ . Hence $\bar{x}$ is not ${\cal Q}_{M}$ -stationary and we have demonstrated that ${\cal Q}_{M}$ -stationarity is a stronger property than M-stationarity.

5 Application to MPVC

In this section we consider a mathematical program with vanishing constraints (MPVC) of the form

[TABLE]

where $f:\mathbb{R}^{n}\to\mathbb{R}$ , $h:\mathbb{R}^{n}\to\mathbb{R}^{m_{E}}$ , $g:\mathbb{R}^{n}\to\mathbb{R}^{m_{I}}$ , $G:\mathbb{R}^{n}\to\mathbb{R}^{m_{V}}$ and $H:\mathbb{R}^{n}\to\mathbb{R}^{m_{V}}$ are assumed to be at least continuously differentiable. To transform the constraints into the format (1) we use

[TABLE]

where

[TABLE]

Now we denote the feasible region of (53) by $\Omega_{V}$ and we introduce the following index sets of constraints active at a feasible point $\bar{x}\in\Omega_{V}$ :

[TABLE]

Straightforward calculations yield that

[TABLE]

with $T_{\mathbb{R}^{m_{I}}_{-}}(g(\bar{x}))=\{v\in\mathbb{R}^{m_{I}}\,|\,v_{i}\leq 0,i\in I^{g}\}$ ,

[TABLE]

and consequently, $T^{\rm lin}_{\Omega_{V}}(\bar{x})$ is the collection of all $u\in\mathbb{R}^{n}$ fulfilling the system

[TABLE]

Further note that $N_{D_{V}}(-H_{i}(\bar{x}),G_{i}(\bar{x}))=\widehat{N}_{D_{V}}(-H_{i}(\bar{x}),G_{i}(\bar{x}))$ , $i\not\in I^{00}$ and $N_{D_{V}}(-H_{i}(\bar{x}),G_{i}(\bar{x}))=(\mathbb{R}\times\{0\})\cup(\{0\}\times\mathbb{R}_{+})$ , $i\in I^{00}$ .

Similar to MPCC we define for every partition $(\beta_{1},\beta_{2})$ of the set $I^{00}$ the cone

[TABLE]

where $\tau^{\beta_{1},\beta_{2}}_{i}:=T_{D_{V}}(-H_{i}(\bar{x}),G(\bar{x}))$ if $i\not\in I^{00}$ and

[TABLE]

Lemma 4.

For every partition $(\beta_{1},\beta_{2})\in{\cal P}(I^{00})$ the pair $(Q_{1},Q_{2})=(Q^{\beta_{1},\beta_{2}}_{VC},Q^{\beta_{2},\beta_{1}}_{VC})$ consists of two closed convex cones fulfilling (21) and (20).

Proof.

The proof follows the same lines as the proof of Lemma 3 and is therefore omitted. ∎

Similar to the case of MPCC we have

[TABLE]

Consider the following two sets of multipliers,

[TABLE]

and

[TABLE]

Note that

[TABLE]

and

[TABLE]

Proposition 5.

Let $\bar{x}$ belong to the feasible region $\Omega_{V}$ of the MPVC (53) and assume that GGCQ is fulfilled at $\bar{x}$ . Then for every partition $(\beta_{1},\beta_{2})$ of the index set $I^{00}$ we have

[TABLE]

where

[TABLE]

Proof.

We can proceed similarly to the proof of Proposition 4. We have $(Q^{\beta_{1},\beta_{2}}_{VC})^{\circ}=\mathbb{R}^{m_{E}}\times N_{\mathbb{R}^{m_{I}}_{-}}(g(\bar{x}))\times\prod_{i=1}^{m_{V}}(\tau^{\beta_{1},\beta_{2}}_{i})^{\circ}$ and the set $\tilde{N}^{\beta_{1},\beta_{2}}_{VC}=(Q^{\beta_{1},\beta_{2}}_{CC})^{\circ}\cap(\ker\nabla F(\bar{x})^{T}+(Q^{\beta_{2},\beta_{1}}_{CC})^{\circ})$ consists of all $\lambda=(\lambda^{h},\lambda^{g},\lambda^{H},\lambda^{G})$ such that there exists $\eta=(\eta^{h},\eta^{g},\eta^{H},\eta^{G})\in(Q^{\beta_{2},\beta_{1}}_{VC})^{\circ}$ and some $\mu=(\mu^{h},\mu^{g},\mu^{H},\mu^{G})\in\ker\nabla F(\bar{x})^{T}$ such that

[TABLE]

Similar as in the proof of Proposition 4 this yields

[TABLE]

Now consider $i\in\beta_{1}$ . Then $(\tau^{\beta_{1},\beta_{2}}_{i})^{\circ}=\mathbb{R}\times\{0\}$ and $(\tau^{\beta_{2},\beta_{1}}_{i})^{\circ}=\mathbb{R}_{+}\times\mathbb{R}_{+}$ . Hence

[TABLE]

and $(\eta^{H}_{i},\eta^{G}_{i})\in\mathbb{R}_{+}\times\mathbb{R}_{+}$ , or equivalently

[TABLE]

In case that $i\in\beta_{2}$ we have $(\tau^{\beta_{1},\beta_{2}}_{i})^{\circ}=\mathbb{R}_{+}\times\mathbb{R}_{+}$ and $(\tau^{\beta_{2},\beta_{1}}_{i})^{\circ}=\mathbb{R}\times\{0\}$ ,

[TABLE]

and $(\eta^{H}_{i},\eta^{G}_{i})\in\mathbb{R}\times\{0\}$ , which is equivalent to

[TABLE]

These arguments show that $\tilde{N}^{\beta_{1},\beta_{2}}_{VC}$ has the claimed representation and the assertion follows from (18). ∎

In the following theorem we give a sufficient condition for equality in (5).

Theorem 7.

Let $\bar{x}$ belong to the feasible region $\Omega_{V}$ of the MPVC (53) and assume that GGCQ is fulfilled at $\bar{x}$ . Further assume that there is a partition $(\beta_{1},\beta_{2})$ of $I^{00}$ such that

[TABLE]

Then

[TABLE]

Proof.

Under the assumption of the theorem we conclude that

[TABLE]

Now the claimed result follows from Theorem 2 together with Proposition 5 by taking $(Q_{1},Q_{2})=(Q^{\beta_{1},\beta_{2}}_{VC},Q^{\beta_{2},\beta_{1}}_{VC})$ . ∎

Next we establish an equivalent formulation of condition (58).

Lemma 5.

Let $(\beta_{1},\beta_{2})$ be a partition of $I^{00}$ . Then the following statements are equivalent:

(i)

Condition (58) is fulfilled. 2. (ii)

For every $j\in\beta^{1}$ there exists some $z^{j}$ such that

[TABLE]

and there is some $\bar{z}$ such that

[TABLE]

Proof.

Condition (58) is fulfilled if and only if for every $j\in\beta_{1}$ the linear program

[TABLE]

has a solution and the linear program

[TABLE]

has a solution. Since the feasible regions of these linear programs are not empty, by duality theory of linear programming this is equivalent to the statement that the feasible regions of the corresponding dual programs are not empty. Since the feasible regions of the dual programs to (61) and (62), respectively, are given by (59) and (60), respectively, the two statements (i) and (ii) are equivalent. ∎

The characterization of condition (58) by Lemma 5 resembles the well-known Mangasarian-Fromovitz constraint qualification of nonlinear programming. It appears to be not very restrictive, e.g. in case when $\beta_{1}=\emptyset$ , $\beta_{2}=I^{00}$ condition (58) is fulfilled when the system

[TABLE]

has a solution. Hence we think that Theorem 7 is likely to be applicable in many situations.

At the end of this section we consider ${\cal Q}$ -stationarity for MPVC with respect to ${\cal Q}={\cal Q}_{VC}$ , where

[TABLE]

Definition 6.

Let $\bar{x}\in\Omega_{V}$ .

We say that $\bar{x}$ is ${\cal Q}$ -stationary for the MPVC (53) with respect to the partition $(\beta_{1},\beta_{2})$ of the index set $I^{00}$ if

[TABLE]

where $M^{\beta_{1},\beta_{2}}_{VC}$ is given by (5). 2. 2.

*We say that $\bar{x}$ is ${\cal Q}$ -stationary for the MPVC (53) * if it is ${\cal Q}$ -stationary with respect to some partition $(\beta_{1},\beta_{2})$ of the index set $I^{00}$ . 3. 3.

We say that $\bar{x}$ is ${\cal Q}_{M}$ -stationary for the MPVC (53) if there is some partition $(\beta_{1},\beta_{2})$ of $I^{00}$ such that

[TABLE]

It follows from the definition that

[TABLE]

Hence, if $\bar{x}$ is ${\cal Q}$ -stationary with respect to $(I^{00},\emptyset)$ , it is automatically ${\cal Q}_{M}$ -stationary and the following theorem follows from Proposition 5, Theorem 7 and Theorem 3(1.).

Theorem 8.

Assume that GGCQ is fulfilled at the point $\bar{x}\in\Omega_{V}$ . If $\bar{x}$ is B-stationary, then $\bar{x}$ is ${\cal Q}$ -stationary for the MPVC (53) with respect to every partition $(\beta_{1},\beta_{2})$ of $I^{00}$ and, in particular, it is $Q-$ stationary with respect to the partition $(I^{00},\emptyset)$ implying ${\cal Q}_{M}-$ stationarity. Conversely, if $\bar{x}$ is ${\cal Q}$ -stationary with respect to a partition $(\beta_{1},\beta_{2})$ of $I^{00}$ , which fulfills also the assumptions of Theorem 7, then $\bar{x}$ is S-stationary and consequently B-stationary as well.

Further we have

[TABLE]

It was stated in [1, Theorem 4] that, under some weak constraint qualification, the condition $0\in\nabla f(\bar{x})+\nabla F(\bar{x})^{T}{\cal S}$ is a necessary condition for a local minimizer. Hence, if $\bar{x}$ is ${\cal Q}$ -stationary with respect to $(\emptyset,I^{00})$ , then it fulfills also the necessary conditions of [1, Theorem 5.3]. From Lemma 2 we obtain that $\bar{x}$ is ${\cal Q}$ -stationary with respect to $(\beta_{1},\beta_{2})$ , if and only if it ${\cal Q}$ -stationary with respect to $(\beta_{2},\beta_{1})$ . Hence we conclude, that ${\cal Q}$ -stationarity with respect to $(I^{00},\emptyset)$ implies both ${\cal Q}_{M}$ -stationary and the necessary optimality conditions of [1, Theorem 4].

Finally note that GGCQ for MPVC is equivalent to the condition MPVC-GCQ introduced in [17], where it is also shown in [17, Theorem 6.1.8] that under MPVC-GCQ any B-stationary point of MPVC is already M-stationary.

6 Application to generalized equations

Now we consider the problem

[TABLE]

where the mappings $f:\mathbb{R}^{n}\times\mathbb{R}^{m}\to\mathbb{R}$ , $G:\mathbb{R}^{n}\times\mathbb{R}^{m}\to\mathbb{R}^{m}$ are assumed to be continuously differentiable, $C$ is a closed subset of $\mathbb{R}^{n}$ and the set $\Gamma\subset\mathbb{R}^{m}$ is given by $C^{2}$ inequalities, i.e. $\Gamma:=\{y\in\mathbb{R}^{m}\,|\,g_{i}(y)\leq 0,i=1,\ldots,l\}$ , where $g:\mathbb{R}^{m}\to\mathbb{R}^{l}$ is twice continuously differentiable. The constraints fit into our general setting (1) with

[TABLE]

We denote the feasible region of (63) by $\Omega_{GE}$ . We consider a point $(\bar{x},\bar{y})\in\Omega_{GE}$ , fixed throughout this section, and we suppose the following assumptions:

Assumption 1.

The tangent cone $T_{C}(\bar{x})$ is convex and $T_{D}(F(\bar{x},\bar{y}))=T_{C}(\bar{x})\times T_{{\rm gph\,}\widehat{N}_{\Gamma}}(\bar{y},-G(\bar{x},\bar{y}))$ . 2. 2.

GGCQ holds at $(\bar{x},\bar{y})$ . 3. 3.

There is some $v\in\mathbb{R}^{m}$ such that

[TABLE]

i.e. MFCQ holds at $\bar{y}$ .

The first assumption is e.g. fulfilled if $C$ is given by $C^{1}$ -inequalities $h_{i}(x)\leq 0$ $i=1,\ldots,s$ and MFCQ is fulfilled at $\bar{x}$ . Note that the third assumption, that MFCQ holds at $\bar{y}$ , is only made in order to ease the presentation. We claim that it can be weakened to the weaker assumption of metric regularity in the vicinity of $\bar{y}$ (cf. [10]) or metric subregularity and the bounded extreme point property as used in the recent paper [11].

In what follows we set $\bar{y}^{\ast}:=-G(\bar{x},\bar{y})$ and we define by

[TABLE]

the set of Lagrange multipliers associated with $(\bar{y},\bar{y}^{\ast})$ and by

[TABLE]

the critical cone to $\Gamma$ at $\bar{y}$ with respect to $\bar{y}^{\ast}$ . Thanks to the assumed MFCQ for the inequalities describing $\Gamma$ we have $T_{\Gamma}(\bar{y})=T_{\Gamma}^{\rm lin}(\bar{y})=\{v\,|\,\nabla g_{i}(\bar{y})v\leq 0,\ i\in\bar{\cal I}\}$ , $\widehat{N}_{\Gamma}(\bar{y})=\nabla g(\bar{y})^{T}\widehat{N}_{\mathbb{R}^{l}_{-}}(g(\bar{y}))$ and that $\bar{\Lambda}\not=\emptyset$ is compact. Note that we do not require that the gradients $\nabla g_{i}(\bar{y})$ , $i\in\bar{\cal I}$ are linearly independent and hence the set $\bar{\Lambda}$ can contain more than one element.

Given a multiplier $\lambda\in\widehat{N}_{\mathbb{R}^{l}_{-}}(g(\bar{y}))$ we introduce the index sets

[TABLE]

Apart from them we will be working with

[TABLE]

By convexity of the set $\bar{\Lambda}$ a multiplier $\lambda^{+}\in\bar{\Lambda}$ verifying $I^{+}(\lambda^{+})=\bar{I}^{+}$ exists. Further we have

[TABLE]

Indeed, if there would exist numbers $\gamma_{i}$ , $i\in\bar{\cal I}$ violating (65), then, by setting

[TABLE]

with $t>0$ sufficiently small, we would obtain the contradiction that $\bar{I}^{+}$ is strictly contained in $I^{+}(\tilde{\lambda})$ .

Note that $\bar{K}=\{v\,|\,\nabla g_{i}(\bar{y})v=0,i\in\bar{I}^{+},\ \nabla g_{i}(\bar{y})v\leq 0,i\in\bar{I}^{0}\}$ , cf. [10, Lemma 2] and therefore $\bar{K}^{\circ}=\{\sum_{i\in\bar{\cal I}}\mu_{i}\nabla g_{i}(\bar{y})\,|\,\mu_{i}\geq 0,i\in\bar{I}^{0}\}$ .

For a direction $v\in\bar{K}$ we further introduce the directional multiplier set

[TABLE]

Application of [27, Exercise 13.17, Corollary 13.43(a)] (see also [10, Theorem 1]) yields the representation

[TABLE]

A description of the regular normal cone $\widehat{N}_{{\rm gph\,}\widehat{N}_{\Gamma}}(\bar{y},\bar{y}^{\ast})$ can be found in [10, Theorem 2].

In general the structure of the tangent cone (66) is rather complicated. E.g., it is not known whether it always can be represented as the union of finitely many convex polyhedral cones or whether Assumption 1 is sufficient for M-stationarity of a B-stationary point.

In the following theorem we state a sufficient condition that the formula $\widehat{N}_{\Omega_{GE}}=\nabla F(\bar{x},\bar{y})^{T}\widehat{N}_{D}(F(\bar{x},\bar{y}))$ is valid, i.e., that S-stationarity holds at $(\bar{x},\bar{y})$ provided it is B-stationary. We denote by ${\rm lin\,}T_{C}(\bar{x})$ the lineality space of $T_{C}(\bar{x})$ , i.e. the largest linear space contained in $T_{C}(\bar{x})$ . Since $T_{C}(\bar{x})$ is a closed convex cone by our assumption, we have ${\rm lin\,}T_{C}(\bar{x})=T_{C}(\bar{x})\cap(-T_{C}(\bar{x}))$ .

Theorem 9.

Assume that Assumption 1 holds and that for every $w\in\bar{K}$ , every $\lambda_{w}\in\bar{\Lambda}(w)$ and every $z\in\mathbb{R}^{m}$ verifying

[TABLE]

one has

[TABLE]

Further suppose that there exist some $\tilde{u}\in{\rm ri\,}T_{C}(\bar{x})$ , $\tilde{w}\in\bar{K}$ , $\tilde{\lambda}\in\bar{\Lambda}(\tilde{w})$ and some reals $\tilde{\mu}_{i}$ , $i\in\bar{\cal I}$ such that

[TABLE]

Then one has

[TABLE]

Proof.

By Assumption 1 we obtain that

[TABLE]

and, together with (66), that $Q:=T_{C}(\bar{x})\times\{0\}^{m}\times\bar{K}^{\circ}$ is a convex cone contained in $T_{D}(\bar{x},\bar{y},\bar{y}^{\ast})$ . We shall apply Corollary 4 with this cone $Q$ by showing that $T_{D}(\bar{x},\bar{y},\bar{y}^{\ast})=\bar{\cal T}(Q)$ and that there is some $(u,v)$ such that $\nabla F(\bar{x},\bar{y})(u,v)\in{\rm ri\,}{\rm conv\,}\bar{\cal T}(Q)$ . In a first step we show $T_{D}(\bar{x},\bar{y},\bar{y}^{\ast})=\bar{\cal T}(Q)$ , i.e. we prove that for every $(t,w,w^{\ast})\in T_{D}(\bar{x},\bar{y},\bar{y}^{\ast})$ there is some $q:=(t_{q},0,k^{\ast})\in Q$ and some $(u,v)\in\mathbb{R}^{n}\times\mathbb{R}^{m}$ such that

[TABLE]

Let $(t,w,w^{\ast})\in T_{D}(\bar{x},\bar{y},\bar{y}^{\ast})$ be arbitrarily fixed and let $w^{\ast}=\nabla^{2}(\lambda_{w}^{T}g)(\bar{y})w+n^{\ast}$ with $\lambda_{w}\in\bar{\Lambda}(w)$ and $n^{\ast}\in\widehat{N}_{\bar{K}}(w)$ .

Denoting by $A$ the $|\bar{I}^{+}|\times m$ matrix, whose rows are given by $\nabla g_{i}(\bar{y})$ , $i\in\bar{I}^{+}$ , we obtain from (67) that

[TABLE]

Hence there is some $\tilde{k}^{\ast}\in{\rm Range\,}A^{T}=\mathop{\rm span\,}\limits\{\nabla g_{i}(\bar{y})\,|\,i\in\bar{I}^{+}\}$ and some $l\in{\rm lin\,}T_{C}(\bar{x})$ such that $(\nabla_{y}G(\bar{x},\bar{y})+\nabla^{2}(\lambda_{w}^{T}g)(\bar{y}))w=-\tilde{k}^{\ast}-\nabla_{x}G(\bar{x},\bar{y})l$ . Setting $t_{q}:=t-l$ , $u:=l$ , $v:=w$ and $k^{\ast}:=n^{\ast}-\tilde{k}^{\ast}$ and taking into account that $n^{\ast}\in\widehat{N}_{\bar{K}}(w)=\bar{K}^{\circ}\cap\{w\}^{\perp}\subset\bar{K}^{\circ}$ and that $\mathop{\rm span\,}\limits\{\nabla g_{i}(\bar{y})\,|\,i\in\bar{I}^{+}\}$ is exactly the lineality space of $\bar{K}^{\circ}$ , we have $t_{q}\in T_{C}(\bar{x})$ , $k^{\ast}\in\bar{K}^{\circ}$ and

[TABLE]

Thus

[TABLE]

verifying (70), and therefore $T_{D}(\bar{x},\bar{y},\bar{y}^{\ast})=\bar{\cal T}(Q)$ holds.

In order to show that there are $(u,v)$ such that $\nabla F(\bar{x},\bar{y})(u,v)\in{\rm ri\,}{\rm conv\,}\bar{\cal T}(Q)$ , we observe first that

[TABLE]

where ${\cal S}_{1}:=\{(0,w,\nabla^{2}(\lambda^{T}g)(\bar{y})w)\,|\,w\in\bar{K},\lambda\in\bar{\Lambda}(w)\}$ and ${\cal S}_{2}:=T_{C}(\bar{x})\times\{0\}^{m}\times\bar{K}^{\circ}$ . Indeed, by Assumption 1 and (66) it can be easily seen that $T_{D}(\bar{x},\bar{y},\bar{y}^{\ast})\subset{\cal S}_{1}+{\cal S}_{2}$ and by convexity of ${\cal S}_{2}$ the inclusion

[TABLE]

readily follows. On the other hand we have ${\cal S}_{1},{\cal S}_{2}\subset T_{D}(\bar{x},\bar{y},\bar{y}^{\ast})$ implying ${\rm conv\,}{\cal S}_{1},{\cal S}_{2}\subset{\rm conv\,}T_{D}(\bar{x},\bar{y},\bar{y}^{\ast})$ and, together with the fact that ${\rm conv\,}T_{D}(\bar{x},\bar{y},\bar{y}^{\ast})$ is a convex cone, the reverse inclusion

[TABLE]

follows as well and the validity of (73) is shown.

Now consider $(0,w,w^{\ast})\in{\rm ri\,}{\rm conv\,}{\cal S}_{1}$ . Then there are nonnegative coefficients $\alpha_{j}\geq 0$ , $j=1,\ldots,s$ , $\sum_{j=1}^{s}\alpha_{j}=1$ and elements $(0,w_{j},w_{j}^{\ast})\in{\cal S}_{1}$ such that $(0,w,w^{\ast})=\sum_{j=1}^{s}\alpha_{j}(0,w_{j},w_{j}^{\ast})$ . Then, by proceeding as before, for every $j=1,\ldots,s$ we can find $\tilde{k}_{j}^{\ast}\in\mathop{\rm span\,}\limits\{\nabla g_{i}(\bar{y})\,|\,i\in\bar{I}^{+}\}$ and $l_{j}\in{\rm lin\,}T_{C}(\bar{x})$ such that

[TABLE]

By setting $l:=\sum_{j=1}^{s}\alpha_{j}l_{j}$ , $u:=l+\tilde{u}$ , $v:=w+\tilde{w}$ , $\tilde{w}^{\ast}:=\nabla^{2}(\tilde{\lambda}^{T}g)(\bar{y})\tilde{w}$ , $k^{\ast}:=\sum_{j=1}^{s}\alpha_{j}\tilde{k}_{j}^{\ast}+\sum_{i\in\bar{\cal I}}\nabla g_{i}(\bar{y})\tilde{\mu}_{i}$ , we obtain

[TABLE]

Since $\sum_{i\in\bar{\cal I}}\nabla g_{i}(\bar{y})\tilde{\mu}_{i}\in{\rm ri\,}\bar{K}^{\circ}$ by [26, Theorem 6.6], $\sum_{j=1}^{s}\alpha_{j}\tilde{k}_{j}^{\ast}\in\mathop{\rm span\,}\limits\{\nabla g_{i}(\bar{x})\,|\,i\in\bar{I}^{+}\}\subset{\rm lin\,}\bar{K}^{\circ}$ , $\tilde{u}\in{\rm ri\,}T_{C}(\bar{x})$ and $l\in{\rm lin\,}T_{C}(\bar{x})$ , we conclude

[TABLE]

Further, since $(0,w,w^{\ast})\in{\rm ri\,}{\rm conv\,}{\cal S}_{1}$ , $(0,\tilde{w},\tilde{w}^{\ast})\in{\cal S}_{1}$ and ${\cal S}_{1}$ is a cone, we obtain $(0,w+\tilde{w},w^{\ast}+\tilde{w}^{\ast})\in{\rm ri\,}{\rm conv\,}{\cal S}_{1}$ . Thus, by taking into account [26, Corollary 6.6.2],

[TABLE]

and this finishes the proof. ∎

Remark 4.

Theorem 9 improves [10, Theorem 5], where the assumption

[TABLE]

is used. Note that this assumption is equivalent to $\{0\}^{m}=(\mathop{\rm span\,}\limits\{\nabla g_{i}(\bar{x})\,|\,i\in\bar{I}^{+}\})^{\perp}\cap\big{(}\nabla_{x}G(\bar{x},\bar{y})({\rm lin\,}T_{C}(\bar{x}))\big{)}^{\perp}$ and thus the only element $z$ with $\nabla g_{i}(\bar{x})z=0$ , $i\in\bar{I}^{+}$ and $\nabla_{x}G(\bar{x},\bar{y})^{T}z\in({\rm lin\,}T_{C}(\bar{x}))^{\perp}$ is $z=0$ and therefore (67) trivially holds. Further, this assumption also implies (68), because for arbitrary $u\in{\rm ri\,}T_{C}(\bar{x})$ and $\tilde{\mu}_{i}>0$ , $i\in\bar{I}^{0}$ , we can find $l\in{\rm lin\,}T_{C}(\bar{x})$ and $\tilde{\mu}_{i}$ , $i\in\bar{I}^{+}$ with $\nabla_{x}G(\bar{x},\bar{y})l+\sum_{i\in\bar{I}^{+}}\tilde{\mu}_{i}\nabla g_{i}(\bar{x})=-\nabla_{x}G(\bar{x},\bar{y})u-\sum_{i\in\bar{I}^{0}}\tilde{\mu}_{i}\nabla g_{i}(\bar{x})$ and now (68) follows with $\tilde{u}=u+l\in{\rm ri\,}T_{C}(\bar{x})$ , $\tilde{w}=0$ .

Next we consider ${\cal Q}$ -stationarity for the problem (63) under an additional assumption which allows a simplified description of the contingent cone $T_{{\rm gph\,}\widehat{N}_{\Gamma}}(\bar{y},\bar{y}^{\ast})$ as stated in [10, Theorem 3].

Theorem 10.

Assume that Assumption 1(3.) holds at $\bar{y}$ . Further assume that $\bar{\Lambda}(v_{1})=\bar{\Lambda}(v_{2})$ $\forall 0\not=v_{1},v_{2}\in\bar{K}$ and let $\bar{\lambda}$ be an arbitrary multiplier from $\bar{\Lambda}(v)$ for some $0\not=v\in\bar{K}$ , if $\bar{K}\not=\{0\}$ and $\bar{\lambda}\in\bar{\Lambda}$ otherwise. Then

[TABLE]

and

[TABLE]

The assumption $\bar{\Lambda}(v_{1})=\bar{\Lambda}(v_{2})$ $\forall 0\not=v_{1},v_{2}\in\bar{K}$ is for instance fulfilled, if the inequalities $g_{i}(y)\leq 0$ fulfill the constant rank constraint qualification at $\bar{y}$ , see e.g. [13, Corollary 3.2].

In what follows we will assume that the assumptions of Theorem 10 hold and that the tangent cone $T_{C}(\bar{x})$ is a convex polyhedral cone. For every index set $\beta\subset\bar{I}^{0}$ we define the convex polyhedral cone

[TABLE]

where

[TABLE]

Then we have

[TABLE]

and

[TABLE]

It is easy to see that under the assumptions of Theorem 10 we have

[TABLE]

and thus

[TABLE]

Note that for every pair $(\beta_{1},\beta_{2})\subset\bar{I}^{0}\times\bar{I}^{0}$ the cones $(Q^{\beta_{1}}_{GE},Q^{\beta_{2}}_{GE})$ fulfill (21) because they are convex polyhedral cones.

Proposition 6.

Let $(\bar{x},\bar{y})\in\Omega_{GE}$ and assume in addition to Assumption 1 that the contingent cone $T_{C}(\bar{x})$ is polyhedral and $\bar{\Lambda}(v_{1})=\bar{\Lambda}(v_{2})\ \forall 0\not=v_{1},v_{2}\in\bar{K}$ . Then for every pair $(\beta_{1},\beta_{2})\subset\bar{I}^{0}\times\bar{I}^{0}$ we have

[TABLE]

where

[TABLE]

and $\bar{\lambda}$ is an arbitrarily fixed multiplier from $\bar{\Lambda}(v)$ for some $0\not=v\in\bar{K}$ , if $\bar{K}\not=\{0\}$ and $\bar{\lambda}\in\bar{\Lambda}$ otherwise.

Proof.

The statement follows immediately from Theorem 2 if we can show

[TABLE]

Consider an element $(\eta_{C},q^{\ast},q)\in(Q^{\beta_{1}}_{GE})^{\circ}\cap(\ker\nabla F(\bar{x},\bar{y})^{T}+(Q^{\beta_{2}}_{GE})^{\circ})$ . Then there are elements $(\rho_{C},r^{\ast},r)\in\ker\nabla F(\bar{x},\bar{y})^{T}$ and $(\tilde{\eta}_{C},\tilde{q}^{\ast},\tilde{q})\in(Q^{\beta_{2}}_{GE})^{\circ}$ such that

[TABLE]

Since

[TABLE]

we obtain $\rho_{C}=\nabla_{x}G(\bar{x},\bar{y})^{T}r=\eta_{C}-\tilde{\eta}_{C}$ and thus $\eta_{C}=\nabla_{x}G(\bar{x},\bar{y})^{T}r+\tilde{\eta}_{C}\in\nabla_{x}G(\bar{x},\bar{y})^{T}r+\widehat{N}_{C}(\bar{x})$ verifying (86e). The relations (86a) and (86b) follow simply from the representation of $(Q^{\beta_{1}}_{GE})^{\circ}$ . By using the representations $\tilde{q}^{\ast}+\nabla^{2}(\bar{\lambda}^{T}g)(\bar{y})\tilde{q}=\sum_{i\in\bar{\cal I}}\mu_{i}^{\tilde{q}}\nabla g_{i}(\bar{y})$ with $\mu_{i}^{\tilde{q}}\geq 0$ , $i\in\bar{I}^{0}\setminus\beta_{2}$ , it follows that

[TABLE]

Since $r=q-\tilde{q}$ , $\nabla g_{i}(\bar{y})\tilde{q}=\nabla g_{i}(\bar{y})q=0,i\in\bar{I}^{+}$ we have

[TABLE]

where $\mu_{i}^{r}:=\mu_{i}^{q}-\mu_{i}^{\tilde{q}}$ , showing (86d). By taking into account $\nabla g_{i}(\bar{y})(q-r)=\nabla g_{i}(\bar{y})\tilde{q}\leq 0,i\in\beta_{2}$ , $\mu_{i}^{q}-\mu_{i}^{r}=\mu_{i}^{\tilde{q}}\geq 0,i\in\bar{I}^{0}\setminus\beta_{2}$ we obtain together with (89) that (86c) also holds. Hence, $(\eta_{C},q^{\ast},q)$ belongs to the set $\tilde{N}^{\beta_{1},\beta_{2}}_{GE}$ and the inclusion $(Q^{\beta_{1}}_{GE})^{\circ}\cap(\ker\nabla F(\bar{x},\bar{y})^{T}+(Q^{\beta_{2}}_{GE})^{\circ})\subset\tilde{N}^{\beta_{1},\beta_{2}}_{GE}$ follows.

To show the reverse inclusion consider $(\eta_{C},q^{\ast},q)\in\tilde{N}^{\beta_{1},\beta_{2}}_{GE}$ together with $r\in\mathbb{R}^{m},\mu_{i}^{q},\mu_{i}^{r},i\in\bar{\cal I}$ according to the definition. By setting $\rho_{C}:=\nabla_{x}G(\bar{x},\bar{y})^{T}r$ , $r^{\ast}:=\nabla_{y}G(\bar{x},\bar{y})^{T}r$ , $(\tilde{\eta}_{C},\tilde{q}^{\ast},\tilde{q}):=(\eta_{C},q^{\ast},q)-(\rho_{C},r^{\ast},r)$ it follows, by using the same arguments as above, that $(\rho_{C},r^{\ast},r)\in\ker\nabla F(\bar{x},\bar{y})^{T}$ and $(\tilde{\eta}_{C},\tilde{q}^{\ast},\tilde{q})\in(Q^{\beta_{2}}_{GE})^{\circ}$ . Since we obviously have $(\eta_{C},q^{\ast},q)\in(Q^{\beta_{1}}_{GE})^{\circ}$ , we obtain $(\eta_{C},q^{\ast},q)\in(Q^{\beta_{1}}_{GE})^{\circ}\cap(\ker\nabla F(\bar{x},\bar{y})^{T}+(Q^{\beta_{2}}_{GE})^{\circ})$ and this finishes the proof. ∎

Theorem 11.

Assume that the assumptions of Proposition 6 are fulfilled and assume that we are given a partition $(\beta_{1},\beta_{2})$ of $\bar{I}^{0}$ such that the following two conditions are fulfilled:

(i)* For every $j\in\beta_{2}$ there are $l^{j}\in{\rm lin\,}T_{C}(\bar{x})$ , $\tilde{\alpha}^{j}_{i}$ , $i\in\bar{I}^{+}$ and $z^{j}\in\mathbb{R}^{m}$ with*

[TABLE]

(ii)* For every $k\in\beta_{1}$ there are $l^{k}\in{\rm lin\,}T_{C}(\bar{x})$ , $\tilde{\alpha}^{k}_{i}$ , $i\in\bar{I}^{+}$ and $z^{k}\in\mathbb{R}^{m}$ with*

[TABLE]

Then

[TABLE]

Proof.

In view of Theorem 2 and Proposition 6 the statement follows if we can show $\tilde{N}^{\beta_{1},\beta_{2}}_{GE}\subset\widehat{N}_{D}(F(\bar{x},\bar{y}))$ . This inclusion holds true if for every $(\eta_{C},q,r)\in\mathbb{R}^{n}\times\mathbb{R}^{m}\times\mathbb{R}^{m}$ , $\mu_{i}^{q},\mu_{i}^{r},i\in\bar{\cal I}$ fulfilling the system

[TABLE]

we have $\nabla g_{i}(\bar{y})r\leq 0$ , $i\in\beta_{2}$ and $\mu_{i}^{r}\geq 0$ , $i\in\beta_{1}$ because then we have $\nabla g_{i}(\bar{y})q\leq 0$ , $\mu_{i}^{q}\geq 0$ , $i\in\beta_{1}\cup\beta_{2}=\bar{I}^{0}$ and thus the triple $(\eta_{C},q^{\ast},q)\in\tilde{N}^{\beta_{1},\beta_{2}}_{GE}$ with $q^{\ast}=-\nabla^{2}(\bar{\lambda}^{T}g)(\bar{y})q+\sum_{i\in\bar{\cal I}}\nabla g_{i}(\bar{y})\mu_{i}^{q}=-\nabla^{2}(\bar{\lambda}^{T}g)(\bar{y})q+\sum_{i\in\bar{\cal I}}\nabla g_{i}(\bar{y})\tilde{\mu}_{i}$ also belongs to $\widehat{N}_{D}(F(\bar{x},\bar{y}))$ .

The first condition $\nabla g_{i}(\bar{y})r\leq 0$ , $i\in\beta_{2}$ is equivalent to the requirement that for every $j\in\beta_{2}$ the optimization problem

[TABLE]

has a solution. Since the tangent cone $T_{C}(\bar{x})$ is assumed to be convex polyhedral, so also is the regular normal cone and therefore this program can be written as a linear program for which obviously the trivial solution is feasible. Hence, by the duality theory of linear programming the program (91) has a solution, if and only if its dual program has a feasible solution, i.e. there are multipliers $\alpha^{j}_{i},\tilde{\alpha}^{j}_{i}$ , $i\in\bar{I}^{+}$ , $\gamma^{j}_{i}\geq 0$ , $\tilde{\gamma}^{j}_{i}\geq 0$ , $i\in\beta_{1}$ , $\delta^{j}_{i}\geq 0$ , $\tilde{\delta}^{j}_{i}\geq 0$ , $i\in\beta_{2}$ , $z^{j}\in\mathbb{R}^{m}$ and $\tilde{l}^{j},l^{j}\in(\widehat{N}_{C}(\bar{x}))^{\circ}=T_{C}(\bar{x})$ such that

[TABLE]

Hence $l^{j}=-\tilde{l}^{j}\in T_{C}(\bar{x})\cap(-T_{C}(\bar{x}))={\rm lin\,}T_{C}(\bar{x})$ and by (65) we obtain $\gamma^{j}_{i}=0$ , $i\in\beta_{1}$ and $\delta^{j}_{i}=0$ , $i\in\beta_{2}$ . Now it is easy to see that the dual program to (91) is feasible if and only if condition (i) is fulfilled.

The second requirement $\mu_{i}^{r}\geq 0$ , $i\in\beta_{1}$ is equivalent to the condition that for every $k\in\beta_{1}$ the program

[TABLE]

has a solution. Using similar arguments as above we obtain that this is equivalent with the existence of multipliers $\tilde{\alpha}^{k}_{i}$ , $i\in\bar{I}^{+}$ , $\tilde{\gamma}^{k}_{i}\geq 0$ , $i\in\beta_{1}$ , $\tilde{\delta}^{k}_{i}\geq 0$ , $i\in\beta_{2}$ , $z^{k}\in\mathbb{R}^{m}$ and $l^{k}\in{\rm lin\,}T_{C}(\bar{x})$ verifying

[TABLE]

and it is easy to see that this is equivalent to condition (ii). ∎

In order to introduce a suitable ${\cal Q}$ -stationarity concept for generalized equations, let us define

[TABLE]

and

[TABLE]

Note that if a subset $\beta\subset\bar{I}^{0}$ does not belong to ${\cal B}_{GE}$ , then the set $\bar{\beta}:=\{i\in\bar{I}^{0}\,|\,\nabla g_{i}(\bar{y})z=0\forall z\in K_{\beta}\}$ fulfills $\beta\subset\bar{\beta}\in{\cal B}_{GE}$ and $K_{\beta}=K_{\bar{\beta}}$ . It follows that $K_{\beta}^{\ast}\subset K_{\bar{\beta}}^{\ast}$ and consequently $Q^{\beta}_{GE}\subset Q^{\bar{\beta}}_{GE}$ . Since we want to consider closed convex cones $Q$ which are as large as possible, we can discard $Q^{\beta}_{GE}$ from our analysis.

It follows immediately from the definition that $\bar{I}^{0}\in{\cal B}_{GE}$ . Further, by [10, Lemma 2] we have $\emptyset\in{\cal B}_{GE}$ .

In contrast to MPCC and MPVC the condition $Q_{1}^{\circ}\cap Q_{2}^{\circ}=\widehat{N}_{D}(F(\bar{x},\bar{y}))$ does not hold automatically for every pair $(Q_{1},Q_{2})\in{\cal Q}_{GE}$ , but it holds for instance for the pair $(Q^{\bar{I}^{0}}_{GE},Q^{\emptyset}_{GE})$ .

Definition 7.

Let $(\bar{x},\bar{y})\in\Omega_{GE}$ .

We say that $(\bar{x},\bar{y})$ is ${\cal Q}$ -stationary for the program (63) with respect to the pair $(\beta_{1},\beta_{2})\in{\cal B}_{GE}\times{\cal B}_{GE}$ satisfying $\beta_{1}\cup\beta_{2}=\bar{I}^{0}$ if

[TABLE]

where $M^{\beta_{1},\beta_{2}}_{GE}$ is given by (85). 2. 2.

We say that $(\bar{x},\bar{y})$ is ${\cal Q}$ -stationary for the program (63) if it is ${\cal Q}$ -stationary with respect to some pair $(\beta_{1},\beta_{2})\in{\cal B}_{GE}\times{\cal B}_{GE}$ with $\beta_{1}\cup\beta_{2}=\bar{I}^{0}$ . 3. 3.

We say that $(\bar{x},\bar{y})$ is ${\cal Q}_{M}$ -stationary for the program (63) if there is some pair $(\beta_{1},\beta_{2})\in{\cal B}_{GE}\times{\cal B}_{GE}$ with $\beta_{1}\cup\beta_{2}=\bar{I}^{0}$ such that

[TABLE]

By using Proposition 6, Theorem 11 and Theorem 3(3.) we obtain the following Theorem.

Theorem 12.

Assume that the assumptions of Proposition 6 hold at the B-stationary point $(\bar{x},\bar{y})\in\Omega_{GE}$ . Then $(\bar{x},\bar{y})$ is ${\cal Q}$ -stationary with respect to every pair $(\beta_{1},\beta_{2})\in{\cal B}_{GE}\times{\cal B}_{GE}$ with $\beta_{1}\cup\beta_{2}=\bar{I}^{0}$ and $(\bar{x},\bar{y})$ is also ${\cal Q}_{M}$ -stationary. Conversely, if $(\bar{x},\bar{y})$ is ${\cal Q}$ -stationary with respect to some pair $(\beta_{1},\beta_{2})\in{\cal B}_{GE}\times{\cal B}_{GE}$ fulfilling the assumptions of Theorem 11, then $(\bar{x},\bar{y})$ is S-stationary and consequently B-stationary as well.

Acknowledgements

The research was supported by the Austrian Science Fund (FWF) under grant P26132-N25. The authors would like to express their gratitude to the reviewers for their careful reading and numerous important suggestions.

Bibliography29

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] W. Achtziger, C. Kanzow , Mathematical programs with vanishing constraints: Optimality conditions and constraint qualifications , Math. Program., 114 (2008), pp. 69–99.
2[2] J. F. Bonnans, A. Shapiro , Perturbation analysis of optimization problems , Springer, New York, 2000.
3[3] D. Dorsch, V. Shikhman, O. Stein , Mathematical programs with vanishing constraints: critical point theory , J. Glob. Optim., 52 (2012), pp. 591–605.
4[4] M. L. Flegel, C. Kanzow , An Abadie-type constraint qualification for mathematical programs with equilibrium constraints , J. Optim. Theory Appl., 124 (2005), pp. 595–614.
5[5] M. L. Flegel, C. Kanzow , A direct proof for M-stationarity under MPEC-GCQ for mathematical programs with equilibrium constraints, in Optimization with Multivalued Mappings: Theory, Applications and Algorithms, S. Dempe and V. Kalashnikov, eds., Springer, New York, NY, 2006, pp. 111–122.
6[6] M. L. Flegel, C. Kanzow, J. V. Outrata , Optimality conditions for disjunctive programs with application to mathematical programs with equilibrium constraints , Set-Valued Anal., 15 (2007), pp. 139–162.
7[7] H. Gfrerer , On directional metric subregularity and second-order optimality conditions for a class of nonsmooth mathematical programs , SIAM J. Optim., 23 (2013), pp. 632–665.
8[8] H. Gfrerer , Optimality conditions for disjunctive programs based on generalized differentiation with application to mathematical programs with equilibrium constraints , SIAM J. Optim., 24 (2014), pp. 898–931.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On estimating the regular normal cone to constraint systems and stationarity conditions††thanks: This is an Accepted Manuscript of an article published by Taylor & Francis in Optimization on 31 October 2016, available online: http://www.tandfonline.com/10.1080/02331934.2016.1252915

Abstract

1 Introduction

2 Preliminaries

Proposition 1**.**

Proof.

Lemma 1**.**

Proof.

Definition 1**.**

Definition 2**.**

Proposition 2** (cf.[14, Proposition 1]).**

Theorem 1**.**

Proof.

Definition 3**.**

Proposition 3**.**

Proof.

Corollary 1**.**

3 Estimating the regular normal cone

Theorem 2**.**

Proof.

Definition 4**.**

Corollary 2**.**

Lemma 2**.**

Corollary 3**.**

Proof.

Remark 1**.**

Theorem 3**.**

Proof.

Theorem 4**.**

Proof.

Remark 2**.**

Corollary 4**.**

Proof.

4 Application to MPCC

Lemma 3**.**

Proof.

Proposition 4**.**

Proof.

Theorem 5**.**

Proof.

Corollary 5**.**

Proof.

Example 1**.**

Definition 5**.**

Theorem 6**.**

Proof.

Remark 3**.**

Example 2**.**

5 Application to MPVC

Lemma 4**.**

Proof.

Proposition 5**.**

Proof.

Theorem 7**.**

Proof.

Lemma 5**.**

Proof.

Definition 6**.**

Theorem 8**.**

6 Application to generalized equations

Assumption 1**.**

Theorem 9**.**

Proof.

Remark 4**.**

Theorem 10**.**

Proposition 6**.**

Proof.

Theorem 11**.**

Proof.

Definition 7**.**

Theorem 12**.**

Acknowledgements

Proposition 1.

Lemma 1.

Definition 1.

Definition 2.

Proposition 2 (cf.[14, Proposition 1]).

Theorem 1.

Definition 3.

Proposition 3.

Corollary 1.

Theorem 2.

Definition 4.

Corollary 2.

Lemma 2.

Corollary 3.

Remark 1.

Theorem 3.

Theorem 4.

Remark 2.

Corollary 4.

Lemma 3.

Proposition 4.

Theorem 5.

Corollary 5.

Example 1.

Definition 5.

Theorem 6.

Remark 3.

Example 2.

Lemma 4.

Proposition 5.

Theorem 7.

Lemma 5.

Definition 6.

Theorem 8.

Assumption 1.

Theorem 9.

Remark 4.

Theorem 10.

Proposition 6.

Theorem 11.

Definition 7.

Theorem 12.