Sufficient Conditions for Metric Subregularity of Constraint Systems with Applications to Disjunctive and Ortho-Disjunctive Programs
Mat\'u\v{s} Benko, Michal \v{C}ervinka, Tim Hoheisel

TL;DR
This paper investigates metric subregularity constraint qualification for nonconvex optimization problems, introducing directional pseudo- and quasi-normality notions, and applies these to disjunctive and ortho-disjunctive programs to develop verification tools.
Contribution
It introduces new directional normality concepts and applies them to disjunctive and ortho-disjunctive programs, enhancing the understanding and verification of metric subregularity.
Findings
Characterization of pseudo-normality via extremal conditions
Development of tools to verify MSCQ in complex programs
Extension of existing conditions to new classes like ortho-disjunctive programs
Abstract
This paper is devoted to the study of the metric subregularity constraint qualification (MSCQ) for general optimization problems, with the emphasis on the nonconvex setting. We elaborate on notions of directional pseudo- and quasi-normality, recently introduced by Bai et al. (SIAM J. Opt., 2019), which combine the standard approach via pseudo- and quasi-normality with modern tools of directional variational analysis. We focus on applications to disjunctive programs, where (directional) pseudo-normality is characterized via an extremal condition. This, in turn, yields efficient tools to verify pseudo-normality and MSCQ, which include, but are not limited to, Robinson's result on polyhedral multifunctions and Gfrerer's second-order sufficient condition for metric subregularity. Finally, we refine our study by defining the new class of ortho-disjunctive programs which comprises prominent…
| Polyn. -OSC | Polyn. -OSC | |
| Dir. -OSC | Pseudo-normality | |
| -OSC | -OSC |
| Polyn. -OSC | Polyn. -OSC | |
| Dir. -OSC | Pseudo-normality | |
| -OSC | -OSC |
| Robinson SC | Polyn. -OSC | |
| Def. | Pseudo-normality | |
| Polyn. -OSC | Polyn. -OSC |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Sufficient conditions for metric subregularity of constraint systems with applications to disjunctive and ortho-disjunctive programs
Matúš Benko*†,‡*
,
Michal Červinka*♭,♯*
and
Tim Hoheisel⋄
Abstract.
This paper is devoted to the study of the metric subregularity constraint qualification (MSCQ) for general optimization problems, with the emphasis on the nonconvex setting. We elaborate on notions of directional pseudo- and quasi-normality, recently introduced by Bai et al. (SIAM J. Opt., 2019), which combine the standard approach via pseudo- and quasi-normality with modern tools of directional variational analysis. We focus on applications to disjunctive programs, where (directional) pseudo-normality is characterized via an extremal condition. This, in turn, yields efficient tools to verify pseudo-normality and MSCQ, which include, but are not limited to, Robinson’s result on polyhedral multifunctions and Gfrerer’s second-order sufficient condition for metric subregularity. Finally, we refine our study by defining the new class of ortho-disjunctive programs which comprises prominent optimization problems such as mathematical programs with complementarity, vanishing or switching constraints.
Key words and phrases:
metric subregularity, error bound property, pseudo-/quasi-normality, MPCC, MPVC, disjunctive programs, ortho-disjunctive programs
*†*Institute of Computational Mathematics, Johannes Kepler University Linz, A-4040 Linz, Austria, e-mail: [email protected]
*‡*Faculty of Mathematics, University of Vienna, 1090 Vienna, Austria
♭Institute of Economic Studies, Faculty of Social Sciences, Charles University, Opletalova 26, 110 00, Prague 1, Czech Republic, e-mail: [email protected]
♯Institute of Information Theory and Automation, Czech Academy of Sciences, Pod Vodarenskou vezi 4, 180 00 Prague 8, Czech Republic, e-mail: [email protected]
⋄Institute of Mathematics and Statistics, McGill University, 805 Sherbrooke St West, Room 1114 Montréal, Québec, Canada H3A 0B9
1 Introduction
In this paper we study constraint qualifications (CQs) for a general mathematical program (GMP) given by
[TABLE]
where and are continuously differentiable and is closed. Constraint qualification are regularity conditions on the feasible set of an optimization problem and play a crucial role for stationarity and optimality conditions, sensitivity analysis or exact penalization, as well as the convergence analysis of numerical algorithms.
At the center of our attention is the metric subregularity constraint qualification (MSCQ). Known also under other monikers such as error bound property or calmness constraint qualification (in general, calmness is equivalent to metric subregularity of the inverse mapping), MSCQ is, to the best of our knowledge, the weakest known CQ to ensure the full calculus for (limiting) normal cones and subdifferentials, see [33, 43]. In particular, MSCQ guarantees that local minimizers of (1) are Mordukhovich (M)-stationary [33]. Moreover, MSCQ also yields exactness of the penalty function
[TABLE]
see, e.g., [15, 16, 18, 46], which is an important tool for establishing necessary optimality conditions, as well as for numerical methods [16].
Apart from the area of optimality conditions and exact penalization, MSCQ turns out to be essential also in second-order variational analysis and closely related areas of stability and sensitivity, cf. e.g., [8, 9, 30, 31] and references therein.
The main drawback of MSCQ is the difficulty with efficient verification of this property. There exist two main approaches to ensure MSCQ. The first one makes use of the stronger property of metric regularity, which is closely related to other concepts such as the Aubin property, (generalized) Mangasarian-Fromovitz constraint qualification (GMFCQ), no nonzero abnormal multiplier constraint qualification (NNAMCQ). Metric regularity can be characterized via co-derivatives [56, Theorem 9.40] (known as the Mordukhovich criterion) or via graphical derivatives [21, Theorem 4B.1]. For more information as to metric regularity we refer to the monographs [21, 42, 47, 52, 56].
The second approach corresponds to Robinson’s famous result on polyhedral multifunctions [55, Proposition 1] and is, in turn, restricted to this special case. There are many situations, however, in which metric subregularity is provably satisfied, yet can not be detected by either of these approaches.
Therefore, a lot of attention has been given to conditions that lie between metric regularity and metric subregularity. A very common approach is to provide sufficient conditions for subregularity/calmness in terms of various derivative-like objects [23, 33, 34, 43, 48, 54, 59]. An exception is the early and very interesting attempt by Klatte and Kummer [46, Theorem 3.6], where calmness of an intersection mapping is studied. For further details on metric subregularity and related notions as well as more bibliographical pointers on the topic, we refer to the paper and the textbook by Dontchev and Rockafellar [20, 21] and to the textbook by Ioffe [42].
We will further focus on the following two strategies: the first one is obtained by the pseudo- and quasi-normality, first introduced for nonlinear programming in [10], and later extended to MPCCs in [45, 60] as well as to general programs of the form (1), see [32]. The second one, based on the directional approach recently developed by Gfrerer and co-authors [7, 25, 26, 27, 29, 30], was established and utilized in [25, 26, 29] under the name first/second-order sufficient condition for metric subregularity (FOSCMS/SOSCMS). FOSCMS can be viewed as a directional, less restrictive counterpart of the Mordukhovich criterion. The main advantage of these conditions is their point-based nature, which makes it possible to verify them efficiently. The point-based character of these conditions can be justified by the existence of suitable calculus rules for the (directional) limiting normal cones, despite the fact that these objects are defined with information taken from the neighborhood by using a limiting process.
In this paper, we synthesize the concepts of pseudo- and quasi-normality with the above mentioned directional approach, which also serves as our main workhorse throughout the paper. Hence, we study constraint qualifications called directional pseudo-/quasi-normality, which are milder than both pseudo-/quasi-normality and FOSCMS, and imply MSCQ (cf. Theorems 3.2 and 3.5).
We would like to point out, that despite working on this combined approach independently of Bai, Ye, and Zhang [3], the exact same definitions of directional pseudo- and quasi-normality were first published in said paper. Here, we present alternative or simpler proofs of certain common results using different techniques, which can further illuminate these novel tools for the reader. Moreover, we present a thorough investigation of applicability of these new CQs, which goes beyond the material in [3].
Although the core material of our study is valid for general programs (1) with an arbitrary closed set , we are particularly interested in situations when is not convex. Optimization problems with inherently nonconvex structures induced by imposing logical or combinatorial conditions on otherwise smooth or convex data [57] has been of increasing interest in recent years. Among the prominent examples are mathematical programs with complementarity constraints (MPCCs) [49, 53], or mathematical programs with vanishing constraints (MPVCs) [37], etc. For these optimization problems there are several applications in the natural and social sciences, economics and engineering. Moreover, they are very challenging from both a theoretical and numerical perspective. More examples of such programs are discussed in Section 4, where we apply our results to disjunctive programs in which is a finite union of polyhedra. In Section 5 we introduce the new notion of ortho-disjunctive programs. Ortho-disjunctive programs are disjunctive programs with an additional product structure of which allows us to address some issues that cannot be resolved in the general disjunctive setting. Both disjunctive and ortho-disjunctive programs provide a unified framework for the above mentioned particular problem classes.
The main contributions of the paper are as follows:
- •
Pseudo-normality for disjunctive programs: For disjunctive programs, we observe that pseudo-normality can be cast in a simpler way which is, in fact, a proper extension of the definition of pseudo-normality that has already been used for NLPs and MPCCs in the literature. This new definition, however, reveals an interpretation of pseudo-normality via an extremal condition, see (26), which is neither visible from the general definition for (1) nor from the specially tailored ones for NLPs and MPCCs, respectively. This extremal condition then yields efficient tools to verify pseudo-normality. Indeed, apart from recovering the Robinson’s result and the Gfrerer’s SOSCMS, employing higher-order analysis yields a variety of new milder point-based sufficient conditions for pseudo-normality and MSCQ, see Section 4.3.
- •
Quasi-normality for ortho-disjunctive programs: A similar approach as the one to pseudo-normality can be made for (directional) quasi-normality if one moves from the disjunctive to even more specialized ortho-disjunctive setting, designed to utilize an underlying product structure exhibited by the standard examples of disjunctive programs (MPCCs, MPVCs). The corresponding extremal condition characterizing quasi-normality leads to a surprising connection between quasi-normality and multi-objective optimization. Again, sufficient conditions of second- or higher-order are readily available.
- •
PQ-normality: In Section 3 we established the new notion of (directional) PQ-normality, which includes both pseudo- and quasi-normality as extreme cases. This unified notion puts us in a position to better understand and to exploit certain product structures for which neither quasi- nor pseudo-normality is suitable.
The rest of the paper is organized as follows. In Section 2 we present some preliminary results and notions from variational analysis as well as key results regarding constraint qualifications. Section 3 contains fundamental results of our study dealing with CQs for the general program (1). In Section 4, we study disjunctive programs and obtain full results on pseudo-normality. Section 5 deals with disjunctive programs with additional product structures often present in the problems of interest (MPCCs, MPVCs, etc.). In particular, the notion of ortho-disjunctive programs is introduced and complete results on quasi-normality are obtained.
Notation: Most of the notation used is standard: The closed ball in with center at and radius is denoted by and we use for the closed unit ball. The extended real line is given by . For its epigraph is given by . For a nonempty set we define the (Euclidean) distance function through . The projection mapping associated with is defined by . We write for a sequence of scalars and for a sequence of vectors. For a mapping its Jacobian at is denoted by . In particular, for , the Jacobian at is a row vector, and we denote its Hessian at by . Moreover, for the scalarized function is given by . Note that for we have and we often use the latter notation. For a matrix , its range or image is . For some vector we set and .
2 Preliminaries
This section is divided into two parts. First, we introduce some basic notions from variational analysis. The second part is devoted to constraint qualifications for the general mathematical program (1).
2.1 Variational analysis
Given a closed set and , the tangent cone to at is defined by
[TABLE]
The regular normal cone to at is given as the polar cone of the tangent cone, i.e.
[TABLE]
The limiting normal cone to at is given by
[TABLE]
If we set . Observe that holds. In case is a convex set, regular and limiting normal cone coincide with the classical normal cone of convex analysis, i.e.,
[TABLE]
and we will use the notation in this case. Finally, given a direction , the limiting normal cone to at in direction is defined by
[TABLE]
Note that, by definition, we have . Furthermore, observe that for all and if .
For and such that is finite (hence ) the sets
[TABLE]
denote the regular and limiting subdifferential of at , respectively. Observe that, in particular, for the indicator function of a set , given by
[TABLE]
we have The distance function enjoys a rich subdifferential calculus briefly summarized in the next result.
Proposition 2.1** (Subdifferentiation of distance function).**
Let be closed and continuously differentiable. Then the following hold:
- (i)
(**[56, Example 8.53]**) \partial{\rm d}_{S}(y)=\left\{\begin{array}[]{ll}N_{S}(y)\cap\mathbb{B}&\text{if}\;y\in S,\\ \frac{y-P_{S}(y)}{{\rm d}_{S}(y)}&\text{if}\;y\notin S;\end{array}\right.
- (ii)
(**[56*, Theorem 10.6]**) *
2.2 Constraint qualifications
The purpose of this section is to recall several well-established CQs for the general program (1) and to highlight some basic relations between them. We commence with the CQ that is most important to our study.
Definition 2.2** (MSCQ).**
Let be feasible for (1). We say that the metric subregularity constraint qualification (MSCQ) holds at if there exists a neighborhood of and such that
[TABLE]
Note that MSCQ is exactly metric subregularity in the set-valued sense of the feasibility mapping for the constraint system which is given by , see e.g. [29].
The stronger property of metric regularity holds for around if and only if there are neighborhoods of and of [math] and such that
[TABLE]
It is well-known that metric regularity of a multifunction is equivalent to the Aubin property of the inverse multifunction [56, Theorem 9.43]. Applying the Mordukhovich criterion to the feasibility mapping yields a condition that there is no nonzero multiplier such that
[TABLE]
which is often known as generalized Mangasarian-Fromovitz constraint qualification (GMFCQ) at . In the rest of the paper, we mainly stick to the GMFCQ terminology, but sometimes refer to GMFCQ also as the Mordukhovich criterion. Thanks to the calculus rules for limiting normal cones and subdifferentials, the Mordukhovich criterion often provides an efficient tool for verifying metric regularity. There are still plenty of situations, however, where GMFCQ is not fulfilled but MSCQ is. It is therefore an important and worthwhile endeavor to fill the gap between GMFCQ and MSCQ, ideally with verifiable conditions at that. Let us proceed with the next list of constraint qualifications for (1) relevant for our study, see, e.g., [32, 29].
Definition 2.3** (Constraint qualifications).**
Let be feasible for (1).We say that
- (i)
pseudo-normality holds at if there is no nonzero such that (4) holds and that satisfies the following condition: There exists a sequence with
[TABLE]
- (ii)
quasi-normality* holds at if there is no nonzero such that (4) holds and that satisfies the following condition: There exists a sequence with*
[TABLE]
- (iii)
first-order sufficient condition for metric subregularity (FOSCMS) holds at if for every with one has
[TABLE]
- (iv)
second-order sufficient condition for metric subregularity (SOSCMS)* holds at if is twice differentiable at , is the union of finitely many convex polyhedra, and for every with one has*
[TABLE]
We point out that imposing that the (nonexisting) multiplier is in in the definition of pseudo-/quasi-normality is, clearly, redundant, since it follows from . Nevertheless, in order to be consistent with the literature and to emphasize the connection to GMFCQ and other CQs, we stick to the original definition. In particular, it is obvious from the definition that GMFCQ implies both pseudo- and hence quasi-normality. The concepts of pseudo- and quasi-normality are well-established in the literature. Note that in [32], the condition in (i) and (ii) is replaced by . In order to see that no difference arises, consider the following elementary lemma which follows readily from the definitions of continuity and of the limiting normal cone, respectively.
Lemma 2.4**.**
Let be closed, and let be continuous. Then for every there exist and such that .
Corollary 2.5**.**
Under the assumptions of Definition 2.3 let . Then the following hold:
- (i)
If and for all then there exists such that and for all .
- (ii)
If and for all then there exists such that and for all .
Proof.
We only prove part (i); part (ii) can be shown analogously. To this end, define the continuous maps
[TABLE]
and set . Applying Lemma 2.4 then generates the desired sequences. ∎
Corollary 2.5 guarantees that using instead of in the definition of pseudo- and quasi-normality does not matter. We note that this is also true for the directional versions of these CQs to be established in Definition 3.4.
The obvious drawback of pseudo- and quasi-normality is that they are expressed via sequences, which makes it is quite difficult to check their validity and apply them. Another way of relaxing GMFCQ is provided by FOSCMS and SOSCMS, which can be easier to verify due to the calculus for directional limiting objects [7].
In order to simplify the notation, given feasible for (1), we define
[TABLE]
and set
[TABLE]
i.e., the directional normal cone is replaced by the standard one. With these conventions, GMFCQ at reads
[TABLE]
while FOSCMS now reads
[TABLE]
The fact that GMFCQ implies FOSCMS is clear from the inclusion
[TABLE]
The following example shows that this implication can be strict. In addition, it also illustrates that MSCQ is strictly weaker than quasi-normality, cf. Proposition 2.7(i).
Example 2.6**.**
Let and set . Clearly and , hence and the Mordukhovich criterion (GMFCQ) is violated at .
Moreover, setting , and we obtain , showing that also quasi-normality is violated at .
On the other hand, since for all , FOSCMS and hence MSCQ are satisfied at .
We point out that the set in Example 2.6 is convex, thus illustrating that even in the convex case one may not be able to verify MSCQ using the non-directional conditions (GMFCQ, pseudo- and quasi-normality), but one may invoke a directional one (here FOSCMS).
Although the directional conditions FOSCMS and SOSCMS are similar in flavor, we point out that SOSCMS is only applicable in the case where has disjunctive structure. In this setting, there is yet another condition due to Robinson [55] that ensures MSCQ.
The following proposition summarizes several important sufficient conditions for MSCQ, other than GMFCQ, which have already been established in the literature and that are important to our study. We point out, however, that the validity of these results will be a simple corollary of our refined analysis in Section 3.
Proposition 2.7** (Sufficient conditions for MSCQ).**
Let be feasible for (1). Then under either of the following conditions MSCQ holds at .
- (i)
(**[32, Theorem 5.2]**)quasi-normality (or pseudo-normality) holds at ;
- (ii)
(**[29, Corollary 1]**) FOSCMS holds at ;
- (iii)
(**[29, Corollary 1]**) SOSCMS holds at ;
- (iv)
(**[55, Proposition 1]**) is affine and is the union of finitely many convex polyhedra.
As we can see, two of these conditions are applicable for the general program (1) and are strictly milder than GMFCQ. The other two are restricted to the special structure of disjunctive constraints and hence are in general not comparable with GMFCQ. Interestingly, all four conditions are mutually incomparable and were obtained by different approaches. The only available comparison is for the disjunctive constraints, where FOSCMS clearly implies SOSCMS.
We will refer to (iv) in Proposition 2.7 as Robinson’s result. We point out, however, that [55, Proposition 1] in fact contains a stronger statement.
3 New constraint qualifications for GMP
In this section we are primarily concerned with constraint qualifications for the general mathematical program (1). In particular, we investigate directional counterparts of pseudo- and quasi-normality introduced in [3], and introduce a new CQ called PQ-normality that unifies pseudo- and quasi-normality. We then show that each of these CQs implies MSCQ, and hence recover statements (i) and (ii) of Proposition 2.7. Afterwards, we propose various sufficient conditions for these CQs under some additional structural assumptions. Hence, when applied to the disjunctive constraints in Section 4, these conditions also recover statements (iii) and (iv) of Proposition 2.7.
3.1 Directional constraint qualifications and PQ-normality
In [3, Corollary 4.1], it was shown that metric subregularity is implied by directional quasi-normality, see Definition 3.4. Here, we propose a different proof that follows the techniques used, e.g., in [45, Lemma 4.4] and [32, Lemma 5.1.]. Note that our main tools are the Ekeland’s variational principle and the rich subdifferential calculus for the distance function from Proposition 2.1, and this approach is novel even is the nondirectional setting. We start with the following observation, where we invoke definitions of and .
Lemma 3.1**.**
Let be feasible for (1) such that MSCQ is violated at . Then there exist sequences and as well as with such that
[TABLE]
Proof.
Violation of MSCQ at readily yields a sequence with . We put and find that is an -minimizer of for all . Hence by Ekeland’s variational principle [56, Proposition 1.43] with , there exists a sequence such that and for all . This implies as well as for all by applying a nonsmooth Fermat’s rule (cf. [56, Theorem 10.1]) and invoking a sum rule for locally Lipschitz functions (cf. [56, Exercise 10.10]). In particular, there exists a sequence . As , w.l.o.g. we may assume that with . Now let for all . Then
[TABLE]
As minimizes for all , we find that . Hence we infer that the first term on the right in (7) satisfies
[TABLE]
The second term on the right in (7) goes to zero by differentiability of and we conclude from (7) that . Finally, as for all , we have . ∎
Theorem 3.2**.**
Let be feasible for (1) and assume that the following holds: For every with and there does not exist a nonzero that satisfies the following condition: There exists a sequence such that for all we have
[TABLE]
Then MSCQ is fulfilled at .
Proof.
Assume that MSCQ is not satisfied at . Consider sequences , and with provided by Lemma 3.1. Recall that
[TABLE]
see Proposition 2.1 (ii). Moreover, by Proposition 2.1 (i), it holds that
[TABLE]
since . Consequently, there exists such that with
[TABLE]
we have
[TABLE]
Moreover, by the definition of in (8) and the fact that , [56, Example 6.16] implies that
[TABLE]
Since is bounded, we may assume w.l.o.g. that for some . Then from (8) we infer that . Hence, passing to the limit in (9) we obtain
[TABLE]
Now, if then w.l.o.g. and hence . Analogously, we argue for . Altogether, we find that
[TABLE]
Finally, Lemma 3.1 yields that , showing , which establishes a contradiction. ∎
Instead of directly extracting directional versions of quasi- and pseudo-normality from Theorem 3.2, we introduce the notion of PQ-normality which serves as a bridge between pseudo- and quasi-normality, which are then identified as the two extreme cases of PQ-normality. We strongly emphasize that introducing PQ-normality does not merely serve the academic purpose of unifying the two concepts. In fact, it has important consequences for the class of programs in Section 5 where the set possesses an underlying product structure in addition to its disjunctive nature.
First, we introduce additional notation. For we denote by its scalar components. More generally, suppose that is expressed via factors as and introduce the multi-indices with . Note that there is a one-to-one correspondence between such multi-indices and factorizations of . The components of some we denote as for , where is some (abstract) index set of elements. Note that we do not identify with in order to avoid ambiguity of notation, e.g., stands only for the first, scalar, component of . Moreover, we use a Greek letter to indicate the vector components of and a Latin letter to indicate the scalar components .
Given a multi-index fix . The component , vector in general, can also be written via its scalar components, i.e., there exists an index set, denoted by , such that . Note that . Finally, given two multi-indices with , we say that is a refinement of and write , provided for every there exists an index set such that
[TABLE]
Note that the special multi-indices and are in fact maximal and minimal in the sense that for any multi-index with one has .
The following example illustrates the use of the above notation.
Example 3.3**.**
Let , and consider a multi-index corresponding to the factorization . Consider also an element . Since has three components, we may set, e.g., yielding , and . Clearly, we have , and .
Moreover, the multi-index is a refinement of , since we may set
[TABLE]
to obtain
[TABLE]
We now proceed with the definition of PQ-normality which embeds quasi- and pseudo-normality as extremal cases in a whole family of constraint qualifications.
Definition 3.4** (PQ-normality).**
Let be feasible for (1), consider with , and let be a multi-index such that . We say that
- (i)
PQ-normality w.r.t. * holds at , if there is no nonzero that satisfies the following condition: There exists a sequence with and*
[TABLE]
- (ii)
PQ-normality w.r.t. in direction * holds at , if there is no nonzero that satisfies the following condition: There exists a sequence with , (10) and*
[TABLE]
We say that directional PQ-normality w.r.t. holds at , if PQ-normality w.r.t. in direction holds at for all with . In particular, we refer to PQ-normality w.r.t. (in direction ) as pseudo-normality (in direction ), while PQ-normality w.r.t. we call quasi-normality.
It is clear from the definition that PQ-normality w.r.t. implies PQ-normality w.r.t. provided . In particular, since for all with , we conclude that pseudo-normality implies PQ-normality w.r.t. any and this further implies quasi-normality. Naturally, all of the above comments remain true for the corresponding directional CQs.
For the sake of completeness, we reformulate Theorem 3.2 in terms of directional PQ-normality.
Theorem 3.5**.**
Let be feasible for (1) and let the directional PQ-normality w.r.t. any , in particular directional pseudo- or quasi-normality, hold at . Then MSCQ is fulfilled at .
We point out that directional quasi-normality is strictly weaker than both FOSCMS (clear from the definition of the respective CQs) as well as quasi-normality, see Example 2.6. Hence it constitutes, to the best of our knowledge, one of the weakest conditions to imply MSCQ for the general optimization problem (1), which can still be efficiently verified in some very important cases as shown in Section 3.2 and Section 3.3 below.
The same directional versions of pseudo- and quasi-normality were independently introduced in a recent paper by Bai et al. [3]. In order to show that these imply MSCQ, Bai et al. build on results from [28, Corollary 1, Remarks 1 and 2]. Hence, we believe our alternative proof can provide additional insight on the role of pseudo- and quasi-normality in verifying MSCQ. More importantly, in what follows, we focus on simplifying these conditions under specific structural assumptions on the feasible set, which is a crucial step to facilitate their use.
3.2 Simplified CQs and second-order sufficient conditions: The standard case
For some important instances of the general program (1), the concepts of pseudo- and quasi-normality were introduced without the undesirable additional sequence , see [10] for standard NLPs and [45] for MPCCs. In the remaining part of this section, we address the question as to when this is possible for more general instances of (1), working with the generalized notion of PQ-normality. In turn, in the remainder of this section, denotes a multi-index in for some with unless stated otherwise. As a result of dropping the sequence , we obtain a characterization of PQ-normality via an extremal condition, which, in turn, yields several sufficient conditions for PQ-normality.
For clarity of exposition, we split our analysis into the standard (non-directional) and the directional case.
We begin our study of the non-directional case by the following straightforward result, which follows readily from definition of PQ-normality using the sequences and , taking also into account Lemma 2.4 and the arguments in the proof of Corollary 2.5.
Lemma 3.6**.**
Let be feasible for (1). If PQ-normality w.r.t. holds at then there is no nonzero that satisfies the following condition: There exists a sequence with
[TABLE]
Note that in case of MPCCs, by the geometry of the feasible set and the resulting normal cones, one always has . Thus the conditions used in [45] simplify to and if , respectively. However, in the general setting of problem (1), as well as in the case of general disjunctive constraints, we cannot make this simplification. In order to obtain the reverse implication, however, we have to impose some additional assumptions on the constraints of (1).
Assumption 3.7**.**
Let be a multi-index and let be feasible for (1). Assume that for every and every sequence with , there exists a subsequence such that
[TABLE]
Theorem 3.8** (Simplified PQ-normality under Ass. 3.7).**
Let be feasible for (1) and such that Assumption 3.7 holds. Then PQ-normality w.r.t. at is equivalent to the following simplified PQ-normality w.r.t. at , i.e.
There is no nonzero such that there exists a sequence fulfilling (12).
Proof.
The fact that PQ-normality implies the simplified PQ-normality follows from Lemma 3.6.
In turn, if PQ-normality w.r.t. is violated, there exist and with and for all . Relabeling by only using the indices and then summing up the above expression with (13) for all shows that the simplified PQ-normality is then violated as well. ∎
As the above theorem shows, under Assumption 3.7, the simplified PQ-normality is equivalent to PQ-normality, hence sufficient for MSCQ. Without Assumption 3.7 this is, in general, false, see Example 3.13. In the following sections, however, we deal with various types of optimization problems which automatically satisfy Assumption 3.7 for a suitable multi-index, including and , at every feasible point.
As we will now show, Theorem 3.8 also reveals an interesting connection between PQ-normality and vector optimization. This, in turn, paves the way to a variety of sufficient conditions for PQ-normality, hence also for MSCQ.
Let us recall some standard terminology from multiobjective optimization [22, 44]. Given , a point is called a local weak efficient solution of the unconstrained vector optimization problem if there exists a neighborhood of such that no satisfies for all . Given and , we define the function
[TABLE]
Theorem 3.9**.**
Let be feasible for (1) and let Assumption 3.7 for some be fulfilled. Then PQ-normality w.r.t. holds at if and only if for every , the vector is a local weak efficient solution of the unconstrained vector optimization problem for given by (14).
Proof.
If there exists such that is not a local weak efficient solution of , then and there exists such that for all and all . This shows that PQ-normality w.r.t. is violated due to Theorem 3.8.
In turn, if PQ-normality w.r.t. is violated, there exists and a sequence such that for all and all , which shows that is not a local weak efficient solution of . ∎
This simple observation has some significant consequences. In particular, it allows us to use the standard sufficient conditions for a local weak efficient solution to obtain the following point-based sufficient condition for PQ-normality.
Corollary 3.10** (Sufficient condition for PQ-normality).**
Let be feasible for (1) with twice differentiable at and let Assumption 3.7 for some be fulfilled. Then PQ-normality w.r.t. , in particular MSCQ, holds at under the following condition: For every , every with for all and every with one has
[TABLE]
Proof.
Consider and given by (14) and let be arbitrary. Then
[TABLE]
since . Hence, every with for all in fact fulfills for all . The result thus follows from [11, Theorem 4] and Theorem 3.9. ∎
Remark 3.11**.**
Note that implies the first-order necessary conditions for local efficient solution, for all , as can be seen from (16).
The above corollary motivates the following definition.
Definition 3.12**.**
Given a feasible point for (1) and a multi-index , we say that the second-order sufficient condition for PQ-normality w.r.t. , SOSCPQN() for short, holds at provided: For every , every with for all and every with one has (15).
Moreover, we refer to SOSCPQN() and SOSCPQN() as second-order sufficient condition for pseudo-/quasi-normality (SOSCPN and SOSCQN), respectively.
Naturally, one can also consider higher-order sufficient conditions. We do so in Section 4, where we focus on pseudo-normality. Note that pseudo-normality is connected to standard maximality since is a scalar function in that case.
The following example shows that SOSCPN on its own, i.e., without Assumption 3.7 for , does not guarantee pseudo-normality, not even MSCQ.
Example 3.13**.**
Consider given by and defined by and let . Clearly and Thus, for every and every we have , where is such that , showing that SOSCPN holds at . On the other hand, for a sequence we obtain , while
[TABLE]
showing the violation of MSCQ and consequently of pseudo-normality as well.
We point out that the set in Example 3.13 equals and is therefore convex, yet SOSCPN still does not imply MSCQ.
Theorem 3.14**.**
Let be feasible for (1) with twice differentiable at and consider two multi-indices with . Then SOSCPQN() implies SOSCPQN(). In particular, we have SOSCPN SOSCPQN() SOSCQN.
Proof.
Consider , and with for all and . For any , we find some index set such that by . Summing up over yields . Thus, we can apply SOSCPQN() in order to infer the existence of such that
[TABLE]
This yields, however, that SOSCPQN() is fulfilled.
The second statement now follows from the obvious relation valid for any . ∎
The above theorem holds regardless of Assumption 3.7. If one seeks to use any of the sufficient conditions to get metric subregularity, however, one clearly needs it, see Examples 3.13 and 3.16. Note also that if Assumption 3.7 holds for , then it also holds for any .
The following example shows that SOSCQN is, in fact, strictly milder than SOSCPN. Moreover, it demonstrates that one can effectively verify MSCQ by means of SOSCQN even when pseudo-normality is not fulfilled.
Example 3.15**.**
Let for two convex polyhedral sets and let for and and let . In particular, Assumption 3.7 for is fulfilled by Corollary 5.1. Clearly, , and hence
SOSCQN is fulfilled since for any for some and for one has , . In particular, quasi-normality and MSCQ follows.
On the other hand, let and consider a sequence . We obtain
[TABLE]
showing the violation of pseudo-normality.
The next example shows that, without Assumption 3.7 for , the simplified form of quasi-normality from Lemma 3.6 does not imply MSCQ even if is a convex polyhedral set.
Example 3.16**.**
Let be convex polyhedral set given by and given by and let . Clearly and we find that For every for some and every close to we have if and if , showing that the simplified form of quasi-normality holds at . On the other hand, for a sequence we obtain , while
[TABLE]
showing the violation of MSCQ.
3.3 Simplified CQs and second-order sufficient conditions: The directional case
In this subsection, we consider the directional case, where the situation is slightly different.
Theorem 3.17**.**
Let be feasible for (1) and consider with . Then under Assumption 3.7 for , PQ-normality w.r.t. at in direction holds if: there is no nonzero such that there exists a sequence with fulfilling (12).
Proof.
The proof follows by the same arguments as used in the proof of Theorem 3.8. ∎
In contrast to the standard case, the following example shows that the reverse implication in the above theorem is not true in general.
Example 3.18**.**
Consider given by and defined by and let and . Clearly and Set and note that any sequence fulfills as well as (12) for , since .
On the other hand, for arbitrary sequence with we have . Hence, for any one has , showing that Assumption 3.7 for is fulfilled. Moreover yields . Then, however, we obtain
[TABLE]
showing that pseudo-normality at in direction is fulfilled.
Nevertheless, the previous theorem still allows us to use sufficient conditions. Consider the following second-order sufficient condition for directional PQ-normality w.r.t. , SOSCdirPQN() for short.
Proposition 3.19** (SOSCdirPQN()).**
Let be feasible for (1) with twice differentiable at and let Assumption 3.7 for some be fulfilled. Then directional PQ-normality w.r.t. , in particular MSCQ, holds at if the following SOSCdirPQN() is fulfilled: For every with , every with , for all and every with condition (15) is fulfilled.
Proof.
Assume that directional PQ-normality w.r.t. is violated. Theorem 3.17 yields the existence of , and a sequence with such that for all with as in (14). Hence, by passing to a subsequence if necessary, we can assume that with and , where for simplification we dropped the upper index from .
By Taylor expansion, we have
[TABLE]
where denotes the vector in with components for . If there exists a subsequence such that , we conclude from (17) that
[TABLE]
where for . Passing to a subsequence if necessary, and taking into account that with being a closed set, we conclude that , i.e., there exists with . This is, however, a contradiction with , since we obtain that by and (16), which clearly holds due to .
Consequently, remains bounded and by passing to a subsequence if necessary we assume that . Note also that in this case we get by (17). By similar arguments as before, (17) now yields the existence of such that
[TABLE]
Moreover, we can clearly take with since is the direct sum of the span of and its orthogonal complement and . The assumed SSOSCdirPQN() (15) implies the existence of with , a contradiction. This completes the proof. ∎
Remark 3.20**.**
Note that includes the condition .
As before, we will refer to SOSCdirPQN() and SOSCdirPQN() as second-order sufficient condition for directional pseudo/quasi-normality (SOSCdirPN and SOSCdirQN).
The following directional counterpart of Theorem 3.14 follows by the same arguments.
Theorem 3.21**.**
Let be feasible for (1) with twice differentiable at and consider two multi-indices with . Then SOSCdirPQN() implies SOSCdirPQN(). In particular, we have SOSCdirPN SOSCdirPQN() SOSCdirQN.
We point out here that, unlike in the non-directional case, we could not find an example to show that the above implications can be indeed strict, so this remains an open question.
3.4 Summary
We now summarize our findings of this section. We studied the directional versions of pseudo- and quasi-normality, first established in the paper by Bai et al. [3]. In addition, we introduced the new concept of PQ-normality, together with its directional counterpart, that unifies the two standard CQs. As a result, we obtained novel and improved results for the metric subregularity constraint qualification and we established interesting connections among the well-known CQs and the new ones.
In the following diagram, we summarize the relations between the various constraint qualifications weaker than GMFCQ that imply MSCQ. The point-based conditions are naturally of primary interest and are hence emphasized in double-framed boxes. Note that pseudo- and quasi-normality are included as special cases of PQ-normality for and .
4 Programs with disjunctive constraints
In this section we study a special case of problem (1) in which the set is disjunctive, that is it can be written as a union of finitely many convex polyhedra, i.e.,
[TABLE]
where we refer the reader to Section 4.1 for a definition of convex polyhedral sets. Subsequently, we call problem (1) with disjunctive (in the sense of (18)) as a (mathematical) program with disjunctive constraints or simply a disjunctive program.
Disjunctive programs have been systematically studied for decades, see, e.g., [58] and the references therein. For more recent works on disjunctive programs, which are also more related to our approach, we refer to the papers [5, 24, 27, 50] and the thesis [6].
The most prominent examples of disjunctive programs are the aforementioned classes of MPCCs, MPVCs, as well as mathematical programs with relaxed cardinality constraints (MPrCCs), mathematical programs with relaxed probabilistic constraints (MPrPCs), and the recently introduced mathematical programs with switching constraints (MPSCs).
For the mathematical background and several applications we refer the reader to the textbooks [49, 53] for MPCCs as well as to the book [19] on the closely related class of bilevel programs. As for MPVCs we refer to the paper [1] and the thesis [37] and the references therein. For relaxed cardinality constrained problems we point to the papers [14, 17]. For MPrPCs see [2], and for MPSCs see [51].
Dropping standard constraints for brevity, all of these programs exhibit the general form
[TABLE]
where are continuously differentiable, is a finite index set and is given by
- (a)
(complementarity constraints)
[TABLE]
- (b)
(vanishing constraints)
[TABLE]
- (c)
(relaxed cardinality constraints)
[TABLE]
- (d)
(relaxed probabilistic constraints)
[TABLE]
- (e)
(switching constraints)
[TABLE]
Clearly, , , , and are disjunctive, rendering the resulting optimization problem a disjunctive program. We point out that there is generally not a unique way to write the disjunctive sets in (a)-(e) as a union of convex polyhedral sets. For instance, can be alternatively written as .
The main finding of this section is to show that the crucial Assumption 3.7 is automatically fulfilled for disjunctive programs. In addition, we also prove that directional pseudo-normality does not only imply, but is, in fact, equivalent to its simplified form from Theorem 3.17, which suggests that our sufficient conditions are not too restrictive. Recall that Example 3.18 shows that, in general, the simplified form is strictly stronger. For these purposes, we commence our study with a preliminary section on the variational geometry of convex polyhedral sets and how these extend to a more general setting.
4.1 Key properties of convex polyhedral sets
Recall that a set is said to be convex polyhedral (or a convex polyhedron) if it is the intersection of finitely many closed half-spaces. In particular, for a convex polyhedron there exist and such that
[TABLE]
Clearly, every convex polyhedron is closed. Due to convexity of , the regular and limiting normal cone to coincide with the classical normal cone of convex analysis, see (3). Given , we have
[TABLE]
where , i.e., the normal cone of at is the convex cone generated by , see e.g. [36, p. 67]. Therefore, there is only a finite number of different normal cones induced by a convex polyhedral set, in fact, this number is bounded by (as there can be at most active sets in ).
We will make use of two essential properties of convex polyhedra. The first one is the well-known exactness of tangent approximation, see [56, Exercise 6.47]: Given a convex polyhedron , for any there exists a neighborhood of such that
[TABLE]
In particular, taking into account [56, Exercise 6.44], one has
[TABLE]
The second property is closely related to Assumption 3.7 as stated in the following lemma.
Lemma 4.1**.**
Let be closed and convex, let and . Then there exists a subsequence such that the following hold:
- (i)
We have for all ;
- (ii)
Moreover, if is polyhedral then for all .
Proof.
(i) Taking the limit in yields . In particular, as we get .
(ii) Recall from the discussion above, that for a convex polyhedral set there are only finitely many different normal cones. Hence, there exists a subsequence such that for all and some closed convex cone . Consequently, from we obtain and hence due to convexity of and . ∎
The above lemma immediately yields that Assumption 3.7 for the multi-index is fulfilled at every feasible point for program (1) with convex polyhedral , regardless of the constraint mapping . However, since we are not primarily interested in this convex polyhedral setting, we now state the desirable properties from (20) and Lemma 4.1 (ii) in a general form. To this end, given an arbitrary closed set and , consider the following condition:
[TABLE]
where denotes a neighborhood of . Moreover, given also a multi-index with and , consider the condition:
[TABLE]
where is a subsequence of . Note that (P2) is automatically fulfilled if . We will repeatedly refer to these conditions in the subsequent study and hence we formulated it for an arbitrary multi-index . Clearly, if is feasible for (1) and satisfies (P2) for , and every multiplier , then Assumption 3.7 for is fulfilled at .
Motivated by the disjunctive setting in (18), for the remainder of our study we deal with sets generated by unions and, in addition, Cartesian products of convex polyhedra (see the product setting in Section 5). Hence, we now examine properties (P1) and (P2) under these set operations on convex polyhedra.
Consider first a collection of closed sets for and set . We start with some elementary observations about tangent and normal cones. To this end, for , let us denote and observe that, by the definition of the tangent cone, we have
[TABLE]
hence, by polarization
[TABLE]
This yields the following elementary estimate
[TABLE]
which can be derived, e.g., from the more general result [7, Proposition 3.1].
On the other hand, consider now , where is closed for and let . By [56, Proposition 6.41], we have
[TABLE]
Note that for the tangent cones, [56, Proposition 6.41] in general yields only the inclusion . It can be easily seen, however, that
[TABLE]
holds, provided satisfies (P1) at for all . Indeed, for we readily obtain from (P1) for every the existence of such that holds for all . Taking yields for all and follows.
Next we show that conditions (P1) and (P2) are preserved under unions and products, provided the obvious adjustments of multi-index, point and multiplier are made if needed.
Proposition 4.2**.**
Let with closed and let .
- (i)
If satisfies (P1) at for all , then also satisfies (P1) at .
- (ii)
If satisfies (P2) for some multi-index , the point and some for all , then also satisfies (P2) for , and .
Proof.
Denoting for the neighborhoods given by the assumption (i) and taking into account (21), the first statement follows easily by setting , where is a neighborhood of such that . Clearly, the existence of is guaranteed by the closedness of .
In order to prove (ii), consider sequences and . From (22), closedness of and finiteness of one easily obtains the existence of and a subsequence such that
[TABLE]
The assumption now yields the existence of a subsequence such that for and . ∎
Recall that if for some , then automatically satisfies (P2).
Proposition 4.3**.**
Let with closed and .
- (i)
If satisfies (P1) at for all , then satisfies (P1) at .
- (ii)
If satisfies (P2) for multi-index with , the point and for all , then satisfies (P2) for , and .
Proof.
Denoting by the neighborhoods given by the assumption in (i), the first statement follows by simply setting and applying (25).
In order to prove (ii), consider sequences and . By (24), we have for every and . By assumption, there exists a subsequence with . Consequently, by assumption, there exists a subsequence such that . Repeating this argument another times, we find that there exists a subsequence such that for all . This proves the statement. ∎
We conclude this subsection by showing that the program (1), with satisfying properties (P1) and (P2), automatically satisfies the crucial Assumption 3.7, and in addition, that directional PQ-normality is equivalent to its simplified counterpart in this case. We point out that this result is the very foundation for all remaining results of the paper.
Proposition 4.4**.**
Let be feasible for (1) with closed and satisfying (P1) at as well as (P2) for some multi-index , the point and every multiplier . Then Assumption 3.7 for is fulfilled at and, moreover, (directional) PQ-normality w.r.t. at is equivalent to its simplified form (12) from Theorem 3.8 (Theorem 3.17).
Proof.
Assumption 3.7 for at follows from (P2) for with at . Hence, the statement for the nondirectional version follows from Theorem 3.8. Similarly, the implication from the directional simplified form to directional PQ-normality follows from Theorem 3.17.
It remains to show that PQ-normality w.r.t. in direction implies its simplified form. We do this by contraposition, so let us assume that there exists and such that and
[TABLE]
By the definition of the directional normal cone, there exists and as well as . Taking into account (P1) together with [56, Exercise 6.44] we obtain
[TABLE]
for any sufficiently small. Hence by setting we conclude . Moreover, (P2) for yields that, by passing to a subsequence if necessary, we may take such that , for all and . Consequently, we obtain
[TABLE]
Finally, , showing the violation of PQ-normality w.r.t. in direction and the proof is complete. ∎
4.2 Pseudo-normality for disjunctive programs
The desired results for the disjunctive setting (18) can be viewed as a corollary of our analysis in Section 4.1. Indeed, Lemma 4.1 and Proposition 4.2 yield that a disjunctive set satisfies properties (P1) and (P2) for the multi-index . In particular, due to (P1), the endeavor of computing the normal cone to disjunctive at some point can be reduced to computing the normal cone to a union of finitely many polyhedral cones at zero, i.e.,
[TABLE]
see [35, p. 59]. More importantly, the following corollary is a consequence of Proposition 4.4.
Corollary 4.5**.**
Let be disjunctive in the sense of (18). Then satisfies (P1) at every point as well as (P2) for the multi-index at every point and every . In particular, Assumption 3.7 for is fulfilled at every feasible point for disjunctive programs. Moreover, (directional) pseudo-normality at is equivalent to its simplified form: (for any with ) there is no nonzero () such that there exists a sequence (with ) fulfilling
[TABLE]
We emphasize that Corollary 4.5 clarifies that the various definitions of pseudo-normality used in the literature stem from the same concept. In the general setting (1), pseudo-normality contains the additional sequence , but in the special cases of disjunctive programs it reduces to the simplified version without .
Corollary 4.5 also allows us to use all the sufficient conditions for pseudo-normality, hence also for MSCQ, studied in Section 3. These conditions now take on simpler forms since the vector optimization techniques reduce to standard optimization in the disjunctive setting. This can be seen from (26), which yields that pseudo-normality of is equivalent to being a local maximizer of for all , cf. Theorem 3.9. In particular, the second-order sufficient conditions from Corollary 3.10 and Proposition 3.19 read as follows.
Corollary 4.6**.**
Let be feasible for (1) with disjunctive and twice differentiable at . Consider the following two conditions:
- (i)
second-order sufficient condition for pseudo-normality (SOSCPN): For every and every one has
[TABLE]
- (ii)
second-order sufficient condition for directional pseudo-normality (SOSCdirPN): For every with and every one has (27).
Then condition (i) (condition (ii)) implies (directional) pseudo-normality at . In particular, either of the two conditions implies MSCQ at .
Clearly, an affine can never fulfill the strict inequality of SOSCPN. The required maximality of expressed in (26) can be secured nonetheless.
Corollary 4.7**.**
Let be feasible for (1) with disjunctive. If is affine then pseudo-normality, and consequently also MSCQ, holds at .
Proof.
For affine we have for all . Hence, taking into account we find that
[TABLE]
showing that is a local maximizer of and pseudo-normality thus follows. ∎
We point out that the sufficiency of SOSCdirPN for MSCQ established in Corollary 4.6 corresponds to the sufficiency of Gfrerer’s SOSCMS for MSCQ (Proposition 2.7 (iii)). In turn, Corollary 4.7 corresponds to Robinson’s result (Proposition 2.7 (iv)). Hence, by employing the notion of (directional) pseudo-normality and its sufficiency for MSCQ, we found new proofs for these interesting results. Moreover, the notion of directional quasi-normality unifies all sufficient conditions for MSCQ from Proposition 2.7.
Note that the analogous results were obtained also in [3, Theorem 4.1., Proposition 4.2.]. What was not noticed there, however, is the underlying maximality principle (26), which provides a nice understanding and makes things much simpler. In particular, it enables us to extend the above results by means of higher-order analysis.
4.3 Higher-order conditions
In order to proceed, we rely once more on the notion of multi-indices. First, we introduce the following standard notation: Given and we set
[TABLE]
Given a function , -times differentiable at , and with we set
[TABLE]
Corollary 4.8**.**
Let be feasible for a disjunctive program with -times differentiable at . Consider the following two conditions:
- (i)
for every , , and all one has
[TABLE]
- (ii)
for every with , , and all , where denotes a neighbourhood of , one has (28).
Then condition (i) (condition (ii)) implies (directional) pseudo-normality at . In particular, either of the two conditions implies MSCQ at .
Proof.
Both statements follows from the same arguments, namely, given and and setting , Taylor expansion together with (28) yield
[TABLE]
∎
Similarly as in the case of affine , the strict inequality of the above higher-order sufficient conditions does not have to be fulfilled, as long as has polynomial structure, i.e., for every and every we have
[TABLE]
for some , denoting the degree of , and . We point out that one actually has and (29) can be equivalently rewritten as
[TABLE]
for arbitrary .
Corollary 4.9**.**
Let be feasible for a disjunctive program with being polynomial of degree , i.e., given by (29). Consider the following two conditions:
- (i)
for every , and for all one has
[TABLE]
- (ii)
for every with , , and all , where denotes a neighbourhood of , one has (31).
Then condition (i) (condition (ii)) implies (directional) pseudo-normality at . In particular, either of the two conditions implies MSCQ at .
Proof.
Denoting and taking into account (30), for any , one has
[TABLE]
for every . Hence, given and , both statements follows from (31) since
[TABLE]
∎
Of course the above higher-order conditions are sufficient for pseudo-normality and MSCQ also for general programs (1) fulfilling Assumption 3.7 for .
4.4 Summary and example for the disjunctive case
For the sake of completeness, we summarize the sufficient conditions for pseudo-normality and MSCQ in the disjunctive setting in the following theorem.
Theorem 4.10** (Sufficient conditions for pseudo-normality and MSCQ).**
Consider (1) with disjunctive in the sense of (18) and a feasible point . Then any of the conditions from Corollaries 4.5, 4.6, 4.7, 4.8 and 4.9 implies (directional) pseudo-normality and MSCQ at .
The following parametric example demonstrates the usefulness of our conditions based on pseudo-normality.
Example 4.11**.**
Let be given by , defined by for some parameters and let . Clearly,
[TABLE]
and for any . Note also that for all directions with since and . Moreover, observe that
The most crucial parameter is . Indeed, if , then locally around , the set is the singleton , and thus it can be seen that sequence shows violation of MSCQ. On the other hand, if , MSCQ holds and can be verified by SOSCMS. Hence, suppose now that .
Next, let us look into parameter . If , again shows violation of MSCQ regardless of other parameters. Note that if , the sequence satisfies , but for we get
[TABLE]
for sufficiently large , showing violation of pseudo-normality in direction , hence we cannot use any of the stronger conditions to verify MSCQ. Clearly, a similar problem occurs if .
We conjecture that MSCQ holds in this case, but as the direct proof appears fairly technical and since for our purposes it is more interesting to see the limitations of sufficient conditions in this case (rather than determine if MSCQ holds), we skip the details for and only prove MSCQ in the simpler case below.
Let us mention, however, that if , we may use the directional version of the fourth-order sufficient condition based on Corollary 4.8 (ii) to verify MSCQ, even if .
Next, we prove that MSCQ holds if , regardless of parameters and . Since the feasible set , locally around , equals , we get for any close enough to . On the other hand, for with and we clearly have . Given , let be sufficiently close to [math] so that . One computes that
[TABLE]
Thus, setting yields
[TABLE]
for all close enough to , and hence MSCQ follows.
In order to better illustrate the results of this paper, in the following tables corresponding to and , respectively, we provide sufficient conditions ensuring MSCQ for given parameters. Recall from above that, for and , pseudo-normality-based conditions are not applicable, and thus we restrict ourselves to the case . As mentioned above, the case can be handled by the directional fourth-order sufficient condition while for the case we provided the direct proof. We point out, however, that in both cases, unless , one can use also other sufficient conditions as indicated in the table. In particular, one can see the meaning of parameter , which does not seem to influence the validity of MSCQ, but it influences which sufficient conditions can be invoked to verify it. Note also that if , depending on other parameters, conditions other that SOSCMS can be used as well. Nevertheless, the only condition that can never be used in case is SOSCPN, which is applicable if . Hence, we further detail only the case .
To illustrate the difference between directional and non-directional approach, observe that the mildest non-directional sufficient condition for MSCQ, pseudo-normality, characterized by the maximality condition (26), is satisfied if and only if and provided and if . On the other hand, on top of the above situations, directional pseudo-normality can be applied whenever or also in case and .
The power of our new sufficient conditions is nicely demonstrated for , when Gfrerer’s SOSCMS can never be used. Similarly, Robinson’s result can not be applied unless all the parameters are zero.
5 Disjunctive programs with product structures
The simplified form of quasi-normality is not sufficient for metric subregularity even in case the set under consideration is a general convex polyhedral set, see Example 3.16. On the other hand, we realize that the set in all cases (19) (a)-(e) is a union of products of closed intervals. This additional product structure motivates our study of ortho-disjunctive programs in Section 5.1, which enables us to recover and extend several known quasi-normality results for MPCCs and MPVCs and obtain new corresponding results for MPSCs, MPrCCs and MPrPCs.
In order to clarify the role of product structures in a broader context, consider first an instance of GMP (1), where
[TABLE]
for some multi-index with and , i.e., is the Cartesian product of disjunctive sets. Note that all the prototypical disjunctive programs from (19) (a)-(e) exhibit such “outer” product structure.
We emphasize that given by (32) is still a disjunctive set in the sense of (18). Indeed, denoting , for the set is convex polyhedral and Regardless, it turns out to be advantageous to exploit the underlying product structure of rather than just treating as a disjunctive set. One of the reasons is that we deal with the unions of only sets, which is typically a small number ( for all in all cases (19) (a)-(e)), instead of dealing with the union of sets. We point out that the newly developed concept of -stationarity from [4, 5] takes advantage of this observation.
On the basis of Propositions 4.2 and 4.3 we readily infer that, on top of property (P1), given by (32) satisfies also (P2) for multi-index . Proposition 4.4 thus yields that in this case the (directional) PQ-normality w.r.t. coincides with its simplified form. In particular, standard NLPs, where for some , fit into (32) with the multi-index and hence we can readily handle quasi-normality for NLPs with ease.
Utilizing the “outer” product structure on its own, however, does not enable one to analyze the quasi-normality for programs from (19) (a)-(e), where the factors are two-dimensional. To overcome this, consider the GMP (1) with the “inner” product structure, i.e., where
[TABLE]
for some multi-index .
By the same arguments as before, again satisfies (P2) for and PQ-normality w.r.t. attains the simplified form. Moreover, the choice of multi-index now offers richer setting.
5.1 Ortho-disjunctive constraints and quasi-normality
Motivated by the above discussion, we now introduce the new subclass of disjunctive programs containing the “inner” product structure with one-dimensional factors. To this end, consider the mathematical program of the form
[TABLE]
where , with and we also allow symbols and to include unbounded intervals. Note that we do not work with extended real numbers, i.e., given , stands for . This simply means that is a product of closed convex subsets of , i.e., closed intervals. We refer to such sets as ortho-disjunctive and to such programs as mathematical programs with ortho-disjunctive constraints or briefly ortho-disjunctive programs.
Naturally, one can combine the “inner” and “outer” products and consider the Cartesian product of ortho-disjunctive sets, a setting that indeed fits the problem class (19) best. As before, it can be easily shown that such sets are still ortho-disjunctive. Moreover, only the “inner” products are important for our remaining analysis. Hence, we proceed without the “outer” product, which is also more consistent with the notion of disjunctive sets.
On the basis of Propositions 4.2, 4.3 and 4.4 we obtain the following analogon of Corollary 4.5.
Corollary 5.1**.**
Set given by (34) satisfies (P1) at every point as well as (P2) for multi-index at every and every . In particular, for ortho-disjunctive program (34), Assumption 3.7 for is fulfilled at every feasible point and, moreover, the (directional) quasi-normality at is equivalent to its simplified form: (for any ) there is no nonzero () such that there exists a sequence (with ) fulfilling
[TABLE]
Just as in the case of pseudo-normality, cf. the comments after Corollary 4.5, we have now clarified that, in fact, there is only one concept of quasi-normality which, in general, contains the additional sequence , but in special cases, such as NLPs or MPCCs, simplifies to the known versions without . Moreover, the above corollary provides the definition of quasi-normality for all other ortho-disjunctive programs.
Before we state the main result of this subsection that parallels Theorem 4.10 for pseudo-normality, we write down explicitly the conditions from Theorem 3.9, Proposition 3.19 and Corollary 3.10 for multi-index corresponding to quasi-normality.
Given , from (14) reads as
[TABLE]
Moreover, assuming that is twice differentiable at , the second-order sufficient conditions from Corollary 3.10 and Proposition 3.19, respectively, read as follows:
- •
Second-order sufficient condition for quasi-normality (SOSCQN): For every , every with for all and every with one has
[TABLE]
- •
Second-order sufficient condition for directional quasi-normality (SOSCdirQN): For every with , every with for all and every with one has (37).
Moreover, for a closed interval and we have
[TABLE]
where and denotes the negative and the positive part of any number , respectively, extended to symbols by the natural convention . Thus, depending on which norm we consider for the products, the penalty function now reads as
[TABLE]
Theorem 5.2** (Sufficient conditions for quasi-normality and MSCQ).**
Consider an ortho-disjunctive program (34) and a feasible point . Then each of the following conditions implies (directional) quasi-normality and MSCQ at : (i) the weak efficiency of for from (36), (ii) SOSCQN from (37) (SOSCdirQN).
Let us briefly comment on the importance of the previous theorem (together with Corollary 5.1). First, consider only the statement that the (simplified form of) quasi-normality (35) implies MSCQ and hence M-stationarity and exactness of the penalty function (5.1) at local minimizers. For MPCCs, we thus recover the following results: [45, Theorem 3.3] (quasi-normality implies M-stationarity), [45, Lemma 4.3 and 4.4] (pseudo-normality implies MSCQ), [45, Theorem 4.5 and Corollary 4.6] (pseudo-normality implies exactness of and penalty function), as well as [60, Theorem 3.1] (quasi-normality implies MSCQ). Similarly, for MPVCs we recover and improve [41, Theorem 3.1] (pseudo-normality implies exactness of the penalty function) and the fact that quasi-normality implies M-stationarity, which is not stated in the paper, but follows directly from [41, Theorem 2.1 and Definition 2.3]. Moreover, to the best of our knowledge, pseudo- and quasi-normality were not yet introduced for MPSCs, MPrCCs and MPrPCs and all our results are hence new when applied to these problem classes.
Second, we also provide verifiable sufficient conditions for quasi-normality, together with sufficient conditions for pseudo-normality (higher-order conditions, polynomiality of ) from Section 4, which enhances the applicability of our results.
Finally, we open a path for a refined analysis using directional quasi-normality as well as all the corresponding sufficient conditions (SOSCdirQN etc.).
In order to illuminate and compare our results with the literature, we conclude this section with application to MPCCs. The same exercise could be executed for other classes (19) (b)-(e). Recall that, omitting standard equality and inequality constraints, an MPCC is given as
[TABLE]
The constraints of MPCCs fit the general setting with , and , where is clearly ortho-disjunctive. As we mentioned, itself is also ortho-disjunctive, but we choose to rather keep the “outer” product as well, noting that the impact is only visible at the penalty function. We point out that the standard approach to MPCCs is to consider and in order to work with nonnegative signs of certain multipliers, while in our case we obtain the opposite sign restrictions.
A simple computation yields that for we have
[TABLE]
Hence, denoting
[TABLE]
for some feasible point , we conclude that if and only if
[TABLE]
Consequently, Corollary 5.1 yields that satisfies quasi-normality provided there is no nonzero fulfilling
[TABLE]
together with (39) such that there exists a sequence with
[TABLE]
On the other hand, is M-stationary provided there exists satisfying (39) and
[TABLE]
Moreover, using first the -norm to handle the “outer” product we get
[TABLE]
Next, using the -norm for the “inner” product, for arbitrary we have . Note that this agrees with the corresponding expression from (5.1), which reads as \min\big{\{}\max\{(G)^{-},|H|\},\max\{|G|,(H)^{-}\}\big{\}}. Consequently, we obtain
[TABLE]
Conclusion
Building on recently developed directional techniques from variational analysis, this paper contains a complex and self-contained study of the metric subregularity constraint qualification (MSCQ) for broad classes of nonconvex optimization problems including, most importantly, disjunctive programs. Our findings reveal a common denominator of several prominent sufficient conditions for MSCQ occurring in the literature. Thus, our study improves understanding of these seemingly independent approaches and provides an additional insight. Moreover, it offers a wider spectrum of sufficient conditions for MSCQ, including point-based ones, and consequently also improves existing sufficient conditions. Furthermore, by introducing the new notion of ortho-disjunctive programs we established an appropriate framework for a unified study of several nonconvex optimization problems such as mathematical programs with complementarity, vanishing or switching constraints. These ortho-disjunctive programs hence provide an intriguing area for future research.
Acknowledgments
The authors also thank two anonymous referees for their comments which helped improve the presentation of the material.
Dedication
The authors would like to dedicate this paper to Helmut Gfrerer in honor of his 60th birthday.
Funding
The research of the first author was supported by the Austrian Science Fund (FWF) under grant P29190-N32. The work on the revised version was supported by the FWF grant P32832-N. The research of the second author was supported by the Grant Agency of the Czech Republic (Grant No. 18-04145S). Part of this work was done while the second author was visiting McGill University, partially supported by H2020-MSCA-RISE project GEMCLIME-2020 under GA No. 681228. The research of the third author was supported by an NSERC discovery grant.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Achtziger, W., Kanzow, C.: Mathematical programs with vanishing constraints: Optimality conditions and constraint qualifications. Math. Program. 114 , 69–99 (2008)
- 2[2] Adam, L., Branda, M.: Nonlinear Chance Constrained Problems: Optimality Conditions, Regularization and Solvers. J. Optim. Theory Appl. 170 (2), 419–436 (2016)
- 3[3] Bai, K., Ye, J. J., Zhang, J.: Directional quasi-/pseudo-normality as sufficient conditions for metric subregularity. SIAM J. Optim., 29 (4), 2625–2649 (2019)
- 4[4] Benko, M., Gfrerer, H.: On estimating the regular normal cone to constraint systems and stationary conditions. Optimization 66 , 61–92 (2017)
- 5[5] Benko, M., Gfrerer, H.: New verifiable stationary concepts for a class of mathematical programs with disjunctive constraints. Optimization 67 , 1–23 (2018)
- 6[6] Benko, M.: Numerical methods for mathematical programs with disjunctive constraints. Ph D-thesis, Univ. Linz (2016).
- 7[7] Benko, M., Gfrerer, H., Outrata, J. V.: Calculus for directional limiting normal cones and subdifferentials. Set-Valued Var. Anal., 27 (3), 713–745 (2019)
- 8[8] Benko, M., Gfrerer, H., Mordukhovich, B. S.: Characterizations of tilt-stable minimizers in second-order cone programming. SIAM J. Optim., 29 (4), 3100–3130 (2019)
