Intersection disjunctions for reverse convex sets
Eli Towle, James Luedtke

TL;DR
This paper introduces a new framework for generating valid inequalities for reverse convex sets, extending intersection cuts to intersection disjunctions and multi-term disjunctions, applicable in optimization models like bilevel and polynomial optimization.
Contribution
It develops a generalized approach to derive valid inequalities from solutions outside the convex set, expanding the classical intersection cut methodology with multi-term disjunctions.
Findings
Introduces intersection disjunctions for reverse convex sets.
Generalizes to multi-term disjunctions using recession directions.
Provides a cut-generating linear program for valid inequalities.
Abstract
We present a framework to obtain valid inequalities for a reverse convex set: the set of points in a polyhedron that lie outside a given open convex set. Reverse convex sets arise in many models, including bilevel optimization and polynomial optimization. An intersection cut is a well-known valid inequality for a reverse convex set that is generated from a basic solution that lies within the convex set. We introduce a framework for deriving valid inequalities for the reverse convex set from basic solutions that lie outside the convex set. We first propose an extension to intersection cuts that defines a two-term disjunction for a reverse convex set, which we refer to as an intersection disjunction. Next, we generalize this analysis to a multi-term disjunction by considering the convex set's recession directions. These disjunctions can be used in a cut-generating linear program to obtain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Intersection disjunctions for reverse convex sets
Eli Towle [email protected]
James Luedtke [email protected]
(Department of Industrial and Systems Engineering, University of Wisconsin – Madison)
Abstract
We present a framework to obtain valid inequalities for a reverse convex set: the set of points in a polyhedron that lie outside a given open convex set. Reverse convex sets arise in many models, including bilevel optimization and polynomial optimization. An intersection cut is a well-known valid inequality for a reverse convex set that is generated from a basic solution that lies within the convex set. We introduce a framework for deriving valid inequalities for the reverse convex set from basic solutions that lie outside the convex set. We first propose an extension to intersection cuts that defines a two-term disjunction for a reverse convex set, which we refer to as an intersection disjunction. Next, we generalize this analysis to a multi-term disjunction by considering the convex set’s recession directions. These disjunctions can be used in a cut-generating linear program to obtain valid inequalities for the reverse convex set.
Keywords: Mixed-integer nonlinear programming; valid inequalities; reverse convex sets; disjunctive programming; intersection cuts
- Acknowledgments
This material is based upon work supported by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing Research (ASCR) under Contract DE-AC02-06CH11357. The authors acknowledge partial support through NSF grant SES-1422768.
1 Introduction
A reverse convex set is a set of the form , where is a polyhedron and is an open convex set. This is a general set structure arising in the context of mixed-integer nonlinear programming (MINLP). In this setting, is a linear programming relaxation of the MINLP feasible region, and contains no solutions feasible to the problem. We are motivated by cases where is either non-polyhedral or is defined by a large number of linear inequalities, because if is a polyhedron defined by a small number of inequalities, we can optimize and separate over efficiently using the disjunctive programming techniques of Balas [3, 4]. We study valid inequalities for reverse convex sets. These inequalities can be used to strengthen the convex relaxation of any problem for which an open convex set containing no feasible points can be identified; such sets are known as convex -free sets (e.g., Conforti et al. [13]).
Intersection cuts are valid inequalities for . Intersection cuts were introduced in the context of concave minimization by Tuy [44] and later by Balas [1] for integer programming. The inequalities of Tuy [44] are often referred to as “concavity cuts” or “-valid” cuts. For ease of exposition, we refer to such inequalities as “intersection cuts” due to their similarity to the inequalities of Balas [1]. An intersection cut is generated from a basic solution of that satisfies . A basic solution of corresponding to basis forms the apex of a translated simplicial cone defining a relaxation of . For each extreme ray of this cone, a point on that intersects the extreme ray is found. A hyperplane is formed, such that the hyperplane passes through all of these points and satisfies . The intersection cut is valid for . For a detailed review of intersection cuts, see Section 2.
Our main contribution in this work is to show how valid inequalities for can be obtained from a basic solution of in the case where . Because , intersection cuts generated using the cone are not valid or even well-defined in general. However, under the assumption that each extreme ray of that does not intersect lies in the recession cone of , we present two linear inequalities that form a two-term disjunction that contains . If intersected with one of these inequalities is empty, the inequality defining the other disjunctive term is valid for . We call inequalities obtained in this manner external intersection cuts. If both disjunctive terms are nonempty, we can generate valid inequalities for using the standard cut-generating linear program (CGLP) for disjunctive programming of Balas [3, 4]. We refer to these disjunctions as intersection disjunctions.
Glover [21] exploits the recession structure of to strengthen the intersection cut. This motivates us to extend our analysis by considering , the recession cone of . We present a relaxation of that incorporates the structure of . We provide a class of valid inequalities for the relaxation, the size of which grows exponentially with the dimension of the problem. We derive a polynomial-size extended formulation that captures the full strength of this exponential family of inequalities. We then prove that the proposed relaxation of is equivalent to the union of at most possibly nonconvex sets, thereby forming a disjunction that contains the reverse convex set. Under some assumptions, we propose a polyhedral relaxation of each disjunctive term individually. Given these polyhedral relaxations, we can use a CGLP to generate disjunctive cuts for .
This paper is organized as follows. In Section 1.1, we review related literature. In Section 1.2, we provide motivating examples of reverse convex sets. In Section 2, we review intersection cuts. In Section 3, we present a two-term disjunction that contains and is generated by basic solutions of that lie outside of . In Section 4, we extend this analysis by presenting a multi-term disjunction for by considering . We propose extended formulations that can be used to define polyhedral relaxations of the disjunctive terms.
1.1 Related literature
The problem of optimizing a linear function over a reverse convex set is known as linear reverse convex programming (LRCP). Tuy [46] shows that any convex program with multiple reverse convex constraints can be reduced to one with a single reverse convex constraint with the introduction of an additional variable and an additional convex constraint. By reduction from a concave minimization problem, optimizing a linear function over a reverse convex set is NP-hard. Matsui [35] shows that this holds even in special cases restricting the structure of the linear constraints or the convex set . Reverse convex optimization problems were first studied from a global optimality perspective in the 1970s (e.g., Bansal and Jacobsen [6, 7], Hillestad [28], and Ueing [47]). Hillestad and Jacobsen [29, 30] presented one of the first cutting plane algorithms for LRCP, though it does not always converge to an optimal solution (e.g., Gurlitz [25]). Numerous algorithms for solving LRCP have been developed. Gurlitz and Jacobsen [26] present a partial enumeration procedure, Thuong and Tuy [43] propose sequentially solving concave minimization problems, and Fülöp [20] proposes a cutting plane algorithm that cuts off edges of the polyhedron that do not contain an optimal solution. Ben Saad and Jacobsen [10] present a cutting plane algorithm to solve LRCP based on level sets, but later showed that it does not converge to a globally optimal solution [11]. Branch-and-bound methods from concave minimization literature have also been adapted to solve reverse convex optimization problems (e.g., Horst [31], Horst et al. [32], Muu [37], and Ueing [47]).
Hillestad and Jacobsen [30] define the concept of a basic solution for LRCP. They show that the convex hull of the feasible region of LRCP is a polytope if the linear constraints form a polytope and the functions defining the reverse convex constraints are differentiable. Sen and Sherali [42] extend this result, showing that the closure of the convex hull of any polyhedron intersected with a finite number of reverse convex constraints is a polyhedron.
In cutting plane algorithms for MINLP problems, intersection cuts may be constructed from the basis corresponding to an optimal solution to an LP relaxation of the problem. Infeasible basic solutions of within are also candidates for generating intersection cuts for and may yield intersection cuts that are not dominated by those generated from feasible basic solutions. Gomory mixed-integer (GMI) cuts for mixed-integer linear programming behave similarly. In particular, Nemhauser and Wolsey [38] showed that the intersection of GMI cuts from all basic solutions is equivalent to the split closure. This is not true when considering only GMI cuts from basic feasible solutions (e.g., Cornuéjols and Li [15]).
Intersection cuts can be generated from any convex set that does not contain feasible points in its interior. In integer programming, these are maximal lattice-free convex sets. More generally, these types of sets are convex -free sets. When considering a fixed basis, intersection cuts generated using a convex set can only be stronger than those produced using a subset of . Balas and Margot [5] generalize intersection cuts such that inequalities for can be obtained using a more general polyhedron rather than a translated simplicial cone. Glover [21] proposes improved intersection cuts for the special case where is a polyhedron. Intersection cuts omit from the inequality variables corresponding to extreme rays of that lie within the recession cone of . Glover [21] uses the polyhedron’s recession information to include these terms with negative coefficients, thereby strengthening the cut. A similar strengthening is proposed for polynomial optimization problems by Bienstock et al. [12].
The idea of improving the intersection cut by considering has been studied in the context of minimal valid functions. Dey and Wolsey [16] consider minimal valid functions for a polyhedral and note that the minimal valid function for is the uniquely defined intersection cut if and . Results from this work were extended by Basu et al. [8] and Basu et al. [9]. Fukasawa and Günlük [19] use the nonnegativity of integer variables to derive minimal valid inequalities for a mixed-integer set. These inequalities consider recession directions of a relevant convex lattice-free set and thus may contain negative variable coefficients.
Ultimately, standard approaches to generating intersection cuts for require a basic solution of that lies within the convex set . In this paper, we present a framework for constructing valid inequalities for using a basic solution that lies outside .
1.2 Motivating examples
In many MINLP problems, a reverse convex set can be derived via a problem reformulation. In this case, the polyhedron is a relaxation of the MINLP feasible region, and the set is an open convex set which is known to contain no solutions feasible to the MINLP. We can use the set to derive valid inequalities for the original problem. We motivate our study of reverse convex sets by showing how this structure appears in a variety of MINLP contexts. For all of the examples that follow, the closure of the set we derive is non-polyhedral, or possibly defined by a large number of linear inequalities.
One example of this reverse convex structure appears in difference of convex (DC) functions (e.g., Tuy [45]). A function is a DC function if there exist convex functions such that for all . A DC set can be written as
[TABLE]
Equivalently, we can write (1) as , where . The convex set contains no points feasible to . Hartman [27] shows the class of DC functions is broad, subsuming all twice continuously differentiable functions.
Reverse convex sets also appear in the context of polynomial optimization. Bienstock et al. [12] consider the set of symmetric matrices representable as the outer-product of a vector with itself: . Polynomial optimization problems can be reformulated to include the constraint that a square matrix of variables is outer-product representable. Bienstock et al. [12] construct non-polyhedral outer-product-free sets that do not contain any matrices representable as an outer-product of some vector, and as such are not feasible to the problem. Accordingly, they present families of cuts for , where is formed by the linear constraints of the problem reformulation. They characterize sets that are maximal outer-product-free, that is, not contained in any other outer-product-free sets. For the specific case of quadratically constrained programs (QCPs), Saxena et al. [40] use disjunctive programming techniques to derive valid inequalities for a reverse convex set in an extended variable space. In a companion paper, Saxena et al. [41] suggest the following eigen-reformulation of the quadratic constraint :
[TABLE]
where denote the eigenvalues of and the corresponding eigenvectors. The convex set does not contain any points feasible to QCP.
Reverse convex sets can also be used to define relaxations of bilevel optimization problems. Bilevel programs include constraints of the form , where is the value function of the lower-level problem for a fixed top-level decision :
[TABLE]
The function is convex. The set is defined by a reverse convex inequality and does not contain any points feasible to the bilevel program. The closure of this set is polyhedral, but may be defined by a large number of linear inequalities. Fischetti et al. [18] propose intersection cuts for a specific class of bilevel integer programming problems.
2 Intersection cut review
We briefly review intersection cuts, following the presentation of Conforti et al. [14]. Let be a matrix with full row rank and let . Let be a polyhedron. Let be an open convex set. We are interested in valid inequalities for the reverse convex set .
For a basis of , let be the nonbasic variables. For some and , we can rewrite as
[TABLE]
The basic solution corresponding to basis is , where if , and [math] if . By removing the nonnegativity constraints on variables , , we obtain , the cone admitted by the basis . The basic solution forms the apex of . There is an extreme ray of for each :
[TABLE]
The conic hull of the extreme rays forms the recession cone of . Together, the basic solution and these extreme rays provide a complete internal representation of , namely, .
Intersection cuts are valid inequalities for constructed from basic solutions of that lie within . These cuts are transitively valid for . Assume . For each , let be defined as
[TABLE]
The set is the set of points where the extreme rays of emanating from leave the set . Because is open, for all . If , lies in the recession cone of .
The following inequality is valid for (Balas [1]):
[TABLE]
We refer to (3) as the standard intersection cut. Here, and throughout the paper, we use the convention that .
Notation. Let be the extended real numbers. For a nonzero vector and , we define the line segment . Closed brackets (e.g., ) denote the inclusion of one or both endpoints of the line segment. The set is unbounded if and only if or . We remark that is equivalent to . However, we use the notation for consistency.
3 Intersection disjunctions and external intersection cuts
In this paper, we consider a fixed basis and corresponding basic solution . Let be defined as in Section 2. For the remainder of this paper, we assume the basic solution lies outside of . Recall is a translated simplicial cone with apex and linearly independent extreme rays . For all , let be defined as in (2), and let
[TABLE]
For , we use the convention and if the set does not intersect . If is polyhedral and intersects , and can be obtained by solving a linear program. If is non-polyhedral, a convex program may be required to obtain these parameters. An exception is the case where is bounded and a point in is known a priori, in which case a binary search can be performed to find the values of and .
We partition into the following three sets:
[TABLE]
For , the halfline does not intersect . Observe for .
Throughout Section 3, we make the following assumption.
Assumption 1**.**
It holds that for all .
We say a disjunction is valid for a set if the disjunction contains the set. Theorem 1 proposes a valid disjunction for .
Theorem 1**.**
Under Assumption 1, for every , either
[TABLE]
Proof.
If , then for all , and all trivially satisfy . We prove the result for . Assume satisfies and . We show .
Because and , there exists such that . For all , let . It holds that if and only if or . Therefore, . We write as
[TABLE]
Consider satisfying . If , then . Similarly, if , then . In both cases, .
By (5), is a convex combination of points in plus an element of . Thus, . ∎
Remark 1*.*
The two-term disjunction (4) can be used in a disjunctive framework to generate valid inequalities for . Specifically, the set is a subset of , where and . The sets and are polyhedral, because is a polyhedron and the inequalities (4) are linear. We can obtain valid inequalities for by generating valid inequalities for using the disjunctive programming approach of Balas [3, 4].
Remark 2*.*
We consider the relationship between the two-term disjunction (4) and the standard intersection cut. The two-term disjunction (4) assumes that the basic solution does not lie within . If , then (trivially, every extreme ray of emanating from intersects ) and for all . Because for all , the inequality of (4) is ill-defined. Instead, we can show that all points in lie in either or . However, because , we can conclude that the inequality is valid for . This is precisely the standard intersection cut of Balas [1].
Example 1*.*
Let and . Consider generated by the (only) basic solution of , . In this case, . The feasible region is the disconnected set shaded in Figure 1(a). The inequalities (4) form a valid disjunction for , shown in Figure 1(b).
Proposition 1 states that if is bounded, then the inequality defining each term of (4) is sufficient to define the convex hull of the points in satisfying that inequality. This is not true if the interior of is nonempty, as shown by Dey and Wolsey [16] for the standard intersection cut (3).
Proposition 1**.**
Under Assumption 1, if is bounded, then
[TABLE]
Proof.
We show only that , as the second statement can be shown using similar techniques. Under Assumption 1, bounded implies .
Because and the set is convex, . Next, let satisfy . Then
[TABLE]
Because for all , we have . For , let . For any , equals if , and [math] otherwise. Hence, for all , . Continuing from (6), we have . ∎
The disjunction presented in Theorem 1 can be particularly useful if is empty when intersected with one of the inequalities (4). In this case, the inequality defining the other disjunctive term is valid for .
Definition 1**.**
If , we refer to the inequality as an external intersection cut. We say the same for the inequality if .
External intersection cuts are valid for . We provide an example where intersection cuts are insufficient to define , but the facet-defining inequality for can be obtained from an external intersection cut. In order to derive the inequalities (4), we must first translate our polyhedral set to standard form, using additional slack variables as necessary. We then select a basis and calculate and for all . In this example and all that follow, we intentionally omit the intermediate steps required to obtain these inequalities, presenting them in the original variable space.
Example 2*.*
Let
[TABLE]
As can be seen in Figure 2(a), no standard intersection cut is able to generate the inequality that is facet-defining for . However, the basic solution corresponding to the constraints and generates this inequality as an external intersection cut. For this basic solution, the set is empty, implying that the inequality is valid for . Figure 2(b) shows the inequalities (4) for this example.
We note that in this example, there does exist an open convex set such that and the intersection cut defined by with respect to generates the facet-defining inequality for . For instance, one such set is . Methods for enlarging the set to generate intersection cuts stronger than those produced by are outside the scope of this work, though this topic has been studied by Balas [2].
Example 3*.*
Figure 3 depicts another example of an external intersection cut. The extreme rays of enter into the convex set and remain within on an unbounded interval, so . The external intersection cut is valid for .
Definition 2**.**
If and , we say the disjunction (4) is an intersection disjunction for .
If (4) is an intersection disjunction for , we can use a disjunctive CGLP to generate valid inequalities for using the techniques of Balas [3, 4].
We provide an example of why Assumption 1 is necessary for the validity of the two-term disjunction of Theorem 1.
Example 4*.*
Consider the reverse convex set shown in Figure 4(a). The extreme ray does not intersect the bounded convex set , so (4) is not a disjunction for . Figure 4(b) shows the same example but with the halfline added to the set . Because Assumption 1 holds, Theorem 1’s disjunction is valid for .
Our final example of this section motivates considering how the recession cone can be used to derive more general valid disjunctions for .
Example 5*.*
Let , and
[TABLE]
Figure 5 provides a graphical representation of , where is generated from the basic solution . Assumption 1 does not hold; namely, , but . However, there exists a valid two-term disjunction for that cannot be obtained with the theory of this section.
4 Valid inequalities and intersection disjunctions using
In this section, we generalize the results of Section 3 by considering the full recession cone of . In Section 4.1, we construct an inner approximation of and analyze its relationship to . We derive inequalities to define a polyhedral relaxation of in Section 4.2. In Section 4.3, we generalize the two-term disjunction of Theorem 1 to a multi-term disjunction that uses the recession cone of . We propose polyhedral relaxations of these disjunctive terms in Section 4.4.
4.1 An inner approximation of
Let . We define and as follows:
[TABLE]
Both and are subsets of . Additionally, we define and as follows:
[TABLE]
Note that . We derive inequalities valid for . We illustrate the sets and graphically in the example that follows.
Example 6*.*
Let and . Let be the basic solution of , corresponding to basis . The sets and are shown in Figure 6(a). Figures 6(b) and 6(c) show the sets and , respectively. Figure 7(a) shows the set . The set is depicted in Figure 7(b).
We motivate the study of and by showing that each set retains the strength of under the convex hull operator.
Theorem 2**.**
It holds that
[TABLE]
Proof.
Observe . By definition, , which implies
[TABLE]
To complete the proof, we show . Let . Assume , or we have nothing to prove. Because , we have , where for all . Let .
To begin, assume . Assume also that , otherwise is a convex combination of the points :
[TABLE]
Let . We rewrite as
[TABLE]
We have for all . Additionally, for all . For , , because and for a sufficiently large . The coefficients on the vectors are nonnegative and sum to one. Then .
Next, assume . Because for all , we have . Then there exists satisfying . Let . For all , it holds that . Furthermore, . It follows from the above analysis that . Consequently, .
Finally, assume . For , let be the following:
[TABLE]
Consider a fixed . It holds that , because
[TABLE]
It must be the case that . If not, we have for some , implying and contradicting .
It holds that , because the coefficients on the terms are nonnegative:
[TABLE]
Then we have . For a sufficiently large , , because . Because , it follows that . Let be defined as follows:
[TABLE]
For any , let be the following convex combination of and :
[TABLE]
We have , and for all . Then
[TABLE]
Rearranging the definition of from (8), we have . Thus,
[TABLE]
Theorem 2 supports our selection of as a relaxation of , as we do not lose anything when considering . For the remainder of Section 4, we make the following assumption.
Assumption 2**.**
The recession cone of is contained in the recession cone of .
If Assumption 2 does not hold, we can consider the convex set instead of . Indeed, . Our analysis only requires the set to be relatively open in , not necessarily open. By Corollary 1, replacing with does not change the strength of our relaxation of with respect to the convex hull operator.
Corollary 1**.**
It holds that .
Proof.
The statement follows directly from Theorem 2:
[TABLE]
If in Corollary 1 is replaced with , the statement is no longer true in general. This is relevant, because we present a valid disjunction for in Section 4.3. When we add the constraints of to the disjunctive formulation of , the cuts obtained from a CGLP could be stronger than those obtained from a CGLP built from a valid disjunction for . Thus, substituting with in order to satisfy Assumption 2 has the potential to weaken the generated disjunctive cuts.
4.2 Polyhedral relaxation of
We present valid inequalities for , where is fixed and
[TABLE]
In this section, we are interested in deriving valid inequalities for when , in which case . Because , inequalities valid for are also valid for . We consider the more general set to be able to apply this analysis in Section 4.4. Observe that . Furthermore, if and only if .
For , let be the following:
[TABLE]
It holds that if . In all other cases, is finite and its supremum is attained, because is a closed convex cone. The parameter depends on , but we suppress this dependence for notational simplicity.
Let be defined as follows:
[TABLE]
Let be the cone formed by extreme rays and (). The set is composed of indices corresponding to extreme rays of that exhibit the following property: for every , the cone contains a nontrivial element of (i.e., anything outside of ).
Proposition 2**.**
Let . For any , .
Proof.
If , then , and the point lies in . Assume . The point is a convex combination of and , both of which lie in :
[TABLE]
For and , we define to be
[TABLE]
The parameter also depends on . We again omit this dependence for notational simplicity.
Theorem 3 presents a family of valid inequalities for .
Theorem 3**.**
Let . The inequality
[TABLE]
is valid for .
Prior to proving Theorem 3, we use Farkas’ lemma to derive a result on the existence of a solution to a particular family of linear equations.
Lemma 1**.**
Let be two finite index sets. Let and satisfy . Then there exists such that
[TABLE]
Proof.
Let . If , then any satisfying for all is a solution to system (11). Therefore, assume .
Assume for contradiction (11) does not have a solution. By Farkas’ lemma, there exist and such that
[TABLE]
We multiply (12a) by to obtain
[TABLE]
Summing this expression over and produces the inequality
[TABLE]
By assumption, , which implies . Combining (12b) with the inequality , we conclude that . Thus, (13) implies
[TABLE]
Inequality (14) contradicts (12b). Therefore, (11) has a solution. ∎
Proof of Theorem 3..
The statement is trivially true if . Therefore, assume . For ease of notation, let for . Let and . Note if and only if . Let satisfy . We show .
We apply Lemma 1 with , , for , and for . Thus, there exists satisfying
[TABLE]
By Proposition 2, for all and , because . Consequently,
[TABLE]
We use (16) and from (15) to rewrite , :
[TABLE]
We have . Substituting (17) into the definition of , we have:
[TABLE]
By (15a), the sum of the weights on the terms , in (18) are greater than :
[TABLE]
By (15b), each individual coefficient on , in (18) is nonnnegative. Together with (19), we have
[TABLE]
Continuing from (20), , because . Furthermore, because recession cone membership is preserved under addition,
[TABLE]
This holds because for all and , for all by (9), and for all . By (20) and (21), we have . ∎
As a result of Theorem 3 and Proposition 2, for any and satisfying for all , the inequality
[TABLE]
is valid for . This is useful when calculating approximately (e.g., via binary search), as it is sufficient to calculate a positive lower bound on . However, our choice of yields a stronger inequality than inequalities corresponding to smaller choices of .
Example 6* (continued).*
Consider . Figures 8(a) and 8(b) provide a graphical representation of Theorem 3 applied to Example 6. The selection of is shown in Figure 8(a), where . The vector lies on the boundary of . We note . Figure 8(b) shows the valid inequality of Theorem 3 using this selection of .
We next consider the problem of selecting a subset of that yields the most violated inequality of the form (10) to cut off a candidate solution . That is, we are interested in the separation problem
[TABLE]
For each , we define the function to be if , and [math] otherwise. The value is the contribution of index to the objective function (22) for a given .
Proposition 3**.**
The maximization problem (22) is a supermodular maximization problem.
Proof.
The separation problem (22) can be equivalently written as
[TABLE]
From standard properties of the operator, the objective function of (23) is the sum of supermodular (and modular) functions. ∎
By Proposition 3, the separation problem (22) can be solved in strongly polynomial time (e.g., Grötschel et al. [22], Grötschel et al. [23], Orlin [39]).
We next propose an extended formulation for the relaxation of defined by inequality (10) for all :
[TABLE]
Let and . For each , let be ordered such that . Similarly, for any , let satisfy . For all , let , , , , , and . We define to be the set of such that
[TABLE]
Theorem 4 establishes the relationship between and the relaxation .
Theorem 4**.**
The polyhedron is an extended formulation of :
[TABLE]
Proof.
We first argue that the following linear program solves the separation problem (22) for a fixed :
[TABLE]
The constraint matrix of is totally unimodular. To see this, we complement the variables with for all to obtain an equivalent problem. The resulting constraint matrix has entries, and each row contains no more than one and one .
We show (24) correctly models the separation problem (22). First, let be the optimal solution of (22). We construct feasible to (24) with objective function value equal to . If , then the optimal objective value of the separation problem is [math]. In this case, set for all and , and set for all . Then is feasible to (24) with objective value [math]. If , set if , and [math] otherwise. For all , let be the smallest index satisfying ; that is, . For each , set for all , and set for all . By construction, satisfies (24b)–(24f). For a fixed , we have
[TABLE]
and the objective function (24a) evaluates to the desired value of
[TABLE]
Now, let be an optimal solution to (24). Set . It remains to show is not less than the optimal objective value of (24). Recall the constraint matrix of (24) is totally unimodular, so is [math]– valued. If for all , then the separation problem objective evaluated at is [math] and the optimal objective value of (24) is nonpositive, as desired. Next, assume . By constraints (24c), given , for all . By constraints (24b) and (24d), for each , there exists such that for and for . Then the optimal objective value of (24) is
[TABLE]
Consider a fixed . By constraints (24c), for . Then . Due to the ordering , we have . Therefore, the optimal objective value of the separation problem evaluated at is at least as large as (25):
[TABLE]
Hence, (24) models the separation problem (22) for a fixed .
We conclude by relating to (24). The point lies in if and only if the primal objective (24a) does not exceed . The linear program (24) is feasible and bounded, so strong duality applies. Let , , and be the linear program’s dual variables, as labeled in (24). By strong duality, if and only if the dual of (24) has objective value less than or equal to . Because the dual of (24) is a minimization problem, we enforce this condition with the constraint . We also replace the fixed in the dual of (24) with the nonnegative variable . Thus, if and only if there exists satisfying the dual constraints of (24) and the aforementioned dual objective cut. These constraints define . ∎
Within the proof of Theorem 4, we show that the linear program (24) can be used to solve the separation problem (22). The remainder of the proof uses the separation linear program (24) and duality theory to derive an extended formulation, a technique that was first proposed by Martin [34].
For a fixed , the constraints (24b), (24d), and (24e) form an instance of the mixing set, first studied by Günlük and Pochet [24]. The proof’s derivation of the extended formulation follows results from Luedtke and Ahmed [33] and Miller and Wolsey [36].
Proposition 4 states that if no cuts of the form (10) exist, then there exist no valid inequalities for other than those defining .
Proposition 4**.**
Under Assumption 2, if , then .
Proof.
It suffices to show for .
We first show for . Assume for contradiction there exists and such that . By the definition of in (9), there exists , , and such that for all , , and
[TABLE]
Equivalently, we have . Because the vectors are linearly independent and there exists such that , it holds that . This contradicts Assumption 2, which states .
Now, consider , , and . Because , there exists such that . Then for a sufficiently large , . We have that is a convex combination of and . Thus, for all , so . ∎
In the case where () and , Theorem 2 and Proposition 4 together give us .
4.3 A multi-term disjunction valid for
The set has the potential to be a tighter relaxation of than the two-term disjunction of Theorem 1, because it considers the full structure of . In this section, we derive a valid disjunction for that contains terms. These terms are defined by nonconvex sets, but in Section 4.4 we derive polyhedral relaxations of each term. Given the valid disjunction we derived for , these polyhedral relaxations can be used with other inequalities defining to obtain a union of polyhedra that contains . The disjunctive programming approach of Balas can be applied to construct a CGLP to find a valid inequality that separates a candidate solution from .
Let be defined as follows:
[TABLE]
We define the following sets for :
[TABLE]
The sets and () are the foundation of our multi-term valid disjunction for .
In Proposition 5, we present an equivalent construction of ().
Proposition 5**.**
For , can be written as
[TABLE]
Proof.
For , it holds that . For , we also have . This gives us:
[TABLE]
Observe that for all . This allows us to rewrite from (29) as
[TABLE]
Finally, note that . Hence, for all , we can replace in the convex hull operator of (30) with :
[TABLE]
Theorem 5 presents a disjunctive representation of . Throughout, let .
Theorem 5**.**
It holds that
[TABLE]
Before proving Theorem 5, we prove a consequence of Farkas’ lemma [17].
Lemma 2**.**
Let , where . There exists such that
[TABLE]
Proof.
By Farkas’ lemma, either system (32) has a solution, or there exists such that
[TABLE]
Assume for contradiction there exists a satisfying (33). The nonnegativity of , (33b), and (33c) imply for all . The vector is nonnegative and by assumption sums to a strictly positive value. We conclude , contradicting (33a). ∎
Proof of Theorem 5..
It suffices to show . If , we have by (7) and (26), and the result holds. Therefore, assume . By construction, for all , implying .
Let . By ’s membership in , there exist , , , and such that for all , , and
[TABLE]
If for all , then by (34) and we have nothing left to prove. We therefore assume . From (28), implies that for all , there exist , , , and such that for all , , and
[TABLE]
Because and , it holds that . We apply Lemma 2 with , for , and for . Then there exists such that and for all . We use this as convex combination multipliers on (34) and (35) to rewrite as
[TABLE]
For every and , . The coefficients on the terms (, ) in (36) are nonnegative and sum to one:
[TABLE]
It follows that . Lastly, we have . Thus, by (36), . ∎
The multi-term disjunction (31) is a generalization of the two-term disjunction of Theorem 1. Recall that this two-term disjunction does not account for the recession structure of beyond the property that for all and the assumption for all . If is bounded and Assumption 1 holds, it can be shown that the multi-term disjunction reduces to the simple two-term disjunction of Theorem 1. In particular, we have
[TABLE]
In the remainder of this paper, we derive polyhedral relaxations for each of the terms in the disjunction (31). Given a polyhedral relaxation of each disjunctive term, we can obtain valid inequalities for using a disjunctive approach analogous to the method outlined in Remark 1.
Remark 3*.*
The multi-term disjunction (31) for can be extended to the case . Specifically, if for all , the set defined in (26) contains every point in except for . Because , we know that (one of the terms of the disjunction (31)) is empty.
Example 5* (continued).*
Using the two-term disjunction from Section 3, we were unable to derive meaningful cuts for from Example 5. In contrast, Theorem 5 provides a disjunction for . A graphical representation of the relationship for this example is shown in Figures 9(a)–9(c). In this example, . The disjunction of Theorem 5 can be seen in Figure 10.
Based on the disjunction (31), the inequalities (10), which are valid for , are also valid for for all .
The sets , are nonconvex in general. In Section 4.4, we derive polyhedral relaxations of these sets. Together, these relaxations form polyhedra whose union contains the feasible region .
4.4 Polyhedral relaxation of ,
In this section, we describe a polyhedral relaxation of the set for .
To begin, we consider the set . The set is equivalent to from Section 4.2 when . As such, the theory of Section 4.2 can be applied to the specific case to obtain an exponential family of inequalities for and a polynomial-size extended formulation of the polyhedron defined by these inequalities.
Example 5* (continued).*
Let from Section 4.2 equal . Consider and defined in Example 5. Figure 11(a) shows the selection of for . This is then used to construct the inequality of Theorem 3 in Figure 11(b).
Now, let be fixed. For the remainder of this section, we describe a polyhedral relaxation of . Let be defined as follows:
[TABLE]
Because , we have .
Observation 1*.*
It holds that .
Proposition 6**.**
The index is not in .
Proof.
Assume for contradiction . By Observation 1, there exists and such that , which implies . This is a contradiction; , so the halfline extending from intersects on a finite interval. ∎
Proposition 7 characterizes the points where intersects each edge of .
Proposition 7**.**
Let . If Assumption 2 holds, then
[TABLE]
Proof.
Let . By Observation 1, there exists such that . Consider any . We have . Because , we have . Thus, .
Next, let . By the construction of in (27), . Assume for contradiction . There exists , , , and such that , for , and
[TABLE]
Observe for all and ; if not, from (37), contradicting Assumption 2. Therefore,
[TABLE]
Because , we have . This contradicts .
Finally, let . Assume for contradiction . We follow the definitions in the previous case () to obtain
[TABLE]
Again, we obtain , contradicting . ∎
The proof of Proposition 7 shows that without Assumption 2, it may be the case that for some . This is due to the addition of to .
Corollary 2 follows from Proposition 7.
Corollary 2**.**
If , then there exists such that for all .
By Corollary 2, if , lies in the relative interior of . We can construct a polyhedral relaxation of by using intersection cuts generated by the cone . Methods for strengthening intersection cuts (e.g., Glover [21]) can be used to obtain a strengthened polyhedral relaxation. For this reason, we present inequalities only for the case .
Assumption 3**.**
There exists such that , i.e., .
For and , let
[TABLE]
We define to be the indices of that satisfy the following property:
[TABLE]
For any and , intersected with the cone contains something other than the trivial directions .
The proof of Proposition 8 is similar to that of Proposition 2.
Proposition 8**.**
Let . For any , we have .
For and , define to be
[TABLE]
By Proposition 8, if , for all pairs .
Theorem 6**.**
Let . The inequality
[TABLE]
is valid for .
Proof.
Assume , or the result trivially holds. By construction, . Let satisfy . We show . For ease of notation, let .
By Proposition 8, for , there exists such that . Then
[TABLE]
By Lemma 1, there exists such that
[TABLE]
This result is obtained with , , for all , and for all . With the satisfying (39), we have
[TABLE]
Using (40), is equivalent to
[TABLE]
By (39b), the coefficients on the terms , are nonnegative. Observe that
[TABLE]
It follows that . ∎
We next consider the separation problem for , where
[TABLE]
In particular, given some , we are interested in finding a subset of that maximizes the violation of an inequality of the form (38):
[TABLE]
Proposition 9**.**
The separation problem (41) is a supermodular maximization problem.
Similar to the derivation of in Section 4.2, we derive an extended formulation for the relaxation of defined by inequality (38) for all . Let , where . Let . For all , let be ordered to satisfy . For , let be the unique integer satisfying . For all , let , , , , , and . We define to be the set of such that
[TABLE]
Theorem 7**.**
It holds that .
The proof of Theorem 7 is left out, because it mirrors that of Theorem 4. We can use the extended formulation to construct a polyhedral relaxation of from the multi-term disjunction (31).
The nontrivial inequalities of Theorem 6 are predicated on the existence of a nonempty . We end this section by stating that if no such subset exists (i.e., ), then no nontrivial inequalities exist for .
Proposition 10**.**
Under Assumption 3, if , then .
Proof.
By Assumption 3, . We show for all . Observe by Proposition 7.
Consider any and . We show . By Proposition 7, is finite. Then for a sufficiently large , . We have that is a convex combination of and . Hence, .
Now, consider , , and . Because , there exists such that . Then there exists such lies outside of . Therefore, is a convex combination of and . This holds for an arbitrary , so . ∎
5 Discussion and future work
Our analysis requires the basic solution to lie outside . We showed in Section 3 that if , we obtain the standard intersection cut of Balas. It remains to discuss how we can derive valid inequalities for when .
Under Assumption 1, our analysis still applies if . To demonstrate this, assume for simplification that (this is a more restrictive version of Assumption 1). It follows that for all . Similar to the observation made in Remark 2 for the case , we can show that every point in lies in or . We can generate inequalities for in a disjunctive CGLP using the two polyhedra defined by the constraints of added to each of these two sets. Similarly, if and Assumption 1 holds, the term of the multi-term disjunction (31) is equal to . We can again use disjunctive programming to generate cuts for with the knowledge that . Polyhedral relaxations for the remaining disjunctive terms can still be generated using the methods discussed in Section 4.4.
We conclude with some ideas for future work. One direction is to study the computational strength of cuts obtained using these ideas. Another possibility is to generalize this disjunctive framework to allow for cuts to be generated by bases of rank less than (i.e., bases that do not admit a basic solution). Additionally, the strength of relative to could be analyzed. Specifically, it remains to be seen if , which by Theorem 4 would imply that we have a polynomial-size extended formulation of . The same applies to the strength of relative to for .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Balas [1971] Balas E (1971) Intersection cuts – A new type of cutting planes for integer programming. Oper. Res. 19(1):19–39.
- 2Balas [1972] Balas E (1972) Integer programming and convex analysis: intersection cuts from outer polars. Math. Program. 2(1):330–382.
- 3Balas [1979] Balas E (1979) Disjunctive programming. Ann. Discrete Math. 5:3–51.
- 4Balas [1998] Balas E (1998) Disjunctive programming: Properties of the convex hull of feasible points. Discrete Appl. Math. 89(1–3):3–44, originally MSRR no. 348, Carnegie-Mellon University, 1974.
- 5Balas and Margot [2013] Balas E, Margot F (2013) Generalized intersection cuts and a new cut generating paradigm. Math. Program. 137(1-2):19–35.
- 6Bansal and Jacobsen [1975 a] Bansal PP, Jacobsen SE (1975 a) An algorithm for optimizing network flow capacity under economies of scale. J. Optim. Theory Appl. 15(5):565–586.
- 7Bansal and Jacobsen [1975 b] Bansal PP, Jacobsen SE (1975 b) Characterization of local solutions for a class of nonconvex programs. J. Optim. Theory Appl. 15(5):549–564.
- 8Basu et al. [2010] Basu A, Conforti M, Cornuéjols G, Zambelli G (2010) Minimal inequalities for an infinite relaxation of integer programs. SIAM J. on Discrete Math. 24(1):158–168.
