This paper explores the structure of maximal lattice-free convex sets within affine subspaces, demonstrating their polyhedral nature and extending classical results to broader contexts in integer programming.
Contribution
It extends Lovász's theorem by characterizing maximal lattice-free convex sets as polyhedra in affine subspaces, relevant for integer programming inequalities.
Findings
01
All irredundant inequalities derive from these sets.
02
Maximal lattice-free convex sets are polyhedra.
03
Extension of Lovász's theorem to affine subspaces.
Abstract
We consider a model that arises in integer programming, and show that all irredundant inequalities are obtained from maximal lattice-free convex sets in an affine subspace. We also show that these sets are polyhedra. The latter result extends a theorem of Lov\'asz characterizing maximal lattice-free convex sets in Rn.
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Full text
Maximal lattice-free convex sets in linear subspaces
Supported by NSF grant CMMI0653419,
ONR grant N00014-03-1-0188 and ANR grant BLAN06-1-138894.
(March 30, 2009, Revised April 21, 2010.)
Abstract
We consider a model that arises in integer programming, and show that all irredundant inequalities are obtained from maximal lattice-free convex sets in an affine subspace. We also show that these sets are polyhedra. The latter result extends a theorem of Lovász characterizing maximal lattice-free convex sets in Rn.
1 Introduction
The study of maximal lattice-free convex sets dates back to
Minkowski’s work on the geometry of numbers. Connections between
integer programming and the geometry of numbers were investigated in
the 1980s starting with the work of Lenstra [22]. See
Lovász [23] for a survey. Recent work in cutting plane
theory [1],[2],[3],[4],[5],[8],[10],[13],[14],[15],[17],[19],[25]
has generated renewed interest in the study of maximal lattice-free
convex sets. In this paper we further pursue this line of research.
In the first part of the paper we consider convex sets in an affine
subspace of Rn that are maximal with the property of not
containing integral points in their relative interior. When this
affine subspace is rational, these convex sets are characterized by
a result of Lovász [23]. The extension to irrational
subspaces appears to be new, and it has already found an application
in the proof of a key result in [10]. It is also used to
prove the main result in the second part of this paper: We consider
a model that arises in integer programming, and show that all
irredundant inequalities are obtained from maximal lattice-free
convex sets in an affine subspace.
Let W be an affine subspace of Rn. Assume that W contains
an integral point, i.e. W∩Zn=∅. We say that a set B⊂Rn is a maximal
lattice-free convex set in W if B⊂W, B is convex, B
has no integral point in its interior with respect to the topology
induced on W by Rn, and B is inclusionwise maximal with
these three properties. This definition implies that either B
contains no integral point in its relative interior or B has
dimension strictly less than W.
The subspace W is said to be rational if it is generated by
the integral points in W. So, if we denote by V the affine hull
of the integral points in W, V=W if and only if W is rational.
If W is not rational, then the inclusion V⊂W is strict.
When W is not rational, we will also say that W is irrational. An example of an irrational affine subspace W⊆R3 is the set of points satisfying the equation x1+x2+2x3=1. The affine hull V of W∩Z3 is the set of
points satisfying the equations x1+x2=1,x3=0.
Theorem 1**.**
Let W⊂Rn be an affine space containing an integral point and V the affine hull of W∩Zn. A set S⊂W is a maximal lattice-free convex set of W if and only if one of the following holds:
(i)
S* is a polyhedron in W whose dimension equals dim(W),
S∩V is a maximal lattice-free convex set of V whose
dimension equals dim(V), the facets of S and S∩V are in
one-to-one correspondence and for every facet F of S, F∩V
is the facet of S∩V corresponding to F;*
(ii)
S* is an hyperplane of W of the form v+L, where v∈S and L∩V is an irrational hyperplane of V;*
(iii)
S* is a half-space of W that contains V on its boundary.*
A characterization of maximal lattice-free convex sets of V, needed in (i) of the previous theorem, is given by the following.
Theorem 2**.**
(Lovász [23])* Let V be a rational affine subspace
of Rn containing an integral point. A set S⊂V is a maximal lattice-free convex set of V if and only if one of the following holds:*
(i)
S* is a polyhedron of the form S=P+L where P is a polytope,
L is a rational linear space, dim(S)=dim(P)+dim(L)=dim(V), S does not
contain any integral point in its relative interior and there is an integral point in the relative interior of each facet of S;*
(ii)
S* is an affine hyperplane of V of the form v+L, where v∈S and L is an irrational hyperplane of V;*
The polyhedron S=P+L in Theorem 2(i) is
called a cylinder over the polytopeP and can be shown to
have at most 2dim(P) facets [16].
Theorem 1 is new and it is used in the proof of
our main result about integer programming,
Theorem 3 below. It is also used to prove the
last theorem in [10]. Theorem 2
is due to Lovász ([23] Proposition 3.1). Lovász only
gives a sketch of the proof and it is not clear how case (ii) in
Theorem 2 arises in his sketch or in the
statement of his proposition. Therefore in
Section 2 we will prove both theorems.
Figure 1 shows examples of maximal lattice-free convex
sets in a 2-dimensional affine subspace W of R3. We denote by
V the affine space generated by W∩Z3. In the first picture
W is rational, so V=W, while in the second one V is a subspace
of W of dimension 1.
We now give an example of Theorem 1(ii). Let
W⊆R4 be the set of points satisfying the equation x1+x2+x3+2x4=1. The affine hull V of W∩Z4
is the set of points satisfying the equations x1+x2+x3=1,x4=0. The set S⊂W defined by the equations x1+x2+x3+2x4=1,
x1+2x2=1 satisfies
Theorem 1(ii). Indeed, dim(W)=dim(S)+1=3. Furthermore, dim(V)=2 and S∩V is the line satisfying
the equations x1+x2+x3=1,x1+2x2=1,x4=0 and it is an irrational subspace since the only integral point it
contains is (1,0,0,0).
Next we highlight the relation between lattice-free convex sets and
valid inequalities in integer programming. This was first observed
by Balas [6].
Suppose we consider q rows of the optimal tableau of the LP
relaxation of a given MILP, relative to q basic integer variables
x1,…,xq. Let s1,…,sk be the nonbasic variables,
and f∈Rq be the vector of components of the optimal basic
feasible solution. The tableau restricted to these q rows is of
the form
[TABLE]
where rj∈Rq, j=1,…,k, and I denotes the set of integer nonbasic variables. Gomory [18] proposed to consider the relaxation of the above problem obtained by dropping the nonnegativity conditions x≥0. This gives rise to the so called corner polyhedron. A further relaxation is obtained by also dropping the integrality conditions on the nonbasic variables, obtaining the mixed-integer set
[TABLE]
Note that, since x∈Rq is completely determined by s∈Rk, the above is equivalent to
[TABLE]
We denote by Rf(r1,…,rk) the set of points s satisfying
(1). The above relaxation was studied by Andersen
et al. [1] in the case of two rows and Borozan and
Cornuéjols [10] for the general case. In these papers
they showed that the irredundant valid inequalities
for Rf(r1,…,rk) correspond to maximal lattice-free convex
sets in Rq. In [1, 10] data are assumed to be
rational. Here we consider the case were f,r1,…,rk may
have irrational entries.
Let W=⟨r1,…,rk⟩ be the linear space generated
by r1,…,rk. Note that, for every s∈Rf(r1,…,rk), the point
f+∑j=1krjsj∈(f+W)∩Zq, hence we assume f+W
contains an integral point. Let V be the affine hull of (f+W)∩Zq. Notice that f+W and V coincide if and only if W is a
rational space. Borozan and Cornuéjols [10] proposed to
study the following semi-infinite relaxation, which is a
special case of Gomory and Johnson’s group problem [20]. Let
Rf(W) be the set of points s=(sr)r∈W of RW
satisfying
[TABLE]
where W is the set of all s∈RW with
finite support, i.e. the set {r∈W∣sr>0} has finite cardinality. Notice that Rf(r1,…,rk)=Rf(W)∩{s∈W∣sr=0\mboxforallr=r1,…,rk}.
Given a function ψ:W→R and α∈R, the linear inequality
[TABLE]
is valid for Rf(W) if it is satisfied by every s∈Rf(W).
Note that, given a valid inequality (3) for Rf(W), the inequality
[TABLE]
is valid for Rf(r1,…,rk).
Hence a characterization of valid linear inequalities for Rf(W)
provides a characterization of valid linear inequalities for
Rf(r1,…,rk).
Next we observe how maximal lattice-free convex sets in f+W give
valid linear inequalities for Rf(W). Let B be a maximal
lattice-free convex set in f+W containing f in its interior.
Since, by Theorem 1, B is a polyhedron and since
f is in its interior, there exist a1,…,at∈Rq such
that B={x∈f+W∣ai(x−f)≤1,i=1…,t}. We define
the function ψB:W→R by
[TABLE]
Note that the function ψB is subadditive, i.e.
ψB(r)+ψB(r′)≥ψB(r+r′), and positively
homogeneous, i.e. ψB(λr)=λψB(r) for every
λ≥0. We claim that
[TABLE]
is valid for Rf(W).
Indeed, let s∈Rf(W), and x=f+∑r∈Wrsr. Since
x∈Zn and B is lattice-free, x∈/int(B). Then
[TABLE]
where the first equation follows from positive homogeneity, the
first inequality follows from subadditivity of ψB and the last
one follows from the fact that x∈/int(B).
We will show that all nontrivial irredundant valid linear
inequalities for Rf(W) are indeed of the type described above.
Furthermore, if W is irrational, we will see that Rf(W) is
contained in a proper affine subspace of W, so each inequality
has infinitely many equivalent forms. Note that, by definition of
ψB, ψB(r)>0 if r is not in the recession cone of B,
ψB(r)<0 when r is in the interior of the recession cone of
B, while ψB(r)=0 when r is on the boundary of the
recession cone of B. We will show that one can always choose a
form of the inequality so that ψB is a nonnegative function.
We make this more precise in the next theorem.
Given a point s∈Rf(W), then f+∑r∈Wrsr∈Zq∩(f+W). Recall that we denote by V the affine hull of Zq∩(f+W). Thus Rf(W) is contained in the affine subspace V of W defined as
[TABLE]
Observe that, given
C∈Rℓ×q and d∈Rℓ such that V={x∈f+W∣Cx=d}, we have
[TABLE]
Given two valid inequalities ∑r∈Wψ(r)sr≥α
and ∑r∈Wψ′(r)sr≥α′ for Rf(W), we say
that they are equivalent if there exist ρ>0 and
λ∈Rℓ such that ψ(r)=ρψ′(r)+λTCr
and α=ρα′+λT(d−Cf). Note that, if two valid
inequalities ∑r∈Wψ(r)sr≥α and ∑r∈Wψ′(r)sr≥α′ for Rf(W) are equivalent, then
V∩{s∣∑r∈Wψ(r)sr≥α}=V∩{s∣∑r∈Wψ′(r)sr≥α}.
A linear inequality ∑r∈Wψ(r)sr≥α that is
satisfied by every element in {s∈V∣sr≥0\mboxforeveryr∈W} is said to be trivial.
We say that inequality ∑r∈Wψ(r)sr≥αdominates inequality ∑r∈Wψ′(r)sr≥α if
ψ(r)≤ψ′(r) for all r∈W. Note that, for any sˉ∈W such that sˉr≥0 for all r∈W, if sˉ
satisfies the first inequality, then sˉ also satisfies the
second. A valid inequality ∑r∈Wψ(r)sr≥α for
Rf(W) is minimal if it is not dominated by any valid linear
inequality ∑r∈Wψ′(r)sr≥α for Rf(W) such
that ψ′=ψ. It is not obvious that nontrivial valid linear
inequalities are dominated by minimal ones. We will show that this
is the case. Note that it is not even obvious that minimal valid
linear inequalities exist.
We will show that, for any maximal lattice-free convex set B of
f+W with f in its interior, the inequality ∑r∈WψB(r)sr≥1 is a minimal valid inequality for Rf(W).
The main result is a converse, stated in the next
theorem. We need the notion of equivalent inequalities, which define
the same region in V.
Theorem 3**.**
*Every nontrivial valid linear inequality for Rf(W) is dominated by a nontrivial minimal valid linear inequality for Rf(W).
Every nontrivial minimal valid linear inequality for Rf(W) is equivalent to an inequality of the form*
[TABLE]
such that ψB(r)≥0 for all r∈W and B is a maximal lattice-free convex set in f+W with f in its interior.
This theorem generalizes earlier results of Borozan and
Cornuéjols [10]. In their setting it is immediate that
all valid linear inequalities are of the form ∑r∈Wψ(r)sr≥1 with ψ nonnegative. From this, it follows
easily that ψ must be equal to ψB for some maximal
lattice-free convex set B. The proof is much more complicated for
the case of Rf(W) when W is an irrational space. In this case,
valid linear inequalities might have negative coefficients. For
minimal inequalities, however, Theorem 3 shows
that there always exists an equivalent one where all coefficients
are nonnegative. The function ψB is nonnegative if and only if
the recession cone of B has empty interior. Although there are
nontrivial minimal valid linear inequalities arising from maximal
lattice-free convex sets whose recession cone is full dimensional,
Theorem 3 states that there always exists a
maximal lattice-free convex set whose recession cone is not full
dimensional that gives an equivalent inequality. A crucial
ingredient in showing this is a new result about sublinear functions
proved in [9].
In light of Theorem 3, it is a natural question to
ask what is the subset of W obtained by intersecting the set of
nonnegative elements of V with all half-spaces defined by
inequalities ∑r∈Wψ(r)sr≥1 as in Theorem 3. In a
finite dimensional space, the intersection of all half-spaces
containing a given convex set C is the closure of C. Things are
more complicated in infinite dimension. First of all, while in
finite dimension all norms are topologically equivalent, and thus
the concept of closure does not depend on the choice of a specific
norm, in infinite dimension different norms may produce different
topologies. Secondly, in finite dimensional spaces linear functions
are always continuous, while in infinite dimension there always
exist linear functions that are not continuous. In particular,
half-spaces (i.e. sets of points satisfying a linear inequality) are
not always closed in infinite dimensional spaces (see
Conway [12] for example).
To illustrate this, note that if W is endowed
with the Euclidean norm, then 0=(0)r∈W belongs to
the closure of conv(Rf(W)) with respect to this norm, as shown
next. Let xˉ be an integral point in f+W
and let sˉ be defined by
[TABLE]
Clearly, for every choice of k, sˉ∈Rf(W), and for k that goes to infinity the point sˉ is
arbitrarily close to 0 with respect to the Euclidean
distance. Now, given a valid linear inequality ∑r∈Wψ(r)sr≥1
for conv(Rf(W)), since ∑r∈Wψ(r)0=0 the hyperplane
H={s∈W:∑r∈Wψ(r)sr=1} separates strictly
conv(Rf(W)) from 0 even though 0 is in the closure of conv(Rf(W)). This implies that H is not a closed
hyperplane of W, and in particular the function s↦∑r∈Wψ(r)sr is not
continuous with respect to the
Euclidean norm on W.
A nice answer to our question is given by considering
a different norm on W. We endow W with
the norm ∥⋅∥H defined by
[TABLE]
It is straightforward to show that ∥⋅∥H is indeed a norm. Given A⊂W, we denote by Aˉ the closure of
A with respect to the norm ∥⋅∥H.
Let BW be the family of all maximal lattice-free
convex sets of W with f in their interior.
Theorem 4**.**
[TABLE]
Note that Theorems 3 and 4 are
new even when W=Rq.
A valid inequality ∑r∈Wψ(r)sr≥1 for Rf(W) is
said to be extreme if there do not exist distinct functions
ψ1 and ψ2 satisfying ψ≥21(ψ1+ψ2), such that ∑r∈Wψi(r)sr≥1, i=1,2, are
both valid for Rf(W). The above definition is due to Gomory and
Johnson [20]. Note that, if an inequality is not extreme, then
it is not necessary to define conv(Rf(W)).
The next theorem exhibits a correspondence between extreme
inequalities for the infinite model Rf(Rq) and extreme
inequalities for some finite problem Rf(r1,…,rk) where
r1,…,rk∈Rq. The theorem is very similar to a result
of Dey and Wolsey [15].
Theorem 5**.**
Let B be a maximal lattice-free convex set in Rq with f in
its interior. Let L=lin(B) and let P=B∩(f+L⊥). Then
B=P+L, L is a rational space, and P is a polytope. Let
v1,…,vk be the vertices of P, and
rk+1,…,rk+h be a rational basis of L. Define
rj=vj−f for j=1,…,k.
Then the inequality ∑r∈RqψB(r)sr≥1 is extreme
for Rf(Rq) if and only if the inequality ∑j=1ksj≥1 is extreme for conv(Rf(r1,…,rk+h)).
Even though the data in integer programs are typically rational and
studying the infinite relaxation (1) for W=Qq seems
natural [10, 13], some of its extreme inequalities arise
from maximal lattice-free convex sets that are not rational
polyhedra [13].
For example, the irrational triangle B defined by the inequalities
x1+x2≤2,x2≥1+2x1,x2≥0 is a
maximal lattice-free convex set in the plane, and it gives rise to
an extreme valid inequality ∑r∈Q2ψB(r)sr≥1 for
Rf(Q2) for any rational f in the interior of B. In fact,
every maximal lattice-free triangle gives rise to an extreme
valid inequality for Rf(Q2) [13]. Therefore, even when W=Qq in (1), irrational coefficients are needed to
describe some of the extreme inequalities for Rf(Qq). Indeed,
it follows from Theorem 3 and
from [10] that the extreme inequalities for Rf(Qq) are
precisely the restrictions to Qq of the extreme inequalities for
Rf(Rq). This suggests that the more natural setting
for (1) might in fact be W=Rq.
The paper is organized as follows. In Section 2 we
will state and prove the natural extensions of
Theorems 1 and 2 for
general lattices. In Section 3 we prove
Theorem 3, while in Section 4
we prove Theorem 4 and in Section 5
we prove Theorem 5.
2 Maximal lattice-free convex sets
Given X⊂Rn, we denote by ⟨X⟩ the linear
space generated by the vectors in X. The underlying field is
R in this paper. The purpose of this section is to prove
Theorems 1 and 2. For
this, we will need to work with general lattices.
Definition 6**.**
An additive group Λ of Rn is said to be finitely
generated if there exist vectors a1,…,am∈Rn such that
Λ={λ1a1+…+λmam∣λ1,…,λm∈Z}.
If a finitely generated additive group Λ of Rn can be
generated by linearly independent vectors a1,…,am,
then Λ is called a lattice of the linear space
⟨a1,…,am⟩. The set of vectors
a1,…,am is called a basis of the lattice Λ.
Definition 7**.**
Let Λ be a lattice of a linear space V of Rn. Given a linear subspace L of V, we say that L is a Λ-subspace of V if there exists a basis of L contained in Λ.
For example, in R2, consider the lattice Λ generated by
vectors (0,1) and (1,0). The line x2=2x1 is a
Λ-subspace, whereas the line x2=2x1 is not.
Given y∈Rn and ε>0, we will denote by Bε(y) the open ball centered at y of radius ε.
Given an affine space W of Rn and a set S⊆W, we denote by intW(S) the interior of S with respect to the topology induced on W by Rn, namely intW(S) is the set of points x∈S such that Bε(x)∩W⊂S for some ϵ>0. We denote by relint(S) the relative interior of S, that is relint(S)=intaff(S)(S).
Definition 8**.**
Let Λ be a lattice of a linear space V of Rn, and let W be a linear space of Rn containing V. A set S⊂Rn is said to be a Λ-free convex set of W if S⊂W, S is convex and Λ∩intW(S)=∅, and S is said to be a maximal Λ-free convex set of W if it is not properly contained in any Λ-free convex set.
The next two theorems are restatements of Theorems 1 and 2 for general lattices.
Theorem 9**.**
Let Λ be a lattice of a linear space V of Rn, and let W be a linear space of Rn containing V. A set S⊂Rn is a maximal Λ-free convex set of W if and only if one of the following holds:
(i)
S* is a polyhedron in W, dim(S)=dim(W), S∩V is a maximal Λ-free convex set of V,
the facets of S and S∩V are in one-to-one correspondence and
for every facet F of S, F∩V is the facet of S∩V
corresponding to F;*
(ii)
S* is an affine hyperplane of W of the form S=v+L where v∈S and L∩V is a hyperplane of V that is not a lattice subspace of V;*
(iii)
S* is a half-space of W that contains V on its boundary.*
Theorem 10**.**
Let Λ be a lattice of a linear space V of Rn. A set S⊂Rn is a maximal Λ-free convex set of V if and only if one of the following holds:
(i)
S* is a polyhedron of the form S=P+L where P is a polytope, L is a Λ-subspace of V, dim(S)=dim(P)+dim(L)=dim(V), S does not contain any point of Λ in its interior and there is a point of Λ in the relative interior of each facet of S;*
(ii)
S* is an affine hyperplane of V of the form
S=v+L where v∈S and L is not a Λ-subspace of V.*
We assume Theorem 10 holds. Its proof will be given in the next section.
(⇒) Let S be a maximal Λ-free convex set of W. We show that one of (i)−(iii) holds.
If V=W, then (iii) cannot occur and either (i) or (ii) follows from Theorem 10. Thus we assume V⊂W.
Assume first that dim(S)<dim(W). Then there exists a hyperplane
H of W containing S, and since intW(H)=∅, then
S=H by maximality of S. Since S is a hyperplane of W, then
either V⊆S or S∩V is a hyperplane of V. If
V⊆S, then let K be one of the two half spaces of W
separated by S. Then intW(K)∩Λ=∅,
contradicting the maximality of S. Hence S∩V is a
hyperplane of V. We show that P=S∩V is a maximal
Λ-free convex set of V. Indeed, let K be a convex set in
V such that intV(K)∩Λ=∅ and P⊆K.
Since conv(S∪K)∩V=K, then intW(conv(S∪K)∩Λ)=∅. By maximality of S, S=conv(S∪K),
hence P=K.
Given v∈P, S=v+L for some hyperplane L of
W, and P=v+(L∩V). Applying Theorem 10 to
P, we get that L∩V is not a lattice subspace of V, and
case (ii) holds.
So we may assume dim(S)=dim(W). Since S is convex, then intW(S)=∅. We consider two cases.
Case 1. intW(S)∩V=∅.
Since intW(S) and V are nonempty disjoint convex sets, there
exists a hyperplane separating them, i.e. there exist α∈Rn and β∈R such that αx≥β for every
x∈S and αx≤β for every x∈V. Since V is a
linear space, then αx=0 for every x∈V, hence β≥0. Then the half space H={x∈W∣αx≥0} contains
S and V lies on the boundary of H. Hence H is a maximal
Λ-free convex set of W containing S, therefore S=H by
the maximality assumption, so (iii) holds.
Case 2. intW(S)∩V=∅.
We claim that
[TABLE]
To prove this claim, notice that the direction intW(S)∩V⊆intV(S∩V) is straightforward. Conversely, let
x∈intV(S∩V). Then there exists ε>0 such that
Bε(x)∩V⊆S. Since intW(S)∩V=∅, there exists y∈intW(S)∩V. Then there
exists ε′>0 such that Bε′(y)∩W⊆S. We may assume y=x since otherwise the result
holds. Since x∈int(Bε(x)), there exists z∈intW(Bε(x)) such that x is in the relative interior
of the segment yz. Since x,y∈V, z∈V and therefore z∈S. The ball Bδ(x) with radius δ=ε′∥y−z∥∥x−z∥ is contained in
conv({z}∪Bε′(y)). By convexity of S,
conv({z}∪(Bε′(y)∩W))⊆S and therefore
(Bδ(x)∩W)⊆S. Thus x∈intW(S). Since x∈V,
it follows that x∈intW(S)∩V.
Let P=S∩V. By (6) and because
intW(S)∩Λ=∅, we have
intV(P)∩Λ=∅. We show that P is a maximal
Λ-free convex set of V. Indeed, let K be a convex set in
V such that intV(K)∩Λ=∅ and P⊆K.
Since conv(S∪K)∩V=K, Claim (6) implies that
intW(conv(S∪K))∩Λ=∅. By maximality,
S=conv(S∪K), hence P=K.
Since dim(P)=dim(V), by Theorem 10 applied
to P, P is a polyhedron with a point of Λ in the
relative interior of each of its facets. Let F1,…,Ft be the
facets of P. For i=1,…,t, let zi be a point in
relint(Fi)∩Λ. By (6), zi∈/intW(S).
By the separation theorem, there exists a half-space Hi of W
containing intW(S) such that zi∈/intW(Hi). Notice
that Fi is on the boundary of Hi. Then S⊆∩i=1tHi. By construction intW(∩i=1tHi)∩Λ=∅, hence by maximality of S, S=∩i=1tHi. For every j=1,…,t, intW(∩i=jHi) contains zj. Therefore Hj defines a facet of S for
j=1,…,t.
(⇐) Let S be a set in Rn satisfying one
of (i),(ii),(iii). Clearly S is a convex set in W and
intW(S)∩Λ=∅, so we only need to prove
maximality. If S satisfies (iii), then this is immediate. This
is also immediate when S satisfies (i) and V=W. So we may assume that either
S satisfies (i) and dim(V)<dim(W), or S satisfies (ii).
Suppose that there exists a closed
convex set K⊂W strictly containing S such that
intW(K)∩Λ=∅. Let w∈K∖S. Then
conv(S∪{w})⊆K. To conclude the proof of the
theorem, it suffices to prove that
S∩V is strictly contained in conv(S∪{w})∩V. Indeed, by
maximality of S∩V, this claim implies that the set
intV(conv(S∪{w})∩V) contains a point in Λ.
Now conv(S∪{w})⊆K implies that intW(K)
contains a point of Λ, a contradiction.
It only remains to prove that S∩V⊂conv(S∪{w})∩V. This is clear when S is a
hyperplane satisfying (ii). The statement is also clear if w∈V. Assume now that S is a polyhedron
satisfying (i) and dim(V)<dim(W) and that w∈/V.
Let F be a facet of S that separates w from
S. If F∩V is contained in a proper face of F, then F∩V is contained in at least two facets of S, a contradiction to
the one-to-one correspondence property. So F∩V is not
contained in proper face of F. Therefore, there exists p∈relint(F)∩V. Choose ε>0 such that
Bε(p)∩aff(F)⊆F. Note that F⊆V since otherwise F∩V=F but this is a
contradiction since dim(S∩V)=dim(V)<dim(W)=dim(S),
and dim(F∩V)=dim(S∩V)−1, dim(F)=dim(S)−1.
Let W′=aff(V∪{w}). Note that V and aff(F)∩W′ are distinct affine hyperplanes of W′. Let H,H′ be the two open half-spaces of W′ defined by V, and assume w.l.o.g. that w∈H′. Since p∈V, H∩aff(F)∩Bε(p) contains some point t. Since Bε(p)∩aff(F)⊆F, it follows that t∈H∩F. Let T be the line segment joining w and t. Since t∈H and w∈H′ it follows that T∩V contains exactly one point, say wˉ. Note that wˉ=t,w. Since w∈/S and t∈F, we have that wˉ∈conv(S∪{w})∩V but wˉ∈/S.
∎
To simplify notation, given S⊆Rn, we denote
intV(S) simply by int(S).
The following standard result in lattice theory provides a useful
equivalent definition of lattice (see Barvinok [7], p. 284
Theorem 1.4).
Theorem 11**.**
Let Λ be the additive group generated by vectors
a1,…,am∈Rn. Then Λ is a lattice of the linear
space ⟨a1,…,am⟩ if and only if there exists
ε>0 such that ∥y∥≥ε for every
y∈Λ∖{0}.
In this paper we will only need the “only if” part of the
statement, which is easy to prove (see [7], p. 281 problem
5). Theorem 11 implies the following result
(see [7], p. 281 problem 3).
Corollary 12**.**
Let Λ be a lattice of a linear space of Rn. Then every
bounded set in Rn contains a finite number of points in
Λ.
Throughout this section, Λ will be a lattice of a linear space
V of Rn. The following lemma proves the “only if” part of
Theorem 10 when S is bounded and
full-dimensional.
Lemma 13**.**
Let S⊂V be a bounded maximal Λ-free convex set with
dim(S)=dim(V). Then S is a polytope with a point of Λ
in the relative interior of each of its facets.
Proof.
Since S is bounded, there exist integers L,U such that S is
contained in the box B={x∈Rn∣L≤xi≤U}. For each
y∈Λ∩B, since S is convex there exists a closed
half-space Hy of V such that S⊆Hy and
y∈/int(Hy). By Corollary 12, B∩Λ is finite, therefore ⋂y∈B∩ΛHy is
a polyhedron. Thus P=⋂y∈B∩ΛHy∩B is a
polytope and by construction Λ∩int(P)=∅. Since
S⊆B and S⊆Hy for every y∈B∩Λ,
it follows that S⊆P. By maximality of S, S=P,
therefore S is a polytope. We only need to show that S has a
point of Λ in the relative interior of each of its facets.
Let F1,…,Ft be the facets of S, and let Hi={x∈V∣αix≤βi} be the closed half-space defining Fi,
i=1,…,t. Then S=⋂i=1tHi. Suppose, by
contradiction, that one of the facets of S, say Ft, does not
contain a point of Λ in its relative interior. Given
ε>0, the polyhedron S′={x∈V∣αix≤βi,i=1,…,t−1,αtx≤βt+ε} is
a polytope since it has the same recession cone as S. The polytope
S′ contains points of Λ in its interior by the maximality
of S. By Corollary 12, int(S′) has a finite
number of points in Λ, hence there exists one minimizing
αtx, say z. By construction, the polytope S′={x∈V∣αix≤βi,i=1,…,t−1,αtx≤αtz} does not contain any point of Λ in its interior and
properly contains S, contradicting the maximality of S.
∎
We will also need the following famous theorem of Dirichlet.
Theorem 14** (Dirichlet).**
Given real numbers α1,…,αn,ε with 0<ε<1, there exist integers p1,…,pn and q such that
[TABLE]
The following is a consequence of Dirichlet’s theorem.
Lemma 15**.**
Given y∈Λ and r∈V∖{0}, then for every
ε>0 and λˉ≥0, there exists a point of
Λ∖{y} at distance less than ε from
the half line {y+λr∣λ≥λˉ}.
Proof.
First we show that, if the statement holds for λˉ=0, then it holds for arbitrary λˉ.
Given ε>0, let Z be the set of points of Λ at
distance less than ε from {y+λr∣λ≥0}. Suppose, by contradiction, that no point in Z has distance
less than ε from {y+λr∣λ≥λˉ}. Then Z is contained in
Bε(0)+{y+λr∣0≤λ≤λˉ}.
By Corollary 12, Z is finite, thus there exists
an εˉ>0 such that every point in Z has distance
greater than εˉ from {y+λr∣λ≥0}, a contradiction. So we only need to show that, given
ε>0, there exists at least one point of
Λ∖{y} at distance at most ε from
{y+λr∣λ≥0}. We may assume ε<1.
Without loss of generality, assume ∥r∥=1. Let m=dim(V) and
a1,…,am be a basis of Λ. Then there exists
α∈Rm such that r=α1a1+…+αmam.
Denote by A the matrix with columns a1,…,am, and define
∥A∥=supx:∥x∥≤1∥Ax∥ where, for a vector v,
∥v∥ denotes the Euclidean norm of v. Choose δ>0 such
that δ<1 and δ≤ε/(∥A∥m). By
Dirichlet’s theorem, there exist p∈Zm and λ≥1 such
that
[TABLE]
Let z=Ap+y. Since p∈Zm, then z∈Λ. Note that p=0 since ∥α∥≥∥A∥∥Aα∥=∥A∥∥r∥>∥A∥λε, where the
first inequality follows from the definition of ∥A∥ and the last
one follows from the assumptions on ∥r∥, ε and
λ. Therefore, z∈Λ∖{y}. Furthermore
[TABLE]
∎
Lemma 16**.**
Let S be a Λ-free convex set, and let C=rec(S). Then also S+⟨C⟩ is Λ-free.
Proof.
Let r∈C, r=0. We only need to show that S+⟨r⟩ is Λ-free. Suppose there exists y∈int(S+⟨r⟩)∩Λ.
We show that y∈int(S)+⟨r⟩. Suppose not. Then (y+⟨r⟩)∩int(S)=∅, which implies that there is a hyperplane H separating the line y+⟨r⟩ and S+⟨r⟩. This contradicts y∈int(S+⟨r⟩).
This shows y∈int(S)+⟨r⟩. Thus there exists λˉ such that yˉ=y+λˉr∈int(S), i.e. there exists ε>0 such that Bε(yˉ)∩V⊂S. Since y∈Λ, then y∈/int(S), and thus, since yˉ∈int(S) and r∈C, we must have λˉ>0.
Since r∈C, then Bε(yˉ)+{λr∣λ≥0}⊂S. Since y∈Λ, by Lemma 15 there exists z∈Λ at distance less than ε from the half line {y+λr∣λ≥λˉ}. Thus z∈Bε(yˉ)+{λr∣λ≥0}, hence z∈int(S), a contradiction.
∎
Given a linear subspace L of Rn, we denote by L⊥ the orthogonal complement of L. Given a set S⊆Rn, the orthogonal projection of S onto L⊥ is the set
[TABLE]
We will use the following result (see Barvinok [7], p. 284 problem 3).
Lemma 17**.**
Given a Λ-subspace L of V, the orthogonal projection of
Λ onto L⊥ is a lattice of L⊥∩V.
Lemma 18**.**
If a linear subspace L of V is not a Λ-subspace of V,
then for every ε>0 there exists y∈Λ∖L at
distance less than ε from L.
Proof.
The proof is by induction on k=dim(L). Assume L is a linear subspace of V that is not a Λ-subspace, and let ε>0. If k=1, then, since the origin [math] is contained in Λ, by Lemma 15 there exists y∈Λ at distance
less than ε from L. If y∈L, then L=⟨y⟩, thus L is a Λ-subspace of V, contradicting our assumption.
Hence we may assume that k≥2 and the statement holds for spaces of dimension k−1.
Case 1:L contains a nonzero vector r∈Λ. Let
[TABLE]
By Lemma 17, Λ′ is a lattice of ⟨r⟩⊥∩V. Also,
L′ is not a lattice subspace of ⟨r⟩⊥∩V with respect to Λ′,
because if there exists a basis a1,…,ak−1 of L′ contained in Λ′, then
there exist scalars μ1,…,μk−1 such that a1+μ1r,…,ak−1+μk−1r∈Λ,
but then r,a1+μ1r,…,ak−1+μk−1r is a basis of L contained in Λ, a contradiction.
By induction, there exists a point y′∈Λ′∖L′ at distance less than ε from L′.
Since y′∈Λ′, there exists a scalar μ such that y=y′+μr∈Λ, and y
has distance less than ε from L.
Case 2:L∩Λ={0}. By Lemma 15,
there exists a nonzero vector y∈Λ at distance less than
ε from L. Since L does not contain any point in
Λ other than the origin, y∈/L.
∎
Lemma 19**.**
Let L be a linear subspace of V with dim(L)=dim(V)−1, and let v∈V. Then v+L is a maximal Λ-free convex set if and only if L is not a lattice subspace of V.
Proof.
(⇒) Let S=v+L and assume that S is a maximal
Λ-free convex set. Suppose by contradiction that L is a
Λ-subspace. Then there exists a basis a1,…,am of
Λ such that a1,…,am−1 is a basis of L. Thus
S={∑i=1mxiai∣xm=β} for some β∈R. Then, K={∑i=1mxiai∣⌈β−1⌉≤xm≤⌈β⌉} strictly contains S and
int(K)∩Λ=∅, contradicting the maximality of
S.
(⇐) Assume L is not a Λ-subspace of V. Since
S=v+L is an affine hyperplane of V, int(S)=∅, thus
int(S)∩Λ=∅, hence we only need to prove that
S is maximal with such property. Suppose not, and let K be a
maximal convex set in V such that int(K)∩Λ=∅
and S⊂K. Then by maximality K is closed. Let w∈K∖S. Since K is closed and convex, conv({w}∪S)⊆K. Since conv({w}∪S)=conv({w}∪(v+L))=conv({v,w})+L, we have that K⊇conv({v,w})+L. Let
ε be the distance between v+L and w+L, and δ
be the distance of conv({v,w})+L from the origin. By
Lemma 18, since L is not a Λ-subspace
of V, there exists a vector y∈Λ∖L at distance
εˉ<ε from L. Moreover, either y or −y
has distance strictly less than δ from conv({v,w})+L. We
conclude that either (⌊εˉδ⌋+1)y or
−(⌊εˉδ⌋+1)y is strictly between
v+L and w+L, and therefore is in the interior of K. Since
these two points are integer multiples of y∈Λ, this is a
contradiction.
∎
(⇐) If S satisfies (ii), then by
Lemma 19, S is a maximal Λ-free convex
set. If S satisfies (i), then, since
int(S)∩Λ=∅, we only need to show that S is
maximal. Suppose not, and let K be a convex set in V such that
int(K)∩Λ=∅ and S⊂K. Given y∈K∖S, there exists a hyperplane H separating y from S such that
F=S∩H is a facet of S. Since K is convex and S⊂K, then conv(S∪{y})⊆K. Since dim(S)=dim(V),
F⊂S hence the relint(F)⊂int(K). By assumption,
there exists x∈Λ∩relint(F), so x∈int(K), a
contradiction.
(⇒) Let S be a maximal Λ-free convex set. We show that S satisfies either (i) or (ii).
Observe that, by maximality, S must be closed.
If dim(S)<dim(V), then S is contained in some affine hyperplane H. Since int(H)=∅, we have S=H by maximality of S, therefore S=v+L where v∈S and L is a hyperplane in V. By Lemma 19, (ii) holds.
Therefore we may assume that dim(S)=dim(V). In particular, since S is convex, int(S)=∅.
By Lemma 13, if S is bounded, (i) holds. Hence we may assume that S is unbounded. Let C be the recession cone of S and L the lineality space of S. By standard convex analysis, S is unbounded if and only if C={0} (see for example Proposition 2.2.3 in [21]).
Claim 1.L=C*. *
By Lemma 16, S+⟨C⟩ is Λ-free. By maximality of S this implies that S=S+⟨C⟩, hence ⟨C⟩⊆L. Since L⊆⟨C⟩, it follows that L=C. ⋄
Let P=projL⊥(S) and Λ′=projL⊥(Λ). By Claim 1, S=P+L and
P⊂L⊥∩V is a bounded set. Furthermore, dim(S)=dim(P)+dim(L)=dim(V) and dim(P)=dim(L⊥∩V). Notice that int(S)=relint(P)+L, hence relint(P)∩Λ′=∅. Furthermore P is inclusionwise maximal among the convex sets of L⊥∩V without points of Λ′ in the relative interior: if not, given a convex set K⊆L⊥∩V strictly containing P and with no point of Λ′ in its relative interior, we have S=P+L⊂K+L, and K+L does not contain any point of Λ in its interior, contradicting the maximality of S.
Claim 2.L* is a Λ-subspace of V.*
By contradiction, suppose L is not a Λ-subspace of V.
Then, by Lemma 18, for every ε>0,
there exists a point in Λ∖L whose distance from L
is at most ε. Therefore, its projection onto L⊥
is a point y∈Λ′∖{0} such that
∥y∥<ε. Let Vε be the linear subspace of
L⊥∩V generated by the points in {y∈Λ′∣∥y∥<ε}. Then dim(Vε)>0.
Notice that, given ε′>ε′′>0, then Vε′⊇Vε′′⊃{0}, hence there exists ε0>0 such that Vε=Vε0 for every ε<ε0. Let U=Vε0.
By definition, Λ′ is dense in U (i.e. for every ε>0 and every x∈U there exists y∈Λ′ such that ∥x−y∥<ε). Thus, since relint(P)∩Λ′=∅, we also have relint(P)∩U=∅. Since dim(P)=dim(L⊥∩V), it follows that relint(P)∩(L⊥∩V)=∅, so in particular U is a proper subspace of L⊥∩V.
Let Q=proj(L+U)⊥(P) and Λ′′=proj(L+U)⊥(Λ′). We show that relint(Q)∩Λ′′=∅. Suppose not, and let y∈relint(Q)∩Λ′′. Then, y+w∈Λ′ for some w∈U. Furthermore, we claim that y+w′∈relint(P) for some w′∈U. Indeed, suppose no such w′ exists. Then (y+U)∩(relint(P)+U)=∅. So there exists a hyperplane H in L⊥∩V separating y+U and P+U. Therefore the projection of H onto (L+U)⊥ separates y and Q, contradicting y∈relint(Q). Thus
z=y+w′∈relint(P) for some w′∈U.
Since z∈relint(P), there exists εˉ>0 such that Bεˉ(z)∩(L⊥∩V)⊂relint(P). Since Λ′ is dense in U and y+w∈Λ′, it follows that Λ′ is dense in y+U.
Hence, since z∈y+U, there exists xˉ∈Λ′ such that ∥xˉ−z∥<εˉ, hence xˉ∈relint(P), a contradiction. This shows relint(Q)∩Λ′′=∅.
Finally, since relint(Q)∩Λ′′=∅, then int(Q+L+U)∩Λ=∅. Furthermore P⊆Q+U, therefore S⊆Q+L+U. By the maximality of S, S=Q+L+U hence the lineality space of S contains L+U, contradicting the fact that L is the lineality space of S and U={0}. ⋄
Since L is a Λ-subspace of V, Λ′ is a lattice
of L⊥∩V by Lemma 17. Since P is a
bounded maximal Λ′-free convex set, it follows from
Lemma 13 that P is a polytope with a point of
Λ′ in the relative interior of each of its facets, therefore
S=P+L has a point of Λ in the relative interior of each of
its facets, and (i) holds.
∎
From the proof of Theorem 10 we get the following.
Corollary 20**.**
Every Λ-free convex set of V is contained in some maximal Λ-free convex set of V.
Proof.
Let S be a Λ-free convex set of V. If S is bounded, the proof of Lemma 13 shows that the corollary holds. If S is unbounded, Claim 1 in the proof of Theorem 10 shows that S+⟨C⟩ is Λ-free, where C is the recession cone of S. Hence we may assume that the lineality space L of S is equal to the recession cone of S. The projection P of S onto L⊥ is bounded. If L is a Λ-subspace, then Λ′=projL⊥Λ is a lattice and P is Λ′-free, hence it is contained in a maximal Λ′-free convex set B of L⊥∩V, and B+L is a maximal Λ-free convex set of V containing S. If L is not a Λ-subspace, then we may define a linear subspace U of L⊥∩V and sets Q and Λ′′ as in the proof of Claim 2. Then proof of Claim 2 shows that Q is a bounded Λ′′-free convex set of V∩(L+U)⊥ and Λ′′ is a lattice, thus Q is contained in a maximal Λ′′-free convex set B of V∩(L+U)⊥, and B+(L+U) is a maximal Λ-free convex set of V containing S.
∎
3 Minimal Valid Inequalities
In this section we will prove Theorem 3.
For ease of notation, we denote Rf(W) simply by Rf in this section.
A linear function Ψ:W→R is of the form
[TABLE]
for some ψ:W→R. Throughout the rest of the paper, capitalized Greek letters indicate linear functions from W to R, while the corresponding lowercase letters indicate functions from W to R as defined in (8).
Definition 21**.**
A function σ:W→R is positively
homogeneous if σ(λr)=λσ(r) for every r∈W and scalar λ≥0, and it is subadditive if
σ(r1+r2)≤σ(r1)+σ(r2) for every r1,r2∈W. The function σ is sublinear if it is positively
homogeneous and subadditive.
Note that if σ is sublinear, then σ(0)=0. One can
easily show that a function is sublinear if and only if it is
positively homogeneous and convex. We also recall that convex
functions are continuous on their domain, so if σ is
sublinear it is also continuous [21].
Definition 22**.**
Inequality ∑r∈Wψ(r)sr≥α
dominates inequality ∑r∈Wψ′(r)sr≥α if
ψ(r)≤ψ′(r) for all r∈W.
Lemma 23**.**
Let Ψ(s)≥α be a valid linear inequality for Rf. Then Ψ(s)≥α is dominated by a valid linear inequality Ψ′(s)≥α for Rf such that ψ′ is sublinear.
Proof: We first prove the following.
Claim 1.For every s∈W such that ∑r∈Wrsr=0 and sr≥0, r∈W, we have ∑r∈Wψ(r)sr≥0.
Suppose not. Then there exists s∈W such that ∑r∈Wrsr=0, sr≥0\mboxforallr∈W and ∑r∈Wψ(r)sr<0. Let xˉ be an integral point in W. For any λ>0, we define sλ∈W by
[TABLE]
Since f+∑r∈Wrsrλ=xˉ, it follows that sλ is in Rf. Furthermore ∑r∈Wψ(r)srλ=ψ(xˉ−f)+λ(∑r∈Wψ(r)sr). Therefore ∑r∈Wψ(r)srλ goes to −∞ as λ goes to +∞. ⋄
We define, for all rˉ∈W,
[TABLE]
By Claim 1, ∑r∈Wψ(r)sr≥−ψ(−rˉ) for all s∈W such that rˉ=∑r∈Wrsr and sr≥0 for all r∈W. Thus the infimum in the above equation is finite and the function ψ′ is well defined. Note also that ψ′(rˉ)≤ψ(rˉ) for all rˉ∈W, as follows by considering s∈W defined by srˉ=1, sr=0 for all r∈W, r=rˉ.
Claim 2.The function ψ′ is sublinear
Note first that ψ′(0)=0. Indeed, Claim 1 implies ψ′(0)≥0, while choosing sr=0 for all r∈W shows ψ′(0)≤0.
Next we show that ψ′ is positively homogeneous. To prove this, let rˉ∈W and s∈W such that rˉ=∑r∈Wrsr and sr≥0 for all r∈W. Let γ=∑r∈Wψ(r)sr. For every λ>0, λrˉ=∑r∈Wr(λsr), λsr≥0 for all r∈W, and ∑r∈Wψ(r)(λsr)=λγ. Therefore ψ′(λrˉ)=λψ′(r).
Finally, we show that ψ′ is convex. Suppose by contradiction that there exist r′,r′′∈W and 0<λ<1 such that ψ′(λr′+(1−λ)r′′)>λψ′(r′)+(1−λ)ψ′(r′′)+ϵ for some positive ϵ. By definition of ψ′, there exist s′,s′′∈W such that r′=∑r∈Wrsr′, r′′=∑r∈Wrsr′′, sr′,sr′′≥0 for all r∈W, ∑r∈Wψ(r)sr′<ψ′(r′)+ϵ and
∑r∈Wψ(r)sr′′<ψ′(r′′)+ϵ. Since ∑r∈Wr(λsr′+(1−λ)sr′′)=λr′+(1−λ)r′′, it follows that
ψ′(λr′+(1−λ)r′′)≤∑r∈Wψ(r)(λsr′+(1−λ)sr′′)<λψ′(r′)+(1−λ)ψ′(r′′)+ϵ, a contradiction. ⋄
Claim 3.The inequality ∑r∈Wψ′(r)sr≥α is valid for Rf.
Suppose there exists sˉ∈Rf such that ∑r∈Wψ′(r)sˉr≤α−ϵ for some positive ϵ. Let {r1,…,rk}={r∈W∣sˉr>0}. For every i=1,…,k, there exists si∈W such that ri=∑r∈Wrsri, sri≥0, r∈W, and ∑r∈Wψ(r)sri<ψ′(ri)+ϵ/(ksˉri).
Let s~=∑i=1ksˉrisi. Then
[TABLE]
hence s~∈Rf. Therefore ∑r∈Wψ(r)s~r≥α since ∑r∈Wψ(r)sr≥α is valid for Rf. Now
[TABLE]
a contradiction.
□
Recall the following definitions from the introduction.
Definition 24**.**
A valid inequality ∑r∈Wψ(r)sr≥α for
Rf is minimal if it is not dominated by any valid linear
inequality ∑r∈Wψ′(r)sr≥α for Rf such that
ψ′=ψ.
Definition 25**.**
Let V be the affine hull of (f+W)∩Zq.
Let C∈Rℓ×q and d∈Rℓ be such that V={x∈f+W∣Cx=d}. Given two valid inequalities ∑r∈Wψ(r)sr≥α and ∑r∈Wψ′(r)sr≥α′
for Rf(W), we say that they are equivalent if there exist
ρ>0 and λ∈Rℓ such that
ψ(r)=ρψ′(r)+λTCr and
α=ρα′+λT(d−Cf).
Lemma 26**.**
*Let Ψ(s)≥α and Ψ′(s)≥α′ be two equivalent valid linear inequalities for Rf.
(i) The function ψ is sublinear if and only if ψ′ is sublinear.
(ii) Inequality Ψ(s)≥α is dominated by a minimal valid linear inequality if and only if Ψ′(s)≥α′ is dominated by a minimal valid linear inequality. In particular, Ψ(s)≥α is minimal if and only if Ψ′(s)≥α′ is minimal.*
Proof.
Since Ψ(s)≥α and Ψ′(s)≥α′ are equivalent, by definition there exist ρ>0 and
λ∈Rℓ, such that ψ(r)=ρψ′(r)+λTCr and
α=ρα′+λT(d−Cf). This proves (i).
Point (ii) follows from the fact that, given a function ψˉ′ such that ψˉ′(r)≤ψ′(r) for every r∈W, then the function ψˉ defined by ψˉ(r)=ρψˉ′(r)+λTCr, r∈W, satisfies ψˉ(r)≤ψ(r) for every r∈W. Furthermore ψˉ(r)<ψ(r) if and only if ψˉ′(r)<ψ′(r).
∎
Given a nontrivial valid linear inequality Ψ(s)≥α for Rf such that ψ is sublinear, we consider the set
[TABLE]
Since ψ is continuous, Bψ is closed. Since
ψ is convex, Bψ is convex. Since ψ defines a valid inequality, Bψ is lattice-free. Indeed the interior of Bψ is int(Bψ)={x∈f+W:ψ(x−f)<α}. Its boundary is bd(Bψ)={x∈f+W:ψ(x−f)=α}, and its recession cone is rec(Bψ)={x∈f+W:ψ(x−f)≤0}. Note that f is in the interior of Bψ if and only if α>0 and f is on the boundary if and only if α=0.
Remark 27**.**
Given a linear inequality of the form Ψ(s)≥1 such that ψ(r)≥0 for all r∈W,
[TABLE]
Proof.
Let r∈W. If ψ(r)>0, let t be the minimum positive number such that f+t−1r∈Bψ. Then f+t−1r∈bd(Bψ), hence ψ(t−1r)=1 and by positive homogeneity ψ(r)=t. If ψ(r)=0, then r∈rec(Bψ), hence f+t−1r∈Bψ for every t>0, thus the infimum in the above equation is [math].∎
This remark shows that, if ψ is nonnegative, then it is the gauge of the convex set Bψ−f (see [21]).
Before proving Theorem 3, we need the following general theorem about sublinear functions. Let K be a closed, convex set in W with the origin in its interior. The polar of K is the set K∗={y∈W∣ry≤1\mboxforallr∈K}. Clearly K∗ is closed and convex, and since 0∈int(K), it is well known that K∗ is bounded. In particular, K∗ is a compact set. Also, since 0∈K, K∗∗=K (see [21] for example). Let
[TABLE]
Note that K^ is contained in the relative boundary of K∗. Let ρK:W→R be defined by
Let K⊂W be a closed convex set containing the origin in its interior. Then K={r∈W∣ρK(r)≤1}. Furthermore, for every sublinear function σ such that K={r∣σ(r)≤1}, we have ρK(r)≤σ(r) for every r∈W.
Remark 29**.**
Let K⊂W be a polyhedron containing the origin in its interior. Let a1,…,at∈W such that K={r∈W∣air≤1,i=1,…,t}. Then ρK(r)=maxi=1,…,tair.
Proof.
The polar of K is K∗=conv{0,a1,…,at} (see Theorem 9.1 in Schrijver [24]). Furthermore, K^ is the union of all the facets of K∗ that do not contain the origin, therefore
[TABLE]
for all r∈W.
∎
Remark 30**.**
Let B be a closed lattice-free convex set in f+W with f in its interior, and let K=B−f. Then the inequality ∑r∈WρK(r)sr≥1 is valid for Rf.
Proof: Let s∈Rf. Then x=f+∑r∈Wrsr is integral, therefore x∈/int(B) because B is lattice-free. By Theorem 28, ρK(x−f)≥1. Thus
[TABLE]
where the second inequality follows from the subadditivity of ρK and the last from the positive homogeneity.
□
Lemma 31**.**
Given a maximal lattice-free convex set B of f+W containing f in its interior, ΨB(s)≥1 is a minimal valid inequality for Rf.
Proof.
Let Ψ(s)≥1 be a valid linear inequality for Rf such that ψ(r)≤ψB(r) for all r∈W. Then Bψ⊃B and Bψ is lattice-free. By maximality of B, B=Bψ. By Theorem 28 and Remark 29, ψB(r)≤ψ(r) for all r∈W, proving ψ=ψB.
∎
Let Ψ(s)≥α be a nontrivial valid linear inequality for Rf. By Lemma 23, we may assume that ψ is sublinear.
Claim 1**.**
If int(Bψ)∩V=∅, then Ψ(s)≥α is trivial.
Suppose int(Bψ)∩V=∅ and let s∈V such that sr≥0 for every r∈W. Let x=f+∑r∈Wrsr. Since s∈V, x∈V, so x∈/int(Bψ). This implies
[TABLE]
where the last inequality follows from the sublinearity of ψ. ⋄
Claim 2**.**
If f∈V and α≤0, then int(Bψ)∩V=∅.
Suppose f∈V, α≤0 but int(Bψ)∩V=∅. Then dim(int(Bψ)∩V)=dim(V), hence int(Bψ)∩V contains a set X of dim(V)+1 affinely independent points. For every x∈X and every λ>0, ψ(λ(x−f))=λψ(x−f)<0, where the last inequality is because x∈int(Bψ). Hence the set Γ=f+cone{x−f∣x∈X} is contained in int(Bψ). Since Γ has dimension equal to dim(V) and V is the convex hull of its integral points, Γ∩Zq=0, contradicting the fact that Bψ has no integral point in its interior. ⋄
Claim 3**.**
If f∈/V, then there exists a valid linear inequality Ψ′(s)≥1 for Rf equivalent to Ψ(s)≥α.
Since f∈/V, Cf=d, hence there exists a row ci of C such that di−cif=0. Let λ=(1−α)(di−cif)−1, and define ψ′(r)=ψ(r)+λcir for every r∈W. The inequality Ψ′(s)≥1 is equivalent to Ψ(s)≥α. ⋄
Thus, by Claims 1, 2 and 3 there exists a valid linear inequality Ψ′(s)≥1 for Rf equivalent to Ψ(s)≥α. By Lemma 26, ψ′ is sublinear and Ψ(s)≥α is dominated by a minimal valid linear inequality if and only if Ψ′(s)≥α′ is dominated by a minimal valid linear inequality. Therefore we only need to consider valid linear inequalities of the form Ψ(s)≥1 where ψ is sublinear. In particular the set Bψ={x∈W∣ψ(x−f)≤1} contains f in its interior.
Let K={r∈W∣ψ(r)≤1}, and let K^ be defined as in (9).
Claim 4**.**
The inequality ∑r∈WρK(r)sr≥1 is valid for Rf and ψ(r)≥ρK(r) for all r∈W.
Note that Bψ=f+K. Thus, by Remark 30, ∑r∈WρK(r)sr≥1 is valid for Rf. Since ψ is sublinear, it follows from Theorem 28 that ρK(r)≤ψ(r) for every r∈W. ⋄
By Claim 4, since ρK is sublinear, we may assume that ψ=ρK.
Claim 5**.**
There exists a valid linear inequality Ψ′(s)≥1 for Rf dominating Ψ(s)≥1 such that ψ′ is sublinear, Bψ′ is a polyhedron, and rec(Bψ′∩V)=lin(Bψ′∩V).
Since Bψ is a lattice-free convex set, it is contained in some maximal lattice-free convex set S by Corollary 20. The set S satisfies one of the statements (i)-(iii) of Theorem 9. By Claim 1, int(S)∩V=∅, hence case (iii) does not apply. Case (ii) does not apply because dim(S)=dim(Bψ)=dim(W). Therefore case (i) applies. Thus S is a polyhedron and S∩V is a maximal lattice-free convex set in V. In particular, by Theorem 10, rec(S∩V)=lin(S∩V). Since S is a polyhedron containing f in its interior, there exists A∈Rt×q and b∈Rt such that bi>0, i=1,…,t, and S={x∈f+W∣A(x−f)≤b}. Without loss of generality, we may assume that supx∈Bψai(x−f)=1 where ai denotes the ith row of A, i=1,…,t. By our assumption, supr∈Kair=1. Therefore ai∈K∗, since air≤1 for all r∈K. Furthermore ai∈cl(K^), since supr∈Kair=1.
Let Sˉ={x∈f+W∣A(x−f)≤e}, where e denotes the vector of all ones. Then Bψ⊆Sˉ⊆S.
Let Q={r∈W∣Ar≤e}. By Remark 29,
ρQ(r)=maxi=1,…,tair for all r∈W. Since Sˉ⊆S, Sˉ is lattice-free, by Remark 30 the inequality ∑r∈WρQ(r)sr≥1 is valid for Rf. Furthermore, since {a1,…,at}⊂cl(K^), by Claim 4 we have
[TABLE]
for all r∈W. Let ψ′=ρ(Q). Note that Bψ′=Sˉ. So, rec(Bψ′)=rec(Sˉ)={r∈W∣Ar≤0}=rec(S). Since rec(S∩V)=lin(S∩V), then rec(Bψ′∩V)=lin(Bψ′∩V). ⋄
By Claim 5, we may assume that Bψ={x∈f+W∣A(x−f)≤e}, where A∈Rt×q and e is the vector of all ones, and that rec(Bψ∩V)=lin(Bψ∩V). Let a1,…,at denote the rows of A. By Claim 4 and Remark 29,
[TABLE]
Let G be a matrix such that W={r∈Rq∣Gr=0}.
Claim 6**.**
There exists λ∈Rℓ such that ψ(r)+λTCr≥0 for all r∈W.
Given λ∈Rℓ, then by (11) ψ(r)+λTCr≥0 for every r∈W if and only if minr∈W(maxi=1,…,tair+λTCr)=0. The latter holds if
and only if
[TABLE]
By LP duality,
this holds if and only if the following system is feasible
[TABLE]
Clearly the latter is equivalent to
[TABLE]
Note that rec(Bψ∩V)={r∈Rq∣Ar≤0,Cr=0,Gr=0} and lin(Bψ∩V)={r∈Rq∣Ar=0,Cr=0,Gr=0}.
Since rec(Bψ∩V)=lin(Bψ∩V), the system
[TABLE]
is infeasible. By Farkas Lemma, this is the case if and only if there exists γ≥0, λ, μ~, and τ such that
[TABLE]
If we let y=γ+eτ and μ=−μ~, then (y,λ,μ) satisfies (12). By the previous argument, λ satisfies the statement of the claim. ⋄
Let λ as in Claim 6, and let ψ′ be the function defined by ψ′(r)=ψ(r)+λTCr for all r∈W. So ψ′(r)≥0 for every r∈W. Let α′=1+λT(d−Cf). Then the inequality Ψ′(s)≥α′ is valid for Rf and it is equivalent to Ψ(s)≥α. If α′≤0, then Ψ′(s)≥α′ is trivial. Thus α′>0. Let ρ=1/α′ and let ψ′′=ρψ′. Then Ψ′′(s)≥1 is equivalent to Ψ(s)≥1. By Lemma 26(i), ψ′′ is sublinear.
Let B be a maximal lattice-free convex set of f+W containing Bψ′′. Such a set B exists by Corollary 20.
Claim 7**.**
ψ′′(r)≥ψB(r)* for all r∈W.*
Let r∈rec(Bψ′′). Since ψ′′ is nonnegative, ψ′′(r)=0. Since rec(Bψ′′)⊆rec(B), ψB(r)≤0=ψ′′(r). Let r∈/rec(Bψ′′). Then f+τr∈bd(Bψ′′) for some τ>0, hence ψ′′(τr)=1 and, by positive homogeneity, ψ′′(r)=τ−1. Because Bψ′′⊂B, f+τr∈B. Since B={x∈f+W∣ψB(x−f)≤1}, it follows that ψB(τr)≤1, implying ψB(r)≤τ−1=ψ′′(r). ⋄
Claim 7 shows that Ψ′′(s)≥1 is dominated by ΨB(s)≥1, which is minimal by Lemma 31. By Lemma 26(ii), Ψ(s)≥1 is dominated by a minimal valid linear inequality which is equivalent to ΨB(s)≥1.
∎
*Example. *
We illustrate the end of the proof in an example. Suppose W={x∈R3∣x2+2x3=0}, and let f=(21,0,0). Note that f+W=W. All integral points in W are of the form (k,0,0), k∈Z, hence V={x∈W∣x2=0}. Thus V={s∈W∣∑r∈Wr2sr=0}.
Consider the function ψ:W→R defined by
ψ(r)=max{−4r1−4r2,4r1−4r2}. The set Bψ={x∈W∣−4(x1−21)−4x2≤1,4(x1−21)−4x2≤1}
does not contain any integral point, hence Ψ(s)≥1 is valid
for Rf. Note that Bψ is not maximal (see
Figure 2).
Setting λ=4 in Claim 6, let
ψ′(r)=ψ(r)+λr2 for all r∈W. Note that
ψ′(r)=max{−4r1,4r1}≥0 for all r∈W. The set
Bψ′={x∈W∣−4(x1−21)≤1,4(x1−21)≤1} is contained in the maximal lattice-free convex set
B={x∈W∣−2(x1−21)≤1,2(x1−21)≤1},
hence ψ′ is pointwise larger than the function ψB defined
by ψB(r)=max{−2r1,2r1} and ΨB(s)≥1 is valid for
Rf. This completes the illustration of the proof.
Note that ΨB(s)≥1 does not dominate Ψ(s)≥1.
However the inequality ΨB(s)≥1 is equivalent to a valid
inequality Ψ(s)≥1 which dominates Ψ(s)≥1. We show how to construct ψˉ in our example. The function
ψˉ is defined by ψB(r)−λr2 for all r∈W
is pointwise smaller than ψ and Ψˉ(s)≥1 is valid
for Rf. Moreover, Bψˉ={x∈W∣−2(x1−21)−4x1≤1,2(x1−21)−4x1≤1} is a maximal
lattice-free convex set containing Bψ. Note that the recession
cones of Bψ and Bψˉ are full dimensional, hence
ψ and ψˉ take negative values on elements of the
recession cone. For example ψ(0,−1,21)=ψˉ(0,−1,21)=−4. The recession cones of
Bψ′ and B coincide and are not full dimensional, thus
ψ′(0,−1,21)=ψB(0,−1,21)=0, since the vector (0,−1,21) is in
the recession cone of B.
4 The intersection of all minimal inequalities
In this section we prove Theorem 4. First we need the following.
Lemma 32**.**
Let ψ:W→R be a continuous function that is
positively homogeneous. Then the function Ψ:W→R, defined by Ψ(s)=∑r∈Wψ(r)sr, is continuous with respect to
(W,∥⋅∥H).
Proof:
Define γ=sup{∣ψ(r)∣:r∈W,∥r∥=1}. Since the
set {r∈Rf(W):∥r∥=1} is compact and ψ is
continuous, γ is well defined (that is, it is finite). Given
s,s′∈W, we will show ∣Ψ(s′)−Ψ(s)∣≤γ∥s′−s∥H, which implies that Ψ is continuous. Indeed
“⊆” By Lemma 32, ΨB is continuous in
(W,∥⋅∥H) for every B∈BW,
therefore {s∈W:ΨB(s)≥1} is a closed
half-space of (W,∥⋅∥H). It is immediate to
show that also {s∈W:sr≥0,r∈W} is a
closed set in (W,∥⋅∥H). Since V={s∈W∣∑r∈W(Cr)sr=d−Cf}, and since for each row ci of C the function r↦cir is positive homogeneous, then by Lemma 32V is also closed.
Thus
[TABLE]
is an intersection of closed sets, and is
therefore a closed set of (W,∥⋅∥H). Thus,
since it contains conv(Rf(W)), it also contains
conv(Rf(W)).
“⊇” We only need to show that, for every
sˉ∈V such that sˉ∈/conv(Rf(W))
and sˉr≥0 for every r∈W, there exists B∈BW
such that ∑r∈WψB(r)sˉr<1.
The theorem of Hahn-Banach implies the following.
Given a closed convex set A in (W,∥⋅∥H) and a
point b∈/A, there exists a continuous linear function
Ψ:W→R that strictly separates A
and b, i.e. for some α∈R, Ψ(a)≥α for
every a∈A, and Ψ(b)<α.
Therefore, there exists a linear function
Ψ:W→R such that Ψ(sˉ)<α and Ψ(s)≥α for every s∈conv(Rf).
By the first part of Theorem 3, we may assume that Ψ(s)≥α is a nontrivial minimal
valid linear inequality. By the second part of Theorem 3, this
inequality is equivalent to an inequality of the form ∑r∈WψB(r)sr≥1 for some maximal lattice-free convex set B of
W with f in its interior.
∎
By Theorem 10, L:=lin(B) is a rational space
and P is a polytope. Also, by construction, ψB(rj)=1 for
j=1,…,k, and ψB(rj)=0 for j=k+1,…,k+h. Thus
∑j=1ksj≥1 is a valid inequality for
Rf(r1,…,rk+h). We recall that, by
Theorem 3, ∑r∈RqψB(r)sr≥1
is a minimal valid inequality for Rf(Rq).
We first show that Rf(r1,…,rk+h) is nonempty. Since
rk+1,…,rk+h are rational, there exists a positive
integer N such that Nrj is integral for j=k+1,…,k+h.
Since f∈int(B), it follows that, for every r∈Rq, there
exists s∈Rk+h such that r=∑j=1k+hrjsj and
sj≥0, j=1,…,k. Thus, given xˉ∈Zn, there
exists sˉ such that xˉ−f=∑j=1k+hrjsˉj and
sj≥0, j=1,…,k. Let λ be a positive integer
such that sˉ+λN∑j=k+1k+hej≥0, where ej
denotes the jth unit vector in Rk+h. Then sˉ+λN∑j=k+1k+hej≥0∈Rf(r1,…,rk+h).
“⇐” Let us assume that ∑j=1ksj≥1 is an
extreme inequality for Rf(r1,…,rk+h). We show that
∑r∈RqψB(r)sr≥1 is extreme for Rf(Rq).
Let ∑r∈Rqψi(r)sr≥1, i=1,2, be valid
inequalities for Rf(Rq) such ψB≥21(ψ1+ψ2). We will show that ψB=ψ1=ψ2. Since
ψB is minimal and ψB≥21(ψ1+ψ2), then
ψ1,ψ2 are both minimal. Thus, given Bi={x∈Rq:ψi(x−f)≤1}, i=1,2, B1 and B2 are maximal
lattice-free convex sets. Furthermore, since ∑j=1ksj≥1 is extreme for Rf(r1,…,rk+h), then
ψ1(rj)=ψ2(rj)=ψB(rj) for j=1,…,k+h. In
particular, ψ1(rj)=ψ2(rj)=1 for j=1,…,k and
ψ1(rj)=ψ2(rj)=0 for j=k+1,…,k+h. Hence B1 and
B2 contain B, since they contain the vertices of P and their
lineality space contains rk+1,…,rk+h. By the maximality
of B, B1=B2=B, therefore ψB=ψ1=ψ2, proving that
∑r∈RqψB(r)sr≥1 is extreme.
“⇒” Let us assume that ∑r∈RqψB(r)sr≥1 is an extreme inequality for Rf(Rq).
We prove that ∑j=1ksj≥1 is an extreme inequality
for Rf(r1,…,rk+h).
Let α,β∈Rk+h be vectors such that αs≥1 and βs≥1 are valid for Rf(r1,…,rk+h), and
21(αj+βj)≤ψB(rj), j=1,…,k+h.
We will show that it must follow that αj=βj=ψB(rj)
for j=1,…,k+h.
We first observe that, for j=k+1,…,k+h, αj=βj=0.
If not, since 21(αj+βj)≤ψB(rj)=0 for
j=k+1,…,k+h, then we may assume that αj<0 for some
j∈{k+1,…,k+h}. Given sˉ∈Rf(r1,…,rk+h)
it now follows that sˉ+λNej∈Rf(r1,…,rk+h) for every positive integer λ.
However, limλ→+∞α(sˉ+λNej)=αsˉ+limλ→+∞λNαj=−∞, contradicting the fact that αs≥1 is
valid for Rf(r1,…,rk+h).
Define, for every r∈Rq,
[TABLE]
Note that, for every r∈Rq, the above linear program is
feasible. We also observe that, for every xˉ∈Zq,
ψα(xˉ−f)≥1. Indeed, given sˉ∈Rk+h
such that ψα(xˉ−f)=αsˉ and
∑j=1k+hrjsˉj=xˉ−f, sj≥0 for
j=1,…,k, then there exists a positive integer λ such
that s~=sˉ+λN∑j=k+1k+hej∈Rf(r1,…,rk+h). Since αj=0, j=k+1,…,k+h,
then ψα(xˉ−f)=αsˉ=αs~≥1.
The above fact also implies that the linear
program (13) admits a finite optimum for every
r∈Rq.
We show that ∑r∈Rqψα(r)sr≥1 is a valid
inequality for Rf(Rq). The function ψα is sublinear
(the proof is similar to the one of Lemma 23).
Therefore, for every s∈Rf(Rq), given xˉ=f+∑r∈Rqrsr, it follows that
[TABLE]
We may define ψβ similarly. It now follows that the sets
Bψα and Bψβ are lattice-free convex
sets with f in their interior.
By definition, ψα(rj)≤αj and
ψβ(rj)≤βj, j=1,…,k+h.
Let ψ=21(ψα+ψβ). We will show that
ψ=ψB. Indeed, ψ(rj)≤21(αj+βj)≤ψB(rj), j=1,…,k+h. Thus
ψ(rj)≤1 for j=1,…,k and ψ(rj)≤0 for
j=k+1,…,k+h. In particular, f+r1,…,f+rk∈Bψ
and rk+1,…,rk+h∈rec(Bψ). Thus Bψ⊇B. Since ψ is a convex combination of ψα and
ψβ, it follows that ∑r∈Rqψ(r)sr≥1 is a
valid inequality for Rf(Rq). Thus Bψ is a lattice-free
convex set. Since B is maximal, it follows that Bψ=B. Hence
ψ=ψB.
Since ψB is extreme, it follows that
ψα=ψβ=ψB. Hence
αj=βj=ψB(rj), j=1,…,k+h. □
Bibliography25
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] K. Andersen, Q. Louveaux, R. Weismantel, L. A. Wolsey, Cutting Planes from Two Rows of a Simplex Tableau , Proceedings of IPCO XII, Ithaca, New York (June 2007), Lecture Notes in Computer Science 4513 , 1–15.
2[2] K. Andersen, Q. Louveaux, R. Weismantel, An Analysis of Mixed Integer Linear Sets Based on Lattice Point Free Convex Sets , Mathematics of Operations Research 35 (2010), 233–256.
3[3] K. Andersen, Q. Louveaux, and R. Weismantel. Certificates of Linear Mixed Integer Infeasibility , Operations Research Letters 36 (2008), 734–738.
4[4] K. Andersen, Q. Louveaux and R. Weismantel, Mixed-Integer Sets Associated from Two Rows of Two Adjacent Simplex Bases , Mathematical Programming B 124 (2010) 455–480.
5[5] K. Andersen, C. Wagner, R. Weismantel, On an Analysis of the Strength of Mixed Integer Cutting Planes from Multiple Simplex Tableau Rows , SIAM Journal on Optimization 20 (2009) 967–982.
6[6] E. Balas, Intersection Cuts – A New Type of Cutting Planes for Integer Programming , Operations Research 19 (1971), 19–39.
7[7] A. Barvinok, A Course in Convexity , Graduate Studies in Mathematics 54 , American Mathematical Society, Providence, Rhode Island, 2002.
8[8] A. Basu, P. Bonami, G. Cornuéjols, F. Margot, On the Relative Strength of Split, Triangle and Quadrilateral Cuts , to appear in Mathematical Programming. DOI: 10.1007/s 10107-009-0281-x