On the extension complexity of scheduling
Hans Raj Tiwary, Victor Verdugo, Andreas Wiese

TL;DR
This paper investigates the extension complexity of linear programming formulations for the minimum makespan scheduling problem, revealing exponential lower bounds for natural formulations and proposing more efficient approximate models.
Contribution
It establishes exponential lower bounds on the extension complexity of canonical and configuration formulations for scheduling, and introduces polynomial-size approximate formulations.
Findings
Canonical formulation has exponential extension complexity.
Configuration integer program has even larger lower bounds.
Polynomial-size approximate formulations exist for general settings.
Abstract
Linear programming is a powerful method in combinatorial optimization with many applications in theory and practice. For solving a linear program quickly it is desirable to have a formulation of small size for the given problem. A useful approach for this is the construction of an extended formulation, which is a linear program in a higher dimensional space whose projection yields the original linear program. For many problems it is known that a small extended formulation cannot exist. However, most of these problems are either -hard (like TSP), or only quite complicated polynomial time algorithms are known for them (like for the matching problem). In this work we study the minimum makespan problem on identical machines in which we want to assign a set of given jobs to machines in order to minimize the maximum load over the machines. We prove that the canonical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScheduling and Optimization Algorithms · Advanced Graph Theory Research · Constraint Satisfaction and Optimization
On the extension complexity of scheduling
††thanks: This work was partially supported by GA ČR project 17-09142S and FONDECYT Regular grant 1170223.
Hans Raj Tiwary Department of Applied Mathematics, Charles University. [email protected]
Victor Verdugo Department of Mathematics, London School of Economics and Political Science. [email protected] of Engineering Sciences, Universidad de O’Higgins. [email protected]
Andreas Wiese Department of Industrial Engineering, Universidad de Chile. [email protected]
Abstract
Linear programming is a powerful method in combinatorial optimization with many applications in theory and practice. For solving a linear program quickly it is desirable to have a formulation of small size for the given problem. A useful approach for this is the construction of an extended formulation, which is a linear program in a higher dimensional space whose projection yields the original linear program. For many problems it is known that a small extended formulation cannot exist. However, most of these problems are either -hard (like TSP), or only quite complicated polynomial time algorithms are known for them (like for the matching problem). In this work we study the minimum makespan problem on identical machines in which we want to assign a set of given jobs to machines in order to minimize the maximum load over the machines. We prove that the canonical formulation for this problem has extension complexity , even if each job has size 1 or 2 and the optimal makespan is 2. This is a case that a trivial greedy algorithm can solve optimally! For the more powerful configuration integer program we even prove a lower bound of . On the other hand, we show that there is an abstraction of the configuration integer program admitting an extended formulation of size . In addition, we give an -approximate integral formulation of polynomial size, even for arbitrary processing times and for the far more general setting of unrelated machines.
1 Introduction
In order to solve a linear program quickly one is interested in a formulation with as few variables and constraints as possible. A useful technique for this are extended formulations. A polytope is said to be an extended formulation or extension of a polytope if is a linear projection of . There are many examples of polytopes that require many constraints to be described but that admit extended formulations that are much smaller. For instance, the convex hull of all characteristic vectors of spanning trees in a graph with vertex needs inequalities to be described [4], but it admits an extended formulation of size [13]. The extension complexity of a polytope is the minimum number of inequalities needed to describe an extended formulation of it, see [2, 11, 20, 21] for surveys on the topic. We study the classical scheduling problem of assigning jobs on identical machines to minimize the makespan, also known as in the scheduling literature. We are given a set of jobs and a set of identical machines and every job has a processing time . The goal is to assign each job on a machine in order to minimize the maximum load over the machines, where the load of a machine is the sum of the processing times of the jobs assigned to it.
1.1 Our Contribution
A natural formulation for , known as the assignment integer program, uses a variable for each combination of a machine and a job , modelling whether is assigned to . The makespan is then modeled by an additional variable . Then, its linear relaxation is given by
[TABLE]
Lower bounds on the extension complexity. We prove that there are instances with jobs and machines such that the convex hull of all integral solutions to the above linear program has an extension complexity of . The optimal solutions form a face of and our bound also holds for this face and hence for all polytopes containing it as a face. Our instances satisfy that for each job and the optimal makespan is 2. Such instances can be solved optimally by a simple greedy algorithm in time . Our key insight is that there are faces of in which some jobs cannot be assigned to certain machines, e.g., defined via equalities of the form . Hence, an extended formulation of also yields such a formulation for the polytope of any instance of the *restricted assignment problem *in which we have the same input as in and additionally for each job there is a set of machines such that must be assigned to a machine in . Using a result in [6] we show that there are 3-Bounded-2-SAT instances such that the polytope describing all feasible solutions for them has extension complexity . This might be of independent interest, in particular since 2-SAT can be solved easily in polynomial time. We reduce these instances to the restricted assignment problem using a reduction from 3-Bounded-2-SAT to the restricted assignment problem such that for each job and the optimal makespan is 2 [3]. Hence, the polytope of all optimal solutions has extension complexity . By the above this also holds for the polytope of the corresponding instance of where we ignore the sets . Moreover, we show that this holds also for each subpolytope containing all optimal solutions.
Then we consider the variation of the above formulation in which we omit the variable and the second constraint. The set of integral points is simply the set of *all *schedules and the resulting (polynomial size) LP is integral. Note that hence in this space there exists an integral linear program of small complexity containing all optimal schedules. However, we show that if we restrict this polytope to the convex hull of all *optimal *schedules then resulting polytope again has extension complexity . We show that our bounds are almost tight by giving an extended formulation of size for the convex hull of all optimal solutions, also for arbitary processing times and for the formulation in which the makespan is a variable.
*Approximate schedules: Lower bound. *For there is a polynomial time -approximation algorithm known and even an EPTAS, see e.g., [8, 9]. Therefore, one might wonder whether we can obtain a polynomial size extended formulations that contains all optimal schedules and possibly also some -approximate schedules, e.g., for . We show that this is not possible for any . Even more, if we ask for a small polytope that contains all -approximate schedules for some value we show that this is does not exist for any with . Moreover, this is tight in the sense that the set of all -approximate schedules is simply the set of all schedules which admits the polynomial size formulation mentioned above.
Approximate schedules: Upper bound. Despite this negative result, we show that there is a polynomial size formulation for a polytope that contains all optimal schedules and *some *-approximate schedules if the target makespan is fixed. Our construction works for arbitrary job processing times and even in the more complex setting of unrelated machines, i.e., , where for each combination of a machine and a job there is a value denoting the processing time of when it is assigned to machine . Key to our extended formulation is to construct an instance of bipartite matching in which there is a vertex for each job which can be matched to vertices representing slots on the machines. Then we prove that in the space with the makespan as a variable such a formulation cannot exist which yields a separation between the two spaces even though they might appear very similar at first glance.
Configuration integer program. Finally, we study the configuration integer program which is a popular approach in scheduling, e.g., [19, 10], with connections to bin packing, see e.g., [7, 15]. There is a variable for each combination of a machine and a configuration where , contains all sets of jobs whose total processing size does not exceed an upper bound on the optimal makespan. While for large already the number of variables can be exponential, for constant this number is only polynomial and could potentially admit a small extended formulation. However, we prove that already for the case that for each job and there are instances with jobs and machines such that this linear program has extension complexity . To show this, we establish the maybe surprising connection that there are such instances for which the corresponding polytope is an extended formulation of the perfect matching polytope in a graph with vertices and the latter has extension complexity [16]. On the other hand, there is an abstraction of the configuration integer program which instead of assigning a configuration to each machine only assigns a pattern that describes how many jobs of each size are assigned to but without specifying the actual jobs. We prove that in contrast to the configuration integer program this abstraction admits an extended formulation of size for some function .
1.2 Related work
Lower bounds. There are many examples known of polytopes that do not admit small extended formulations, i.e., formulations of polynomial size. For instance, Yanakakis [22] proved that for TSP there can be no such formulation that is symmetric, i.e., stays invariant under permutation of cities. Recently, it became an active field of research to prove such lower bounds. For instance, Fiorini et al. [5] extended the above result for TSP to arbitrary (possibly non-symmetric) formulations and Avis et al. [1] showed that neither for 3-SAT, subset sum, 3D-matching, nor for MaxCut for suspensions of cubic planar graphs, there can be small extended formulations. Note that all these problems are -hard and hence a polynomial size extended formulation for any of them would be very surprising.
Easy problems with large extension complexity. There are only few problems in for which the corresponding polytope is known to have large extension complexity. The most famous example is probably the perfect matching polytope for which Rothvoss showed in his celebrated result that it has exponential extension complexity [16]. While the matching problem is in , the polynomial time algorithm for it is highly complicated. Also, Rothvoss showed that there exists a family of matroids whose associated polytopes have exponential extension complexity [14]. This contrasts with the fact that we can optimize over any matroid in polynomial time using the greedy algorithm [17].
2 Extension complexity: Lower bound
Suppose that we are given an instance of . We consider the linear program defined by (1)-(4) in Section 1.1. Denote by the convex hull of all its integral solutions. In the remainder of this section we prove the following theorem.
Theorem 1**.**
For every there exists an instance of with jobs, machines, and such that .
Let . For any given SAT formula with variables we define the polytope as the convex hull of all satisfying assignments, i.e., . We use the following theorem that follows easily from [6].
Theorem 2**.**
For every there exists a 2-SAT formula with variables and clauses such that . Each clause of contains exactly two literals.
Proof of Theorem 2.
Let . In [6] it is shown that there exists a graph with vertices such that for its independent set polytope it holds that , i.e., is the convex hull of all incidence vectors of independent sets of . Moreover, the degree of is bounded by a global constant that is independent of . Based on we construct a 2-SAT formula . For each node we introduce a variable , the intuition being that is true if is in the independent set. For each edge we introduce a clause , modelling that not both and can be in the independent set. The number of variables is and since has bounded degree each variable appears in at most clauses. Hence, the number of clauses is also . Each satisfying assignment of corresponds to an independent set of and vice versa. Therefore, . ∎
Let denote the formula due to Theorem 2. We transform into an equivalent 3-Bounded-2-SAT formula using a standard reduction i.e., is a 2-SAT formula in which each variable appears at most three times. Let be a variable in and assume that occurs times. We introduce new variables and for each we replace the -th occurrence of with . Additionally, we add the clauses \Big{(}\neg x_{i}^{(\ell)}\vee x_{i}^{(\ell+1)}\Big{)} for each and the clause \Big{(}\neg x_{i}^{(k)}\vee x_{i}^{(1)}\Big{)}. Hence, in any satisfying assignment of the resulting formula, either for each or for each . We do this transformation with each variable in . Let denote the resulting formula. By construction, we have that has variables and clauses, using that both quantities are linear in the number of literals in and the latter is bounded by . By construction, each variable appears exactly three times and at most two times positively and at most two times negatively. Also, each clause contains exactly two literals.
2.1 Reduction to the restricted assignment problem
Construction. Next, based on we construct an instance of the restricted assignment problem. We invoke the reduction from [3]. For each variable in we introduce a machine , a machine , and a job . The job has processing time and it can be assigned only on and , i.e., . The intuition behind is that if is scheduled on then is true and if is scheduled on then is false. For each clause we introduce one machine . For each variable that occurs in we introduce a job with . If occurs positively in then we define , otherwise we define . Finally, for each clause we introduce a job with and .
Since the total number of variables and clauses is , we introduced jobs and machines. Let denote the set of jobs and let denote the number of machines defined so far. We want that in solutions with makespan 2 each machine has a load of exactly 2. To this end, we introduce a set of dummy jobs of length 1 each. For each dummy job , it can be assigned to any machine, i.e., .
Correctness. For each satisfying assignment of there is a schedule of makespan 2: If a variable is assigned to be in the satisfying assignment then we schedule on machine , otherwise we schedule on machine . Consider a clause . There must be at least one variable that satisfies . For each such variable , if occurs positively in then we assign on , otherwise we assign on . For each variable that does not satisfy we assign to . Also, we assign to . Using that each variable appears at most twice positively and at most twice negatively, one can check that the resulting makespan is 2. Finally, we assign the dummy jobs to the machines such that each machine still has a makespan of at most 2. This is the optimal solution since the largest processing time is 2. One can also easily show that if there is a solution of makespan 2 then there exists a satisfying assignment for , see [3] for details.
Faces of scheduling polytope.
Let and denote the set of jobs and machines in the construction above, respectively. Also, let denote the set of allowed machines for each job . We consider the polyhedron . Note that ignores the sets of allowed machines for each job . We argue that there is a face of such that each vertex of corresponds to a schedule in which each job is assigned on a machine in and the makespan is 2. Observe that the inequalities and are valid inequalities for . Hence, the set
[TABLE]
is a face of . Also, .
Proof of Theorem 1.
We describe now a linear projection of to . Given a point we define that the component of corresponding to the variable equals to . By the construction above and the proof of correctness of the reduction, for each integral point there exists a feasible schedule for with makespan 2 in which each job is assigned to a machine in . Additionally, for each variable in it holds that if then is assigned on machine in this schedule. Thus, there exists an integral point such that . Similarly, for each integral point we have that . Thus, and therefore, . Finally we give a linear projection . Recall that for each variable in we introduced a set of new variables in . By construction, in each satisfying assignment for all these variables have the same value. Therefore, we define that the component of corresponding to equals for each variable in . We obtain . Therefore, . Hence, . This completes the proof of Theorem 1. ∎
Since already the face of containing the optimal solutions to has extension complexity we obtain the following corollary.
Corollary 1**.**
For every there exists an instance of with jobs, machines, and such that for any integral polyhedron that contains all optimal and possibly also some other solutions to it holds that .
2.2 Extensions
Above we proved that has large extension complexity. The polyhedron is defined using the variable which represents an upper bound on the makespan of the respective solution. This raises the question whether there exist compact extended formulations in the space defined only via the variables , without the variable . Formally, we consider the polyhedron
[TABLE]
describing the set of all schedules, including all optimal ones. One can easily show that its extension complexity is by simply taking its linear relaxation. Observe that Corollary 1 rules out such a polyhedron in the space lifted with the variable .
Proposition 1**.**
The polyhedron has extension complexity .
This follows by observing that the extreme points of the linear relaxation are integral. Let be a feasible point in the relaxation such that for some machine and some job . Due to Equality (2) there must be another machine such that . For sufficiently small we define a new solution , where is the canonical vector with value at entry and zero otherwise. Similarly, we define a new solution . We can find such that and , and hence is not an extreme point. Therefore, the extension complexity of is .
However, when we minimize the makespan, we are interested only in the set of all integral points in that correspond to optimal solutions. Note that their convex hull corresponds to the face of containing all points with . Due to Corollary 1 this polytope has extension complexity . We can strengthen this statement. In the instance constructed above the optimal makespan is 2 and any non-optimal solution has a makespan of at least 3. Thus, all integral polytopes that contain all optimal solutions to and possibly some -approximate solutions have large extension complexity. Note that this contrasts the fact that admits an EPTAS [9].
Corollary 2**.**
Let with . For every there exists an instance of with jobs, machines, and such that for any integral polytope whose vertices consist of all optimal solutions and possibly some -approximate solutions to it holds that .
Next, we show that there cannot be an integral polytope of polynomial extension complexity containing *all *-approximate solutions of for any .
Corollary 3**.**
*For every and every there exists an instance of with jobs and machines such that for any integral polytope whose vertices are all -approximate solutions to it holds that . In particular, if for some then the instance has jobs and machines and . *
Proof.
Assume by contradiction that such a polytope exists and suppose that . Take the instance defined above. We add dummy jobs of length 1 each, denote them by . Also, we add dummy machines, denote them by . Note that the optimal makespan is still 2 and thus any -approximate solution has makespan . Let denote the resulting instance and observe that it has jobs and machines. We consider . Then for each dummy machine the inequality is a valid inequality. Also, consider an assignment of the jobs in to the machines such that each machine gets a load of in this way. Formally, we define a map such that for each . Then for each job the inequality is a valid inequality. Therefore, the set defined as
[TABLE]
is a face of . Therefore, contains exactly the solutions in which the dummy jobs are assigned as described by the map and the non-dummy jobs are assigned such that they give a load of 2 on each machine. Thus, there is a linear projection of to the polytope as defined in Theorem 2. This implies that . ∎
Finally, we can show that our lower bounds from Theorem 1 and Corollary 2 are almost tight by giving an upper bound of .
Theorem 3**.**
Let be an instance of with jobs and machines. Let denote the convex hull of the vertices corresponding to all optimal solutions to in . It holds that and .
In the following we prove Theorem 3. Given an instance , we first describe a dynamic program that computes a solution to in time for some given target makespan , assuming that such a solution exists. Then based on it we define an extended formulation of of size .
Dynamic program. We introduce a cell for each subset of jobs and each integer . In we want to store a schedule for the jobs on the machines such that each machine has a makespan of at most , assuming that such a schedule exists. Consider the case , that is, we look for a schedule in the single machine . For each set of jobs with we store in the cell the schedule that assigns all jobs in to machine . For each set of jobs with we store in the cell , indicating that no feasible schedule exists for . Now suppose we are given a cell with . If there is a subset with such that in the cell we stored a schedule (and not ) then in we store the schedule that assigns on machine and the schedule in the cell on the machines . If no such set exists then we store in . Finally, if we stored a schedule then we output this schedule, otherwise contains and we output that there is no schedule for on machines with makespan at most . This dynamic program table has cells and evaluating one cell takes time, which yields a total running time of .
In the transition above, for each cell we took an *arbitrary *subset such that and does not contain . In fact, we can construct *any *schedule of makespan at most in this way if for each cell we choose for the set of jobs that the respective schedule assigns on machine . In the sequel, we define a graph with vertices including two special vertices and such that any path from to corresponds to a solution that the above dynamic program might compute for suitable choices for . Then we define a linear program whose vertices are exactly these paths which then yield an extended formulation to if we choose .
Construction of the graph. Let . Let be the set of cells of the dynamic program table such that there exists a schedule for the jobs on machines of makespan at most . For convenience we add a dummy element to . Then, in our graph the set of vertices corresponds to . For each we introduce an arc if and only if and there is a set with such that . Let be the source and the sink. We call the set of arcs, and then .
Lemma 1**.**
*Every - path in the graph corresponds to a schedule of makespan at most . Furthermore, every schedule of makespan at most induces an - path in . *
Proof.
Let be an - path with vertices where and . Then, our construction of guarantees that for each . Thus, for each arc we assign the jobs in to machine which yields a schedule of makespan at most . Conversely, consider a schedule of makespan at most and for each let be the jobs that are assigned to machine in . Then, there is a path that for each visiting the vertex . Observe that for each we have that since and . ∎
Extended formulation. We define a linear program with a variable for each arc . We add constraints that describe a flow in such that we send exactly one unit of flow from to . In particular, we require that one unit of flow leaves , one unit of flow enters , and on all other vertices there is flow conservation. For every job , let which are the arcs such that if one of them is contained in an -path then in the schedule corresponding to job is assigned on machine . Then, consider the following linear program
[TABLE]
where for each vertex , we have that denote the out-going and in-going edges at , respectively.
Proof of Theorem 3.
For , we project the polytope of the linear program above to by defining according to (6) for each job and each machine . By Lemma 1 this yields an extended formulation of size . We describe now how to extend the above to an extended formulation for of size . For that, we show how to modify the dynamic program used for constructing the extended formulation of .
Note that for the makespan of any feasible solution there are only options since the makespan equals the sum of the processing times of the jobs on some machine and there are only options for this quantity. Let be a set containing all these options. We define a cell for each subset of jobs, each integer with and for each . For each such cell in which the previous dynamic program above does not store we define a vertex like above and also arcs as before. Note that there are no arcs between two vertices , with . We define a new source vertex and introduce an arc for each , assuming that the vertex exists. Similarly, we define a new sink vertex and an arc for each . A path from to represents the choice of a makespan and defining a schedule with makespan at most . As before, consider a linear program computing - paths in this new graph, using a flow formulation and variables as before, that is, constraints (5) and (8). We lift this linear program by adding a variable and introducing a new constraint
[TABLE]
Then we project each point in the resulting polyhedron to the point by setting for each and each . That concludes the proof of the theorem. ∎
3 Approximate polynomial size extended formulation
In the previous section we showed in Corollary 3 that there can be no polynomial size extended formulation of whose vertices are exactly all -approximate solutions, for essentially any . However, in this section we show that there is a small extended formulation that contains all optimal schedules and *some *approximate schedules. Our formulation works even in the more general setting of unrelated machines, i.e., , where the processing time of a job can depend on the machine that it is assigned to and for each such combination the input contains a value . In this setting the polytope defined exactly in the same as way as before, however, now a solution is optimal if for each machine .
Construction of the extended formulation. In the sequel let . The intuition behind our construction is the following. We define a bipartite graph in which we have one vertex for each job and vertices * *for each machine . Each vertex corresponds to a *slot *for machine . We search for a matching that assigns each job vertex to some slot vertex and we allow this assignment if and only if , i.e., we introduce an edge if and only if . The intuition is that in for each machine there can be at most one job assigned to with , at most two jobs with and more general at most jobs with . Hence, there is a matching in that corresponds to . On the other hand, in any matching the total processing time of the jobs on any machine is bounded by which is at most if (e.g., set or set to be the makespan found by a 2-approximation algorithm for [12, 18]).
We define an integral polytope that models this bipartite matching. For each let be the subset of jobs whose vertex has an edge incident to . For each edge we have a variable that intuitively indicates whether job is assigned to the slot . On top of this, we project the resulting polytope to the space of the variables of which yields .
[TABLE]
Theorem 4**.**
Given an instance of there exists an extended formulation of size for an integral polytope whose vertices correspond to all optimal solutions and some -approximate solutions to .
Proof of Theorem 4.
Consider an optimal schedule and let be its corresponding solution in . For each machine , let be the subset of job vertices in such that , that is, , and consider the bipartite subgraph of that is induced by the vertices . We check that there exists a matching in that covers . Consider . There exists at least one job in with processing time at most , otherwise would exceed the makespan in machine . Therefore, job is connected to for every , and its degree in is at least . In particular, is connected to at least slots in the bipartite subgraph . By Hall’s theorem we conclude that exists a matching in covering every job in . For every job in we define if is connected to in the matching, and otherwise. By construction the solution satisfies the constraints in the program above.
On the other hand, consider a vertex solution . The integrality of comes from the fact that the linear program restricted to variables is a bipartite matching formulation. We bound the makespan of the schedule obtained from . For each machine , we have
[TABLE]
For every , we have that . This, together with constraint (10) allows us to upper bound the last summation above by
[TABLE]
4 Extension complexity of configuration-LP
An alternative way to formulate as an integer linear program is via a configuration integer program. For a given target makespan we define the set of all *configurations *to be all subsets of jobs whose total processing time is at most , i.e., . For each machine and each configuration we introduce a variable that models whether machine gets exactly the jobs in configuration assigned to it. Denote by the convex hull of all solutions to the integer program below.
[TABLE]
For arbitrary values of the number of variables of can be exponential in the input length. However, for constant there are only a polynomial number of variables (and constraints) since the number of possible configurations is bounded by . This raises the questions whether admits a small extended formulation for such values of . If and for each Theorem 1 implies a lower bound of since there is an easy projection of the configuration-LP to . We strengthen this to the case where for each and to a lower bound of .
Theorem 5**.**
For every there is an instance of with machines and jobs such that the polytope has an extension complexity of . It holds that and for each .
Proof of Theorem 5.
For every we construct an instance of by , contains machines, and contains jobs with for each . The set is thus the set of all pairs of jobs. We show that there is a linear map that projects each solution to to a matching in a complete graph on vertices. Let be the complete graph on vertices, i.e., . Consider the perfect matching polytope of , given by
[TABLE]
We define a linear map as follows: for each edge we define . For each vertex of it holds that and therefore . On the other hand, let be a vertex of , i.e., represents a perfect matching in . Then we can construct a vertex of such that as follows: assume that the machines are numbered . Then for each machine we define and for each . Then . Rothvoss showed that the extension complexity of is [16]. Our construction implies that is an extended formulation for . Therefore, the extension complexity of is also . ∎
5 An extended formulation with small parameterized size
In this section we consider an abstraction of the configuration integer program that exhibits low extension complexity parameterized by . Instead of encoding configurations as subsets of jobs, we consider how many jobs in the configuration have a certain processing time . That is, in this context a configuration is a multiset of , and let be the multiplicity of in , namely, the number of times that appears in . For every , let be the number of jobs in with processing time equal to . We denote by the set of configurations having total processing time at most . The key fact about this encoding is that two or more machines can be scheduled in the same configuration and there are fewer solutions overall than for the configuration-LP. As before, we propose a formulation where for every combination of machine and configuration we have variable indicating whether is scheduled according to . That is,
[TABLE]
Theorem 6**.**
The extension complexity of the integer hull for the above formulation is for some function .
We define a program deciding for each configuration how many machines get configuration such that we have enough slots for all jobs, that is,
[TABLE]
We denote the convex hull of the solutions to the integer program above. Since the number of processing times ranges in , the total number of configurations in is upper bounded by .
Lemma 2**.**
There exists a function such that the number of vertices of the polytope is upper bounded by .
Observe that the lemma above holds even when number of machines and jobs on the right hand side of the integer program above are not necessarily bounded by a function of , which is in general the case. Now we can prove Theorem 6 via a Dantzig-Wolfe reformulation with a variable for each of the vertices of . Since the size of is bounded by , we have at most that many variables of this type. Then, we model the values of the variables as the solution to a suitable transshipment problem.
Proof of Lemma 2.
Let be a vertex of the polytope . We argue that there are at most possibilities for for some suitable function . For a value to be defined later define a new point by setting if and otherwise for each configuration . Define , which is feasible for the following integer linear program
[TABLE]
One can easily show that is a vertex of the integer hull of the solutions to the program above, since otherwise is non-trivial convex combination of feasible solutions of . Let denote the configurations in the support of . We identify each configuration with a vector which is the column in the matrix of the program above corresponding to , i.e., the first entry of each such vector is a 1 and the other entries are given by the values . If the vectors in are linearly independent then the above IP has a unique solution. Hence, there is at most one vertex of of the form where is a solution to the above IP. In the sequel we argue that it cannot be that the vectors in are linearly dependent if is sufficiently large. Then the claim of the lemma follows since the vector satisfies that for each and the number of such vectors can be bounded by a value .
If the vectors in are not linearly independent then there exists a configuration and a set of linearly independent vectors and values such that . Let denote the smallest value such that such that for each . If and for each then we can write as the convex combination of two integral vectors and hence is not a vertex. To ensure the former we define to be the maximum over all values and that we can obtain in this way, i.e., by selecting one , expressing it as a linear combination of a set of linearly independent configurations , and finding the smallest value such that . Note that the number of values obtained in this way is finite and hence is well-defined (and finite). ∎
Proof of Theorem 6.
Let be the set of vertices in the polytope . By Lemma 2, there exists a function such that the size is bounded by . Since we can restrict the optimization problem to its set of vertices, we consider the linear program obtained by lifting the integer program above by the Dantzig-Wolfe reformulation using the set of vertices . For each vertex , consider a variable indicating whether we pick or not the vertex solution . In addition, for each configuration we consider as before a variables indicating how many times the configuration is used. Finally, for each combination of machine and configuration we have a variable indicating whether machine is scheduled according to .
The idea behind the extended formulation is to first select a vertex by using the Dantzig-Wolfe reformulation in the variables , which provides for each configuration the number of times, , that is used. Then we formulate a transportation problem between machines and configurations satisfying the offer for each configuration.
More specifically, consider a complete bipartite graph where we have one vertex for each machine , and we have a vertex for each configuration . For each vertex we have an offer , and every vertex has a demand of 1. The variable indicates whether the demand of the machine vertex is satisfied by the configuration vertex . More specifically, consider
[TABLE]
Observe that constraints (26) and (29) guarantees that the total demand equals the total offer in the transportation problem over . It holds that a vertex of the linear program above is integral. If not, suppose that is fractional, otherwise the integrality of implies the integrality of and in turns the integrality of since the transportation program in the graph given by constraints (28)-(29) is integral. If is fractional, is a non-trivial convex combination of the vertices , and each of these vertices is feasible for the constraints (26)-(27), which implies that is a convex combination of the vectors , where is the canonical vector that is 1 for entry and zero otherwise. For each such that , the constraints (28)-(29) solve a transportation problem between the machine vertices and the configurations vertices , where the offer for is equal to . The vertices with positive offer are given by , and then any solution to this problem is a convex combination of the integral solutions to the transportation problem over . Since every solution is supported over , and therefore we contradicted the fact that is a vertex of the polytope. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] David Avis and Hans Raj Tiwary. On the extension complexity of combinatorial polytopes. Math. Program. , 153(1):95–115, 2015.
- 2[2] Michele Conforti, Gérard Cornuéjols, and Giacomo Zambelli. Extended formulations in combinatorial optimization. Annals OR , 204(1):97–143, 2013.
- 3[3] Tomáš Ebenlendr, Marek Krčál, and Jiří Sgall. Graph balancing: A special case of scheduling unrelated parallel machines. Algorithmica , 68(1):62–80, 2014.
- 4[4] Jack Edmonds. Matroids and the greedy algorithm. Mathematical programming , 1(1):127–136, 1971.
- 5[5] Samuel Fiorini, Serge Massar, Sebastian Pokutta, Hans Raj Tiwary, and Ronald de Wolf. Exponential lower bounds for polytopes in combinatorial optimization. J. ACM , 62(2):17:1–17:23, 2015.
- 6[6] Mika Göös, Rahul Jain, and Thomas Watson. Extension complexity of independent set polytopes. SIAM Journal on Computing , 47(1):241–269, 2018.
- 7[7] Rebecca Hoberg and Thomas Rothvoss. A logarithmic additive integrality gap for bin packing. In Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms , pages 2616–2625. SIAM, 2017.
- 8[8] Dorit S Hochbaum and David B Shmoys. Using dual approximation algorithms for scheduling problems theoretical and practical results. Journal of the ACM (JACM) , 34(1):144–162, 1987.
