Bridging between 0/1 and Linear Programming via Random Walks
Joshua Brakensiek, Venkatesan Guruswami

TL;DR
This paper introduces a random-walk based algorithm that interpolates between 0/1 integer programming and linear programming, providing a spectrum of solutions with varying computational complexity.
Contribution
It presents a novel algorithmic framework that smoothly transitions from exponential to polynomial time for solving linear programs over subsets of [0,1], extending to CSPs with threshold polymorphisms.
Findings
Runs in $O_E((2- ext{measure}(E))^n ext{poly}(n,m))$ time for feasible solutions in $E^n$
Recovers a randomized algorithm for $k$-SAT with runtime $(2-2/k)^n ext{poly}(n)$
Incorporates new techniques like a multiplicative potential and LP-based distribution evolution
Abstract
Under the Strong Exponential Time Hypothesis, an integer linear program with Boolean-valued variables and equations cannot be solved in time for any constant . If the domain of the variables is relaxed to , the associated linear program can of course be solved in polynomial time. In this work, we give a natural algorithmic bridging between these extremes of - and linear programming. Specifically, for any subset (finite union of intervals) containing , we give a random-walk based algorithm with runtime that finds a solution in to any -variable linear program with constraints that is feasible over . Note that as expands from to , the runtime improves smoothly from to polynomial. Taking in our result…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Bridging between 0/1 and Linear Programming via Random Walks
Joshua Brakensiek Department of Computer Science, Stanford University, Stanford, CA. Email: [email protected]. Some of this work was done when the author was at Carnegie Mellon University. Research supported in part by NSF CCF-1526092, and an NSF Graduate Research Fellowship.
Venkatesan Guruswami Computer Science Department, Carnegie Mellon University, Pittsburgh, PA 15213. Email: [email protected]. Research supported in part by NSF grants CCF-1422045 and CCF-1526092.
Abstract
Under the Strong Exponential Time Hypothesis, an integer linear program with Boolean-valued variables and equations cannot be solved in time for any constant . If the domain of the variables is relaxed to , the associated linear program can of course be solved in polynomial time. In this work, we give a natural algorithmic bridging between these extremes of [math]- and linear programming. Specifically, for any subset (finite union of intervals) containing , we give a random-walk based algorithm with runtime that finds a solution in to any -variable linear program with constraints that is feasible over . Note that as expands from to , the runtime improves smoothly from to polynomial.
Taking in our result yields as a corollary a randomized time algorithm for -SAT. While our approach has some high level resemblance to Schöning’s beautiful algorithm, our general algorithm is based on a more sophisticated random walk that incorporates several new ingredients, such as a multiplicative potential to measure progress, a judicious choice of starting distribution, and a time varying distribution for the evolution of the random walk that is itself computed via an LP at each step (a solution to which is guaranteed based on the minimax theorem). Plugging the LP algorithm into our earlier polymorphic framework yields fast exponential algorithms for any CSP (like -SAT, -in--SAT, NAE -SAT) that admit so-called “threshold partial polymorphisms.”
1 Introduction
The study of exponential time algorithms for NP-hard problems has been a thriving area of research, and clever algorithms much faster than naive brute-force methods have been devised for many problems. For instance, the canonical NP-complete problem -SAT admits an elegant time randomized algorithm [Sch99] (we use to hide factors, and is the number of variables). The runtime of Schöning’s algorithm is for -SAT, and the algorithm has been derandomized to achieve similar runtime bounds deterministically [MS11]. Even faster algorithms running in time and are known for -SAT and -SAT respectively [Her14]. A survey (albeit not very recent) of exact exponential time algorithms for NP-complete problems appears as [Woe03].
However, there are problems such as general CNF-SAT for which no algorithm with runtime is known for any constant . The strong exponential time hypothesis (SETH) [IP01] asserts that in fact no such algorithm exists, and indeed that for every , there exists some such that -SAT can’t be solved in time. Such “SETH-hardness” is also known for problems such as set cover and Not-all-Equal-SAT [CDL*+*16]. The class of [math]- integer linear programs, which includes satisfiability, set cover, and most natural problems with Boolean-valued variables, also doesn’t admit a time algorithm for any . On the other hand, linear programs where variables are allowed to take values in can of course be solved in polynomial time.
Our main result stated below bridges between these extremes, giving an algorithm with progressively better runtimes as one allows more and more relaxed values for the Boolean variables. As the allowed set of values shrinks from to , the runtime of our algorithm degrades gracefully from polynomial to .
Theorem 1.1** (Main).**
Let be a finite union of intervals that contains which is given explicitly: . Let be its measure (total length). There is a randomized algorithm that on input an -variable linear program with constraints that is feasible over , runs in time , where is a constant depending on the structure of , and with high probability finds a feasible solution to the LP that belongs to .
Fix a constant and consider . For this , we get a time algorithm. Solving the natural LP associated111For each variable in the -SAT instance, specify that . For each clause specify that . For any variable that’s negated in a clause, replace with . with -SAT in this way and rounding variables in to [math] and those in to , we get a -SAT algorithm that matches the runtime of Schöning’s celebrated algorithm [Sch99], although better algorithms for -SAT are known (e.g., [PPSZ05]). The formulation of the problem of finding a relaxed solution in is, in our opinion, a valuable contribution in itself, and it further enables a unified treatment of algorithms for -SAT as well as a more general class of constraint satisfaction problems (CSPs) (to be described shortly).
The algorithm claimed in Theorem 1.1 is based on a random walk approach inspired by Schöning’s algorithm.
Even for the case of , many new ideas are needed to bring the algorithm to fruition. In particular we use a novel global criterion which guides for making local changes to an evolving assignment. See Section 1.1 for an overview of our approach.
We note that the work of Impagliazzo, Lovett, Paturi and Schneider [ILPS14] shows that integer programming can be solved in time if the number of constraints is . The guarantee is incomparable to ours: on the one hand, we allow for many more (even sub-exponentially many) constraints, but on the other hand, we do not find a [math]- valued solution.
1.1 Overview of algorithm and analysis
Schöning’s algorithm for -SAT [Sch99] consists of the following simple procedure. First, one starts with uniformly random assignment to the variables. Then, for steps one checks if the instance is satisfied (in which case terminate) or else one picks any violated clause and picks a uniformly random variable to flip in that clause. Schöning’s analysis used the fact that during each random step, the current assignment’s Hamming distance to a “reference” solution decreases by with probability at least and otherwise increases by at most . A simple combinatorial argument can then show that the probability of success is at least , yielding the -time randomized algorithm.
Our algorithm for bridging integer and linear programming has Schöning’s algorithm as a blueprint, but many additional insights were needed at both the conceptual and technical levels. A primary challenge is that, one needs to resolve that the variables for our integer/linear program can take on a continuum of values compared to only (or a finite number) in Schöning’s algorithm. To get around this, we use the fact that our target domain is expressed as the union of finitely many intervals, and thus we consider the discrete problem of determining which interval each variable belongs to. Since the Cartesian product of intervals is a convex region, when one has a set of intervals selected, one can check if there exists some assignment in those intervals by solving a simple modified linear program.
Even with this reduction to a discrete problem, the “naive” extension of Schöning’s algorithm and analysis still runs into many issues. For instance, the part of Schöning’s random walk where he finds a “bad” clause and then flips a uniformly random variable needs to be modified greatly. In particular, it is not clear what a “bad clause” with respect to our linear program , as a particular choice of intervals could be invalid, even though no individual inequality of rules out the entire region. Note that this shows that local methods cannot suffice for this problem (unlike Schöning’s algorithm). Thus, we need to use the entire problem as our failed constraint. Furthermore, when modifying the choice of intervals for the next step of the random walk as we no longer have sparsity, it is not clear that uniformly changing some variable will help at all.
To get around these issues, we employ a potential function , where represents an integral solution to while represents the current choice of intervals in our random walk. The potential function is multiplicative = , where takes on the value if is in the interval and decreases depending on how far is from the interval . This choice of potential function was inspired by the analysis of the biased random walks corresponding to the “Gambler’s Ruin” problem (e.g., [Fel68, EK02, LL10]). The choice of the potential needs to be precisely tailored to the set , see (1, 2), but intuitively it captures the probability of success of the random walk from the position . Note that since the whole point is to determine the feasibility of the integer program, we cannot explicitly compute . Instead, we simultaneously optimize over all fractional solutions to .
This optimization involves thinking of the random walk as a two-player game involving Alice and Bob. Imagine that Alice has some adversarial distribution of integral solutions to . Bob needs to compute a distribution of states which are changes to in one position such that . By minimax, it suffices to show for any such distribution of strategies, Bob has a deterministic response which causes the potential to increase or stay the same, which results in a simple calculation. This proves that Bob has a random strategy to keep the expected value of the potential as a monovariant no matter what solution Alice is thinking about. We show that this strategy can be explicitly computed by solving a convex optimization problem with a separation oracle.
By itself, knowing that the potential is non-decreasing is not too helpful. The key observation is that is not staying constant during the random walk, sort of like how in Schöning’s algorithm there is at least a chance of making progress. In fact for some we call the traction. This “traction” is all we need to guarantee that the random walk, originating from state , will succeed with probability at least in polynomial time.
Finally, one needs to be careful with picking an appropriate starting distribution. A judicious choice is made to ensure a stating potential of at least which then leads to the running time. See Section 3.4 for more details.
Using this linear programming paradigm, we can then obtain a variety of exponential time algorithms for many CSPs by (1) rewriting the CSP as an integer program, (2) picking a set corresponding to the CSP based on its partial polymorphisms, (3) solving the linear program corresponding to the set using our algorithm, (4) “round” the linear program solution back to a solution to the CSP using partial polymorphism. See Section 4 for more details.
In summary, by overcoming a few conceptual and technical hurdles, we were able to generalize Schöning’s algorithm to solve a much wider range of feasibility problems.
1.2 Connection to constraint satisfaction
We now describe our original motivation, relating to fast exponential algorithms for constraint satisfaction problems (CSPs), that led us toward Theorem 1.1. (In hindsight, we view our main result to be of intrinsic interest from an optimization perspective, well beyond the intended application to CSPs.) In the algebraic theory of constraint satisfaction, the tractability of a CSP has been shown to be intimately tied to the polymorphisms associated with its defining relations [BJK05]. A polymorphism for a relation is a function that preserves membership in . That is, if , a function is a polymorphism if applying component-wise to an arbitrary sequence of -tuples from always leads to a -tuple in . The resulting rich theory can not only explain and predict polynomial time decidability of CSPs, but with appropriate variations of polymorphisms, also explain the complexity of counting, optimization, and promise versions (c.f., the surveys [Che09, BKW17]).
In fact, rather remarkably the framework can also shed light on the complexity of CSPs that are NP-complete. Partial polymorphisms, which are partial functions that preserve membership in the CSP relations whenever they are defined, can explain the runtime of fast exponential algorithms for CSPs, i.e., the smallest222Assuming the ETH, one cannot have a time algorithm for arbitrarily small . for which one can get a time algorithm [JLNZ13]. In particular, they show that 1-in-3-SAT, a variant of 3-SAT in which a clause is satisfied only if exactly one of the variables is true, is essentially the easiest NP-hard problem in terms of the efficiency of an exponential algorithm.
In a recent paper [BG19], we presented a general framework to deduce polynomial time algorithms for CSPs (in fact, the more general promise CSPs) that admit as polymorphisms a family of threshold functions. An example of such a function satisfies if and only if the Hamming weight of is at least some parameter . More generally, such functions can take on more than two values and can have multiple phase transitions at different hamming weights. The Promise CSP algorithm was based on rounding the solution to an associated linear program using the polymorphism. This framework can be applied with our fast exponential algorithm for finding an LP solution in in order to obtain fast exponential algorithms for CSPs that have a sequence of threshold functions as partial polymorphisms. Here represents the interval of input (fractional) Hamming weights on which the polymorphism is defined. (In the case when polymorphisms are defined everywhere, so one can just solve the LP efficiently.) The details of this connection appear in Section 4.
The CSPs for which our method leads to fast exponential algorithms, such as -SAT or -in--SAT (which has an algorithm [Wah07]), often admit even faster exponential time algorithms optimized for the specific CSP. Our LP solver identifies a fairly general sufficient condition that implies fast exponential algorithms for certain CSPs, giving a principled reason for their existence even if the exact runtime is not optimized.
Very recently, [LW18] constructed the first Boolean CSP which has a quantiative lower bound (assuming SETH) and has a nontrivial (e.g., not ) upper bound. The constraints of the CSP do not have a finite description but are rather all possible constraints which have a prescribed partial polymorphism known as a “2-edge operator.” Their upper bound algorithm involves a “meet in the middle strategy” to get a runtime. A related problem they explore (corresponding to “-near-unanimity” operators) gives an nontrivial upper bound involving a variant of Schöning’s algorithm, although the correctness of the algorithm is conditioned on the sunflower conjecture.
These methods also connect to the -SAT problem of [AGH17]. Succinctly, the -SAT problem, for , is the following: given an instance of -SAT in which there exists an assignment which satisfies at least literals in every clause, find a solution to the ordinary -SAT instance. For , there is a reduction of -SAT to finding a solution belonging to for the basic LP of the -SAT instance. Note that this implies the existence of a time algorithm for -SAT. Since -SAT was shown to be NP-hard for every [AGH17], this reduction also shows that our LP problem is NP-hard for for each fixed .
1.3 The road ahead
This algorithm for linear programming spurs many questions for further investigation. The following are a sample of potential directions of exploration.
- •
For succinctness, this article only describes testing feasibility of a linear programs. One may also consider the optimization version of the question where one seeks to maximize subject to and . Assume that is the optimal value. One way to phrase such an inquiry is to desire to find such that and . This can be achieved with essentially the same complexity by performing binary search on candidate values and adding to the linear program.
- •
Although the focus of this paper is optimizing in powers of subsets , one can also investigate product sets . As the relevant potentials are multiplicative, the analysis of this paper can generalize in a straightforward manner, as long as we have a uniform bound on the traction (see 3).
Another extension would be product sets , where each for some constant . Such reductions are relevant in reducing from non-Boolean CSPs.
- •
Also for succinctness, we restrict the exposition to having the bridging between and to be a product set , but our method seems to have the capability, with several additional technical ideas, to generalize to more complex regions . An interesting example is
[TABLE]
where and is any norm on . Note that with covers the case is the norm.
- •
Another important question is finding a derandomization of the random walk algorithm. Moser and Scheder [MS11] did successfully derandomize Schöning’s algorithm, so we envision that similar methods should be able to derandomize our algorithm.
- •
This article shows how this LP algorithm can give exponential time random walk algorithms for a variety of Boolean CSPs and Promise CSPs where the “promise domain” is Boolean. It is possible to extend to higher domains by noting that any CSP can be expressed a -integer program by having indicator variables which represent if the th variable in the CSP is equal to .
Paper Organization. In Section 2, we formally state the main result as well as sketch the algorithm. In Section 3, we prove that this algorithm is correct and has the claimed run-time. In Section 4, we prove some applications of the main result, including recovery of Schöning’s random walk algorithm. In Appendix A, we include the proofs (mainly some calculations) omitted in the body of the paper.
2 Random walk algorithm
Consider any linear program , where . Assume that is bounded by a subexponential function of . Treat this as a [math]- integer program, so we desire . Unless one refutes the Strong Exponential Time Hypothesis (SETH), determining such an requires at least time. That said, we show in this section, that one can get significantly better runtimes if one accepts an approximate solution. We now define what we mean by “approximate.”
Let be a closed subset with which is the union of disjoint intervals of nonzero length. As stated below, these intervals of are explicitly given as part of the problem statement. The set tracks which errors are allowed. That is, we define an approximate solution to be any which satisfies the linear program. This leads to the following theorem.
Theorem 2.1**.**
Let be a linear program with . Let be a sequence of intervals with rational endpoints with
[TABLE]
Assume the promise that there exists which satisfies the linear program. There exists a randomized algorithm computing a solution to the linear program in time
[TABLE]
where is the sum of the lengths of the intervals of , and depends only on .
At a high level, think of the problem as a two-player game. Alice has in mind some solution to while Bob is trying to find such that and is as similar to as possible. To keep track of this similarity, we have a “score” (potential function) for the game which states how similar Bob’s is to . At first Bob’s guess, say does not satisfy , but over time Bob can use the score as a heuristic to guide a random walk so that he gets close enough in similarity to to solve .
2.1 Potential Function
Consider a game in which Alice has in mind and Bob guesses an interval in with and . We give Bob a score satisfying the following rules
- •
Correct guesses have full potential and .
- •
The potential changes monotonically, for any we have
[TABLE]
The precise function which satisfies this condition is as follows:
[TABLE]
where the empty product is equal to .
The potential functions are chosen so that during each step of the random walk, the expected value of the potential stays the same over time. The quantitative justification is in Claim 3.2.
Now, in the actual game, Alice has in mind an -bit vector (the 0-1 integer program solution) and Bob has interval guesses . The potential in this case is just the product of the coordinate-wise potentials.
[TABLE]
Note that we still have that for all and .
In order to make quantitative guarantees about the runtime of our algorithm, we need some parameters which quantify how “well-conditioned” our potential function is. Define the quanta of to be
[TABLE]
In other words, captures how small the potential of one coordinate can be.
Define the traction of to be
[TABLE]
Roughly speaking, measures the minimum amount the potential can go up by when it increases. The latter is quite important in order to ensure that our potential does not get “stuck” by increasing only a negligible amount on each step.
2.2 Sketch of Random Walk Algorithm
With the potential function defined, we can now describe the random walk algorithm at a high level.
Algorithm 2.1: A randomized algorithm for solving approximate integer programs.
Input: , . , .
The algorithm uses a parameter governing the number of iterations.
For each . Sample according to the distribution.
[TABLE]
where and .
- 2.
For in
Check if there exists such that If so, Output .
- 4.
Find , the probability that changes to , by solving the following feasibility problem.
[TABLE]
- 5.
Sample according to the probability . Set except .
- 6.
Output “Fail.”
We claim the following
Theorem 2.2**.**
Assume that there exist such that . Then, if Bob starts his random walk at , he will find a solution to with with probability at least
[TABLE]
as long as
[TABLE]
Furthermore, computing the steps of this random walk can be done in polynomial time.
3 Analysis of Random Walk Algorithm
The proof of Theorem 2.2 consists of three parts.
Prove that Algorithm 2.1 is well-defined. That is, the probability distribution described in Step 4 exists. 2. 2.
Prove that Algorithm 2.1 can be implemented in randomized polynomial time. That is, the probability distribution described in Step 4 can be computed in polynomial time. 3. 3.
Prove that the probability of success of Algorithm 2.1 is correctly stated.
The following three subsections establish each of these parts.
3.1 Algorithm 2.1 is well-defined
In this subsection, we show that the probability distribution described in Step 4 exists. To do this, we use a special case of the minimax theorem [vN28].
Theorem 3.1** (Minimax Theorem).**
Let be compact, convex sets. Let be a function which is affine in both coordinates. Then,
[TABLE]
To apply this, let
[TABLE]
That is, is in some sense the convex hull of Alice’s strategies, and is the distribution of Bob’s potential modifications to .
We then define to be precisely the functional which appears in Step 4 of Algorithm 2.1
[TABLE]
Note that this expression is affine in and is also affine in (the other terms are constant within Step 4 of the algorithm). Note that exists as expected in Algorithm 2.4 if and only if
[TABLE]
That is, at each step, Bob has a strategy that works universally over all possible strategies for Alice. By the Minimax Theorem, it suffices to show that That is, it suffices to prove the following.
Claim 3.2**.**
For all there exists such that .
Proof.
Fix this . We’ll show that there exists which has exactly one nonzero element, and this element is equal to .
Since in Step 3, the algorithm verified that is disjoint from , we know that there exists such that . For notational simplicity, let . Thus, either or .
If , then as but . Thus, consider such that and otherwise. Now we have that
[TABLE]
Note that so we are not dividing by [math].
The case is handled by having and otherwise. The analysis is analogous. ∎
Thus, the distribution needed by Step 4 does indeed exist.
3.2 Algorithm 2.1 has polynomial time complexity
Next, we show that Algorithm 2.1 can be implemented to run in (randomized) polynomial time. It is obvious that Steps 1, 5, 6 run in polynomial time. The number of possible values of in Step 2 is polynomial time as . Step 3 runs in polynomial time as it is equivalent to checking if a linear program is feasible. The non-trivial step to justify is Step 4.
Let be the region of which satisfy the conditions stipulated in Step 4. It is clear that is convex, and we showed that it is non-empty in the previous subsection. Since the set is a compact, convex polytope, it suffices to check the condition (4) for on the vertices of this polytope. By standard results in linear programming, the “representation complexity” of the vertices of is some polynomial in the representation complexity of the system . In particular, this implies that is itself a polytope as its finitely many facets are described by (4) and these have bounded complexity as we can restrict to the vertices of . Thus, [GLS93] describes as a “well-defined” polytope. By Theorem 6.4.1 of [GLS93], finding a point (or asserting that none exists), can be done in polynomial time as long as there exists a strong separation oracle that itself can be computed in polynomial time.333See also https://www.cs.cmu.edu/afs/cs.cmu.edu/academic/class/15859-f11/www/notes/lecture09.pdf. In other words, for any , one need to be able to efficiently compute a hyperplane such that is strictly on one side and is strictly on the other side.
To go about doing this, we first compute the which performs the worst against. We can do this by solving the following linear program.
[TABLE]
Since , we know that . Thus the hyperplane
[TABLE]
can be efficiently computed and strictly separates and . Thus, the random walk itself can be performed efficiently.
3.3 Algorithm 2.1 succeeds with decent probability
We need the following important claim about random walks.
Lemma 3.3**.**
Let be a sequence of random variables and such that
- •
If then
- •
If then surely.
- •
The sequence is a submartingale:
Let . Then, for all whenever
[TABLE]
we have that
[TABLE]
Proof.
Fix . Define . We claim that performs even better than a submartingale. We prove this in Appendix A.
Claim 3.4**.**
[TABLE]
where
[TABLE]
Now, observe that iff . Thus, by taking the expectation of the inequality in the claim over all (subject to ), we get
[TABLE]
Therefore,
[TABLE]
Fix . If , we are done. Otherwise, for all we have that so
[TABLE]
Thus,
[TABLE]
Thus by induction
[TABLE]
Since clearly . We must have that
[TABLE]
a contradiction. This concludes the proof. ∎
Recall that is Alice’s solution to the integer program . When Bob does his random walk, he starts with some and computes random variables .If Bob succeeds at time step , then assume that for all . Step 4 of Algorithm 2.1 ensures that
[TABLE]
so
[TABLE]
Now define the random variable
[TABLE]
In particular, if and only if Bob “wins” on step . This sequence (with being the “traction” ) satisfied the conditions of Lemma 3.3with . Note that is at least .
In particular satisfies (5). Therefore, Bob will have “won” by step with probability at least as desired.
This concludes the proof of Theorem 2.2. Now, we show that this implies Theorem 2.1.
3.4 Proof of Theorem 2.1
Consider a distribution which maximizes the following quantity.
[TABLE]
Recall that is the starting distribution for each coordinate in our algorithm (and different coordinates are sampled independently according to ). The expression captures the initial potential in that coordinate, which by Lemma 3.3 is tied to the ultimate success probability.
Note that and can be computed in time using a simple linear program. By Theorem 2.2, we have that if each coordinate of is sampled independently from the distribution , then the probability of success, in terms of some integral solution to , is at least
[TABLE]
Now consider the following bounded on whose proof is in Appendix A. It says that can be larger than (and thus the success probability better than random guessing) when has positive measure.
Claim 3.5**.**
For all with ,
[TABLE]
where .
Thus, each run of Algorithm 2.1 runs in polynomial time at succeeds with probability at least . Thus, after iterations, the algorithm will succeed with high probability. Since each iteration runs for steps, we have the desired runtime of . This completes the proof.
Remark*.*
Although this bound for is tight in certain cases, such as is the union of two intervals. In other cases, it is not. Consider . Then, the Claim shows that . But, if one considers the distribution which always picks the middle interval, then note that since
[TABLE]
then one has an improved bound that .
4 Applications to CSPs
In this section, we show that Theorem 2.1 can be applied to give fast exponential time algorithms for many Constraint Satisfaction Problems (CSPs).
4.1 CSPs and Partial Polymorphisms
Recall from the CSP literature that a Boolean constraint can be viewed as , where is known as the “arity” of the constraint. A template is a set of these constraints. is the decision problem of deciding if a CNF with constraints from is satisfiable.
Consider a positive integer and a function such that for every where and every sequence satisfying
[TABLE]
it holds that . Such are precisely the partial polymorphisms of . Note that these differ from normal polymorphisms because we are allowed to output .
As an example, for -SAT, a suitable template is and one can verify that for all not divisible by , the following is a partial polymorphism
[TABLE]
This family of partial polymorphisms actually falls under a whole family known as threshold partial polymorphisms
Definition 4.1**.**
Let be a union of a sequence of intervals such that and
[TABLE]
Let such that and is constant within each subinterval. Let be a positive integer such that is non-integral for and is non-integral for . We say that -threshold partial function if
[TABLE]
In particular, -SAT has a threshold partial function with and if and if .
Theorem 4.1**.**
Let be a Boolean CSP template and let and be such that for infinitely many positive integers , has a partial polymorphism which is an -threshold partial function. Then, can be solved in randomized time, where is the number of variables.
Proof.
Given an instance of let be its integer programming relaxation where (e.g., see the Basic LP reduction of [BG19]). Use Theorem 2.1 to find which also satisfies , where we define
[TABLE]
This takes time444Note the change in the hidden constant depending on is a function of , , and , all of which change by a negligible amount if is small., which is at most if we take .
We claim that satisfies the original instance of the . The proof is essentially the same as that of Theorem 4.1 of [BG19]. In essence, for each clause , the fact that solves the LP relaxation means that there exist assignments and weights such that
[TABLE]
Consider such that is an -threshold partial function. Then as in [BG19], pick weights such that so then
[TABLE]
Because or or , we have that for all and will be in the same interval of as . By plugging in into with weights , we then get that which shows that is a satisfying assignment. ∎
Appendix A Omitted Proofs
A.1 Proof of Claim 3.4
First, we need to prove a technical inequality.
Proposition A.1**.**
For all and , we have that
[TABLE]
Proof.
This requires proving two separate inequalities.
[TABLE]
By multiplying both equations by and moving all the terms to the right side, it suffices to verify that the following polynomial-like expressions are nonnegative for (for any fixed ).
[TABLE]
Note that , so it suffices to show that for all .
Note that , but by Jensen’s inequality applied to the convex function , we have that . Thus, for all , so (7) follows.
For , observe that
[TABLE]
Note that and . Thus, since it is clear that for all , we have that for all . Thus, (8) follows, too. ∎
Now we can prove the claim.
Proof of Claim 3.4.
Clearly if , this is trivially true, so we now condition that .
Fix Let . By the convexity of , we know that
[TABLE]
Let . Thus,
[TABLE]
Now as long as , we have that
[TABLE]
Note that the function . Is strictly convex in and takes on minimum value at . We know that , we have that
[TABLE]
where we used Proposition A.1. In combination with (9), we have the desired inequality. ∎
A.2 Proof of Claim 3.5
Proof.
Let and . Then, for all choose as our distribution
[TABLE]
In words, each is proportional to the length of the interval plus the length of the surrounding “space” on either side. In particular, each interval of “space” is double-counted, so the sum of the ’s is . Thus, the ’s correspond to a well-defined probability distribution.
Next, observe that
[TABLE]
Now define
[TABLE]
Note that we can include the terms since We now claim by induction that
[TABLE]
For the first sum, the base case of follows immediately, and for the second sum, the base case of also follows. Then, by the induction hypothesis
[TABLE]
and similarly
[TABLE]
Thus, and . Therefore , as desired. ∎
Acknowledgments
We thank Brian Axelrod, Dima Kogan and anonymous reviewers for helpful feedback on the manuscript.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AGH 17] Per Austrin, Venkatesan Guruswami, and Johan Håstad. (2+ ϵ italic-ϵ \epsilon )-sat is np-hard. SIAM J. Comput. , 46(5):1554–1573, 2017.
- 2[BG 19] Joshua Brakensiek and Venkatesan Guruswami. An Algorithmic Blend of L Ps and Ring Equations for Promise CS Ps. In Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms , SODA ’19, pages 436–455, Philadelphia, PA, USA, 2019. Society for Industrial and Applied Mathematics.
- 3[BJK 05] A. Bulatov, P. Jeavons, and A. Krokhin. Classifying the Complexity of Constraints Using Finite Algebras. SIAM Journal on Computing , 34(3):720–742, January 2005.
- 4[BKW 17] Libor Barto, Andrei A. Krokhin, and Ross Willard. Polymorphisms, and how to use them. In Andrei A. Krokhin and Stanislav Zivny, editors, The Constraint Satisfaction Problem: Complexity and Approximability , volume 7 of Dagstuhl Follow-Ups , pages 1–44. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 2017.
- 5[CDL + 16] Marek Cygan, Holger Dell, Daniel Lokshtanov, Dániel Marx, Jesper Nederlof, Yoshio Okamoto, Ramamohan Paturi, Saket Saurabh, and Magnus Wahlström. On Problems As Hard As CNF-SAT. ACM Trans. Algorithms , 12(3):41:1–41:24, May 2016.
- 6[Che 09] Hubie Chen. A Rendezvous of Logic, Complexity, and Algebra. ACM Comput. Surv. , 42(1):2:1–2:32, December 2009.
- 7[EK 02] SN Ethier and Davar Khoshnevisan. Bounds on Gambler’s Ruin Probabilities in Terms of Moments. Methodology and Computing in Applied Probability , 4(1):55–68, 2002.
- 8[Fel 68] William Feller. An Introduction to Probability Theory and Its Applications , volume 1. John Wiley & Sons, 1968.
