Automating Resolution is NP-Hard
Albert Atserias, Moritz M\"uller

TL;DR
This paper proves that finding optimal Resolution refutations is NP-hard, indicating that Resolution proof systems cannot be efficiently automated unless P equals NP, with implications for proof complexity and automated theorem proving.
Contribution
It establishes the NP-hardness of approximating Resolution refutations within polynomial bounds, highlighting fundamental computational limitations in proof automation.
Findings
Resolution is NP-hard to approximate within polynomial bounds
Distinguishing formulas with polynomial vs. subexponential refutations is NP-hard
Resolution cannot be automatized in subexponential or quasi-polynomial time unless NP is in SUBEXP or QP
Abstract
We show that the problem of finding a Resolution refutation that is at most polynomially longer than a shortest one is NP-hard. In the parlance of proof complexity, Resolution is not automatizable unless P = NP. Indeed, we show it is NP-hard to distinguish between formulas that have Resolution refutations of polynomial length and those that do not have subexponential length refutations. This also implies that Resolution is not automatizable in subexponential time or quasi-polynomial time unless NP is included in SUBEXP or QP, respectively.
| (R1) | or , but not both; | |
| (R2) | if , then both and ; | |
| (R3) | and ; | |
| (R4a) | if and , then ; | |
| (R4b) | if and , then ; | |
| (R5a) | if , , and , then ; | |
| (R5b) | if , , and , then ; | |
| (R6) | if and appears in , then ; | |
| (R7) | or ; | |
| (R8) | . |
| (H1) | is injective, | |
| (H2) | and , | |
| (H3) | if , then , | |
| (H4) | if with , then . |
| (C1) | , | |
| (C2) | . |
| (A1) | , | ||
| (A2) | , | ||
| (A3) | , | ||
| (A4) | , | ||
| (A5) | , , , | ||
| (A6) | , , , | ||
| (A7) | , , , | ||
| (A8) | , , , | ||
| (A9) | , | ||
| (A10) | , | ||
| (A11) | , | ||
| (A12) | , | ||
| (A13) | , , | ||
| (A14) | , , | ||
| (A15) | , , | ||
| (A16) | , , | ||
| (A17) | , , , , | ||
| (A18) | , , , , | ||
| (A19) | , , , | ||
| (A20) | , , | ||
| (A21) | , . |
| (A1) | , | ||
| (A2) | , | ||
| (A3) | , | ||
| (A4) | , | ||
| (A5) | , , , | ||
| (A6) | , , , | ||
| (A7) | , , , | ||
| (A8) | , , , | ||
| (A9) | , | ||
| (A10) | , | ||
| (A11) | , | ||
| (A12) | , | ||
| (A13) | , , | ||
| (A14) | , , | ||
| (A15) | , , | ||
| (A16) | , , | ||
| (A17) | , , , , | ||
| (A18) | , , , , | ||
| (A19) | , , , | ||
| (A20) | , , | ||
| (A21) | , , | ||
| (A22) | , , | ||
| (A23) | , , | ||
| (A24) | . |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Automating Resolution is NP-Hard
Albert Atserias and Moritz Müller
Universitat Politècnica de Catalunya
Abstract
We show that the problem of finding a Resolution refutation that is at most polynomially longer than a shortest one is NP-hard. In the parlance of proof complexity, Resolution is not automatizable unless P = NP. Indeed, we show that it is NP-hard to distinguish between formulas that have Resolution refutations of polynomial length and those that do not have subexponential length refutations. This also implies that Resolution is not automatizable in subexponential time or quasi-polynomial time unless NP is included in SUBEXP or QP, respectively.
1 Introduction
The proof search problem for a given proof system asks, given a tautology, to find an approximately shortest proof of it. Clearly, the computational complexity of such problems is of fundamental importance for automated theorem proving. In particular, among the proof systems for propositional logic, Resolution deserves special attention since most modern implementations of satisfiability solvers are based on it.
We say that the proof search problem for Resolution is solvable in polynomial time if there is an algorithm that, given a contradictory CNF formula as input, outputs a Resolution refutation of in time polynomial in , where is the size of , and is the length of a shortest Resolution refutation of . More succinctly, we say that Resolution is automatizable [11]. It is clear that the concept of automatizability applies not only to Resolution but to any refutation or proof system, and one can ask for automating algorithms that run in quasi-polynomial time, subexponential time, etc..111The time of the automating algorithm is not measured in but in because can be much larger than . We use both and , and not just , because a Resolution refutation need not use all clauses in , but the algorithm should be given the opportunity to at least read all of .
In this paper we show that Resolution is not automatizable unless . The assumption is clearly optimal since implies that it is. To prove our result we give a direct and efficient reduction from , the satisfiability problem for 3-CNF formulas. The reduction is so efficient that it also rules out quasi-polynomial and subexponential time automating algorithms for Resolution under the corresponding hardness assumptions. More precisely, let QP and SUBEXP denote the classes of problems that are decidable in quasi-polynomial time , and in subexponential time , respectively. Then our main result reads:
Theorem 1**.**
Resolution is not automatizable in subexponential time unless . 2. 2.
Resolution is not automatizable in quasi-polynomial time unless . 3. 3.
Resolution is not automatizable in polynomial time unless .
That Resolution is not automatizable in polynomial time has been known under a stronger assumption from parameterized complexity theory, using a more contrived reduction [1]: we review the literature below. The first two statements in Theorem 1 give the first evidence that Resolution is not automatizable in quasi-polynomial or subexponential time. As in the third statement, their assumptions are also optimal in that and imply that Resolution can be automated in quasi-polynomial and subexponential time, respectively.
The main result as stated in Theorem 1 is a direct consequence of the fact, which we also prove, that the problem of non-trivially approximating minimum proof length for Resolution is NP-hard. If for a CNF formula we write for the size of , and for the length of a shortest Resolution refutation of , then we show:
Theorem 2**.**
There are reals and and a polynomial-time computable function that maps any 3-CNF formula to a CNF formula such that, for and :
- (a)
if is satisfiable, then ; 2. (b)
if is unsatisfiable, then .
Moreover, and can be chosen arbitrarily close to and , respectively, which means that it is NP-hard to approximate the minimal Resolution refutation length to within for any .
Proof idea
An idea of how a map as in Theorem 2 could be defined is implicit in [36]. Pudlák [36, Theorem 2] maps a formula to , for some suitable for his context, where is a CNF formula whose clauses describe, in a natural way, the Resolution refutations of of length . He used this function to show that the canonical pair of Resolution is symmetric. In particular, he showed that, if is satisfiable, then has a short Resolution refutation. This refutation proceeds naturally by using a satisfying assignment for as a guide to find a true literal in each line of the alleged refutation, line by line one after another, until it gets stuck at the final empty clause. Conversely, we would like to show that, if is unsatisfiable, then is hard for Resolution. Intuitively, this should be the case: refuting means proving a lower bound and “our experience rather suggests that proving lower bounds is difficult” – this is what Pudlák [36, Section 3] states about a similar formula for strong proof systems.
However, even after considerable time and effort, we failed to prove a Resolution length lower bound for . We bypass the issue by considering a harder version of . The harder is obtained from relativizing seen as the propositional encoding of a first-order formula with a built-in linear order, following the general relativization technique of Dantchev and Riis [19]. When is satisfiable, Pudlák’s upper bound for goes through to , and the linear order is crucial in this. On the other hand, a random restriction argument in the style of [19] reduces a length lower bound for to a certain width lower bound for . The bulk of the current work is in establishing this width lower bound for , when is unsatisfiable. It is proved by showing that, even if has (not too slow) polynomial growth, the formulas and are indistinguishable by inferences of bounded width, where is the number of variables in . Since every unsatisfiable CNF formula with variables has a refutation of length , the formula is satisfiable, from which it follows that does not have bounded-width refutations.
The technical device that we use in the indistinguishability argument is a variant of the conditions from [34], a particular formalization of a Prover-Adversary argument as, e.g., in [19]. The wording is meant to point out some analogy with forcing conditions [6]. This is not straightforward. The main obstacle overcome by our variant is the presence of the built-in linear order in . In fact, Dantchev and Riis [19, Section 5] point out explicitly that their arguments fail in the presence of a built-in linear order.
History of the problem
The complexity of the proof search problem has been extensively investigated. Krajíček and Pudlák [33] showed that Extended Frege systems222We refer to the textbook [28, Chapter 4] for a definition of this and the following systems. All notions relevant to state and prove our results are going to be defined later. are not automatizable assuming RSA is secure against . Subsequently, Bonet et al. showed this for Frege [11] and bounded depth Frege systems [12] assuming the Diffie-Hellman key exchange is secure against polynomial or, respectively, subexponential size circuits.
In fact, these results rule out feasible interpolation, an influential concept introduced to proof complexity by Krajíček [27, 29]. We refer to [32, Chapters 17, 18] for an account. If a system with feasible interpolation has short refutations of the contradictions that state that a pair of NP problems are not disjoint, then the pair can be separated by small circuits. Hence, feasible interpolation can be ruled out by finding short proofs of the disjointness of an NP pair that is hard to separate. Such hardness assumptions turn up naturally in cryptography [23] which explains the type of assumptions that were used in the results above.
The failure of feasible interpolation for a natural system implies (cf. [4, Theorem 3]) that is not even weakly automatizable in the sense that it would be polynomially simulated (see [18]) by an automatizable system. Hence, the above results left open whether weak proof systems, in particular those having feasible interpolation such as Resolution [29], were (weakly) automatizable. We refer to [3] for a survey, and focus from now on on Resolution.
Pudlák showed [36, Corollary 2] that the weak automatizability of a proof system is equivalent to the (polynomial time) separability of its, so-called, canonical NP-pair [37]. This is, informally, the feasibility of distinguishing between satisfiable formulas and those with short refutations. Hence, to rule it out it suffices to reduce some inseparable disjoint NP pair to it. Atserias and Maneva [5] found in this respect useful pairs associated to two player games. The two NP sets collect the games won by the respective players, and separation means deciding the game. Following [5, 24], Beckmann et al. [9] showed that Resolution is not weakly automatizable unless parity games are decidable in polynomial time. Note, however, that this might well be the case, in fact, parity games are decidable in quasi-polynomial time [14].
Moreover, some non-trivial automating algorithms are known. Beame and Pitassi [8] observed that treelike Resolution is automatizable in quasi-polynomial time. For general Resolution there is an algorithm that, when given a 3-CNF formula with variables that has a Resolution refutation of length at most , computes a refutation in time . This follows from the size-width trade-off of Ben-Sasson and Wigderson [10]. Indeed, it is trivial to find a refutation of width at most in time if there is one (and, in general, time is necessary [7]). When is subexponential the runtime of this algorithm is the non-trivial .
However, the automatizability of Resolution is unlikely. First, Alekhnovich et al. [2] showed, assuming only , that automatization is not possible in linear time. In fact, they proved more. They considered the optimization problem of finding, given a contradictory CNF, a Resolution refutation that is as short as possible. They reduced to it the optimization problem MMCSA of finding, given a monotone circuit, a satisfying assignment that has Hamming weight as small as possible. Known PCP theorems imply that this problem is not approximable with superconstant but sublinear ratio , so the same holds for finding short Resolution refutations. This argument can be adapted to many other refutation systems (see [2]).
But the main convincing evidence that Resolution is not automatizable, before the result of this paper, was achieved by Alekhnovich and Razborov [1]. By a different and ingenious reduction they showed that if Resolution, or even treelike Resolution, were automatizable, then MMCSA would have, in the terminology of parameterized complexity theory (see [16, Proposition 5]), an fpt algorithm with constant approximation ratio. Now, the same paper [1] also established “the first nontrivial parameterized inapproximability result” [20, p.9] by further deriving a randomized fpt algorithm for the parameterized decision version of MMCSA, a well-known W[P]-complete problem (see e.g. [21, Theorem 3.14]). The randomized fpt algorithm has subsequently been derandomized by Eickmeyer et al. [20], hence Resolution is not automatizable unless . Very recently, Mertz et al. [26] showed that Resolution is not automatizable in time unless ETH fails; this follows the same line of argument as [1] but is based on a more recent parameterized inapproximability result due to Chen and Lin [17].
Since these results apply not only to Resolution but even to treelike Resolution, which is automatizable in quasipolynomial time, Alekhnovich and Razborov stated that the “main problem left open” [1, Section 5] is whether general Resolution is automatizable in quasi-polynomial time. We consider Theorem 1 as an answer to this question.
The computational problem of computing minimal proof lengths also has a long history. For first-order logic, the problem dates back to Gödel’s famous letter to von Neumann; we refer to [35] for a historical discussion, to [13] for a proof of Gödel’s claim in the letter, and to [15] for some more recent results. In propositional logic, the problem has been shown to be NP-hard for a particular Frege system by Buss [13], and for Resolution by Iwama [25]. Alekhnovich et al. [2] showed that the minimal Resolution refutation length cannot be approximated to within any fixed polynomial unless : for every there are functions and , computable in non-uniform polynomial time, such that for every CNF formula of sufficiently large size we have either or depending on the satisfiability . This falls short to rule out automatizability because has exponential growth. Earlier, Iwama [25] found uniformly computable such functions with polynomially bounded but his gap was only versus for a constant , so also falls short to rule out automatizability.
Outline
In Section 2 we introduce some notation and basic terminology from propositional logic. Section 3 presents Resolution refutations as finite structures. Section 4 is devoted to and proves the width lower bound when is unsatisfiable (Lemma 4). Section 5 discusses the relativized formula , the refutation length upper bound when is satisfiable (Lemma 11), and the refutation length lower bound when is unsatisfiable (Lemma 10). Theorems 2 and 1 are derived from these lemmas in Section 6. In Section 7 we discuss some open issues. Finally, for easiness of reference, in Appendix A we give the detailed lists of clauses for the formulas and .
2 Preliminaries
For we let and understand that . A partial function from a set to a set is a function with domain included in and image included in . We view partial functions from to as sets of ordered pairs . For any set , the restriction of to is . The restriction of with image is .
We fix some notation for propositional logic. Let be a set of propositional variables that take truth values in , where [math] denotes false and denotes true. A literal is a variable or its negation , also denoted . We also write for and for . A clause is a set of literals, that we write as a disjunction of its elements. A clause is non-tautological if it does not contain both a variable and its negation. The size of a clause is the number of literals in it. A CNF formula, or CNF, is a set of clauses, that we write as a conjunction of its elements. A -CNF, where , is a CNF in which all clauses have size at most . The size of a CNF is the sum of the sizes of its clauses. We use to denote the size of .
An assignment, or restriction, is a partial map from the set of variables to . If is an assignment and is a literal, then satisfies if and ; it falsifies if and . If is a clause, then satisfies if it satisfies some literal of ; it falsifies if it falsifies every literal of . The restriction of by , denoted , is if satisfies and [math] if falsifies ; if neither satisfies nor falsifies , then is the clause obtained from by removing all the falsified literals of , i.e., . If is a CNF, then is the CNF that contains for those which are neither satisfied nor falsified by , and that contains the empty clause if some is falsified by .
A clause is a weakening of clause if . A clause is a resolvent of clauses and if there is a variable such that and , and ; we then speak of the resolvent of and on , that we denote by . We also say that is obtained from and by a cut on .
Let be a CNF. A Resolution proof from is a sequence of non-tautological clauses, where and, for all , it holds that is a weakening of a clause in , or there are such that is a weakening of a resolvent of and . The length of the proof is ; each is a line. A Resolution refutation of is a proof from that ends with the empty clause, i.e., . We let denote the minimal such that has a Resolution refutation of length ; if is satisfiable, we let . For a sequence of clauses let be obtained from by removing 1’s and replacing 0’s by the empty clause. It is clear that if is a Resolution refutation of of length , then is a Resolution refutation of of length at most .
3 Refutations as structures
For this section we fix a CNF with variables and clauses . We view Resolution refutations of of length as finite structures with a ternary relation and four unary functions :
[TABLE]
The meaning of is that the literal is in . For each exactly one of or is non-zero. The meaning of is that is a weakening of the resolvent of and on , where and , and and . The meaning of is that is a weakening of the clause of . Formally, a structure of type (1) is a refutation of of length if the following hold for all , , , and :
In words, (R1) determines, for every line , whether it is a weakening of an initial clause, i.e., , or a weakening of a resolvent, i.e., . In the first case by (R6). In the second case, by (R4) and (R5), with (R2) and (R3) ensuring that and are earlier lines in the sequence. Finally, (R7) ensures no is tautological, and (R8) ensures is empty.
We give an example that will play a crucial role in the proof of the width lower bound.
Example 3**.**
We use to denote the full-tree Resolution refutation of . It has length
[TABLE]
and its clauses are arranged in the form of a full binary tree of height with internal nodes and leaves. This tree has one node at level for every that is labelled by the clause
[TABLE]
that is, the unique clause in these variables falsified by the assignment that maps to . In particular, the root of the tree is labelled by the empty clause and, for and , the clause that labels node is the resolvent of the clauses and that label the children nodes and on the variable , i.e., . Since is unsatisfiable, every clause that labels a leaf is a weakening of some clause of .
To view this refutation as a structure of type (1) we have to identify the nodes with numbers in . We first identify the leafs, i.e., the nodes with , with the numbers , then we identify the nodes on level , i.e., the nodes with , with the numbers in and so on, with the root getting .
Let for . We set if , and if . We set if , and if and is, say, smallest such that is a weakening of . We set if . If we set and . Finally, if and only if and .
4 Non-relativized formula REF
Given a CNF with variables and non-tautological clauses , and a natural number , we describe a CNF formula that is satisfiable if and only if has a refutation of length . Its variables are:
- •
for , , indicating that .
- •
for , indicating that .
- •
for , indicating that .
- •
for , indicating that .
- •
for , indicating that .
Clearly, any assignment to these variables describes a ternary relation and binary relations , , and . The clauses of are listed in Table LABEL:fig:refclauses of Appendix A. This set of clauses is satisfied precisely by those assignments that describe refutations of of length . Conversely, given a structure as in (1) the associated assignment satisfies if and only if is a refutation of of length ; this assignment maps variables , , , , and to 1 or 0 depending on whether, respectively, , , , , and or not.
The index is mentioned in the variables
[TABLE]
Observe that if , then is not mentioned in or . The index-width of a clause is the number of indices mentioned by some variable occurring in the clause. Observe that all clauses of have index-width at most two. The index-width of a Resolution refutation is the maximum index-width of its clauses.
Lemma 4**.**
For all integers with and every unsatisfiable CNF with variables, every Resolution refutation of has index-width at least .
Proof.
Fix an unsatisfiable CNF with variables and clauses. For this proof let denote the formula and let denote the formula , where is the length of the full-tree Resolution refutation of from Example 3, which exists for because it is unsatisfiable. Let be the assignment associated to .
Let be an integer such that and note that since and . We partition into intervals where
[TABLE]
In the notation of Example 3, is the set of many nodes at the top levels of the full binary tree. For , the -th block is the set of nodes at level of the full binary tree. In particular, is the set of leaves.
Likewise, we partition into intervals where
[TABLE]
Observe that ; let be the bijection defined by so that for all it holds that
[TABLE]
Observe that for all :
[TABLE]
with the second following from and .
Let be the collection of partial functions such that:
In words, condition (H4) says that preserves membership in matching intervals, and (H3) says that the 0-intervals are kept intact through the fixed bijection . Preserving the intervals has the following important consequence:
Claim 5**.**
For every and the following hold:
* and ,* 2. 2.
if , then , 3. 3.
if , then .
Proof.
Property 1 follows from (H1) and (H2). To prove 2 we distinguish several cases: If , then by (H4), hence and by (H2), which is smaller than . If for some , then by (H4), hence , and by (H4) again, which is smaller than . If , then first note that by (H3). We distinguish the cases whether or not. In case , we have . Since , by (2) we have . In case , we have , so by (H4), which is smaller than . The proof of 3 is analogous to that of 2. ∎
For a set , let
[TABLE]
A condition is a pair , where and are functions in , such that
We say a condition extends if , i.e., extends as a function. Observe, since ,
[TABLE]
We define a partial truth assignment that sets the variables of as follows. Note that if , , and are variables of , then , , and are variables of which are evaluated by . The assignment is defined precisely on the variables of that mention some . For such it maps
- –
to , for all and ; 2. –
to , for all ; 3. –
to , for all ; 4. –
to or [math] indicating whether , for all ; 5. –
to or [math] indicating whether , for all .
Note that and belong to for every , so is defined in the last two cases.
Clearly, if a condition extends , then . For , the restriction of to , denoted , is the pair where is the restriction of to , and is the restriction of with image .
Claim 6**.**
If is a condition and , then is a condition and .
Proof.
The requirement that and belong to is obviously satisfied since (H1)-(H4) are preserved by restrictions to subsets that contain [math]. (C1) and (C2) are clear, so is a condition. The inclusion is clear since extends . ∎
Claim 7**.**
If is a condition with and , then there exists a condition that extends and such that .
Proof.
We assume (otherwise we take ) and set for chosen as follows: if , take ; otherwise for some and we choose . Note there exists as desired because by (3) or (4), so by (5)
[TABLE]
It is clear that . Write and . We have to find such that . Assume at least one of is not in . Then it is distinct from [math] (i.e., ), say it is in . If , we find as the pre-images of under . Otherwise and we choose such that is injective. This can be done because by (3) or (4), so by (5)
[TABLE]
It is clear that . ∎
Claim 8**.**
If is a condition and is a clause of , then .
Proof.
Let , write and assume is defined on all variables of . Then is defined on all indices mentioned by . We distinguish by cases according to the type (A1)-(A21) of .
- –
In case is of type (A1), i.e., for some , then equals and this is 1 because is a clause of . Case (A2) is similar. 2. –
In case (A3), ( and) satisfies for . Note , so is well-defined. Hence . Case (A4) is similar. 3. –
In case (A5), equals and this is 1 because is a clause of . Case (A6) is similar. 4. –
In case (A7), or is distinct from and then, respectively, or is falsified by . Hence . Case (A8) is similar. 5. –
In case (A9), equals . But this is 1 since is a clause of . Case (A10) is similar. 6. –
In case (A11), note implies , so by Claim 5 (1). Then is a leaf and . Hence , so . Case (A12) is similar. 7. –
In case (A13), note implies . But by (C1) and Claim 5 (2). Case (A14) is similar. 8. –
In case (A15), implies and . Hence (by (C1)) and . Further, implies and . Hence falsifies the clause of , a contradiction. Cases (A16)-(A18) are similar. 9. –
In case (A19), implies that falsifies the clause of , a contradiction. Case (A20) is similar. 10. –
In case (A21), implies and falsifies the . But this is a clause of since by (H3) – contradiction.
This finishes the proof of Claim 8. ∎
We are ready to finish the proof of the lemma. Let be the set of conditions with . Assume that there exists a Resolution refutation of of index-width smaller than . Let where and note that , so . The assignment is empty and falsifies the empty clause, the last clause of the refutation. Let be the earliest clause in the refutation such that for some condition . In particular, is defined on all variables of . By Claim 8, is not a weakening of a clause from . Hence, is obtained by a cut of earlier clauses and on some variable. Let be the index mentioned by this variable. Choose according to Claim 7. Then is defined on all variables in , , and and extends the partial assignment , so falsifies . By soundness it falsifies or , say, it falsifies . Let be the restriction of to the indices mentioned in . Then falsifies and by Claim 6. This contradicts the choice of . ∎
Remark 9**.**
The width lower bound in the previous lemma does not have much to do with Resolution; a more general version can be formulated using the notions of semantic refutations and Poizat width from [5]. The notion of a Poizat tree is straightforwardly adapted to the many-sorted structures coding refutations. Define index Poizat width like Poizat width but using the index height of a Poizat tree: the maximum over its branches of the number of indices from appearing in queries of the branch. Then, the conclusion of the above lemma can be strengthened to: every semantic refutation of contains a formula of index Poizat width at least .
5 Relativized formula RREF
Given a CNF formula with variables and clauses, and a natural number , we define the CNF formula as follows. We again write for the variables and for the clauses of . The CNF formula has the same variables as plus
- •
for indicating that is an “active” index.
The clauses of are very similar to those of with a few additional literals in each clause, and three additional types of clauses. For easiness of future reference, we explicitly listed the new set of clauses in Table LABEL:fig:rrefclauses of Appendix A. In words, says that the lines indexed by its at most active indices describe a Resolution refutation of , and it does not put any restriction on the structure of the lines on inactive indices.
First we prove the lower bound:
Lemma 10**.**
There is an integer such that for all integers and with and and every unsatisfiable CNF formula with variables, every Resolution refutation of has length bigger than .
Proof.
Let be an unsatisfiable CNF with variables and clauses and . Assume is a Resolution refutation of of length where . We derive a contradiction assuming at various places that is large enough and this determines the constant . It will be clear that it does not depend on or .
We define a random restriction to (a subset of) the variables of by the following random experiment:
independently for every , map to or [math] each with probability ; 2. 2.
let be the set of for which is mapped to ; 3. 3.
for every and , map both and to 0; 4. 4.
independently for every and every variable that mentions , map the variable to or [math] each with probability .
A literal that mentions evaluates to 1 under with probability at least , namely in the event that is mapped to [math] in step 1 and the right value is chosen in step 4. Thus, the probability that a clause of index-width at least is not satisfied by is at most . By the union bound, the probability that contains a clause of index-width at least is at most , which is strictly less than for (here we use that ). Note the clauses of use variables of , so index-width is well-defined.
The cardinality of the random subset is a symmetric binomial random variable with expectation . By the Chernoff bound there is a real , independent of and , such that with probability at most . For large enough this is strictly less than . Further, is mapped to with probability . Thus, for large enough , by the union bound, there exists a restriction in the support of the above distribution, say, with associated set , such that:
- (i)
has index-width smaller than ; 2. (ii)
; 3. (iii)
maps to , so .
By (iii), satisfies (A24). Also, for a clause of type (A22) or (A23) because this holds for every restriction in the support of the distribution.
Let and let be defined as except that we use instead as index set, with in the role of . More precisely, is obtained from by a copy of variables: a variable is replaced by the variable (of ) obtained by changing its index (and ) to the -th (and the -th) member of .
We claim that for every clause we either have or . We already checked this for (A22)-(A24) and are left with (A1)-(A21). For example, if is a clause of type (A3), then if , and otherwise is a clause in . The case that is of type (A4) is similar. The remaining cases are obvious. Thus, is a Resolution refutation of of length at most . By (i) and (ii), if is large enough, this contradicts Lemma 4 (note ). ∎
The next lemma gives a polynomial upper bound on the length of Resolution refutations of when is satisfiable. In fact, its second statement gives an upper bound that is possibly sublinear in the size of . This second statement is not needed to prove Theorems 1 and 2.
Lemma 11**.**
There is a polynomial such that for all integers and every satisfiable CNF formula with variables and clauses, there exists a Resolution refutation of of length at most . In fact, .
Proof.
Let be a satisfiable CNF with variables and clauses . Let be an assignment that satisfies . We derive the clauses
[TABLE]
for in order. Then many cuts with (A21) and one cut with (A24) yield the empty clause.
First, we derive, for all and , as many weakenings of clauses of , the auxiliary clauses
[TABLE]
Since satisfies we can choose for every some such that appears in . Then is a clause of , namely (A19). But is a weakening of this.
We derive for through many cuts. Through a sequence of many cuts, starting at (A4) and using (A14) for all , get . Cut this with (A12) to get . Cut this with (A2), followed by a sequence of many cuts with all for to get .
Now assume and have been derived for all . First, we derive for every the auxiliary clause
[TABLE]
We treat the case , the case is analogous. Let . Cut (A15) with of type (A20) on to get
[TABLE]
Cut this with on to get
[TABLE]
Cut this with (A17) on for every , and then with with (A22) on to get
[TABLE]
Now cut (A3) with this formula for all , and with (A13) for all to get the following subclause of
[TABLE]
For every , the clause (6) is derived with cuts. Thus, is derived with many cuts. Doing this for all amounts to many cuts.
Having derived the auxiliary clauses we now derive in a sequence of cuts. In a sequence of many cuts, cut (A1) with for all , to get
[TABLE]
Cut with (A9) on , then with (A11) on , and then with (A12) on to get Cut (A2) with this and then with for all in sequence to get as desired.
In total, the refutation uses
[TABLE]
many cuts: the first term counts the cuts in the derivation of , the second term counts the cuts to get the empty clause from , and each term in the big sum counts the cuts in the derivation of for . The length of the refutation is bounded by the number of cuts plus the weakenings to get the ’s plus the number of clauses of . But, in fact, the clauses (A7) and (A8) are not used by the given refutation, and has at most many other clauses. ∎
Remark 12**.**
In the proof of the upper bound Lemma 11, the built-in linear order in the definition of plays a crucial role. This refers to the side conditions in clauses (A13) and (A14) of the definition of RREF in Table LABEL:fig:rrefclauses of Appendix A. Indeed, it is not hard see that if the linear order were not built-in but interpreted, through new propositional variables and its corresponding clause axioms, then the resulting version of would be exponentially hard for resolution independently of the satisfiability or unsatisfiability of . This follows from an “infinite model argument” similar to the proof of the main theorem in [19].
6 Proofs of the hardness results
In this section we derive Theorems 1 and 2 stated in the Introduction.
Proof of Theorem 2.
It suffices to define on 3-CNF formulas with a sufficiently large number of variables . Note for the number of clauses of . We set
[TABLE]
Note has size between and for some constant . Thus, (a) follows from the first statement of Lemma 11 for some constant , and (b) follows from Lemma 10 for and some constant (note that for sufficiently large ). ∎
Our main result, Theorem 1, is implied by the more general statement below. We say that Resolution is automatizable in time if there is an algorithm that, given an unsatisfiable CNF formula , computes some Resolution refutation of in time . Recall that a function is time-constructible if there is an algorithm that given (the string of many 1’s) computes in time . We say that is subexponential if .
Theorem 13**.**
Let be time-constructible, non-decreasing and subexponential. If Resolution is automatizable in time , then there are polynomials and and an algorithm that, given a 3-CNF formula with variables, decides in time whether is satisfiable.
Proof.
Assume that Resolution is automatizable in time and choose , and from Theorem 2. Let be as in the proof given above, so has size at most for every 3-CNF formula with a sufficiently large number of variables.
Consider the following algorithm. Given a 3-CNF formula with variables, compute the formula and run the automating algorithm for up to steps. If the algorithm returns a Resolution refutation within the allotted time, then output ‘satisfiable’. Else output ‘unsatisfiable’.
It is clear that the algorithm runs in time for some polynomial ; here we use that is time-constructible. It suffices to show that it is correct on 3-CNF formulas with a sufficiently large number of variables . If is satisfiable, then by (a) of Theorem 2, has a Resolution refutation of length at most , so the automating algorithm computes a refutation within the alloted time and we answer ‘satisfiable’; here we use that is non-decreasing. If is unsatisfiable, then by (b) of Theorem 2, no Resolution refutation of has length at most , so the automating algorithm cannot compute one within the allotted time and we answer ‘unsatisfiable’; here we use that is subexponential. ∎
7 Concluding remarks
This final section contains the observation that, by a padding argument, the constants and in Theorem 2 can be chosen arbitrarily close to and , respectively, and finishes with some questions.
Proof of Theorem 2 for and .
Given we define for a 3-CNF and a natural and verify (a) and (b) for assuming that and are sufficiently large; again, denotes the number of variables of . The meaning of “sufficiently large” for will depend only on . Write and let be obtained from by deleting the clauses of type (A7) and (A8). As has been noted in the proof, Lemma 11 holds true for instead . Since has at most clauses, the size of satisfies for some constant , a number independent of and . By Lemma 11, for some constant ; but this is at most if . By Lemma 10, , but this is more than if . ∎
The reduction above falls short to rule out weak automatizability of Resolution. For this we would need that when is unsatisfiable, i.e., that is satisfiable, but this is unlikely to hold for a polynomial time as it would put 3-SAT in co-NP. We refer to [4] for a proof of equivalence of the different characterizations of weak automatizability used here and in the Introduction. The main problem left open by the current work is to find more convincing evidence that Resolution is not weakly automatizable.
On the more technical side, we would like to know whether the formulas are hard for Resolution where ranges over unsatisfiable CNF formulas with variables and is some fixed polynomial. We conjecture that this is the case but we only succeeded in establishing a width lower bound333Recently, M. Garlík confirmed the conjecture [22].. Of course, one can define analogous formulas for any proof system . For all we know it could be that such formulas are hard for strong proof systems like Frege or Extended Frege. Of course, it would be a major breakthrough to prove this, even under some plausible computational hardness hypothesis. We refer to [31, Chapter 27] for a discussion.
Appendix A Formulas REF and RREF
In this appendix we include the detailed lists of clauses of the formulas and . Recall, we use bars to denote the negation of the variables, e.g., denotes the negation of .
In the clauses of , clauses (A1)-(A8) say , , and are functions with the appropriate domains and ranges, (A9)-(A10) express (R1), (A11)-(A12) express (R2), (A13)-(A14) express (R3), (A15)-(A16) express (R4), (A17)-(A18) express (R5), (A19) expresses (R6), (A20) expresses (R7), and (A21) expresses (R8).
The clauses of are the same as for but we add to each clause the literals with mentioned by the clause. More precisely, is added to the clauses (A1)-(A14) and (A19) and (A20), both and are added to the clauses (A15)-(A18), and is added to clause (A21). Further, we add three additional types of clauses numbered (A22)-(A24).
Acknowledgments
Both authors were partially funded by European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme, grant agreement ERC-2014-CoG 648276 (AUTAR). First author partially funded by MICCIN grant TIN2016-76573-C2-1P (TASSAT3). We are grateful to Ilario Bonacina and Michal Garlik for their very useful comments on an earlier draft of this paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Alekhnovich and A. A. Razborov. Resolution is not automatizable unless W[P] is tractable. SIAM Journal on Computing 38 (4): 1347-1363, 2008.
- 2[2] M. Alekhnovich, S. R. Buss, S. Moran and T. Pitassi. Minimum propositional proof length is NP-hard to linearly approximate. The Journal of Symbolic Logic 66: 171-191, 2001.
- 3[3] A. Atserias. The proof-search problem between bounded-width Resolution and bounded-degree semi-algebraic proofs. 16th International Conference on Theory and Applications of Satisfiability Testing (SAT’13), LNCS 7962, Springer, pp. 1-17, 2013.
- 4[4] A. Atserias and M. L. Bonet. On the automatizability of Resolution and related propositional proof systems. Information and Computation 189 (2): 182-201, 2004.
- 5[5] A. Atserias and E. Maneva. Mean-payoff games and propositional proofs. Information and Computation 209 (4): 664-691, 2011.
- 6[6] A. Atserias and M. Müller. Partially definable forcing and bounded arithmetic. Archive for Mathematical Logic 54 (1): 1-33, 2015.
- 7[7] A. Atserias, M. Lauria, and J. Nordström. Narrow proofs may be maximally long. ACM Transactions on Computational Logic 17 (3), 19:1-19:30, 2016.
- 8[8] P. Beame and T. Pitassi. 1996. Simplified and improved resolution lower bounds. In 37th Annual Symposium on Foundations of Computer Science (FOCS’96), IEEE Computer Society, pp. 274-282. 1996.
