A Model of Random Industrial SAT
Dina Barak-Pelleg, Daniel Berend, J.C. Saunders

TL;DR
This paper introduces a new model of industrial SAT that incorporates community structure among variables, and studies how this affects the satisfiability threshold, finding it tends to be lower or even vanish compared to standard random SAT.
Contribution
It proposes a community-structured random SAT model and analyzes its satisfiability threshold, revealing significant differences from traditional random SAT models.
Findings
Threshold in the new model is smaller than in random SAT.
Under certain conditions, the threshold vanishes.
Community structure influences satisfiability properties.
Abstract
One of the most studied models of SAT is random SAT. In this model, instances are composed from clauses chosen uniformly randomly and independently of each other. This model may be unsatisfactory in that it fails to describe various features of SAT instances, arising in real-world applications. Various modifications have been suggested to define models of industrial SAT. Here, we focus mainly on the aspect of community structure. Namely, here the set of variables consists of a number of disjoint communities, and clauses tend to consist of variables from the same community. Thus, we suggest a model of random industrial SAT, in which the central generalization with respect to random SAT is the additional community structure. There has been a lot of work on the satisfiability threshold of random -SAT, starting with the calculation of the threshold of -SAT, up to the recent result…
| Parameters | Result | ||
| SAT w.h.p. | |||
| UNSAT w.h.p. | |||
| UNSAT w.h.p. | |||
| UNSAT w.h.p. | |||
| UNSAT w.h.p. | |||
| UNSAT w.h.p | UNSAT w.h.p. | |
| UNSAT w.h.p. | ||
| SAT w.h.p. | UNSAT w.h.p | |
| SAT w.h.p |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsConstraint Satisfaction and Optimization · Rough Sets and Fuzzy Logic · Advanced Algebra and Logic
A Model of Random Industrial SAT
D. Barak-Pelleg
Department of Mathematics, Ben-Gurion University, Beer Sheva 84105, Israel. E-mail: [email protected]
D. Berend
Departments of Mathematics and Computer Science, Ben-Gurion University, Beer Sheva 84105, Israel. E-mail: [email protected] supported in part by the Milken Families Foundation Chair in Mathematics.
J.C. Saunders
Department of Mathematics, Ben-Gurion University, Beer Sheva 84105, Israel. E-mail: [email protected] supported by an Azrieli Fellowship from the Azrieli Foundation.Current address: Department of Mathematics & Statistics, University of Calgary, 2500 University Drive NW, Calgary, Alberta, Canada, T2N 1N4.
Abstract
One of the most studied models of SAT is random SAT. In this model, instances are composed from clauses chosen uniformly randomly and independently of each other. This model may be unsatisfactory in that it fails to describe various features of SAT instances, arising in real-world applications. Various modifications have been suggested to define models of industrial SAT. Here, we focus mainly on the aspect of community structure. Namely, here the set of variables consists of a number of disjoint communities, and clauses tend to consist of variables from the same community. Thus, we suggest a model of random industrial SAT, in which the central generalization with respect to random SAT is the additional community structure.
There has been a lot of work on the satisfiability threshold of random -SAT, starting with the calculation of the threshold of -SAT, up to the recent result that the threshold exists for sufficiently large .
In this paper, we endeavor to study the satisfiability threshold for the proposed model of random industrial SAT. Our main result is that the threshold in this model tends to be smaller than its counterpart for random SAT. Moreover, under some conditions, this threshold even vanishes.
1 Introduction
For both historical and practical reasons, the Boolean satisfiability problem (SAT) is one of the most important problems in theoretical computer science. It was the first problem proven to be NP-complete [11]. Since its introduction, there has been growing interest in the problem, and many aspects of the problem have been researched.
In this problem, one is required to determine whether a certain Boolean formula is satisfiable. An instance of the problem consists of a Boolean formula in several variables . The formula is usually given in conjunctive normal form (CNF). The basic building block of the formula is a literal, which is either a variable or its negation . A clause is a disjunction of the form of several distinct literals. Thus, altogether, the formula looks like , where each is a clause, say . Given a formula, one may assign a TRUE/FALSE value to each of the variables . The formula is satisfiable, or SAT, if there exists an assignment under which the formula is TRUE, and is unsatisfiable, or UNSAT, otherwise.
The -satisfiability (-SAT) problem is a special case of SAT, in which each clause is a disjunction of up to literals. Some authors restrict -SAT to instances with exactly literals in each clause, which terminology we will follow here. Given and , let denote the set of all possible clauses of length over Boolean variables. A random k-SAT *instance * with clauses is a uniformly random element of . Namely, it consists of clauses, selected uniformly randomly and independently from . Thus, clause repetitions are allowed, and two instances, differing in the order of the clauses only, are considered as distinct.
The ratio is the density and denoted by . This parameter turns out to be very important. If is sufficiently small, then a large random instance with density is SAT with high probability, whereas if it is sufficiently large then a large random instance is UNSAT with high probability. Despite its loose name, the notion of “with high probability” is well defined. Let be a sequence of events. The event occurs with high probability (w.h.p.) if . In our case, we take larger and larger random instances with some fixed density, and inquire whether they are SAT or UNSAT. For , denote [1]:
[TABLE]
For , it was proved long ago [10, 16, 22] that . The Satisfiability Threshold Conjecture claims that, in fact, for all [10]. This conjectured common value is the satisfiability threshold. It has been a subject of interest among researchers, theoretically and empirically, to prove the conjecture for and find the threshold. Recently, the conjecture has been proved for large enough [13].
As part of this research, lower and upper bounds were obtained on and for . In [17] it was proven that . This has been improved in [25] to . From the other side, a sequence of successive improvements led finally to the bound [12]. Thus, with the satisfiability conjecture settled in [13] for large , it follows that for such . For small values of , more specific results were obtained. For , the best bounds seem to be [24, 23], and [14]. Experiments and other results of heuristics, based on statistical physics considerations, indicate that [29, 30], , , , [29].
Much more is known about -SAT. First, unlike -SAT for , which is an NP-complete problem, -SAT instances may be solved by a linear time algorithm [10, 22]. Also, there is quite precise information about -SAT for density very close to the threshold [9, 38].
It has been argued that instances of random -SAT do not in fact represent real-world, or industrial, instances [33, 34, 28]. One of the major differences between industrial and random SAT instances is that the set of variables in industrial instances often consists of a disjoint union of subsets, referred to as communities; clauses tend to comprise variables from the same community, with only a minority of clauses containing variables from distinct communities [7, 32]. There are several additional variations [4, 6]. For example, the variables may be selected non-uniformly (say, according to a power-law distribution [5, 20]), and/or the clauses may be of non-constant length.
In this paper we work with a (generalization of a) model introduced by [18] . Our model is similar to the random model, except for the partition of the variables into communities. These communities are of the same size. There are several clause types (defined precisely in the next section), differing in the number of variables from the same or distinct communities in each clause.
Our focus is on the satisfiability threshold in this model. The question has been studied in [18], mostly experimentally, for the model suggested there. We show that the findings in that paper, whereby the threshold tends to be smaller when there are many single-community clauses (i.e., clauses consisting of variables from the same random community) remain true in the general model. In fact, if the communities are small, the threshold may even be [math].
We present our model in Section 2. The main results are stated in Section 3, and the proofs follow in Section 4. In Section 5 we present some simulation results.
2 A Model of Random Industrial SAT
In industrial SAT, the strength of the community structure of an instance is usually measured by its modularity [7, 19, 35]. Roughly speaking, given a graph, its modularity gives an indication of the tendency of the vertices to be connected to other vertices, which are similar to them in some way. In our case, an instance defines the following undirected graph. The set of nodes is the set of variables . There is an edge for if there exists a clause in the instance, containing both variables (or its negation) and (or its negation). More precisely, we view this object as a multi-graph; if both and (or their negations) appear in several clauses, there are several edges connecting them. Given an instance, high modularity indicates that there exists a partition of the set of variables into subsets, such that a large portion of the edges connect vertices of the same subset, compared to a random graph with the same number of vertices and same degrees [31, 35] .
As in the regular model, we have Boolean variables and clauses in an instance. Each clause is chosen independently of the others. Each variable in each clause is negated with a probability of , independently of the other variables. The model differs from the regular model in several aspects: There is a community structure on the set of variables, and we also do not necessarily assume all clauses to be of the same length. Specifically, the set of variables is partitioned into disjoint (sets of variables referred to as) communities . For simplicity, we assume all communities to be of the same size , so that . Without loss of generality, we will assume that , . As grows, so do usually both and (although at times one of them may remain fixed), and we will write and when we want to relate to their dependence on . For an -tuple with non-increasing, positive integer entries, denote by the set of all clauses of length , formed of variables from some community , from another community , from some -th community , where the indices are mutually distinct. We will refer to such a clause as a clause of type . We will always implicitly assume that for each , so that we can indeed choose the required number of variables from the various communities. Similarly, we implicitly assume that . Let be the uniform measure on .
Example 2.1**.**
(a) Let , and . The clauses
[TABLE]
and
[TABLE]
are of types and (single-community clause), respectively. (In general, the type of single-community clauses of length will be written as .) The clauses above belong to the spaces and , respectively.
(b) The space consists of clauses, and consists of clauses.
(c) Under the measure , each clause in is chosen with probability ; under , each clause in is chosen with probability .
The random instances we will be dealing with are of the following structure. There is some number of clause types . Each , is a vector . These vectors are mutually distinct. Each clause in the instance is of one of these types. The probability of each clause to be of type is , where , are arbitrary fixed real numbers, with . More formally, we select independently clauses from the space according to the measure . Using similar notations to [18], denote by the probability space of instances. Namely, the sample space of is the -fold Cartesian product of the space corresponding to the selection of a single clause, endowed with the product measure . (For more on the notions of a product of measure spaces and of the product measure, see, for example, [2, Sec. 2.5].) For concreteness, in Algorithm 1 we present the exact mechanism for selecting an instance of .
Note that, when employing Algorithm 1, we care about the order of choices, so that each clause may be obtained in several ways. This is easier to implement and has no bearing on the probability of obtaining each possible clause.
Thus, the regular model of random -SAT is, with the notations above, . Instances in the model presented in [18] include clauses of length of two types: () single-community clauses – all variables belong to the same community ( possible choices), and () the variables belong to distinct communities ( possible choices). For some , each clause is of type () with probability and of type () with probability . With the above notations, their probability space is
[TABLE]
for some .
Example 2.2**.**
With and as in Example 2.1, and , the instance
[TABLE]
is an instance in
[TABLE]
The first clause is of type as all three variables belong to the same community , while the second clause is of type , as the variables belong to three distinct communities: , and , respectively.
As our interest in this paper is in instances constructed as above, from this point on we will use the term “community-structured" instead of the more general “industrial".
3 The Main Results
As explained above, the clauses in an community-structured instance tend to include variables from the same community. In this paper, moreover, we usually deal with the case where one (or more) of the clause types is a single-community type, namely for some and . In some results, we will further restrict ourselves to the case , where the only clause type is a single-community type (equivalently, for some ).
In [18] it was observed empirically that, when the modularity of the variable incidence graph of the instance increases, the threshold decreases. Now, the modularity in our case is larger when more clauses consist of variables from the same community and when the communities are small. Our first result is quite straightforward, but it already hints that instances in the model suggested in Section 2 tend to be no more satisfiable than random -SAT instances. Note that the first part of the proposition is one of the initial results for random SAT [17] .
Proposition 3.1**.**
Let be a random instance in .
(a) Suppose that for each , , the clause length is at most . If , then is UNSAT w.h.p.
(b) Let and for some .
(i) If , then is UNSAT w.h.p.
(ii) If and , then is SAT w.h.p.
Our next result points out a significant difference between random instances and community-structured ones. One might expect the threshold to be different for community-structured instances, but it turns out that this difference may be not just quantitative. The following result shows that, surprisingly, under certain conditions the satisfiability threshold is [math]. To this end, we will consider as some function of , not necessarily , and write instead of .
For real functions and , we write if , and if . We also write if for some .
Theorem 3.2**.**
Let be a random instance in , where for some .
(a) Let .
(i) If (so that ) and , then is SAT w.h.p.
(ii) If , then is SAT with probability bounded away from . If, moreover, , then is SAT with probability bounded away from both [math] and .
(iii) If , then is UNSAT w.h.p.
(b) If and , then is UNSAT w.h.p.
(c) If and for some , then is UNSAT w.h.p.
(d) Let and . Then there exists some such that, if with , then is UNSAT w.h.p.
Remark 3.3**.**
(a) The in part (d) is effective. Namely, as will follow from the proof, one can present such an explicitly (in terms of the implicit constant in the equality ).
(b) Still in case (d), one can deal with the general case of arbitrary as long as the weight of in , namely the probability that a clause is of type , is sufficiently large.
In Theorem 3.2 there are four types of results for the asymptotic satisfiability of a random community-structured instance with variables, clauses, communities of size , and probability measure . Namely, either the probability of satisfiability (i) tends to [math] as , or (ii) it tends to , or (iii) it is bounded away from , or (iv) it is bounded away from both [math] and . These results are summarized in Table 1. In general, we assume that for some and . In the third column we place a ‘’ or a ‘’, depending on whether is required to be or is arbitrary, respectively. The notation indicates a probability bounded away from , and the notation indicates a probability bounded away both from [math] and .
The proof of Theorem 3.2 will use the following lemma.
Lemma 3.4**.**
Consider the spaces and , where . If a random instance in is UNSAT with probability bounded away from [math], then a random instance in is UNSAT w.h.p.
In the proof of Theorem 3.2 (and that of Theorem 3.6), we use some results regarding the classical “balls and bins” problem. In this problem, there are balls and bins. Each ball is placed uniformly randomly in one of the bins, independently of the other balls. One quantity of interest is the maximum load, which is the maximum number of balls in any bin. There are several papers studying the size of the maximum load, as well as generalizations of this problem. It seems that [37] contains all previous results. Our next result seem not to be covered by previous results regarding the balls and bins problem. It will be employed in the proof of Theorem 3.2, and is of independent interest.
Given a sequence of random variables and a probability law , we let denote the fact that converges to in distribution as . Denote by the Poisson distribution with parameter .
Theorem 3.5**.**
Consider the balls and bins problem with bins and balls, where . Let be an arbitrarily fixed integer.
(a) If , then the maximum load is at most w.h.p.
(b) If , then the maximum load is at least w.h.p.
(c) If , then the maximum load is either or w.h.p. Moreover, suppose
[TABLE]
and let be the number of bins that contain exactly balls. Then .
Theorem 3.5 will be proven in Appendix A.
As noted earlier, random -SAT is much better understood than random -SAT for general . This enables us to obtain a stronger result than Theorem 3.2 in the case .
Theorem 3.6**.**
Let be a random instance in .
(a) There exists an such that, if and , then is UNSAT w.h.p.
(b) For :
(i) If , where is as in , then is SAT with probability bounded away from both [math] and .
(ii) If then is UNSAT w.h.p.
(c) For with :
(i) If then is SAT w.h.p.
(ii) If then is UNSAT w.h.p.
(d) For :
(i) If then is SAT w.h.p.
(ii) If then is SAT with probability bounded away from both [math] and .
Remark 3.7**.**
As in Remark 3.3.(b), one can deal with the more general case of arbitrary , as long as one of the clause types is of the form and is of sufficiently large weight.
Similarly to Table 1, we summarize the results of Theorem 3.6 in Table 2. Here, we always assume , and . The notations are as in Table 1.
As we have seen, when clauses tend to be formed of variables in the same community, the instance tends to become unsatisfiable. One may wonder what happens in the opposite case, namely when variables tend to belong to distinct communities. Intuitively, this constraint should usually make little difference, as anyway few clauses may be expected to contain variables from the same community. However, if there are very few communities, this constraint is more significant. Specifically, consider the extreme case of . In this case, we disallow about half of the possible clauses of the classical model . Does it affect the satisfiability threshold? Namely, if when moving from the classical case to a case with most clauses from the same community, we tend to make the instance unsatisfiable, will the constraint of having in each clause variables from distinct communities tend to make it “more satisfiable"? The following theorem shows that it makes a very small difference if at all.
Theorem 3.8**.**
Let and arbitrary and fixed. The satisfiability threshold in the model
[TABLE]
is .
One may still ask whether the regular random model and the model display some difference in behaviour near the threshold, namely for . More precisely, recall that, by [9], for in some range of size around , the satisfiability probability for the random model is bounded away from both [math] and . (See (7) below for a more accurate formulation.) Do the two models behave in the same way for for fixed ?
We studied this question by a large simulation. We detail the experiment in Section . The results seem to indicate strongly that the two models behave in the same way also in the window .
4 Proofs
Proof of Proposition 3.1:
(a)
We follow the proof in the random model [17]. Fix a truth assignment and consider . Each clause has at most literals. The variables are negated with probability independently of each other, and hence each clause is satisfied with probability of at most , independently of the other clauses. The expected number of satisfying truth assignments is therefore at most . As , we have
[TABLE]
(Note that we have not used in this part the specific mechanism by which clauses are selected. The variables in each clause may be selected arbitrarily. As long as all clauses are of length at most , and the sign of each variable in each clause is selected uniformly randomly, and independently of all other variables in this clause and all the others, the conclusion holds.) Thus, is UNSAT w.h.p.
(b)
A random instance in decomposes into sub-instances , , where each is formed of those clauses consisting of variables solely from . Obviously, is SAT if and only if all -s are such. For , let if is satisfiable, and otherwise. The variable indicates whether is satisfiable. Let denote the number of clauses in . Since each of the clauses consists of variables from with probability , independently of all other clauses, is binomially distributed with parameters and :
[TABLE]
(i)
Suppose first that . Let denote the density of the sub-instance , . There exists an with , and therefore . It follows that is UNSAT w.h.p., and hence so is .
The case follows in particular from Theorem 3.2.(a).(iii) (to be proved below).
(ii)
In this case . Without loss of generality assume is fixed. For , let
[TABLE]
Let be an arbitrary fixed number, strictly between and . Denoting by the complement of an event , we have:
[TABLE]
By the weak law of large numbers for the binomial random variables ,
[TABLE]
and therefore
[TABLE]
Hence
[TABLE]
Now consider the second factor on the right-hand side of (3). Clearly, the more clauses any contains, the less likely it is to be satisfiable, and therefore
[TABLE]
As , each of the sub-instances is SAT w.h.p., so that each of the factors in the product on the right-hand side of (4) converges to 1 as . Since, by our assumption, is fixed, so does the whole product. Hence is SAT w.h.p.
∎
As mentioned in Section 3, the proofs of Theorem 3.2 and Theorem 3.6 make use of some results concerning the balls and bins problem. Let be the maximum load for balls and bins. By [37], for any ,
[TABLE]
w.h.p. for an appropriate constant .
Remark 4.1**.**
The constant , in the second part of (5), is the unique solution of the equation
[TABLE]
in (see [37, Lemma 3]). A routine calculation shows that , where the function is the unique non-negative function defined implicitly by the equation
[TABLE]
The function has been studied in [26, pp. 101–102], and in particular expressed as a power series in near [math].
The fact that is the reason that the threshold in Theorem 3.2.(d) is strictly less than . One can easily bound from below. In fact, write . Then
[TABLE]
and hence .
Proof of Theorem 3.2: We follow the notations used in the proof of Proposition 3.1. Recall that is the sub-instance formed of those clauses in consisting of variables solely from , and is the number of clauses in , . Denote
Note that, while we have not assumed that in parts (a).(iii), (b), and (c) of the theorem, we may make this assumption without loss of generality in these parts as well. In fact, suppose that any of these three parts has been proven for the case , and consider the general case. If the probability of in is , then w.h.p. there will be at least clauses of type . To see this, denote by the sub-instance of obtained by taking the clauses of type and by the number of clauses in . Clearly, is distributed binomially with parameters and . Employing Chernoff’s bound we obtain a lower bound of on w.h.p. It follows that has the same lower bound assumed on (namely, it is in part (a).(iii), it is in part (b), and it is in part (c)). As we have assumed the correctness of these parts for , the instance is UNSAT w.h.p., and hence certainly so is the original instance, which contains it. Thus, we may indeed assume in all parts that .
Since , each clause has all its literals from the same community. Hence, the selection of a clause corresponds to the selection of a community. Consider clauses as balls, and communities as bins. The process of selecting the clauses, as far as the community to which the variables in each clause belong, is analogous to that of placing balls in bins uniformly at random. The idea of the proof in parts (b)-(d) will be to prove that w.h.p. we have . This will imply that there is at least one sub-instance with density larger than . Thus, already is UNSAT w.h.p., and consequently so is .
(a)
Without loss of generality, assume that is fixed.
(i)
By Theorem 3.5.(a), there is no sub-instance with more than clauses w.h.p. Since instances with less than clauses are certainly satisfiable, all -s are SAT, and hence so is .
(ii)
Here, we may assume that for some constant . By Theorem 3.5.(c), the probability that there is an , , with at least clauses, is bounded away from [math]. Assume, say, that . Then, with probability at least
[TABLE]
all distinct clauses consisting of the variables have been drawn. As the instance is UNSAT if it contains all these clauses, the probability for our instance to be UNSAT is bounded away from [math]. Now, assume that , for some . Now, by Theorem 3.5.(c) there is no sub-instance with more than clauses with probability bounded away from [math]. Thus, similarly to part (i), is SAT with probability bounded away from [math].
(iii)
Follows from the previous part and Lemma 3.4.
(b)
In view of part (a).(iii), we may assume . We may also assume that for some . Clearly, . On the other hand,
[TABLE]
Thus, by (5), w.h.p., the maximum load is at least
[TABLE]
Now, there are variables in each community. By (6), w.h.p., the density of the sub-instance with the maximal number of clauses is at least
[TABLE]
Hence, this is UNSAT w.h.p., and therefore so is .
(c)
By (5), w.h.p., the number of clauses in the sub-instance with the maximal number of clauses is at least
[TABLE]
For a large enough
[TABLE]
As , for large enough we have Hence, for large enough we have
[TABLE]
This implies that the density of the sub-instance with the maximal number of clauses is at least
[TABLE]
and thus UNSAT w.h.p. Consequently, so is .
(d)
In view of the previous part, we may assume that for some . Choose such that
[TABLE]
where is as in (5). Let , and put . Thus, and . Let . We have
[TABLE]
By (5), the size of the largest sub-instance is w.h.p. Hence, w.h.p. the density of this sub-instance is
[TABLE]
Letting , we get our claim. ∎
Proof of Lemma 3.4: Denote the random instance in by . Denote the instance obtained from the first clauses of by , the instance obtained from the next clauses of by , , the instance obtained from the last clauses of by (with ). According to our assumption, there exists an such that
[TABLE]
Now, the events , are independent, and we clearly have
[TABLE]
Since :
[TABLE]
∎
In the proof of Theorem 3.6 we will use the following result from [9]. There exist some and such that the satisfiability probability of a random -SAT instance with clauses is
[TABLE]
Actually, in the sequel, we will encounter only the first two cases. Note that in the case with , we have
[TABLE]
for some constants .
We will also use an additional result regarding the balls and bins problem. Let be the maximum load for balls and bins. By [37], w.h.p.
[TABLE]
Given a sequence of random variables and a probability law we write if the sequence converges to in distribution.
Proof of Theorem 3.6: We follow the notations used in the proof of Theorem 3.2. Also, for , let
[TABLE]
and let and be analogously understood. More generally, for , let denote the set of -tuples with at least entries , , for which . (Thus,
Note that the set , defined in (2), may now be written in the form
[TABLE]
We use similar notations, for example , and , analogously.
(a) Let be sufficiently small positive numbers, to be determined later. Let be as in (7). We have
[TABLE]
Consider the first term on the right-hand side of (10). The event implies that , , for some . We note that, conditioned on the event , the events , , are independent. Also, for each with we have
[TABLE]
Thus
[TABLE]
In view of Theorem 3.2.(a).(iii), we may assume that . By (8), for some
[TABLE]
As
[TABLE]
[TABLE]
Now we claim that the event in the second term on the right-hand side of (10) is empty. In fact, the event means that all sub-instances are of density less than , and the event means that most of them are of density at most Since the overall density is , the two events do not meet for sufficiently small . Thus
[TABLE]
We turn to the last term on the right-hand side of (10). The condition implies that there is at least one such that the density of is at least . Since the threshold of -SAT is , this is UNSAT w.h.p., and in particular is such. Hence:
[TABLE]
By (10), (13), (14) and (15), is UNSAT w.h.p.
(b) In this part we employ the approach of part (a) with minor changes. We may assume for some .
(i) Consider (10). In the first term on the right-hand side, as for some , by (12) we have
[TABLE]
(14) and (15) still hold in this case. Thus, by (10), (14), (15) and (16),
[TABLE]
In the other direction, let be strictly between and . Similarly to (10),
[TABLE]
First, consider the second factor on the right-hand side of (17). Given that has occurred, for some the event has occurred. Similarly to (11),
[TABLE]
By (8), for some
[TABLE]
Now consider the first factor on the right-hand side of (17). By (9), w.h.p. the number of clauses in the sub-instance with the maximal number of clauses is bounded above by
[TABLE]
Thus the density of all -s is bounded by
[TABLE]
namely
[TABLE]
[TABLE]
Therefore, is SAT with probability bounded away from both [math] and .
(ii) Similarly to (10), and with to be determined later,
[TABLE]
Consider the first addend on the right-hand side of (19). Similarly to (11),
[TABLE]
By (7), for some and
[TABLE]
Consider the second addend on the right-hand side of (19). Define the variables , , as follows: if the -th clause consists of variables from the first community, and otherwise. Thus, . The variables are independent, for and
[TABLE]
Thus, by a version of the Central Limit Theorem [27, Corollary 2.7.1]
[TABLE]
Clearly, . Thus, for large we have
[TABLE]
(We mention in passing that, in fact, we do not need the Central Limit Theorem for our purpose. By [21, Theorem 1], as and
[TABLE]
This inequality is weaker than (21), but suffices for the proof.)
Define the variables
[TABLE]
The -s are -distributed, where . Let . By (21)
[TABLE]
Consider the proportion of sub-instances with at least clauses. We want to find a such that By [15, Lemma 2], the variables are negatively correlated, and hence
[TABLE]
By the one-sided Chebyshev inequality for any
[TABLE]
Thus
[TABLE]
Therefore for w.h.p. . Thus
[TABLE]
(c)
(i) Let . Similarly to (10)
[TABLE]
Consider the first term on the right-hand side of (23). Similarly to the proofs of the previous parts, given that the event has occurred, the density of each sub-instance is less than , and thus,
[TABLE]
Employing the union bound
[TABLE]
By (8), as , for some
[TABLE]
[TABLE]
By (26), (18) and (23), is SAT w.h.p
(ii) Start from (19). Consider the first term on the right-hand side of (19). As , similarly to (20) we have
[TABLE]
By (22), for sufficiently small , the second term on the right-hand side of (19) will vanish.Thus, is UNSAT w.h.p.
(d)
(i) In part (c).(i) we only used the fact that , so that the proof there applies here as well.
(ii) In this case we may assume that is fixed. By (9), the density of the sub-instance with the maximal number of clauses is bounded above by
[TABLE]
Thus, denoting , we have
[TABLE]
By (17),
[TABLE]
By (7), and similarly to (10), for some ,
[TABLE]
Thus, by (27)-(28), is SAT with probability bounded away from [math].
In the other direction, there is at least one sub-instance with density at least . Without loss of generality assume that the density of the first sub-instance is at least and thus, for the same as above
[TABLE]
Therefore,
[TABLE]
Thus, is SAT with probability bounded away from both [math] and .
∎
The proof of Theorem 3.8 follows Chvátal and Reed [10]. The case follows from Proposition 3.1. We will thus assume that .
We first recall two definitions and their relevance to the satisfiability/unsatisfiability of an instance.
Definition 4.2**.**
[10]** A bicycle is a formula that consists of at least two distinct variables and clauses with the following structure: there are literals such that each is either or , we have for all , , and where .
Chvátal and Reed [10] proved that every unsatisfiable formula contains a bicycle.
Definition 4.3**.**
[10]** A snake is a sequence of distinct literals such that no is the complement of another.
Chvátal and Reed [10] proved that, for a snake consisting of the literals , the formula , consisting of the clauses for all with , is unsatisfiable.
Proof of Theorem 3.8: Suppose that
[TABLE]
We have to show that for our formula is satisfiable w.h.p., while for it is unsatisfiable w.h.p.
First suppose that . Let be the probability that our formula contains a bicycle. We will derive an upper bound for . To derive this upper bound, we will add up the probabilities of our formula containing each specific bicycle. Thus, first take some specific bicycle, consisting of clauses as in Definition 4.2 for some . Also, suppose that exactly out of the clauses consist of two variables from the same community. There are at most choices as to which of the clauses will make up the clauses in our formula. The probability of a clause in the formula to be some specific clause, with both variables in the same community, in our bicycle is
[TABLE]
whereas if the variables are in different communities then this probability is
[TABLE]
Then the probability that our formula will contain this specific bicycle is bounded above by
[TABLE]
Now we count the number of possible bicycles. Suppose we restrict ourselves to bicycles such that exactly clauses out of as defined above each consists of two variables from the same community. There are ways to choose these clauses. Then if we pick the literals for the bicycle one at a time, we have choices for the first literal since we have Boolean variables in total. If is supposed to contain both variables from the same community, then there are choices for the second literal. On the other hand, if is supposed to contain variables from different communities, then there are choices for the second literal. Continuing in this way, we see that there are less than choices for the literals in the bicycle. Also, there are at most choices for and . Hence, assuming , we have
[TABLE]
By a geometric series argument, the sum above is finite, and so . Thus, the satisfiability threshold is at least .
Now suppose . We will also assume that . (If , the proof is actually simpler.) For each , choose a in such a way that
[TABLE]
Let . We will show that our formula contains a formula of a snake consisting of literals w.h.p. Thus, our formula will be unsatisfiable w.h.p. We use the second moment method. Let , where if our formula contains each clause of exactly once, and otherwise. We will prove that
[TABLE]
from which the desired result may be deduced using Chebyshev’s inequality. Consider an arbitrary snake . Suppose that contains exactly clauses each consisting of a pair of variables from different communities and exactly clauses each consisting of variables from the same community. We have , where
[TABLE]
Take two snakes and , where contains exactly clauses with variables from different communities and exactly clauses with variables from the same community, and contains exactly clauses with variables from different communities and exactly clauses with variables from the same community. Also, suppose and share precisely clauses with variables in different communities and precisely clauses with variables from the same community. Then . Since , we have
[TABLE]
uniformly in both cases if we assume that where .
Now let us count the snakes such that contains exactly clauses with variables from different communities and exactly clauses with variables from the same community. We denote the set of all such snakes as . First, we may view as a directed graph with vertices (where each is the variable such that is or ) and edges , , with . This directed graph consists of two direct cycle graphs, each consisting of vertices and having exactly one vertex in common (the vertex ). Each edge corresponds to a clause in . Consider the edges corresponding to the clauses with variables in different communities. Let and be the number of such edges in each of the two cycle graphs that make up the whole graph. We can see that . For for the cycle with the edges, there will be ways to choose these edges. These edges will then partition the set of vertices into groups, where the variables corresponding to the vertices in a group will belong to the same community. Thus, the number of ways of choosing the community of each of the variables corresponding to the vertices in this cycle graph is the chromatic number of the cycle graph consisting of vertices in colours or . After choosing all of these communities for each cycle graph, we are left with choosing the variables from these communities, and there will be at least choices per vertex after making the choice for the variable. Putting it altogether, the number of such snakes will be bounded below by
[TABLE]
and bounded above by
[TABLE]
By (29), the latter is asymptotic to the actual number of such snakes as . By (31), we thus have:
[TABLE]
By (29), we have
[TABLE]
uniformly in the range . In particular, if and have no clauses in common, then . Thus, to prove (30), our main concern will be when and have clauses in common. To deal with this case, we will derive an upper bound for
[TABLE]
(where denotes the set of common clauses of and ) for each . First consider how we can construct two snakes and such that and have clauses in common and account for its contribution to the above sum. Viewing and as graphs as above, we let be their intersection, with isolated vertices removed. Suppose that contains edges and vertices. To construct the possible snakes and , we create a procedure with five steps:
-
Choose terms of for membership in .
-
Assign variables to these terms.
-
Choose which positions in the snake will be filled with terms in .
-
Assign variables to the positions in picked out in step 2).
-
Assign variables to all other positions in and .
For 1), we can select our terms of as follows. We first decide if the edge is in or not, and then, for each , we place a marker at if exactly one of and is in . The total number of markers will be between and , and so the total number of choices for the terms is at most . Thus, the total number of choices for 3) will also be at most . Also, we have at most choices for step 4), where is the number of components in .
For step 2), if we impose the restriction that edges among the vertices correspond to the clauses with variables in different communities, then the number of ways to assign such variables is . As well, for step 5), if we impose the restrictions that, of the remaining clauses in , there are exactly with variables in different communities, and that of the remaining clauses in there are exactly clauses with variables in different communities, then the number of ways to assign such variables is bounded above by .
First suppose that . Then none of the components of may contain loops, so that . Putting it all together, weighing all of the possible pairs of snakes and , appropriately using (29), we obtain
[TABLE]
for sufficiently large . Thus by (32) we have for sufficiently large n
[TABLE]
Now suppose that . We have two possibilities for the components of . Either none of them contains loops or exactly one of them contains a loop and the number of loops in this component is exactly or , where the possible loops are and . In either case we have . Thus,
[TABLE]
By (32), for sufficiently large
[TABLE]
Thus
[TABLE]
from which we can deduce (30).
∎
5 Empirical results
To test the question posed after Theorem 3.8, we have conducted the following experiment. We have taken , and with . (This non-symmetric range was due to preliminary simulations, that showed that the interesting window is actually centered somewhat above . For each such , we generated random instances from and (which is just the random model), tested each instance using the SAT solver SAT4J, described in [8], and calculated the percentage of satisfiable instances in each group. To complete the picture, we did the same for the model .
The results are presented in Table 3. The first two models show remarkably similar results. Unsurprisingly, the third model leads to lower satisfiability probabilities.
6 Conclusions
We have dealt with the satisfiability threshold of a particular model of SAT. This model highlights one of the features in which so-called community-structured SAT instances differ from classical SAT instances. Namely, the set of variables decompose into several disjoint subsets-communities. The significance of these communities stems from the fact that clauses tend to contain variables from the same community. We have shown, roughly speaking, that the satisfiability threshold of such instances tends to be lower than for regular instances. Moreover, if the communities are very small, the threshold may even vanish.
The paper leaves a lot to study for industrial SAT instances. To begin with, there are other features considered in the literature as being characteristic of industrial instances. For example, in the scale-free structure, the variables are selected by some heavy-tailed distribution. Moreover, even regarding the issue of communities, there is more to be done. We assumed here that all communities are of the same size. Obviously, there is no justification for this assumption beyond the fact that it simplifies significantly the analysis of the model. What can be said about the threshold if there are both small and large communities? Even prior to that, what would be reasonable to assume regarding the probability of a variable to be selected from each of the communities?
Acknowledgments
We would like to express our gratitude to David Wilson for helpful information regarding the topic of this paper, and to the two anonymous referees for their comments on the first version of the paper.
Appendix A Proof of Theorem 3.5
In the proof of Theorem 3.5 we shall use the following lemma, which is analogous to Lemma 3.4.
Lemma A.1**.**
Consider the balls and bins problem with bins and balls, and also with bins and balls, where . If the maximum load for balls is at least with probability bounded away from [math], then the maximum load for is at least w.h.p.
Proof: Assume we part the balls into disjoint batches of balls each. Suppose we toss the balls in each batch into the bins separately, and check the maximum load for each batch. Let be the maximum load for batch , . According to our assumption, there exists an such that
[TABLE]
Let be the maximum load in the case we place all the balls into the bins. The events , are independent, and we clearly have
[TABLE]
Hence:
[TABLE]
∎
The next proof will make use of the notion of negative association of random variables [15]: Denote for . Random variables are negatively associated if for every two index sets , with ,
[TABLE]
for every two functions and , which are both non-decreasing or both non-increasing.
In the proof of Theorem 3.5, we will make use of the following result, concerning the balls and bins problem. Let denote the number of balls placed in the -th bin, . Let be non-decreasing functions, . By [15, Lemma 2], the variables are negatively associated, and in particular the -s are negatively correlated.
Proof of Theorem 3.5: Let be as above. We clearly have
[TABLE]
Define the variables
[TABLE]
The -s are -distributed, where . Let .
(a) We use the first moment method. Obviously:
[TABLE]
Let us index the balls from to , and let if the -th ball entered the first bin and otherwise, . Thus, the -s are -distributed. Let denote the set of subsets of size of . By the union bound and symmetry:
[TABLE]
Since ,
[TABLE]
Thus, w.h.p. the maximum load does not exceed .
(b) We employ the second moment method. First, if , then there must be at least one bin with at least balls in it. Thus we may assume that . We have
[TABLE]
For sufficiently large we have , and therefore
[TABLE]
Thus we have
[TABLE]
By [15], the variables are negatively associated. Since each is a non-decreasing function of , this yields for . Hence:
[TABLE]
As , the Paley–Zygmund inequality [36] yields
[TABLE]
By (33), we have . Also, by (33) and (34), we have
[TABLE]
and so . Thus, w.h.p. the maximum load is at least .
(c) The first statement follows from parts (a) and (b), applied with and , respectively, instead of . For the convergence part, suppose (1) holds. Observe that there are possible ways to distribute the balls into the bins. Obviously, . Let . We will prove that
[TABLE]
Specify bins, say out of the bins. The number of balls in bins , and all of the other bins combined forms a multinomial distribution. It follows that
[TABLE]
As , we thus have
[TABLE]
From (1), we have
[TABLE]
and so (35) holds. The desired result follows from Brun’s sieve, which is stated in Theorem of [3].
∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Achlioptas and Peres, [2004] Dimitris Achlioptas and Yuval Peres, “The threshold for random k 𝑘 k -SAT is 2 k ln 2 − O ( k ) superscript 2 𝑘 2 𝑂 𝑘 2^{k}\ln 2-O(k) ”, Journal of the American Mathematical Society 17.4 (2004), 947–973.
- 2Adams and Guillemin, [1996] Malcolm Adams, and Victor Guillemin, “Measure theory and probability”, Corrected reprint of the 1986 original, Birkhäuser Boston, Inc., Boston, MA, (1996).
- 3Alon and Lubetzky, [2009] Noga Alon and Eyal Lubetzky, “Poisson approximation for non-backtracking random walks”, Israel Journal of Mathematics 174.1 (2009), 227–252.
- 4Ansótegui et al., [2015] Carlos Ansótegui, Maria L. Bonet, Jesús Giráldez-Cru, and Jordi Levy, “On the Classification of Industrial SAT Families”, Artificial Intelligence Research and Development - Proc. of the 18th International Conference of the Catalan Association for Artificial Intelligence , Valencia, Catalonia, Spain, October 21–23 (2015), 163–172.
- 5[5] Carlos Ansótegui, María L. Bonet, and Jordi Levy, “Towards industrial-like random SAT instances”, Proc. of the 21st International Joint Conference on Artificial Intelligence, IJCAI 2009 (2009).
- 6[6] Carlos Ansótegui, María L. Bonet, and Jordi Levy, “On the Structure of Industrial SAT Instances”, Principles and Practice of Constraint Programming - CP 2009 , Lecture Notes in Comput. Sci., 5732, I. P. Gent, Ed., Springer Berlin Heidelberg (2009), 127–141.
- 7Ansótegui et al., [2012] Carlos Ansótegui, Jesús Giráldez-Cru, and Jordi Levy, “The community structure of SAT formulas”, Proc. of Theory and Applications of Satisfiability Testing–SAT 2012: 15th International Conference , Trento, Italy, June 17–20, 2012, A. Cimatti, R. Sebastiani, Eds., 410–423.
- 8[8] Daniel Le Berre and Anne Parrain, “The Sat 4j library, release 2.2, system description”, Journal on Satisfiability, Boolean Modeling and Computation 7 (2010), 59–64.
