The Power of the Combined Basic LP and Affine Relaxation for Promise   CSPs

Joshua Brakensiek; Venkatesan Guruswami; Marcin Wrochna; and Stanislav; \v{Z}ivn\'y

arXiv:1907.04383·cs.DS·December 3, 2020

The Power of the Combined Basic LP and Affine Relaxation for Promise CSPs

Joshua Brakensiek, Venkatesan Guruswami, Marcin Wrochna, and Stanislav, \v{Z}ivn\'y

PDF

Open Access

TL;DR

This paper introduces a polynomial-time algorithm for promise CSPs that admit infinitely many symmetric polymorphisms, extending previous work and unifying solutions for Boolean CSPs by leveraging symmetry properties.

Contribution

The authors develop a new algorithm that solves promise CSPs with symmetric polymorphisms, generalizing prior results and providing a complete characterization of its applicability.

Findings

01

The algorithm solves all promise CSPs with infinitely many symmetric polymorphisms.

02

It extends to block-symmetric polymorphisms, broadening its scope.

03

Block symmetric polymorphisms are both sufficient and necessary for the algorithm's success.

Abstract

In the field of constraint satisfaction problems (CSP), promise CSPs are an exciting new direction of study. In a promise CSP, each constraint comes in two forms: "strict" and "weak," and in the associated decision problem one must distinguish between being able to satisfy all the strict constraints versus not being able to satisfy all the weak constraints. The most commonly cited example of a promise CSP is the approximate graph coloring problem--which has recently seen exciting progress [BKO19, WZ20] benefiting from a systematic algebraic approach to promise CSPs based on "polymorphisms," operations that map tuples in the strict form of each constraint to tuples in the corresponding weak form. In this work, we present a simple algorithm which in polynomial time solves the decision problem for all promise CSPs that admit infinitely many symmetric polymorphisms, which are invariant…

Equations65

w_{i} (a)

w_{i} (a)

p_{j} (y)

a \in A \sum w_{i} (a)

y \in R_{j}^{A} \sum p_{j} (y)

y \in R_{j}^{A} y ∣_{i} = a \sum p_{j} (y)

a \in A \sum r_{i} (a)

a \in A \sum r_{i} (a)

y \in R_{j}^{A} \sum q_{j} (y)

y \in R_{j}^{A} y ∣_{i} = a \sum q_{j} (y)

w_{i} (a) = 0

w_{i} (a) = 0

p_{j} (y) = 0

W_{i} (a) := u ℓ w_{i} (a) + v r_{i} (a) .

W_{i} (a) := u ℓ w_{i} (a) + v r_{i} (a) .

a \in A \sum W_{i} (a) = a \in A \sum u ℓ w_{i} (a) + v r_{i} (a) = u ℓ + v = L .

a \in A \sum W_{i} (a) = a \in A \sum u ℓ w_{i} (a) + v r_{i} (a) = u ℓ + v = L .

W_{i} (a) \geq u ℓ (1/ ℓ) + v (- M) \geq M ℓ - ℓ M = 0.

W_{i} (a) \geq u ℓ (1/ ℓ) + v (- M) \geq M ℓ - ℓ M = 0.

X_{i} := f (\dots, W_{i} (a) times \forall a \in A a, \dots, a, \dots)

X_{i} := f (\dots, W_{i} (a) times \forall a \in A a, \dots, a, \dots)

P_{j} (y) := u ℓ p_{j} (y) + v q_{j} (y) .

P_{j} (y) := u ℓ p_{j} (y) + v q_{j} (y) .

y \in R_{j}^{A} \sum P_{j} (y) = u ℓ y \in R_{j}^{A} \sum p_{j} (y) + v y \in R_{j}^{A} \sum q_{j} (y) = L .

y \in R_{j}^{A} \sum P_{j} (y) = u ℓ y \in R_{j}^{A} \sum p_{j} (y) + v y \in R_{j}^{A} \sum q_{j} (y) = L .

P_{j} (y) \geq u ℓ (1/ ℓ) + v (- M) \geq M ℓ - ℓ M = 0.

P_{j} (y) \geq u ℓ (1/ ℓ) + v (- M) \geq M ℓ - ℓ M = 0.

W_{i} (a)

W_{i} (a)

= u ℓ y \in R_{j}^{A} y ∣_{i} = a \sum p_{i} (y) + v y \in R_{j}^{A} y ∣_{i} = a \sum q_{i} (y)

= y \in R_{j}^{A} y ∣_{i} = a \sum P_{j} (y)

A T (x_{1}, \dots, x_{L}) = 1 [x_{1} - x_{2} + x_{3} - \dots \pm x_{L} \geq 1] .

A T (x_{1}, \dots, x_{L}) = 1 [x_{1} - x_{2} + x_{3} - \dots \pm x_{L} \geq 1] .

W_{b, i} (a) := u_{b} ℓ w_{i} (a) + v_{b} r_{i} (a) .

W_{b, i} (a) := u_{b} ℓ w_{i} (a) + v_{b} r_{i} (a) .

\sum_{a\in A}W_{b,i}(a)=\sum_{a\in A}\big{(}u_{b}\ell w_{i}(a)+v_{b}r_{i}(a)\big{)}=u_{b}\ell+v_{b}=L_{b}.

\sum_{a\in A}W_{b,i}(a)=\sum_{a\in A}\big{(}u_{b}\ell w_{i}(a)+v_{b}r_{i}(a)\big{)}=u_{b}\ell+v_{b}=L_{b}.

X_{i} := f (L_{1} total \dots, W_{1, i} (a) times a, \dots, a, \dots, \dots, L_{κ} total \dots, W_{κ, i} (a) times a, \dots, a, \dots)

X_{i} := f (L_{1} total \dots, W_{1, i} (a) times a, \dots, a, \dots, \dots, L_{κ} total \dots, W_{κ, i} (a) times a, \dots, a, \dots)

P_{b, j} (y) := u_{b} ℓ p_{j} (y) + v_{b} q_{j} (y) .

P_{b, j} (y) := u_{b} ℓ p_{j} (y) + v_{b} q_{j} (y) .

y \in R_{j}^{A} \sum P_{b, j} (y) = u_{b} ℓ y \in R_{j}^{A} \sum p_{j} (y) + v_{b} y \in R_{j}^{A} \sum q_{j} (y) = L_{b} .

y \in R_{j}^{A} \sum P_{b, j} (y) = u_{b} ℓ y \in R_{j}^{A} \sum p_{j} (y) + v_{b} y \in R_{j}^{A} \sum q_{j} (y) = L_{b} .

W_{b, i} (a)

W_{b, i} (a)

= y \in R_{j}^{A} y ∣_{i} = a \sum P_{b, j} (y)

g (x_{1}, \dots, x_{L^{'}}) := f (x_{π (1)}, \dots, x_{π (L)}) .

g (x_{1}, \dots, x_{L^{'}}) := f (x_{π (1)}, \dots, x_{π (L)}) .

Q_{co n v}^{(L)} = {w : [L] \to Q_{\geq 0} ∣ \sum_{i \in [L]} w (i) = 1},

Q_{co n v}^{(L)} = {w : [L] \to Q_{\geq 0} ∣ \sum_{i \in [L]} w (i) = 1},

w_{/ π} (i) := w (π^{- 1} (i)) = \sum_{j \in π^{- 1} (i)} w (j) for i \in [L^{'}] .

w_{/ π} (i) := w (π^{- 1} (i)) = \sum_{j \in π^{- 1} (i)} w (j) for i \in [L^{'}] .

\displaystyle\mathcal{M}_{\mathrm{BLP+Aff}}^{(L)}:=\{(w,r)\mid\

\displaystyle\mathcal{M}_{\mathrm{BLP+Aff}}^{(L)}:=\{(w,r)\mid\

r : [L] \to Z,

\forall_{i \in [L]} w (i) = 0 ⟹ r (i) = 0

w^{'} (i)

w^{'} (i)

r^{'} (i)

X_{i} := f (\dots, W_{i} (a) times a, \dots, a, \dots)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsService-Oriented Architecture and Web Services · Business Process Modeling and Analysis · Scheduling and Optimization Algorithms

Full text

The Power of the Combined Basic LP and Affine

Relaxation for Promise CSPs††thanks: An extended abstract of part of this work (by the first two authors) appeared in the Proceedings of the 31st Annual ACM-SIAM Symposium on Discrete Algorithms (SODA’20) [BG20].

Joshua Brakensiek Stanford University, Stanford, CA 94305, USA. Email: [email protected]. Research supported in part by an REU supplement to NSF CCF-1526092 and a NSF Graduate Research Fellowship.

Venkatesan Guruswami Computer Science Department, Carnegie Mellon University, Pittsburgh, PA 15213. Email: [email protected]. Research supported in part by NSF grants CCF-1814603 and CCF-1908125.

Marcin Wrochna University of Oxford, UK. Email: [email protected]. Research supported by funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 714532).

Stanislav Živný University of Oxford, UK. Email: [email protected]. Research supported by a Royal Society University Research Fellowship and by funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 714532).

Abstract

In the field of constraint satisfaction problems (CSP), promise CSPs are an exciting new direction of study. In a promise CSP, each constraint comes in two forms: “strict” and “weak,” and in the associated decision problem one must distinguish between being able to satisfy all the strict constraints versus not being able to satisfy all the weak constraints. The most commonly cited example of a promise CSP is the approximate graph coloring problem—which has recently seen exciting progress [BKO19, WŽ20] benefiting from a systematic algebraic approach to promise CSPs based on “polymorphisms,” operations that map tuples in the strict form of each constraint to tuples in the corresponding weak form.

In this work, we present a simple algorithm which in polynomial time solves the decision problem for all promise CSPs that admit infinitely many symmetric polymorphisms, which are invariant under arbitrary coordinate permutations. This generalizes previous work of the first two authors [BG19]. We also extend this algorithm to a more general class of block-symmetric polymorphisms. As a corollary, this single algorithm solves all polynomial-time tractable Boolean CSPs simultaneously. These results give a new perspective on Schaefer’s classic dichotomy theorem and shed further light on how symmetries of polymorphisms enable algorithms. Finally, we show that block symmetric polymorphisms are not only sufficient but also necessary for this algorithm to work, thus establishing its precise power.

1 Introduction

A central challenge in the theory of algorithms is to understand the mathematical structure (or lack thereof) that governs the efficient tractability (or intractability) of a computational problem. For the class of constraint satisfaction problems (CSP), a rich algebraic theory culminating in the recent resolution of the Feder-Vardi dichotomy conjecture [FV98] in [Bul17, Zhu17] has established a striking link between problem structure and its tractability. In particular, a CSP is efficiently solvable if and only if its defining relations admit an “interesting” polymorphism. Informally, a polymorphism is a function whose component-wise action preserves membership in the relations defining the CSP, and “interesting” means that the function obeys some non-trivial identities. As an example, for the (efficiently solvable) CSP corresponding to linear equations over a ring $R$ , the $3$ -ary function $f(x,y,z)=x-y+z$ is a polymorphism (capturing the fact that if $v_{1},v_{2},v_{3}$ are solutions to a linear system, then so is $v_{1}-v_{2}+v_{3}$ ), and it obeys the so-called Mal’tsev identity $f(x,y,y)=f(y,y,x)=x$ for all $x,y\in R$ . Indeed, generalizing Gaussian elimination, any CSP with such a Mal’tsev polymorphism is efficiently tractable [Bul02, BD06].

Recently, an exciting new direction of study has emerged in the rich backdrop of the complexity dichotomy for CSPs. This concerns a vast generalization of the CSP framework to the class of promise constraint satisfaction problems (PCSP). In a promise CSP, each constraint comes in two forms: “strict” and “weak.” Given an instance of a PCSP, one must distinguish between being able to satisfy all the strict constraints versus not being able to satisfy all the weak constraints. (This is the decision version; in the search version, given an instance with a promised assignment satisfying the strong form of the constraints, one seeks an assignment satisfying the weak form of the constraints.) A prime example of a PCSP is the approximate graph coloring problem, where one seeks to color a graph using more colors than its chromatic number.

The formal study of promise CSPs originated in [AGH17] who classified the complexity of a PCSP called $(2+\epsilon)$ -SAT. They further defined an extension of polymorphisms to the promise setting and postulated that the structure of those polymorphisms might govern the complexity of a PCSP. (This extension of polymorphisms to the promise setting is quite natural, requiring that the operation map tuples obeying the strict form of a constraint to a tuple satisfying its weak form.) Building on the impetus of [AGH17], Brakensiek and Guruswami systematically studied PCSPs under the polymorphic lens and established promising links to the universal-algebraic framework developed for CSPs [BG18, BG19]. It emerged from these works that a rich enough family of polymorphisms leads to efficient algorithms, whereas severely limited polymorphisms are a prescription for hardness. However, unlike for CSPs, there is no sharp transition between these cases — the significant difficulty being that, unlike for CSPs, polymorphisms for PCSPs are not closed under composition and lack the rich algebraic structure of a clone (c.f., [BKW17]). This nascent algebraic theory for PCSPs was lifted to a more abstract level in [BKO19, BBKO19] and also led to concrete breakthroughs in approximate graph coloring/homomorphisms [BKO19, KO19, WŽ20]. In particular, while previous works [BG18, BG19] focused on the actual form of the polymorphisms, the results of [BKO19] reveal that it is not the polymorphisms themselves, but rather solely the identities they satisfy, that capture the complexity of the associated PCSP, extending a similar phenomenon known earlier for CSPs [BOP18].

This work concerns the theme of designing algorithms for PCSP based on a rich enough family of polymorphisms. Our main result is that the decision version of an arbitrary PCSP admitting an infinite family of symmetric polymorphisms — i.e., polymorphisms which are invariant under any permutation of inputs — is tractable (see Theorem 2). Our result also extends to the case of block-symmetric polymorphisms (see Theorem 3). That is, the coordinates can be partitioned into “blocks” such that the function is invariant under permutations within each block. Notably, in the block-symmetric case the algorithm is identical–only the analysis changes. Furthermore, the number of blocks is irrelevant, the only assumption we need is that the minimum block size can be made arbitrarily large. Our final result (Theorem 4) shows that block-symmetry is not only sufficient but also necessary for our algorithm to work. In fact, Theorem 4 also establishes that without loss of generality one can assume that there are only two blocks of symmetric coordinates.

Further our algorithm is very simple — it checks if the canonical linear programming (LP) relaxation of the PCSP is feasible, and if so, it further checks if a slight adaptation of a canonical affine relaxation is feasible. The algorithm outputs satisfiable if both these relaxations are feasible. The polymorphisms are not used in the algorithm itself and only enter the analysis. The analysis is short but subtle — if we had symmetric polymorphisms of all arities then it is known that the basic LP relaxation itself correctly decides satisfiability, as one can round the fractional solution to a satisfying assignment using the polymorphism after clearing denominators of the fractional solution [KOT*+*12, BKW17]. If polymorphisms only exist of certain arities (e.g., all odd majorities), then the LP alone doesn’t suffice (e.g., [KOT*+*12]). We solve a linear system over the integers corresponding to the affine relaxation which lets us adjust the LP solution to match the arity at which a polymorphism exists. As a subtle twist, the affine relaxation is not of the original PCSP, but rather a refinement of the CSP which results from throwing out assignments to constraints which were ruled out by the basic LP.

It should be pointed out that we only solve the decision version of the PCSP, and not the search version. Unlike CSPs, for promise CSP there is no known reduction from search to decision, even for special cases like approximate graph coloring. Our work might be indicative of the subtle relationship between the search and decision problems for promise CSPs.

We now compare our result here with the previous work [BG19] where an algorithm was given to solve (the search version of) any PCSP that admits an infinite family of structured symmetric polymorphisms. Examples of such structured families include threshold and threshold-periodic polymorphisms. The value of a threshold polymorphism (for a Boolean PCSP) depends on whether the fraction of $1$ s in the input belongs in a finite number of intervals. (A basic example consists of Majority functions of odd arities, which are polymorphisms for 2-SAT.) A threshold-periodic polymorphism can have a periodic behavior depending on which interval the Hamming weight belongs to — for example it can be Majority for relative weights in $(1/3,2/3)$ and parity outside this interval. More generally, one can generalize to the non-Boolean case, as well as for the block-symmetric case, via regional polymorphisms whose value depends on the geometric region in which the vector of frequencies of the inputs to the polymorphisms lies. Due to this geometric interpretation, [BG19] assumes a fixed number of blocks (corresponding to a fixed dimension), whereas our new algorithm and analysis is independent of the number of blocks. The algorithm was a combination of solving the LP relaxation (albeit over a special ring like $\mathbb{Z}[\sqrt{2}]$ rather than the rationals) and the affine relaxation over a large enough finite ring. The analysis relied on the special structure of the polymorphisms (beyond their full symmetry). In contrast, our result here is more general, and only requires the polymorphism to be a symmetric function — its exact specifics or structure do not matter. It is encouraging that our methodology is consistent with the algebraic result in [BKO19] that the symmetries possessed by the polymorphisms capture the complexity of the PCSP.

Our result and methods have significance even for normal (non-promise) CSPs. For instance, we get a single unified algorithm to solve all non-trivial tractable cases of Boolean CSPs in Schaefer’s classic dichotomy theorem [Sch78], namely 2-SAT, Horn-SAT (or its dual), and Mod-2 Linear Equations. The two main techniques to solve CSPs are local propagation based algorithms (which work for the so-called bounded-width CSPs [BK14, KOT*+*12], etc.) and Gaussian elimination (which is a global algorithm that works for linear equations). The major difficulty in proving the full CSP dichotomy was tackling the complicated ways in which these two very different algorithms might need to be interlaced to solve a general CSP. It is our hope that this work serves as an impetus toward the potential discovery of a more modular CSP algorithm that incorporates together linear programming or its extensions (like Sherali-Adams, or semidefinite programming) and linear equation solving. In this light, it is encouraging that full symmetry of the polymorphisms, which is indeed a strong assumption, is not the limit of our techniques, which also extend to the block-symmetric case.

To put this work in further context, except for [BG19] as mentioned previously, nearly all works in the PCSP literature [AGH17, BG18, FKOS19] focus primarily on the structure of the relations. In particular, [BG18, FKOS19] characterized the complexity of all Boolean symmetric relations (rather than symmetric polymorphisms) which encompass many of the known tractable cases of Boolean PCSP. As classified by [FKOS19], all the relevant tractable polymorphisms are either symmetric functions or one special case of block-symmetric known as alternative threshold (and variants). Thus, in the context of PCSPs, the algorithm in this paper supersedes these previous works. See Section 4 for further discussion.

2 Notation

We let any finite set $A$ or $B$ denote a domain. A relation is a subset $R\subseteq A^{k}$ for any positive integer $k$ ; we denote $\operatorname{ar}(R):=k$ . We define a signature $\tau$ to be a set of symbols such that each $R\in\tau$ has a positive integer arity $\operatorname{ar}(R)$ .

A relational structure with signature $\tau$ , denoted by $\mathbf{A}:=\{R^{\mathbf{A}}\subseteq A^{\operatorname{ar}(R)}:R\in\tau\}$ , is an indexed set of relations over $A$ . A homomorphism between structures $\mathbf{A}=\{R^{\mathbf{A}}:R\in\tau\}$ and $\mathbf{B}=\{R^{\mathbf{B}}:R\in\tau\}$ with the same signature is a map $\sigma:A\to B$ such that $\sigma(R^{\mathbf{A}})\subseteq R^{\mathbf{B}}$ for all $R\in\tau$ (where $\sigma$ is applied to a tuple component-wise).

Two relational structures for which there exists a homomorphism from the first to the second is called a promise template and is denoted as $(\mathbf{A},\mathbf{B})$ .

2.1 PCSP: Decision and Search

Consider a promise template $(\mathbf{A},\mathbf{B})$ with signature $\tau$ . An instance $\mathbf{X}$ of the promise constaint satisfaction problem $\operatorname{PCSP}(\mathbf{A},\mathbf{B})$ consists of a set of variables $X:=\{x_{1},\ldots,x_{n}\}$ , and a set of constraints $c_{1},\ldots,c_{m}$ , where $c_{j}:=(R_{j},\bar{x}^{j})$ , where $R_{j}$ is a symbol in $\tau$ and $\bar{x}^{j}$ is a tuple of arity $\operatorname{ar}(R_{j})$ . We say that $\mathbf{X}$ is satisfiable in $\mathbf{A}$ if one can assign to every variable $x_{i}$ ( $i\in[n]$ ) a value $\sigma(x_{i})$ in the domain $A$ so that for every constraint $c_{j}=(R_{j},\bar{x}^{j})$ ( $j\in[m]$ ), the tuple $\sigma(\bar{x}^{j})$ (with $\sigma$ applied component-wise) is in $R_{j}^{\mathbf{A}}$ . Equivalently, $\mathbf{X}$ can be described as a relational structure with domain $X$ and relations $R^{\mathbf{X}}=\{\bar{x}\in X^{\operatorname{ar}(R)}\colon\exists_{j\in[m]}\ c_{j}=(R,\bar{x})\}$ ; a satisfying assignment is then the same as a homomorphism from $\mathbf{X}$ to $\mathbf{A}$ . If $\mathbf{X}$ is satisfiable in $\mathbf{A}$ , then it is satisfiable in $\mathbf{B}$ (because the satisfying assignment can be composed with the homomorphism from $\mathbf{A}$ to $\mathbf{B}$ ).

We let $\operatorname{PCSP-Decision}(\mathbf{A},\mathbf{B})$ denote the decision problem of distinguishing instances satisfiable in $\mathbf{A}$ from those unsatisfiable in $\mathbf{B}$ (with the promise that the input instance falls into one of these two disjoint cases). We let $\operatorname{PCSP-Search}(\mathbf{A},\mathbf{B})$ denote the search problem of finding an explicit homomorphism from $\mathbf{X}$ to $\mathbf{B}$ , with the promise that a homomorphism from $\mathbf{X}$ to $\mathbf{A}$ exists.

2.2 Polymorphisms

A polymorphism of $(\mathbf{A},\mathbf{B})$ of arity $L\in\mathbb{N}$ is a map $f:A^{L}\to B$ such that for all $R\in\tau$ , $R^{\mathbf{B}}\supseteq f(R^{\mathbf{A}},\ldots,R^{\mathbf{A}})$ where we define the latter to be $\{(f(x^{(1)}_{1},\ldots,x^{(L)}_{1}),\ldots,f(x^{(1)}_{\operatorname{ar}(R)},\ldots,x^{(L)}_{\operatorname{ar}(R)})):x^{(1)},\ldots,x^{(L)}\in R^{\mathbf{A}}\}.$ In other words, consider any $A^{L\times\operatorname{ar}(R)}$ matrix $M$ , where each row is a satisfying assignment in $R^{\mathbf{A}}$ . Let $y\in B^{\operatorname{ar}(R)}$ be the result of applying $f$ to each column of $M$ . Then, $y\in R^{B}$ . We let $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ denote the set of polymorphisms of $(\mathbf{A},\mathbf{B})$ (of all arities).

A map $f:A^{L}\to B$ is said to be symmetric if for all $\pi\in S_{L}$ (the symmetric group on $L$ elements), $f(x_{1},\ldots,x_{L})=f(x_{\pi(1)},\ldots,x_{\pi(L)})$ .

2.3 Basic LP and Affine Relaxation

As is well-studied in the CSP literature (e.g., [RS09, TŽ17]), we consider the canonical linear programming relaxation of a CSP instance, often referred to as the “Basic LP” or “BLP.” For our CSP instance $\mathbf{X}$ , we represent the assignment $X\to A$ of a variable by a (rational) probability distribution of weights $\{w_{i}(a)\}_{a\in A}$ summing to $1$ . We also have a probability distribution over the satisfying assignments to each constraint, which we denote as $p_{j}({y})$ , where $j\in[m]$ is the index of the constraint and ${y}\in R_{j}^{\mathbf{A}}$ is the potential assignment. Finally, the marginal distribution of a variable $x_{i}$ in any constraint has to equal $w_{i}$ . Explicitly, the linear constraints are as follows.

[TABLE]

Here $y|_{i}=a$ denotes that setting $\bar{x}^{j}={y}$ sets $x_{i}=a$ (that is, if $x_{i}$ is the $k$ -th variable of the tuple $\bar{x}^{j}$ , then $a=y_{k}$ ). We let $\operatorname{LP}_{\mathbb{Q}}(\mathbf{X},\mathbf{A})$ denote the rational polytope of solutions. By a theorem of [GLS93] (c.f., [BG19]), we can efficiently find a relative interior point in this polytope. In particular, at such a point, each coordinate is positive if and only if it is positive at some point in the polytope.111For our specialized LP, we do not need such a hammer. We can instead solve the LP repeatedly, each time maximizing a different variable as the objective function–a similar idea appears in [BG18]. Averaging the results would then yield a solution such that each variable is positive if and only if it is positive in some LP solution.

In addition to the Basic LP, we also consider the affine relaxation of a Promise CSP. In essence we solve the same linear system, but instead of enforcing each variable to be a nonnegative rational, we enforce that it is an integer (possibly negative). This can be solved in polynomial time via [KB79] (see also [BG19] for a more detailed discussion of this approach). We let $r_{i}(a)\in\mathbb{Z}$ replace $w_{i}(a)$ for all $a\in A$ and $q_{i}({y})\in\mathbb{Z}$ replace $p_{i}({y})\in\mathbb{Q}$ for all ${y}\in R_{j}^{\mathbf{A}}$ . Explicitly,

[TABLE]

We let $\operatorname{Aff}_{\mathbb{Z}}(\mathbf{X},\mathbf{A})$ denote the integral lattice of solutions.

3 BLP+Affine Algorithm and Analysis for Symmetric Polymorphisms

In the BLP+Affine algorithm, given an instance $\mathbf{X}$ of $\operatorname{PCSP-Decision}(\mathbf{A},\mathbf{B})$ , we seek to throw out any assignment to a constraint for which the LP determines to have weight [math]. That is, given a relative interior point $(w,p)$ of $\operatorname{LP}_{\mathbb{Q}}(\mathbf{X},\mathbf{A})$ , we refine $\operatorname{Aff}_{\mathbb{Z}}(\mathbf{X},\mathbf{A})$ to $\operatorname{Aff}_{\mathbb{Z}}^{\prime}(\mathbf{X},\mathbf{A})$ by requiring $r_{i}(a)$ to be zero whenever $w_{i}(a)$ is, and requiring $q_{i}(y)$ to be zero whenever $p_{i}(y)$ is (by adding equations or just removing those variables from equations defining $\operatorname{Aff}_{\mathbb{Z}}(\mathbf{X},\mathbf{A})$ ).

The algorithm is presented in Figure 1. Note it does not depend on $\mathbf{B}$ ; it is only relevant for the correctness proof.

Definition 1.

We say the BLP+Affine algorithm correctly solves $\operatorname{PCSP-Decision}(\mathbf{A},\mathbf{B})$ if it accepts any instance $\mathbf{X}$ satisfiable in $\mathbf{A}$ and rejects any instance unsatisfiable in $\mathbf{B}$ .

As stated in the introduction, both the algorithm and the proof are structured similarly to those of [KOT*+*12] and [BG19]. Like in those works, the weights of the LP solution and affine relaxation are used to construct a list of assignments which are plugged into the relevant polymorphism. The novel contribution here is that a single argument can cover any infinite symmetric family of polymorphisms.

Theorem 2.

Let $(\mathbf{A},\mathbf{B})$ be a promise template (over any finite domain) such that $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ has symmetric polymorphisms of arbitrarily large arities. Then, the BLP+Affine algorithm correctly solves $\operatorname{PCSP-Decision}(\mathbf{A},\mathbf{B})$ .

Proof.

If an instance $\mathbf{X}$ is satisfiable in $\mathbf{A}$ , then the Basic LP relaxation has a solution. The refinement $\operatorname{Aff}_{\mathbb{Z}}^{\prime}(\mathbf{X},\mathbf{A})$ includes every possible assignment which is in the support of some LP solution, including integral solutions. Thus it is non-empty and therefore the algorithm accepts.

Conversely, suppose the algorithm accepts, meaning both $\operatorname{LP}_{\mathbb{Q}}(\mathbf{X},\mathbf{A})$ and $\operatorname{Aff}_{\mathbb{Z}}^{\prime}(\mathbf{X},\mathbf{A})$ have solutions $(w,p)$ over $\mathbb{Q}_{\geq 0}$ and $(r,q)$ over $\mathbb{Z}$ . The latter is a solution of $\operatorname{Aff}_{\mathbb{Z}}(\mathbf{X},\mathbf{A})$ such that

[TABLE]

We claim $\mathbf{X}$ is satisfiable in $\mathbf{B}$ . Among all the coordinates in the LP solution–the $w$ ’s and $p$ ’s–let $\ell$ be the least common denominator of these rational numbers. Let $M$ be the maximum absolute value of any integer which appears in the affine solution (both the variable weights $r$ and the constraint weights $q$ ). Let $f:A^{L}\to B$ be a symmetric polymorphism of arity $L{}\geq M\ell^{2}.$ Now write $L=u\ell+v$ where $u\in\mathbb{Z}_{\geq 0}$ and $v\in\{0,\ldots,\ell-1\}$ . Note that $u\geq M\ell$ .

For each $i\in[n]$ and $a\in A$ , let

[TABLE]

This is an integer by choice of $\ell$ . For a fixed $i\in[n]$ , note that by Eq. (3) and (6)

[TABLE]

Also, for fixed $i\in[n]$ and $a\in A$ , either $w_{i}(a)=0$ , which implies that $r_{i}(a)=0$ by the refinement, so $W_{i}(a)=0$ . Otherwise, $w_{i}(a)\geq 1/\ell$ , so

[TABLE]

That is, $W_{i}(a)$ for $a\in A$ are non-negative integers which sum to $L$ . We claim that the assignment

[TABLE]

to $x_{i}$ defines a satisfying assignment of $\mathbf{X}$ in $\mathbf{B}$ . (Since $f$ is symmetric, only the quantity of each $a\in A$ in the input matters.) To verify it is indeed satisfying, consider a constraint in $(R_{j},\bar{x}^{j})$ (with $j\in[m]$ ) and assume without loss of generality it is on variables $\bar{x}^{j}=(x_{1},\ldots,x_{k})$ . We claim $(X_{1},\dots,X_{k})\in R_{j}^{\mathbf{B}}$ .

For every valid assignment $y\in R^{\mathbf{A}}_{j}$ to that constraint in $\mathbf{A}$ , define

[TABLE]

By similar logic as before, these are non-negative integers that sum to 1. Indeed, by Eqs. (4) and (7),

[TABLE]

Moreover, either $p_{j}(y)=q_{j}(y)=0$ , implying $P_{j}(y)=0$ , or

[TABLE]

Further note that by Eqs. (5) and (8),

[TABLE]

For each $j\in[m]$ consider a matrix $M(j)\in A^{L\times k}$ , where exactly $P_{j}(y)$ of the rows are equal to $y$ . For all $i\in[k]$ and $a\in A$ , the number of times that $a$ appears in column $i$ is precisely $W_{i}(a)$ by Eq. (9). Thus, $f$ applied to the columns is precisely $(X_{1},\ldots,X_{k})$ . Since $f$ is a polymorphism, this implies $(X_{1},\ldots,X_{k})\in R_{j}^{\mathbf{B}}$ . This concludes the proof that assigning the value $X_{i}$ to each variable $x_{i}$ (for $i\in[n]$ ) satisfies $\mathbf{X}$ in $\mathbf{B}$ and hence that the algorithm is correct. ∎

Remark. Another algorithm which works is to solve $\operatorname{LP}_{\mathbb{Z}[\sqrt{2}]}(\mathbf{X},\mathbf{A})$ (that is the constrained variables are over non-negative elements of the ring $\mathbb{Z}[\sqrt{2}]$ ) using the algorithm from [BG19], instead of $\operatorname{LP}_{\mathbb{Q}}(\mathbf{X},\mathbf{A})$ . In this case, Steps 2 and 3 can be omitted. To sketch why this works, it suffices to justify why solving $\operatorname{LP}_{\mathbb{Z}[\sqrt{2}]}(\mathbf{X},\mathbf{A})$ also solves $\operatorname{LP}_{\mathbb{Q}}(\mathbf{X},\mathbf{A})$ and $\operatorname{Aff}_{\mathbb{Z}}^{\prime}(\mathbf{X},\mathbf{A})$ . For each assigned value of the form $a+b\sqrt{2}$ in a relative interior solution to $\operatorname{LP}_{\mathbb{Z}[\sqrt{2}]}(\mathbf{X},\mathbf{A})$ , consider changing this variable to $a+b\eta$ , where $\eta$ is a sufficiently good rational approximation of $\sqrt{2}$ . Such an assignment is in the relative interior of $\operatorname{LP}_{\mathbb{Q}}(\mathbf{X},\mathbf{A})$ as any inequality non-trivially involving $\eta$ , in particular (1), (2), is not tight due to $\sqrt{2}$ being irrational. To see why $\operatorname{Aff}_{\mathbb{Z}}(\mathbf{X},\mathbf{A})$ is also satisfied, replace each assigned value of $a+b\sqrt{2}$ with $a$ . By inspection, this assignment (when changing $w_{i}$ ’s to $r_{i}$ ’s and $p_{j}$ ’s to $q_{j}$ ’s) satisfies $\operatorname{Aff}_{\mathbb{Z}}(\mathbf{X},\mathbf{A})$ . It also satisfies $\operatorname{Aff}_{\mathbb{Z}}^{\prime}(\mathbf{X},\mathbf{A})$ because $a+b\sqrt{2}=0$ with $a$ and $b$ integral implies $a=0$ .

4 Extension of Analysis to Block Symmetric Polymorphisms

We say that a map $f:A^{L}\to B$ . is block-symmetric if there exists a partition of the coordinates of $f$ into blocks $B_{1}\cup\cdots\cup B_{k}=[L]$ such that $f$ is permutation-invariant within each coordinate block $B_{i}$ . We define the width of $f$ to be the minimum size of any block.222Note that a function $f$ might have different partitions into symmetric blocks; we define the width to be the maximum width over all such partitions. In particular, every $f:A^{L}\to B$ is block-symmetric with width at least 1. Finding the exact width or an appropriate partition into blocks is non-trivial. However, we avoid computing or evaluating $f$ altogether by only considering decision problems; see Section 6 for a discussion of search problems. A natural example of a block symmetric polymorphism with nontrivial width is alternating threshold first studied in [BG18]

[TABLE]

In this case, the blocks are the odd and even coordinates. This polymorphism arises in the context of $\mathbf{A}$ corresponding to 1-in-3 SAT and $\mathbf{B}$ corresponding to NAE-SAT. Recent work shows that this PCSP, although tractable and simple to state, is not algebraically reducible (via so-called pp-constructions) to any tractable finite-domain CSP [BBKO19].

We now show an analogue of Theorem 2 for block-symmetric polymorphisms. Remarkably, the algorithm is identical to the one for ordinary symmetric polymorphisms and is independent of the number of blocks. In particular, it could be that the Promise CSP has finitely many polymorphisms for any particular number of blocks, yet has infinitely many block-symmetric polymorphisms of increasing width.

As discussed in [BG19, FKOS19], nearly all known tractable Boolean PCSPs have polymorphisms which are either symmetric (such as threshold functions) or block-symmetric (such as alternating threshold). Thus, except for those PCSPs which are “homomorphic relaxations”333A homomorphic relaxation of a $\operatorname{PCSP}(\mathbf{A},\mathbf{B})$ is another $\operatorname{PCSP}(\mathbf{C},\mathbf{D})$ such that $\mathbf{C}$ has a homomorphism to $\mathbf{A}$ and $\mathbf{B}$ to $\mathbf{D}$ . In this case, $\operatorname{PCSP}(\mathbf{C},\mathbf{D})$ trivially reduces to $\operatorname{PCSP}(\mathbf{A},\mathbf{B})$ . In general, if $(\mathbf{C},\mathbf{D})$ is a Boolean template that is a homomorphic relaxation of a tractable non-Boolean (P)CSP template, then this is the only algorithm we know for $\operatorname{PCSP}(\mathbf{C},\mathbf{D})$ . We leave as an open question finding an explicit Boolean PCSP which is a homomorphic relaxation of a non-Boolean CSP but not correctly solvable by our BLP+Affine algorithm. of a tractable (P)CSP (c.f., [BG19, BBKO19]), the algorithm presented here supersedes those works in the context of decision PCSP.

Theorem 3.

Let $(\mathbf{A},\mathbf{B})$ be a promise template (over any finite domain) such that $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ has block-symmetric polymorphisms of arbitrarily large width. Then, the BLP+Affine algorithm correctly solves $\operatorname{PCSP-Decision}(\mathbf{A},\mathbf{B})$ .

Proof.

The proof proceeds much like that of Theorem 2. As before, we know that if $\mathbf{X}$ is satisfiable in $\mathbf{A}$ , then the algorithm rejects. We seek to show that if the algorithm accepts, then $\mathbf{X}$ is satisfiable in $\mathbf{B}$ .

Again, let $\ell$ be the least common denominator of all coordinates in the LP solution. Let $M$ be the maximum absolute value of any integer which appears in the affine solution. Let $f:A^{B_{1}\cup\cdots\cup B_{\kappa}}\to B$ be a block-symmetric polymorphism such that each block $B_{b}$ , with $b\in[\kappa]$ , has size at least $M\ell^{2}$ . Let $L_{b}=|B_{b}|$ . Similar to before, for all $b\in[\kappa]$ , write $L_{b}=u_{b}\ell+v_{b}$ where $u_{b}\in\mathbb{Z}_{\geq 0}$ and $v\in\{0,\ldots,\ell-1\}$ . Note that $u_{b}\geq M\ell$ .

We seek to show there exists a homomorphism from $\mathbf{X}$ to $\mathbf{B}$ . For each $b\in[\kappa]$ , $i\in[n]$ and $a\in A$ , let

[TABLE]

For a fixed $b\in[\kappa]$ and $i\in[n]$ , by similar logic to the proof of Theorem 2, we have that $W_{b,i}(a)$ are non-negative integers for all $a\in A$ and

[TABLE]

We now claim that the assignment

[TABLE]

to $x_{i}$ defines a satifying assigment of $\mathbf{X}$ in $\mathbf{B}$ . To verify this, consider a constraint in $(R_{j},\bar{x}^{j})$ (with $j\in[m]$ ) and assume without loss of generality it is on variables $\bar{x}^{j}=(x_{1},\ldots,x_{k})$ . We claim $(X_{1},\dots,X_{k})\in R_{j}^{\mathbf{B}}$ . For all $b\in[\kappa]$ and assignments $y\in R_{j}^{\mathbf{A}}$ define

[TABLE]

By similar logic as previously, $P_{b,j}(y)$ are non-negative integers and by Eqs. (4) and (7),

[TABLE]

Further note that by Eqs. (5) and (8) for $i\in[n],a\in A,$ and $j\in[m]$

[TABLE]

For each $j\in[m]$ consider a matrix $M(j)\in A^{L\times k}$ , where exactly $P_{b,j}(y)$ of the rows are equal to $y$ in the rows indexed by block $B_{b}$ . For all $i\in[k]$ and $a\in A$ , the number of times that $a$ appears in column $i$ and row-block $B_{b}$ is precisely $W_{b,i}(a)$ by Eq. (10). Thus, $f$ applied to the columns is precisely $(X_{1},\ldots,X_{k})$ . Since $f$ is a polymorphism, this implies $(X_{1},\ldots,X_{k})\in R_{j}^{\mathbf{B}}$ . This concludes the proof that the algorithm is correct. ∎

5 Characterizing the Algorithm’s Power

In this section, we characterize the power of the BLP+Affine algorithm from Figure 1 exactly. Recall, we denote the domains of relational structures $\mathbf{A},\mathbf{B},\mathbf{X}$ as $A,B,X$ .

Theorem 4.

Let $(\mathbf{A},\mathbf{B})$ be a promise template. The following are equivalent:

•

BLP+Affine algorithm correctly solves $\operatorname{PCSP-Decision}(\mathbf{A},\mathbf{B})$ .

•

$\operatorname{Pol}(\mathbf{A},\mathbf{B})$ * has block-symmetric polymorphisms of arbitrarily high width.*

•

For every $L\in\mathbb{N}$ , $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ has a block-symmetric polymorphism of arity $2L+1$ with two symmetric blocks of variables of size $L$ and $L+1$ , respectively.

We need a few definitions and fundamental facts from [BKO19, BBKO19]. For an $L$ -ary function $f\colon A^{L}\to B$ and a function $\pi\colon[L]\to[L^{\prime}]$ , the minor of $f$ obtained from $\pi$ is the function $g\colon A^{L^{\prime}}\to B$ defined as

[TABLE]

We write $g=f_{/\pi}$ . Thus sets of polymorphisms $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ are equipped with an operation $(\cdot)_{/\pi}$ which maps $L$ -ary polymorphisms to ${L^{\prime}}$ -ary polymorphisms (for every $\pi\colon[L]\to[L^{\prime}]$ ). We consider such a structure more abstractly, allowing any objects to play the role of polymorphisms:

Definition 5.

A minion $\mathcal{M}$ consists of sets $\mathcal{M}^{(L)}$ for $L\in\mathbb{N}$ and functions $(\cdot)_{/\pi}\colon\mathcal{M}^{(L)}\to\mathcal{M}^{(L^{\prime})}$ for all functions $\pi\colon[L]\to[L^{\prime}]$ , such that compositions agree: $(f_{/\pi})_{/\tau}=f_{/\tau\circ\pi}$ for $\pi\colon[L]\to[L^{\prime}]$ , $\tau\colon[L^{\prime}]\to[L^{\prime\prime}]$ , and $f_{/\operatorname{id}}=f$ . We write $\mathcal{M}$ for the disjoint union of $\mathcal{M}^{(L)}$ , $L\in\mathbb{N}$ , and $\operatorname{ar}(f)=L$ for $f\in\mathcal{M}^{(L)}$ . A minion homomorphism $\xi\colon\mathcal{M}\to\mathcal{N}$ is a function which preserves arity and minors: $\operatorname{ar}(\xi(f))=\operatorname{ar}(f)$ and $\xi(f_{/\pi})=\xi(f)_{/\pi}$ for all functions $\pi\colon[L]\to[L^{\prime}]$ .

Note that the objects in a minion do not have to be functions, and the set $\mathcal{M}^{(L)}$ does not have to be finite, though this is true for minions $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ with finite $\mathbf{A},\mathbf{B}$ . Similarly the operations $(\cdot)_{/\pi}$ are not necessarily defined by Eq. (11), though this will always be the case when elements of a minion $f\in\mathcal{M}^{(L)}$ are $L$ -ary function. As an important example, consider the minion $\mathcal{Q}_{conv}$ of convex combination functions, i.e. functions $\mathbb{Q}^{L}\to\mathbb{Q}$ of the form $w_{1}x_{1}+\dots+w_{L}x_{L}$ for $\sum_{1}^{L}w_{i}=1$ , $w_{i}\in\mathbb{Q}_{\geq 0}$ , with $(\cdot)_{/\pi}$ defined by Eq. (11). We can describe the same minion more concisely by identifying a convex $L$ -ary function with its $L$ -tuple of coefficients $(w_{1},\dots,w_{L})$ . That is, the “ $L$ -ary objects” of the minion $\mathcal{Q}_{conv}$ can be equivalently defined as distributions on $[L]$ :

[TABLE]

and for $\pi\colon[L]\to[L^{\prime}]$ and $w\in\mathcal{Q}_{conv}^{(L)}$ one can define $w_{/\pi}$ as

[TABLE]

This minion characterizes the power of the basic linear programming relaxation in the sense that BLP correctly solves $\operatorname{PCSP-Decision}(\mathbf{A},\mathbf{B})$ (i.e. feasibility of $\operatorname{LP}_{\mathbb{Q}}(\mathbf{X},\mathbf{A})$ implies $\mathbf{X}$ is satisfiable in $\mathbf{B}$ for all instances $\mathbf{X}$ ) if and only if $\mathcal{Q}_{conv}$ admits a minion homomorphism to $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ . This was shown by Barto et al. [BBKO19, Theorem 7.9]. Our proof straightforwardly extends this part of the argument.

We first define the minion that plays the role of $\mathcal{Q}_{conv}$ for the BLP+Affine relaxation. It assigns two coefficients to every coordinate $i\in[L]$ .

Definition 6.

The minion $\mathcal{M}_{\mathrm{BLP+Aff}}$ is defined as follows: for $L\in\mathbb{N}$ , its “ $L$ -ary objects” are

[TABLE]

Equivalently, these could be seen as a function from $[L]$ to $\{(a,b)\in\mathbb{Q}_{\geq 0}\times\mathbb{Z}:a=0\implies b=0\}.$

For $\pi\colon[L]\to[L^{\prime}]$ and $(w,r)\in\mathcal{M}_{\mathrm{BLP+Aff}}^{(L)}$ , we define the minor $(w,r)_{/\pi}$ as $(w^{\prime},r^{\prime})$ , where

[TABLE]

It is easy to check this indeed defines a minion (the $w(i)=0\implies r(i)=0$ condition is preserved when taking a minor and composition of minors works as expected). One could also think of a pair $(w,r)\in\mathcal{M}_{\mathrm{BLP+Aff}}^{(L)}$ as an $L$ -ary function on $\mathbb{Q}^{2}$ , $f(\binom{x_{1}}{y_{1}},\dots,\binom{x_{n}}{y_{n}})=\binom{\sum w(i)x_{i}}{\sum r(i)x_{i}}$ .

The minion $\mathcal{M}_{\mathrm{BLP+Aff}}$ characterizes the BLP+Affine relaxation as follows.

Lemma 7.

Let $(\mathbf{A},\mathbf{B})$ be a promise template. The following are equivalent:

•

BLP+Affine correctly solves $\operatorname{PCSP-Decision}(\mathbf{A},\mathbf{B})$ (Definition 1).

•

$\mathcal{M}_{\mathrm{BLP+Aff}}$ * admits a minion homomorphism to $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ .*

As the proof of this lemma directly extends the arguments by Barto et al. [BBKO19], we refer the reader to Appendix A for an exposition of it.

We now reinterpret this last condition in terms of concrete polymorphisms. One direction is simple:

Lemma 8.

Suppose $\mathcal{M}_{\mathrm{BLP+Aff}}$ has a minion homomorphism to some minion $\mathcal{N}=\operatorname{Pol}(\mathbf{A},\mathbf{B})$ . Then for every $L\in\mathbb{N}$ , $\mathcal{N}$ contains a block-symmetric polymorphism of arity $2L+1$ with two blocks of size $L$ and $L+1$ .

Proof.

Given $L\in\mathbb{N}$ , consider the following object $(w,r)\in\mathcal{M}_{\mathrm{BLP+Aff}}^{(2L+1)}$ : take $w(i):=\frac{1}{2L+1}$ and $r(i):=(-1)^{i+1}$ for $i=1,\dots,2L+1$ . For every permutation $\pi\colon[2L+1]\to[2L+1]$ which maps odd coordinates to odd coordinates (and even to even), $(w,r)_{/\pi}=(w,r)$ . Thus the image of $(w,r)$ in $\mathcal{N}$ has the same property, i.e. it has arity $2L+1$ and it is symmetric on odd coordinates as well as on even coordinates. ∎

We remark the above lemma in fact applies to any minion $\mathcal{N}$ , not only those of the form $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ ; one can define $f\in\mathcal{N}^{(L)}$ to be block-symmetric with blocks $B_{1}\cup\cdots\cup B_{k}=[L]$ if $f_{/\pi}=f$ holds for all permutations $\pi$ of $[L]$ that preserve the blocks; the proof then applies without change.

The idea for the other direction is essentially the same as in the proof of Theorem 2 and 3. We apply it to construct a minion homomorphism from every finite subset of $\mathcal{M}_{\mathrm{BLP+Aff}}$ and use a compactness argument.

Lemma 9.

Suppose the minion $\mathcal{N}=\operatorname{Pol}(\mathbf{A},\mathbf{B})$ (for $\mathbf{A},\mathbf{B}$ finite) contains block-symmetric polymorphisms of arbitrarily high width. Then $\mathcal{M}_{\mathrm{BLP+Aff}}$ admits a minion homomorphism to $\mathcal{N}$ .

Proof.

To avoid cumbersome notation we present the proof only for the case of one block, i.e. we assume that $\mathcal{N}$ contains symmetric polymorphisms of arbitrarily high arity. This extends to more blocks just as Theorem 3 extends Theorem 2.

We define finite subsets of $\mathcal{M}_{\mathrm{BLP+Aff}}$ as follows. For $L,\ell,M\in\mathbb{N}$ , let $\mathcal{M}^{(L)}_{\ell,M}$ be the subset of those $(w,r)\in\mathcal{M}_{\mathrm{BLP+Aff}}^{(L)}$ such that $\ell w(i)\in\mathbb{Z}$ for $i\in[L]$ and $\sum_{i}|r(i)|\leq M$ . Observe that $\mathcal{M}^{(L)}_{\ell,M}$ is a finite set (since the numbers $\ell w(i)$ are $L$ non-negative integers summing to $\ell$ and the numbers $r(i)$ are $L$ integers between $-M$ and $M$ ). Denote $\mathcal{M}_{\ell,M}:=\bigcup_{L\in\mathbb{N}}\mathcal{M}^{(L)}_{\ell,M}$ .

For fixed $\ell,M$ , we define a minion homomorphism from $\mathcal{M}_{\ell,M}$ to $\mathcal{N}$ as follows. Let $f\in\mathcal{N}$ be a function of some arity $L^{*}\geq M\ell^{2}$ . Let $u,v\in\mathbb{N}$ be numbers such that $L^{*}=u\ell+v$ , $v\in\{0,\dots,\ell-1\}$ . Then $u\geq M\ell$ .

Take $L\in\mathbb{N}$ and $(w,r)\in\mathcal{M}^{(L)}_{\ell,M}$ . For $i\in[L]$ , the number $W_{i}:=u\ell w(i)+vr(i)$ is a non-negative integer. Since $\sum_{i}W_{i}=u\ell+v=L^{*}$ , we can map $(w,r)$ to the $L$ -ary minor $g:=f(x_{1},x_{1},x_{1},\dots,x_{L},x_{L})$ of the $L^{*}$ -ary function $f$ where $x_{i}$ is repeated $W_{i}$ times, for $i\in[L]$ . We claim that this map is a minion homomorphism from $\mathcal{M}_{\ell,M}$ to $\mathcal{N}$ (in fact to the subminion of minors of $f$ ). Indeed, for $\pi\colon[L]\to[L^{\prime}]$ , consider the minor $g_{/\pi}$ of $g$ identifying $x_{j}$ for $j\in\pi^{-1}(i)$ into a single variable $z_{i}$ (for $i\in[L^{\prime}]$ ). We have that $g_{/\pi}$ is also a minor of $f$ where $z_{i}$ is repeated $\sum_{j\in\pi^{-1}(i)}W_{j}$ times. That is, $z_{i}$ is repeated $u\ell w(\pi^{-1}(i))+vr(\pi^{-1}(i))$ times. By symmetry of $f$ the ordering does not matter, thus $g_{/\pi}$ (the minor of the image of $f$ ) is the same as the image of the minor $f_{/\pi}$ .

We conclude with a compactness argument similar to that of Remark 7.13 in [BBKO19]. For $k\in\mathbb{N}$ , let $\mathcal{M}_{k}:=\bigcup_{L\leq k}\mathcal{M}^{(L)}_{k!,k}$ . Then $\mathcal{M}_{k}$ is finite, $\mathcal{M}_{k}\subseteq\mathcal{M}_{k+1}$ (because $k!\cdot w(i)\in\mathbb{Z}$ implies $(k+1)!\cdot w(i)\in\mathbb{Z}$ ) and $\bigcup_{k\in\mathbb{N}}\mathcal{M}_{k}=\mathcal{M}_{\mathrm{BLP+Aff}}$ . Consider the possible minion homomorphisms from $\mathcal{M}_{k}$ to $\mathcal{N}$ , or more precisely, restrictions of homomorphisms obtained above to $\mathcal{M}_{k}$ (since $\mathcal{M}_{k}$ itself is technically not a minion). There are only finitely many possible such restrictions $\mathcal{M}_{k}\to\mathcal{N}$ , because $\mathcal{M}_{k}$ is finite, the arities of images in $\mathcal{N}$ are bounded, and hence the number of possible images in $\mathcal{N}$ is also finite. Consider an infinite tree with restrictions from any $\mathcal{M}_{k}$ to $\mathcal{N}$ as nodes, the trivial map from $\mathcal{M}_{0}=\emptyset$ being the root, and the parent of a function $\mathcal{M}_{k+1}\to\mathcal{N}$ being its restriction to $\mathcal{M}_{k}$ . This is an infinite tree (because for each $k$ we have some minion homomorphism from a superset of $\mathcal{M}_{k}$ to $\mathcal{N}$ ) that is connected (because everyone is connected through its ancestors to the root) and finitely branching (because there are only finitely many restrictions $\mathcal{M}_{k}\to\mathcal{N}$ , for any fixed $k$ ). Therefore, by Kőnig’s lemma, the tree contains an infinite path $\zeta_{k}\colon\mathcal{M}_{k}\to\mathcal{N}$ of homomorphisms that are restrictions of each other. Their union is then a homomorphism from $\bigcup_{k\in\mathbb{N}}\mathcal{M}_{k}=\mathcal{M}_{\mathrm{BLP+Aff}}$ to $\mathcal{N}$ . ∎

(We remark the above proof in fact applies to any minion $\mathcal{N}$ , assuming $\mathcal{N}^{(L)}$ is finite for every $L$ .) Lemmas 7, 8, and 9 conclude the proof of Theorem 4.

6 Concluding Thoughts

We conclude with a few natural directions of future inquiry raised by this work.

Inspecting the proofs of Theorems 2 and 3, in order to yield a search algorithm (and not just a decision algorithm), it would suffice to compute:

[TABLE]

for some block-symmetric polymorphism $f$ and a fixed partition into blocks of size at least $L$ , for an integer $L$ which depends polynomially on the least common denominator of rational numbers in the LP solution and the maximum absolute value of integers in the affine solution. In previous work [BG19], Brakensiek and Guruswami circumvented this problem by assuming that $f$ has special structure (such as being a threshold function, etc.). Even then, we often only assumed that you had oracle access to the structure of $f$ . Thus, except for some simple cases studied in the paper, truly polynomial-time search algorithms remain elusive. Perhaps one could hope for a search algorithm like the decision algorithm presented in this paper which is oblivious to the underlying polymorphisms (as long as they are symmetric/block-symmetric).

Question. Is there an “oblivious” polynomial-time algorithm for the search version of Promise CSPs with infinitely many symmetric polymorphisms?

We note that an oblivious polynomial-time algorithm is also not known for the search version of Promise CSPs with symmetric polymorphisms of all arities (which capture the power of BLP [BBKO19, Theorem 7.9]) and for the search version of Promise CSPs with alternating polymorphisms of all odd arities (which capture the power of the affine relaxation [BBKO19, Theorem 7.19]).

Otherwise, one could hope to prove a “structure theorem” that every Promise CSP with infinitely many symmetric polymorphisms also has an infinite threshold-periodic family. As [BG19] shows, such polymorphisms can get exceedingly complicated, suggesting that such a characterization may only be possible in the Boolean case.

Question. Does every Boolean PCSP with infinitely many symmetric polymorphisms have an infinite threshold-periodic family?

Even without a structure theorem, one could perhaps hope to compute the pertinent values of $f$ “on the fly,” but this seems difficult in our current formulation as the arity of $f$ could be exponentially large in the input size!

While Theorem 4 characterizes the power of the BLP+Affine algorithm, it is still worthwhile to ask how this compares to other classes of templates, in particular those studied for non-promise CSPs. The following example of a simple template not solved by the BLP+Affine relaxation was communicated to us by Jakub Opršal.

Example 10.

Let $\mathbf{A}$ be the disjoint union of a directed 2-cycle $\{0,1\}$ and a directed 3-cycle $\{0^{\prime},1^{\prime},2^{\prime}\}$ . Then $\mathbf{A}$ is tractable template (i.e. $\operatorname{PCSP}(\mathbf{A},\mathbf{A})$ is solvable in polynomial time, in fact $\operatorname{Pol}(\mathbf{A},\mathbf{A})$ has cyclic polymorphisms of every prime arity $p>3$ ) but has no non-trivial block-symmetric polymorphisms.

Proof.

To see it admits no block-symmetric polymorphisms $f$ of width greater than one, observe that every such width can be represented as $2n+3n^{\prime}$ for some $n,n^{\prime}\in\mathbb{N}$ , hence every block can be filled with $n$ copies of values $0,1$ and $n^{\prime}$ copies of $0^{\prime},1^{\prime},2^{\prime}$ , giving some input $\bar{v}$ to $f$ . But $f$ should give the same output on the input $\bar{v}^{\oplus 1}$ consisting of $n$ copies of $1,0$ and $n^{\prime}$ copies of $1^{\prime},2^{\prime},0^{\prime}$ . Since $(v_{i},v^{\oplus 1}_{i})$ is an arc of $\mathbf{A}$ for every $i$ and since $f$ is a polymorphism, $(f(\bar{v}),f(\bar{v}^{\oplus 1}))$ would be a loop in $\mathbf{A}$ , a contradiction.

We now observe that $\operatorname{PCSP}(\mathbf{A},\mathbf{A})$ has a straightforward polynomial time algorithm. For each connected component of constraints, the variables must map to either $\{0,1\}$ or $\{0^{\prime},1^{\prime},2^{\prime}\}$ . The first case is equivalent to testing if the graph of constraints is bipartite. The latter can be done by a breath-first search which checks that all directed cycles have length a multiple of $3$ . ∎

Thus the condition of having block-symmetric polymorphisms of high width is not preserved under disjoint union, even though tractability is. We also know that since $\operatorname{Pol}(\mathbf{A},\mathbf{A})$ has a majority polymorphism (simply let $f(x,y,z)$ output $x$ if $x=y$ and $z$ otherwise), $\operatorname{PCSP}(\mathbf{A},\mathbf{A})$ can be solved in polynomial time via the $(2,3)$ -consistency algorithm, 3-rounds of Sherali-Adams, or the canonical SDP relaxation (see also [BK14, TŽ17, BKW17]). Informally, these relaxations ensure that there are locally consistent assignments to every (constant-sized) subset of variables. This consistency is quite powerful. For instance, 2-SAT can be solved by the BLP+Affine relaxation or 3 rounds of Sherali-Adams, but not the BLP by itself. This suggests the tantalising possibility that an analogous hierarchy could provide a uniform algorithm for all tractable non-promise CSPs.

Question. Which (decision) promise CSPs can be solved via constantly many rounds of the Sherali-Adams hierarchy for the BLP+Affine relaxation? Does this capture all tractable non-promise CSPs?

Acknowledgments

We thank Libor Barto, Andrei Krokhin, and Jakub Opršal for useful comments and encouragement. We also thank anonymous reviewers for many helpful comments.

Appendix A From Relaxations to Minion Homomorphisms

In this appendix, we recall the definition of the minion $\mathcal{Q}_{conv}$ and prove Lemma 7 from Section 5. We do this by explaining how free structures relate BLP and Affine relaxations to minions. We carry over the notation from Section 5.

Definition 11.

The minion $\mathcal{Q}_{conv}$ is defined as follows: for $L\in\mathbb{N}$ , the “ $L$ -ary object” of the minion are

[TABLE]

for $\pi\colon[L]\to[L^{\prime}]$ and $w\in\mathcal{Q}_{conv}^{(L)}$ , we define the minor $w_{/\pi}$ of $w$ as

[TABLE]

Let us describe how $\mathcal{Q}_{conv}$ characterizes the power of the basic linear programming relaxation; the case of BLP+Affine will be entirely analogous. Recall that for an instance $\mathbf{X}$ of $\operatorname{PCSP}(\mathbf{A},\mathbf{B})$ , a solution to the BLP relaxation assigns to each variable $i\in X$ a distribution $w_{i}\colon A\to\mathbb{Q}_{\geq 0}$ with $\sum_{a\in A}w_{i}(a)=1$ . It also assigns to each constraint $j$ of $\mathbf{X}$ a distribution over satisfying assignments $p_{j}\colon R^{A}\to\mathbb{Q}_{\geq 0}$ with sum 1. Finally, the relaxation requires that for a variable $i$ in a constraint $j$ of $\mathbf{X}$ , the assignment of $a\in A$ to $i$ has value $w_{i}(a)=\sum_{y}p_{j}(y)$ , where the sum runs over all satisfying assignments $y\in R^{A}$ of the constraint where the variable $i$ takes value $a$ .

In other words, $w_{i}(a)=p_{j}(\pi^{-1}(a))$ , where $\pi=\pi_{j\to i}\colon R^{A}\to A$ maps a satisfying assignment $y$ to the value of variable $i$ in constraint $j$ . That is, $w_{i}$ , as an object of $\mathcal{Q}_{conv}^{|A|}$ , is required to be the minor of $p_{j}\in\mathcal{Q}_{conv}^{|R^{A}|}$ obtained from $\pi$ . Thus the BLP relaxation of $\mathbf{X}$ is satisfiable if and only if one can assign some $w_{i}\in\mathcal{Q}_{conv}^{|A|}$ to each variable $i\in X$ so that the following holds for every constraint $j$ of $\mathbf{X}$ : there is a $p_{j}\in\mathcal{Q}_{conv}^{|R^{A}|}$ such that for all variables $i$ in $j$ , $w_{i}={p_{j}}{/\pi_{j\to i}}$ . This can be phrased as the existence of a homomorphism from $\mathbf{X}$ to the free structure $\mathbb{F}_{\mathcal{Q}_{conv}}$ , defined as follows.

Definition 12.

For a relational structure $\mathbf{A}$ and a minion $\mathcal{M}$ , the free structure $\mathbb{F}_{\mathcal{M}}(\mathbf{A})$ is a template with domain $\mathcal{M}^{|A|}$ (potentially infinite) and with the same signature as $\mathbf{A}$ . For each relation $R^{A}$ of arity $k$ in $\mathbf{A}$ , there is a relation $R^{\mathbb{F}}$ of the same arity in $\mathbb{F}_{\mathcal{M}}(\mathbf{A})$ defined as follows: $w_{1},\dots,w_{k}\in\mathcal{M}^{(|A|)}$ are in the relation $R^{\mathbb{F}}$ if there is some $p\in\mathcal{M}^{(|R^{A}|)}$ such that for each $i\in[k]$ , $w_{i}=p_{/\pi_{i}}$ . Here $\pi_{i}\colon R^{A}\to A$ maps $y\in R^{A}\subseteq A^{k}$ to its $i$ -th coordinate.

The above discussion shows that:

Observation 13.

The BLP relaxation of $(\mathbf{X},\mathbf{A})$ has a solution if and only if $\mathbf{X}$ is satisfiable in $\mathbb{F}_{\mathcal{Q}_{conv}}(\mathbf{A})$ .

Just as in Definition 1, we say that “BLP correctly solves $\operatorname{PCSP-Decision}(\mathbf{A},\mathbf{B})$ ” if for every instance $\mathbf{X}$ , feasibility of the $\operatorname{LP}_{\mathbb{Q}}(\mathbf{X},\mathbf{A})$ implies satisfiability of $\mathbf{X}$ in $\mathbf{B}$ . (Note the other direction is always trivially true: if $\mathbf{X}$ is satisfiable in $\mathbf{A}$ , then the relaxation $\operatorname{LP}_{\mathbb{Q}}(\mathbf{X},\mathbf{A})$ has a solution). Let us write $\mathbf{X}\to\mathbf{A}$ if there exists a homomorphism from $\mathbf{X}$ to $\mathbf{A}$ (i.e. a satisfying assignment); we can now restate the definition.

Observation 14.

Let $(\mathbf{A},\mathbf{B})$ be a promise template. The following are equivalent:

•

BLP correctly solves $\operatorname{PCSP-Decision}(\mathbf{A},\mathbf{B})$ ;

•

for every instance $\mathbf{X}$ , $\mathbf{X}\to\mathbb{F}_{\mathcal{Q}_{conv}}(\mathbf{A})$ implies $\mathbf{X}\to\mathbf{B}$ .

Entirely analogously, we can restate what it means for BLP+Affine to solve a PCSP (Definition 1), by using the minion $\mathcal{M}_{\mathrm{BLP+Aff}}$ (Definition 6).

Observation 15.

Let $(\mathbf{A},\mathbf{B})$ be a promise template. The following are equivalent:

•

BLP+Affine correctly solves $\operatorname{PCSP-Decision}(\mathbf{A},\mathbf{B})$ ;

•

for every instance $\mathbf{X}$ , $\mathbf{X}\to\mathbb{F}_{\mathcal{M}_{\mathrm{BLP+Aff}}}(\mathbf{A})$ implies $\mathbf{X}\to\mathbf{B}$ .

The resulting condition can be simplified by a standard compactness argument. That is, we use the following straightforward generalization of the de Bruijn–Erdős Theorem (see e.g. [Die16, Theorem 8.1.3] for a discussion and short proofs, [RTW17] for general relational structures).

Lemma 16 (Compactness for structures).

Let $\mathbf{F},\mathbf{B}$ be relational structures with $F$ infinite and $B$ finite. If every finite induced substructure of $\mathbf{F}$ admits a homomorphism to $\mathbf{B}$ , then so does $\mathbf{F}$ .

That is, for a promise template $(\mathbf{A},\mathbf{B})$ and any minion $\mathcal{M}$ , the following are equivalent:

•

for every instance $\mathbf{X}$ , $\mathbf{X}\to\mathbb{F}_{\mathcal{M}}(\mathbf{A})$ implies $\mathbf{X}\to\mathbf{B}$ ;

•

$\mathbb{F}_{\mathcal{M}}(\mathbf{A})\to\mathbf{B}$ .

A fundamental property of free structures is that the latter condition is equivalent to the existence of a minion homomorphism, as proved by Barto et al. [BBKO19, Lemma 4.4].

Lemma 17 ([BBKO19]).

Let $(\mathbf{A},\mathbf{B})$ be a promise template and let $\mathcal{M}$ be any minion. The following are equivalent:

•

$\mathbb{F}_{\mathcal{M}}(\mathbf{A})\to\mathbf{B}$ ;

•

there exists a minion homomorphism from $\mathcal{M}$ to $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ .

Altogether, this shows that BLP+Affine solves $\operatorname{PCSP}(\mathbf{A},\mathbf{B})$ if and only if $\mathcal{M}_{\mathrm{BLP+Aff}}$ admits a minion homomorphism to $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ . This concludes the proof of Lemma 7 in Section 5.

We remark that Barto et al. [BBKO19, Theorem 7.9] used the same argument to characterize the power of BLP for PCSPs.

Theorem 18 ([BBKO19]).

Let $(\mathbf{A},\mathbf{B})$ be a promise template. The following are equivalent:

•

BLP solves $\operatorname{PCSP}(\mathbf{A},\mathbf{B})$ (as in Definition 1),

•

$\forall_{\mathbf{X}}\ \ \mathbf{X}\to\mathbb{F}_{\mathcal{Q}_{conv}}(\mathbf{A})\implies\mathbf{X}\to\mathbf{B}$ ,

•

$\mathbb{F}_{\mathcal{Q}_{conv}}(\mathbf{A})\to\mathbf{B}$ ,

•

$\mathcal{Q}_{conv}$ * admits a minion homomorphism to $\operatorname{Pol}(\mathbf{A},\mathbf{B})$ ,*

•

$\operatorname{Pol}(\mathbf{A},\mathbf{B})$ * contains symmetric polymorphisms of every arity.*

Our argument thus only differs in the equivalence of the last two bullets, an analogue of which is proved in Section 5. Finally, let us note that in [BBKO19, Theorem 7.19], the power of the Affine relaxation alone was similarly characterized by the minion $\mathcal{Z}_{\mathrm{aff}}$ , defined analogously to $\mathcal{Q}_{conv}$ , except with integer coefficients (not necessarily non-negative): the $L$ -ary objects are $r\colon[L]\to\mathbb{Z}$ such that $\sum_{i\in[L]}r(i)=1$ .

Bibliography25

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AGH 17] Per Austrin, Venkatesan Guruswami, and Johan Håstad. (2+( ϵ italic-ϵ \epsilon ))-Sat Is NP-hard. SIAM J. Comput. , 46(5):1554–1573, 2017.
2[BBKO 19] Libor Barto, Jakub Bulín, Andrei Krokhin, and Jakub Opršal. Algebraic approach to promise constraint satisfaction. ar Xiv:1811.00970 [cs, math] , 2019.
3[BD 06] Andrei Bulatov and Víctor Dalmau. A Simple Algorithm for Mal’tsev Constraints. SIAM J. Comput. , 36(1):16–27, July 2006.
4[BG 18] Joshua Brakensiek and Venkatesan Guruswami. Promise Constraint Satisfaction: Structure Theory and a Symmetric Boolean Dichotomy. In Artur Czumaj, editor, Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2018, New Orleans, LA, USA, January 7-10, 2018 , pages 1782–1801. SIAM, 2018. Full version available as ECCC TR 16-183.
5[BG 19] Joshua Brakensiek and Venkatesan Guruswami. An Algorithmic Blend of L Ps and Ring Equations for Promise CS Ps. In Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms , SODA ’19, pages 436–455, Philadelphia, PA, USA, 2019. Society for Industrial and Applied Mathematics.
6[BG 20] Joshua Brakensiek and Venkatesan Guruswami. Symmetric polymorphisms and efficient decidability of PCS Ps. In Proceedings of the 31st Annual ACM-SIAM Symposium on Discrete Algorithms (SODA’20) , pages 297–304. SIAM, 2020. ar Xiv:1907.04383.
7[BK 14] Libor Barto and Marcin Kozik. Constraint Satisfaction Problems Solvable by Local Consistency Methods. Journal of the ACM , 61(1):3:1–3:19, January 2014.
8[BKO 19] Jakub Bulín, Andrei Krokhin, and Jakub Opršal. Algebraic Approach to Promise Constraint Satisfaction. In Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing , STOC 2019, pages 602–613, New York, NY, USA, 2019. ACM.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

The Power of the Combined Basic LP and Affine

Abstract

1 Introduction

2 Notation

2.1 PCSP: Decision and Search

2.2 Polymorphisms

2.3 Basic LP and Affine Relaxation

3 BLP+Affine Algorithm and Analysis for Symmetric Polymorphisms

Definition 1**.**

Theorem 2**.**

Proof.

4 Extension of Analysis to Block Symmetric Polymorphisms

Theorem 3**.**

Proof.

5 Characterizing the Algorithm’s Power

Theorem 4**.**

Definition 5**.**

Definition 6**.**

Lemma 7**.**

Lemma 8**.**

Proof.

Lemma 9**.**

Proof.

6 Concluding Thoughts

Example 10**.**

Proof.

Acknowledgments

Appendix A From Relaxations to Minion Homomorphisms

Definition 11**.**

Definition 12**.**

Observation 13**.**

Observation 14**.**

Observation 15**.**

Lemma 16** (Compactness for structures).**

Lemma 17** ([BBKO19]).**

Theorem 18** ([BBKO19]).**

Definition 1.

Theorem 2.

Theorem 3.

Theorem 4.

Definition 5.

Definition 6.

Lemma 7.

Lemma 8.

Lemma 9.

Example 10.

Definition 11.

Definition 12.

Observation 13.

Observation 14.

Observation 15.

Lemma 16 (Compactness for structures).

Lemma 17 ([BBKO19]).

Theorem 18 ([BBKO19]).