This paper classifies all P-time computable six-vertex models on planar graphs, discovering new models beyond existing algorithms and establishing a comprehensive complexity landscape with explicit criteria.
Contribution
It introduces a complete classification of six-vertex models on planar graphs, discovering new P-time cases and providing explicit criteria for complexity.
Findings
01
Identified all P-time computable six-vertex models on planar graphs.
02
Discovered new models beyond Kasteleyn's algorithm.
03
Established a comprehensive complexity classification with explicit criteria.
Abstract
We discover new P-time computable six-vertex models on planar graphs beyond Kasteleyn's algorithm for counting planar perfect matchings. We further prove that there are no more: Together, they exhaust all P-time computable six-vertex models on planar graphs, assuming #P is not P. This leads to the following exact complexity classification: For every parameter setting in C for the six-vertex model, the partition function is either (1) computable in P-time for every graph, or (2) #P-hard for general graphs but computable in P-time for planar graphs, or (3) #P-hard even for planar graphs. The classification has an explicit criterion. The new P-time cases in (2) provably cannot be subsumed by Kasteleyn's algorithm. They are obtained by a non-local connection to #CSP, defined in terms of a "loop space". This is the first substantive advance toward a planar Holant classification…
Tables5
Table 1. Table 4.1: The values of f 𝑓 f and its rotated copies
at intersection vertices
entry-vertices
exit-vertices
Table 2. Table 4.2: The values of g i , j subscript 𝑔 𝑖 𝑗 g_{i,j}
Table 3. Table 4.3: The values of f 𝑓 f and its rotated forms
at self-intersection vertices
Table 4. Table 4.4: The values of gadget f ⊠ subscript 𝑓 ⊠ f_{\boxtimes} when a = x = 1 𝑎 𝑥 1 a=x=1 and b = y 𝑏 𝑦 b=y
Square
Table 5. Table 4.5: The values of gadget f ⊠ subscript 𝑓 ⊠ f_{\boxtimes} when a = − x = 1 𝑎 𝑥 1 a=-x=1 and b = y 𝑏 𝑦 b=y
detMOut(Dλf)=detMIn(Dλf), and detMOut(M(=2)f)=detMIn(M(=2)f).
detMOut(Dλf)=detMIn(Dλf), and detMOut(M(=2)f)=detMIn(M(=2)f).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Markov Chains and Monte Carlo Methods · Computability, Logic, AI Algorithms
Full text
New Planar P-time Computable Six-Vertex Models and a Complete Complexity Classification
Jin-Yi Cai
[email protected]
Department of Computer Sciences, University of Wisconsin-Madison. Supported by NSF CCF-1714275.
Zhiguo Fu
[email protected]
School of Information Science and Technology, Northeast Normal University. Supported by NSFC-61872076,
Natural Science Foundation of Jilin Province 20200201161JC and Fundamental Research Funds for Central Universities.
We discover new P-time computable six-vertex models
on planar graphs beyond
Kasteleyn’s algorithm
for counting planar perfect matchings.111This is also known as the FKT algorithm.
Fisher and Temperley [46], and Kasteleyn [35] independently discovered
this algorithm on the grid graph. Subsequently Kasteleyn [34]
generalized this to all
planar graphs. We further prove that there are no more: Together, they exhaust all P-time computable six-vertex models on planar graphs, assuming
#P is not P.
This leads to the following exact
complexity
classification:
For every parameter setting in C for the six-vertex model,
the partition function is
either (1) computable in
P-time for every graph, or
(2) #P-hard for general graphs but computable in
P-time for
planar graphs, or
(3) #P-hard even for planar graphs.
The classification has an explicit criterion.
The new P-time cases
in (2) provably cannot be subsumed by Kasteleyn’s algorithm.
They are obtained by a non-local connection to #CSP,
defined in terms of
a “loop space”.
This is the first substantive advance toward a planar Holant classification
with not necessarily symmetric constraints.
We introduce Möbius transformation on C as a powerful
new tool in hardness proofs for counting problems.
1 Introduction
Partition functions are Sum-of-Product computations.
In physics, one considers a set of particles connected by some
bonds. Then physical laws impose various local constraints,
with suitable weights for valid local configurations.
If a global configuration σ satisfies
all local constraints, the product of local weights
is the weight of σ, and the sum over all such
σ is the value of the partition function.
The partition function encodes much information about a physical system.
By definition, a partition function is an exponential sized sum.
But in some cases, clever algorithms exist that can compute
it in P-time.
Well-known examples of partition functions from physics
include the Ising model, Potts model, hardcore gas and
the six-vertex model [6, 5].
Most of these are spin systems [33, 29, 28, 39]. If particles take
+/− spins, each can be modeled by a Boolean variable,
and local constraints are expressed by edge (binary) constraint functions.
These are nicely modeled by the #CSP framework [8, 25, 9, 12, 11].
Some physical systems are more naturally described as orientation problems,
and these can be modeled by Holant problems [20], of which
#CSP is a special case.
In this paper we study the six-vertex model,
which consists of orientation problems.
The six-vertex model has a long history in
physics.
Pauling in 1935 introduced the six-vertex model
to account for the residual entropy
of water ice [44].
Consider a large number of oxygen and hydrogen atoms
in a 1 to 2 ratio.
Each oxygen atom (O) is connected by a bond to four other neighboring oxygen
atoms (O), and each bond is occupied by one hydrogen atom (H).
Physical constraint requires that each (H) is closer to exactly one
of the two neighboring (O).
Pauling argued [44] that, furthermore, the allowed configurations
are such that at each oxygen (O) site, exactly two hydrogen (H)
are closer to it, and the other two are farther away.
This can be naturally represented by a 4-regular graph.
The constraint on the placement of hydrogen atoms (H) can be represented by
an orientation of the edges of the graph, such that
at every vertex (O), the in-degree and out-degree are both 2.
In other words, this is an Eulerian orientation [42, 16].
Since there are (24)=6 local valid configurations,
this is called the six-vertex model.
In addition to water ice,
potassium dihydrogen phosphate KH2PO4 (KDP) also
satisfies this model.
The valid local configurations of the six-vertex model
are illustrated in Figure 1.1.
The energy E of the system is determined by
six parameters ϵ1,ϵ2,…,ϵ6
associated with each type of local configuration.
If there are ni sites in local configurations of type i,
then
E=n1ϵ1+n2ϵ2+…+n6ϵ6.
Then the partition function is ZSix=∑e−E/kBT,
where the sum is over all valid configurations, kB
is Boltzmann’s constant, and T is the system’s temperature.
This is a sum-of-product
computation where the sum is over all Eulerian orientations
of the graph, and the product is over all vertices where each
contributes a factor ci=cϵi
if it is in configuration i
(1≤i≤6) for some constant c.
Some choices of the parameters
are well-studied.
For modeling ice (ϵ1=…=ϵ6=0)
on the square N×N lattice graph,
Lieb [40] famously showed that,
the value of the “partition function per vertex”
W=Z1/N2 approaches (34)3/2≈1.5396007… (Lieb’s square ice constant).
This matched experimental
data 1.540±0.001 so well that it is considered a triumph.
Other well-known six-vertex models include:
the KDP model of a ferroelectric
(ϵ1=ϵ2=0, and
ϵ3=ϵ4=ϵ5=ϵ6>0),
the Rys F model of an antiferroelectric
(ϵ1=ϵ2=ϵ3=ϵ4>0,
and ϵ5=ϵ6=0).
Historically these are widely considered among
the most significant applications ever made of statistical mechanics
to real substances.
In classical statistical mechanics the parameters are
real numbers. However, it’s meaningful to consider parameters over complex values.
In quantum theory the parameters are generally complex valued.
Even in classical theory, for example, Baxter generalized the parameters to complex values to develop the “commuting transfer matrix” for
tackling the six-vertex model [5].
Some other models can be transformed to a six-vertex model with
complex weights. There are books
with sections (e.g., see section 2.5.2 of [30]) that are dedicated to this, for example, the Hamiltonian of a one dimensional
spin chain is simply an extension of the Hamiltonian of a six-vertex model with complex Boltzmann
weights.
The six-vertex model has broad connections to combinatorics. The resolution of the
famous Alternating
Sign Matrix conjecture is one example [36, 43, 51, 37, 7].
Also, the Tutte polynomial on a planar graph
at
the point
(3,3) is precisely 1/2 of ZSix on its medial graph which is also a planar graph
with a specific weight assignment [38].
Although Pauling most likely did not think of it in such terms,
the six-vertex model
can be expressed
perfectly as a family of Holant problems with 6 parameters, expressed by signatures of arity 4.
Previously, without being able to account for the planar
restriction, it has been proved [18] that there is a complexity
dichotomy where
the problem on general graphs
is either in P or #P-hard. However, the more interesting
problem is what happens on planar structures where physicists
had discovered some remarkable algorithms, such as
the FKT algorithm [46, 35, 34].
Due to the presence of nontrivial algorithms, a
complete complexity classification in the planar case
is more difficult to achieve.
Not only are reductions to FKT expected
to give planar P-time computable cases that are
#P-hard in general, but also a more substantial obstacle awaits us.
It turns out that there is another planar P-time computable case
that had not been discovered for the six-vertex model in all these decades,
till now. (Since our algorithm and its proof that it runs in P-time is valid for all planar graphs,
this certainly also applies to the grid case, which is traditionally the main concern for
physicists.)
The main theorem in this paper is
a complexity trichotomy for the
six-vertex model: According to the
6 parameters from C,
the partition function ZSix is
either (1) computable in P-time,
or (2) #P-hard on general graphs but computable in P-time on
planar graphs, or (3) remains #P-hard on planar graphs.
The classification has an explicit criterion.
The planar tractable class (2) includes
those that depend on FKT,
and a previously unknown family.
Functions that are expressible as matchgates
(denoted by M) or those that are transformable
to matchgates
(denoted by M) do constitute a family
of ZSix in class (2).
This follows from the FKT and Valiant’s holographic algorithms [50].222It was known [27, 26] that on the grid graph the
parameter settings that satisfy cz=ax+by
(using notations in Section 2) is P-time computable;
in our theory this is in M, and the proof
is: It follows by Matchgate Identities [10].
However, beyond these,
we discover an additional family of
P-time computable ZSix on planar graphs.
The P-time tractability is via a
non-local reduction
to P-time computable #CSP,
where the variables in #CSP correspond to carefully defined circuits in G.
The fact that this
#CSP problem is
in P depends crucially on the global topological constraint
imposed by the planarity of G (but the #CSP instances that this produces is not planar in general.)
The new tractable class provably cannot be subsumed by FKT (even with a holographic transformation).
After carving out this last tractable family, we
prove that everything else is #P-hard, even for the planar case.
A powerful tool in hardness proofs is interpolation [47].
Typically an interpolation proof can succeed
when certain quantities (such as ratios of eigenvalues)
are not roots of unity, lest the iteration repeat after
a bounded number of steps.
A sufficient condition is that these quantities
have complex norm not equal to 1.
However, for some constraint functions,
we can show that these constructions only produce
such quantities of norm equal to 1.
To overcome this difficulty we introduce a new technique in hardness proofs:
Möbius transformations.333Möbius transformations were previously used in the design of quantum algorithms for approximating the Potts model [1]. Here we use Möbius transformations in a different way, which is for hardness proofs. These Möbius transformations are maps
on C; they are
unrelated to Möbius inversions
for partial orders, e.g., as used
in [24].
We explore properties of
Möbius transformations
that map unit circle to
unit circle on C,
and obtain a suitable Möbius transformation that generates
an infinite group. This allows our interpolation proof to succeed.
The classification of the six-vertex model is not only interesting
in its own right, more importantly, it serves as a basic building block in the
classification program for Holant problems on asymmetric signatures.
For Holant problems over general graphs, complexity dichotomies [21, 23, 3, 4, 17] were proved when certain signatures of odd arity (e.g. unary signatures) are present.
When it comes to signature sets of only even arities, the situation is more difficult.
The six-vertex model is precisely an inductive base
case for Holant problems with asymmetric signatures of even arity.
Very recently a full dichotomy
for real-valued Holant problems on asymmetric signatures was achieved [45]. This theorem is for
general graphs without addressing planar tractability.
The very first step to achieve this dichotomy
[45]
is
the
dichotomy of the six-vertex model without planarity [18]. After that, complexity dichotomies were proved for eight-vertex models [13], and for counting weighed Eulerian orientation problems with a reversal symmetry condition [16].
Lin and Wang proved a dichotomy
for nonnegative valued Holant [41].
The dichotomy [45] is built on top of all that;
see Figure 1.2.
However, from the very beginning [50] the additional planar tractability afforded by the likes of the FKT algorithm is at the very heart of
Holant problems. One can say this is the raison d’être
of holographic algorithms. This classification is already done for symmetric signatures [15].
We hope that the present work
will serve as the beginning step
toward achieving a classification of Holant problems including planar
tractability, without the symmetry assumption.
2 Preliminaries and Notations
In this paper, i denotes −1, a square root of −1.
2.1 Definitions and Notations
A constraint function f, or a signature, of arity k
is a map {0,1}k→C.
Fix a set F of constraint functions. A signature grid
Ω=(G,π)
is a tuple, where G=(V,E)
is a graph, π labels each v∈V with a function
fv∈F of arity deg(v),
and the incident edges
E(v) at v with input variables of fv.
We consider all 0-1 edge assignments σ,
each gives an evaluation
v∈V∏fv(σ∣E(v)), where σ∣E(v)
denotes the restriction of σ to E(v). The counting problem on the instance Ω is to compute
[TABLE]
The Holant problem parameterized by the set F is denoted by Holant(F).
If F={f} is a single set, for simplicity, we write {f} as f directly, and also we write {f,g} as f,g.
When G is a planar graph, the corresponding signature grid is called a planar signature grid.
We use Holant(F∣G) to denote the Holant problem over signature grids with a bipartite graph H=(U,V,E),
where each vertex in U or V is assigned a signature in F or G
respectively.
We list the values of a signature f:{0,1}k→C
as a vector of dimension 2k
in lexicographic
order.
Signatures in F are considered as row vectors (or covariant tensors);
signatures in G are considered as column vectors (or contravariant tensors).
Similarly,
Pl-Holant(F∣G) denotes the Holant problem over signature grids with a planar bipartite graph.
A signature f of arity 4 has the signature matrix
M(f)=Mx1x2,x4x3(f)=[f0000f0100f1000f1100f0010f0110f1010f1110f0001f0101f1001f1101f0011f0111f1011f1111].
Notice the order reversal x4x3; this is for the convenience of
composing these signatures in a planar fashion.
If (i,j,k,ℓ) is a permutation of (1,2,3,4),
then the 4×4 matrix Mxixj,xℓxk(f) lists the 16 values
with row index xixj∈{0,1}2
and column index
xℓxk∈{0,1}2 in lexicographic order.
The planar six-vertex model is Pl-Holant(=2∣f),
where M(f)=[000x0bz00cy0a000].
The outer matrix of M(f)
is the submatrix
[M(f)1,1M(f)4,1M(f)1,4M(f)4,4]=[0xa0], and is
denoted by MOut(f).
The inner matrix of M(f) is
[M(f)2,2M(f)3,2M(f)2,3M(f)3,3]=[bzcy], and is
denoted by MIn(f).
A binary signature g has the signature matrix M(g)=Mx1,x2(g)=[g00g10g01g11].
Switching the order, Mx2,x1(g)=[g00g01g10g11].
We use (=2) to denote binary Disequality signature (0,1,1,0)T.
It has the signature matrix [0110]. Let
N=[0110]⊗[0110]=[0001001001001000].
Note that N is the double Disequality (x1=x4)∧(x2=x3),
which is the function of connecting two pairs of edges by (=2).
A function is symmetric if its value depends only
on the Hamming weight of its input.
A symmetric function f on k Boolean variables can
be expressed as
[f0,f1,…,fk],
where fw is the value of f on inputs of Hamming weight w.
For example, (=k) is the Equality signature [1,0,…,0,1]
(with k−1 many 0’s) of arity k.
The support of a signature f is the set of inputs on which f is nonzero.
Counting constraint satisfaction problems (#CSP)
can be defined as a special case of Holant problems.
An instance of #CSP(F) is presented
as a bipartite graph.
There is one node for each variable and for each occurrence
of constraint functions respectively.
Connect a constraint node to a variable node if the
variable appears in that occurrence
of constraint, with a labeling on the edges
for the order of these variables.
This bipartite graph is also known as the constraint graph.
If we attach each variable node with an Equality function,
and consider every edge as a variable, then
the #CSP is just the Holant problem on this bipartite graph.
Thus
#CSP(F)≡THolant(EQ∣F),
where EQ={=1,=2,=3,…} is the set of Equality signatures of all arities.
By restricting to planar constraint graphs,
we have the planar #CSP framework,
which we denote by Pl-#CSP.
The construction above also shows that Pl-#CSP(F)≡TPl-Holant(EQ∣F).
2.2 Gadget Construction
One basic tool used throughout the paper is gadget construction.
An F-gate is similar to a signature grid (G,π) for Holant(F) except that G=(V,E,D) is a graph with internal edges E and dangling edges D.
The dangling edges D define input variables for the F-gate.
We denote the regular edges in E by 1,2,…,m and the dangling edges in D by m+1,…,m+n.
Then the F-gate defines a function f
[TABLE]
where (y1,…,yn)∈{0,1}n is an assignment on the dangling edges, σ^ is the extension of σ on E by the assignment (y1,…,ym), and fv is the signature assigned at each vertex v∈V. (See Figure 2.1 for an example.)
This function f is called the signature of the F-gate.
We say a signature f is realizable from a signature set F by gadget construction
if f is the signature of an
F-gate.
An F-gate is planar if the underlying graph G is a planar graph,
and the dangling edges,
ordered counterclockwise corresponding to the order of the input variables,
are in the outer face in a planar embedding.
A planar F-gate can be used in a planar signature grid as if it is just a single vertex with the particular signature.
If f is realizable by a planar F-gate,
then we can freely add f into F while preserving the complexity of the planar Holant problem, i.e., Pl-Holant(F,f)≡TPl-Holant(F).
The reduction from the right to the left is trivial. The reduction in the other direction is also simple.
Given an instance of Pl-Holant(F,f),
by replacing every occurrence of f with the F-gate,
we get an instance of Pl-Holant(F).
In this paper, we focus on planar graphs, and we assume the edges incident to a vertex are ordered counterclockwise. When connecting two signatures, we need to keep the counterclockwise order of the edges incident to each vertex.
Given a signature f with signature matrix Mx1x2,x4x3(f),
we can rotate it to obtain, for any cyclic permutations
(i,j,k,ℓ) of (1,2,3,4),
the signature f′ with signature matrix Mx1x2,x4x3(f′)=Mxixj,xℓxk(f).
There are four cyclic permutations of (1,2,3,4),
so correspondingly, a signature f has four rotated
forms, with 4×4 signature matrices
Mx1x2,x4x3(f)=[000x0bz00cy0a000],
Mx2x3,x1x4(f)=[000b0ac00zx0y000],
Mx3x4,x2x1(f)=[000a0yz00cb0x000],
and Mx4x1,x3x2(f)=[000y0xc00za0b000].
These are denoted as
f, f2π, fπ and f23π,
respectively. Thus Mx1x2,x4x3(f2π)=Mx2x3,x1x4(f), etc.
Without other specification, M(f) denotes Mx1x2,x4x3(f).
Once we get one form, all four rotation forms can be freely used.
In the proof, after one construction,
we may use this property to get a similar construction and conclude
by quoting this rotational symmetry.
The movement of signature entries under a rotation is illustrated
in Figure 2.2 (Figure 2 in [15]).
Note that no matter in which signature matrix, the pair (c,z) (and only
(c,z)) is always in the inner matrix. We call (c,z) the inner pair, and (a,x), (b,y)
the outer pairs.
There are three common gadgets we will use in this paper.
The first gadget construction is as follows.
Suppose f1 and f2 have signature matrices
Mxixj,xℓxk(f1) and Mxsxt,xvxu(f2), where (i,j,k,ℓ) and (s,t,u,v) are permutations of (1,2,3,4).
By connecting xℓ with xs, xk with xt,
both using Disequality (=2), we get a signature
of arity 4 with the signature matrix
Mxixj,xℓxk(f1)NMxsxt,xvxu(f2)
by matrix product
with row index xixj and column index xvxu (See Figure 2.3).
A binary signature g has the signature vector g(x1,x2)=(g00,g01,g10,g11)T,
and also g(x2,x1)=(g00,g10,g01,g11)T.
Without other specification, g denotes g(x1,x2). Let f be a signature of arity 4 with the signature matrix
Mxixj,xℓxk(f) and (s,t) be a permutation of (1,2).
The second gadget construction is as follows.
By connecting xℓ with xs and xk with xt, both using Disequality (=2),
we get a binary signature with the signature matrix Mxixj,xkxℓNg(xs,xt) as a matrix product with index xixj (See Figure 2.4).
If g00=g11,
then N(g00,g01,g10,g11)T=(g11,g10,g01,g00)T=(g00,g10,g01,g11)T, and similarly, N(g00,g10,g01,g11)T=(g00,g01,g10,g11)T.
Therefore, Mxixj,xℓxkNg(xs,xt)=Mxixj,xℓxkg(xt,xs),
which means that
connecting variables xℓ, xk of f with, respectively,
variables xs, xt of g
using N is equivalent to connecting them directly without N.
Hence, in the setting
Pl-Holant(=2∣f,g) we can form
Mxixj,xℓxk(f)g(xt,xs),
which is technically Mxixj,xℓxkNg(xs,xt),
provided that g00=g11.
Note that for a binary signature g,
we can rotate it by 180∘ without violating
planarity, and so both g(xs,xt) and g(xt,xs) can be freely used once we get one of them.
A signature f of arity 4 also has the 2×8 signature matrix
[TABLE]
Suppose the signature matrix of g is Mxs,xt(g) and the signature matrix of f is Mxi,xjxℓxk(f).
Our third gadget construction is as follows.
By connecting xt with xi using Disequality (=2),
we get a signature h of arity 4 with the signature matrix
Mxs,xt(g)M(=2)Mxi,xjxℓxk(f)
by matrix product with row index xs and column index xjxℓxk (See Figure 2.5).
We may change this form to a signature matrix with row index xsxj and column index xℓxk.
In particular, if My1,y2(g)=[0t10], then connecting y2 with x1 via (=2)
gives
[TABLE]
If we rename the variable y1
by x1, then Mx1x2,x4x3(h)=[f0000f0100tf1000tf1100f0010f0110tf1010tf1110f0001f0101tf1001tf1101f0011f0111tf1011tf1111].
That is, the new signature has the matrix
obtained from multiplying t to the last
two rows of Mx1x2,x4x3(f)
corresponding to x1=1.
Similarly we can modify the last two columns of Mx1x2,x4x3(f).
Given g=(0,1,t,0)T,
we call the modification from Mx1x2,x4x3(f) to
[TABLE]
the operation of t scaling on x1=1.
Similarly we call the modification from Mx1x2,x4x3(f) to
[TABLE]
the operation of t scaling on x4=1.
For any scalar c=0
and any set of signatures F,
we have
Holant(F∪{f})≡THolant(F∪{cf}),
and Pl-Holant(F∪{f})≡TPl-Holant(F∪{cf}).
Thus a scalar c=0 does not change the complexity of a Holant problem.
Hence we can normalize any particular nonzero signature entry to be 1.
2.3 Holographic Transformation
To introduce the idea of holographic transformation,
it is convenient to consider bipartite graphs.
For a general graph,
we can always transform it into a bipartite graph while preserving the Holant value,
as follows.
For each edge in the graph,
we replace it by a path of length two.
(This operation is called the 2-stretch of the graph and yields the edge-vertex incidence graph.)
Each new vertex is assigned the binary Equality signature =2. Thus, we have Holant(=2∣F)≡THolant(F).
For an invertible 2-by-2 matrix T∈GL2(C)
and a signature f of arity n, written as
a column vector (contravariant tensor) f∈C2n, we denote by
T−1f=(T−1)⊗nf the transformed signature.
For a signature set F,
define T−1F={T−1f∣f∈F} the set of
transformed signatures.
For signatures written as
row vectors (covariant tensors) we define
fT and FT similarly.
Whenever we write T−1f or T−1F,
we view the signatures as column vectors;
similarly for fT or FT as row vectors.
In the special case of the Hadamard matrix
H2=21[111−1],
we also define F=H2F.
Note that H2 is orthogonal.
Since constant factors are immaterial, for convenience we sometime
drop the factor 21 when using H2.
Let T∈GL2(C).
The holographic transformation defined by T is the following operation:
given a signature grid Ω=(H,π) of Holant(F∣G),
for the same bipartite graph H,
we get a new signature grid Ω′=(H,π′) of Holant(FT∣T−1G) by replacing each signature in
F or G with the corresponding signature in FT or T−1G. Valiant’s Holant Theorem [50] states that the instances Ω and Ω′ have the same Holant value.
This result also holds for planar instances.
For every T∈GL2(C),
Pl-Holant(F∣G)≡TPl-Holant(FT∣T−1G).
Definition 2.2**.**
We say a signature set F is C-transformable
if there exists a T∈GL2(C) such that
(0,1,1,0)T⊗2∈C and T−1F⊆C.
This definition is important because if Pl-Holant(C) is tractable,
then Pl-Holant(=2∣F) is tractable for any C-transformable set F.
2.4 Polynomial Interpolation
Polynomial interpolation is a powerful technique to prove #P-hardness for counting problems. We use polynomial interpolation
to prove the following lemmas.
Lemma 2.3**.**
Let f be a 4-ary signature with the signature matrix
M(f)=[00010b0000b01000], where b=0 is not a root of unity.
Let χ1 be a 4-ary signature with the signature matrix
M(χ1)=[0001010000101000].
Then for any signature set F containing f, we have
[TABLE]
**Proof. **
We construct a series of gadgets f2s+1 by a chain of 2s+1 many copies of f linked by the double Disequality N (See Figure 2.6). Clearly f2s+1 has the following signature matrix
[TABLE]
The matrix M(f2s+1) has a good form for polynomial interpolation.
Suppose χ1 appears m times in an instance Ω of Pl-Holant(=2∣F∪{χ1}).
We replace each appearance of χ1 by a copy of
the gadget f2s+1 to get an instance Ω2s+1 of Pl-Holant(=2∣F∪{f2s+1}), which is also an instance of Pl-Holant(=2∣F).
We divide Ω2s+1 into two parts.
One part consists of m signatures f2s+1 and its signature is represented by (M(f2s+1))⊗m.
Here we rewrite (M(f2s+1))⊗m as a column vector.
The other part is the rest of Ω2s+1 and its signature is represented by A which is a tensor expressed as a row vector.
Then, the Holant value of Ω2s+1 is the dot product ⟨A,(M(f2s+1))⊗m⟩, which is a summation over 4m bits.
That is, a sum over all 0,1 values for the 4m edges connecting the two parts.
We can stratify all 0,1 assignments of these 4m bits having a nonzero evaluation of a term in Pl-HolantΩ2s+1 into the following categories:
•
There are i many copies of f2s+1 receiving inputs 0011 or 1100;
•
There are j many copies of f2s+1 receiving inputs 0110 or 1001;
where i+j=m.
For any assignment in the category with parameter (i,j), the evaluation of
(M(f2s+1))⊗m is clearly b(2s+1)j.
Let aij be the summation of values of the part A over all assignments in the category (i,j). Note that aij is independent from the value of s since we view the gadget f2s+1 as a block. Since i+j=m, we can denote aij by aj. Then, we rewrite the dot product summation and get
[TABLE]
Under this stratification, the Holant value of Pl-Holant(Ω,=2∣F∪{χ1}) can be represented as
[TABLE]
Since b=0 is not a root of unity, the linear equation system has a nonsingular Vandermonde matrix
[TABLE]
By oracle querying the values of Pl-HolantΩ2s+1,
we can solve the coefficients aj in polynomial time and obtain the value of p(x)=0⩽j⩽m∑ajxj for any x. Let x=1, we get Pl-HolantΩ. Therefore, we have Pl-Holant(=2∣F∪{χ1})⩽TPl-Holant(=2∣F).∎
Corollary 2.4**.**
Let f be a 4-ary signature with the signature matrix
M(f)=[000−10b0000b01000], where b=0 is not a root of unity.
Let χ2 be a 4-ary signature with the signature matrix
M(χ2)=[000−1010000101000].
Then for any signature set F containing f, we have
[TABLE]
Proof.
We still construct a series of gadgets f2s+1 by a chain of odd
many copies of f linked by the double Disequality N. Clearly f2s+1 has the following signature matrix
[TABLE]
Suppose χ2 appears m times in an instance Ω of Pl-Holant(=2∣f∪χ2).
We replace each appearance of χ2 by a copy of
the gadget f2s+1 to get an instance Ω2s+1 of Pl-Holant(=2∣F∪{f2s+1}).
In the same way as in the proof of Lemma 2.3, we divide Ω2s+1 into two parts. One part is represented by (M(f2s+1))⊗m and the other part is represented by A.
Then, the Holant value of Ω2s+1 is the dot product ⟨A,(M(f2s+1))⊗m⟩.
We can stratify all 0,1 assignments of these 4m bits having a nonzero evaluation of a term in Pl-HolantΩ2s+1 into the following categories:
•
There are i many copies of f2s+1 receiving inputs 0011;
•
There are j many copies of f2s+1 receiving inputs 0110 or 1001;
•
There are k many copies of f2s+1 receiving inputs 1100;
where i+j+k=m.
For any assignment in those categories with parameters (i,j,k) where k≡0(mod2), the evaluation of
(M(f2s+1))⊗m is clearly (−1)kb(2s+1)j=b(2s+1)j. And for any assignment in those categories with parameters (i,j,k) where k≡1(mod2), the evaluation of
(M(f2s+1))⊗m is clearly (−1)kb(2s+1)j=−b(2s+1)j.
Since i+j+k=m, the index i is determined by j and k. Let aj0 be the summation of values of the part A over all assignments in those categories (i,j,k) where k≡0(mod2), and aj1 be the summation of values of the part A over all assignments in those categories (i,j,k) where k≡1(mod2). Note that aj0 and aj1 are independent from the value of s.
Let aj=aj0−aj1.
Then, we rewrite the dot product summation and get
[TABLE]
Under this stratification, the Holant value of Pl-Holant(Ω;=2∣f∪χ2) can be represented as
[TABLE]
Since b=0 is not a root of unity, the Vandermonde coefficient matrix
has full rank. Hence we can solve for all the values
aj in polynomial time and obtain the value 0⩽j⩽m∑aj,
and so we get Pl-HolantΩ. Therefore, we have Pl-Holant(=2∣F∪{χ2})⩽TPl-Holant(=2∣F).∎
Lemma 2.5**.**
Let g=(0,1,t,0)T be a binary signature, where t=0 is not a root of unity. Then for any binary signature g′ of the form (0,1,t′,0)T and any signature set F containing g, we have
[TABLE]
Inductively, for any finite signature set B consisting of binary signatures of the form (0,1,t′,0)T and any signature set F containing g, we have
[TABLE]
**Proof. **
Note that M(g)=[0t10]. Connecting the variable x2 of a copy of g with the variable x1 of another copy of g using (=2), we get a signature g2 with the signature matrix
[TABLE]
That is, g2=(0,1,t2,0)T.
Recursively, we can construct gs=(0,1,ts,0)T for s≥1.
Here, g1 denotes g.
Given an instance Ω′ of Pl-Holant(=2∣F∪{g′}),
in the same way as in the proof of Lemma 2.3, we can replace each appearance of g′ by gs and get an instance Ωs of Pl-Holant(=2∣F).
Similarly, the Holant value of Ωs can be represented as
[TABLE]
while the Holant value of Ω′ can be represented as
[TABLE]
Since t=0 is not a root of unity, all ts are distinct,
and so
the Vandermonde coefficient matrix
has full rank. Hence, we can solve
for all aj, and then
compute 0⩽j⩽m∑aj(t′)j.
So we get Pl-HolantΩ′. Therefore, we have Pl-Holant(=2∣F∪{g′})⩽TPl-Holant(=2∣F). The second part of this lemma follows directly by the first part. ∎
Remark:
Note that the reason why the interpolation can succeed is that we can construct polynomially many binary signatures gs of the form (0,1,ts,0)T, where all ts are distinct such that the Vandermonde coefficient matrix has full rank. According to this, we have the following corollary.
Corollary 2.6**.**
Given a signature set F,
if we can use F to construct polynomially many distinct binary signatures gs=(0,1,ts,0)T, then for any finite signature set B consisting of binary signatures of the form (0,1,t′,0)T, we have
[TABLE]
In Lemma 6.4, we will show how to construct polynomially many distinct binary signatures gs=(0,1,ts,0)T using Möbius transformations [2].
A Möbius transformation of the extended complex plane C=C∪{∞},
the complex plane plus a point at infinity, is a rational function of the form z↦cz+daz+b of
a complex variable z,
where the coefficients a,b,c,d are complex numbers satisfying det[acbd]=ad−bc=0.
It is a bijective conformal map. In particular, a Möbius transformation mapping the unit circle S1={z∣∣z∣=1} to itself is of the form
φ(z)=eiθ1+αˉz(z+α) denoted by M(α,eiθ),
where ∣α∣=1, or φ(z)=eiθ/z.
When ∣α∣<1, it maps the interior of S1 to the interior, while when ∣α∣>1, it maps the interior of S1 to the exterior. A Möbius transformation is completely determined by its values on any 3 distinct points
of C.
An interpolation proof based on a lattice structure will be given in Lemma 6.1, where the following lemma is used.
Lemma 2.7**.**
[18*]**
Suppose α,β∈C−{0}, and
the lattice L={(j,k)∈Z2∣αjβk=1}
has the form L={(ns,nt)∣n∈Z}, where s,t∈Z
and (s,t)=(0,0). Let ϕ and ψ be any numbers
satisfying ϕsψt=1.
If we are given the values Nℓ=∑j,k≥0,\leavevmodej+k≤m(αjβk)ℓxj,k
for ℓ=1,2,…(2m+2),
then we can compute ∑j,k≥0,\leavevmodej+k≤mϕjψkxj,k
in polynomial time.
*
2.5 Tractable Signature Sets
We give some sets of signatures that are known
to define tractable counting problems.
These are called tractable signatures.
There are three families: affine signatures,
product-type signatures,
and
matchgate signatures [14].
Affine Signatures
Definition 2.8**.**
For a signature f of arity n,
the support of f is
[TABLE]
Definition 2.9**.**
A signature f(x1,…,xn) of arity n
is affine if it has the form
[TABLE]
where λ∈C,
X=(x1,x2,…,xn,1),
A is a matrix over Z2,
Q(x1,x2,…,xn)∈Z4[x1,x2,…,xn]
is a quadratic (total degree at most 2) multilinear polynomial
with the additional requirement that the coefficients of all
cross terms are even, i.e., Q has the form
[TABLE]
and χ is a 0-1 indicator function
such that χAX=0 is 1 iff AX=0.
We use A to denote the set of all affine signatures.
The following two lemmas follow directly from the definition.
Lemma 2.10**.**
Let g be a binary signature with support of size 4. Then, g∈A iff g has the signature matrix
M(g)=λ[iaicibid], for some nonzero λ∈C,
a,b,c,d∈N and
a+b+c+d≡0(mod2).
Lemma 2.11**.**
Let h be a unary signature with support of size 2. Then, h∈A iff h has the form
M(h)=λ[iaib], for some nonzero λ∈C,
and a,b∈N.
Product-Type Signatures
Definition 2.12**.**
A signature on a set of variables X
is of product type if it can be expressed as a
product of unary functions,
binary equality functions ([1,0,1]),
and binary disequality functions ([0,1,0]), each on one or two
variables of X.
We use P to denote the set of product-type functions.
Note that the variables of the functions in the product
need not be distinct. E.g., let
f(x,y,z)
be listed as
[f000f100f001f101f010f110f011f111]=[00a00b00].
f is the product of (=2)(x,y), (=2)(x,z)
and [a,b](x).
Let g be
[g00g10g01g11]=[0dc0], and
h(x,y,z,w)=f(x,y,z)g(z,w), sharing a variable z.
Then f,g,h∈P.
The following theorem is known [22],
since (=2)∈A∩P.
Theorem 2.13**.**
Let F be any set of complex-valued signatures in Boolean variables. If F⊆A
or F⊆P,
then Holant(=2∣F) is tractable.
Matchgate Signatures
Matchgates were introduced by Valiant [49, 48] to give polynomial-time algorithms for a collection of counting problems over planar graphs.
As the name suggests,
problems expressible by matchgates can be reduced to computing a weighted sum of perfect matchings.
The latter problem is tractable over planar graphs by Kasteleyn’s algorithm [34],
a.k.a. the FKT algorithm [46, 35].
These counting problems are naturally expressed in the Holant framework using matchgate signatures.
We use M to denote the set of all matchgate signatures;
thus Pl-Holant(M) is tractable, as well as
Pl-Holant(=2∣M).
The parity of a signature is even (resp. odd) if its support is on entries of even (resp. odd) Hamming weight.
We say a signature satisfies the even (resp. odd) Parity Condition
if all entries of odd (resp. even) weight are zero. For signatures of arity
at most 4, the matchgate signatures are characterized by the following lemma.
If f has arity ⩽3, then f∈M iff f satisfies
the Parity Condition.
If f has arity 4 and f satisfies the even Parity Condition, i.e.,
[TABLE]
then
f∈M iff
[TABLE]
By this matchgate identity, we have the following corollary.
Corollary 2.15**.**
Given a signature f of arity 4, two 2-by-2 matrices Dλ=[100λ](λ=0) and M(=2)=[0110], if f∈M, then Dλf and M(=2)f∈M.
Proof. Since f∈M, by Lemma 2.14 we know f satisfies the Parity Condition. We only consider that f satisfies even parity. The proof for f satisfying odd parity is similar and we omitted it here. Suppose f has the signature matrix
M(f)=[d00x0bz00cy0a00w]. Then, we have M(Dλf)=[d00λ2x0λ2bλ2z00λ2cλ2y0λ2a00λ4w] and M(M(=2)f)=[w00a0yc00zb0x00d].
Clearly, Dλf and M(=2)f also satisfy even parity.
Moreover, we have
[TABLE]
and
[TABLE]
Since
detMOut(f)=detMIn(f), we have
[TABLE]
That is, Dλf and M(=2)f∈M. ∎
Holographic transformations extend the reach of the FKT algorithm further,
as stated below.
By Definition 2.2, a signature set F is M-transformable
if there exists a T∈GL2(C) such that
(0,1,1,0)T⊗2∈M and T−1F⊆M.
Theorem 2.16**.**
Let F be any set of complex-valued signatures in Boolean variables.
If F is M-transformable,
then Pl-Holant(=2∣F) is tractable.
We will show that for the six-vertex model, M-transformable signatures are exactly characterized by M and M.
Recall the signature class M=H2M, where H2=21[111−1].
We first give the following simple lemma.
Lemma 2.17**.**
For any signature f with the signature matrix M(f)=[000x0bz00cy0a000], and a 2-by-2 matrix Dλ=[100λ], where λ=0, we have f∈M iff (Dλ)⊗4f∈M.
Proof. Note that M((Dλ)⊗4f)=[000λ2x0λ2bλ2z00λ2cλ2y0λ2a000]=λ2M(f). That is, (Dλ)⊗4f=λ2f. Thus, f∈M is equivalent to (Dλ)⊗4f=λ2f∈M. ∎
Lemma 2.18**.**
A signature f with the signature matrix M(f)=[000x0bz00cy0a000] is M-transformable iff f∈M.
Proof.
The reverse direction is obvious, since (=2)I⊗2∈M, and
(=2)H2⊗2∈M.
Suppose f is M-transformable. By definition, there is T∈GL2(C) such that
If λν=μξ=0, since T∈GL2(C), we have μ=ν=0 while λ,ξ=0, or λ=ξ=0 while μ,ν=0. That is,
T=[λ00ξ](λ,ξ=0), or T=[0νμ0](μ,ν=0). By normalization, we may assume λ=1 or μ=1. That is,
[TABLE]
For any α=0, we use Dα to denote the matrix [100α] and we know Dα−1=D1/α.
Then, T=Dξ or T=DνM(=2).
By Corollary 2.15, we know f∈M given T−1f∈M.
Otherwise, λξ+μν=0. Since T∈GL2(C), we know detT=λξ−μν=0. Thus, λξμν=0. By normalization, we may assume λ=1 and hence, ξ=−μν. That is
[TABLE]
Hence, we have T−1=21D1/μH2D1/ν and we know D1/μH2D1/νf∈M.
We have DμM=M.
Hence H2D1/νf∈M. Thus, we have
For signatures of special forms, we give the following three characterizations of M. They follow directly from the definition.
Lemma 2.19**.**
A binary
signature g with the signature matrix M(g)=[g00g10g01g11] is in M iff g00=ϵg11 and g01=ϵg10, where ϵ=±1.
Lemma 2.20**.**
A signature f with the signature matrix M(f)=[00000bz00cy00000] is in M iff b=ϵy and c=ϵz, where ϵ=±1.
Lemma 2.21**.**
If f has the signature matrix M(f)=[000x0b0000y0a000], where abxy=0, then f∈/M.
2.6 Known Dichotomies and Hardness Results
Definition 2.22**.**
A 4-ary signature is non-singular redundant iff in one of its four 4×4 signature matrices,
the middle two rows are identical and the
middle two columns are identical, and the determinant
If f is a non-singular redundant signature,
then Pl-Holant(=2∣f) is #P-hard.*
Theorem 2.24**.**
[38]**
Let G be a connected plane graph and EO(H) be the set of all Eulerian orientations
of the medial graph H=H(G) which is a 4-regular planar graph. Then
[TABLE]
where T is the Tutte polynomial,
and β(O) is the number of saddle vertices in the orientation O, i.e., vertices in
which the edges are oriented “in, out, in, out” in cyclic order.
Remark:
Note that ∑O∈EO(H)2β(O)
can be expressed as Pl-Holant(=2∣f) on H, where f
has the signature matrix
M(f)=[0001012002101000].
Therefore, Pl-Holant(=2∣f) is #P-hard.
Theorem 2.25**.**
[14]**
Let F be any set of complex-valued signatures in Boolean variables.
Then Pl-#CSP(F) is #P-hard unless
F⊆A,
F⊆P, or
F⊆M,
in which case the problem is computable in polynomial time. If F⊆A
or F⊆P, then #CSP(F) is computable in polynomial time without planarity; otherwise #CSP(F) is #P-hard.
Theorem 2.26**.**
[18]**
Let f be a 4-ary signature with the signature matrix
M(f)=[000x0bz00cy0a000], then
Holant(=2∣f) is #P-hard except for the following cases:
•
f∈P;
•
f∈A;
•
there is a zero in each pair (a,x),(b,y),(c,z);
in which cases Holant(=2∣f) is computable in polynomial time.
3 Main Theorem, Proof Outline and Sample Problems
Theorem 3.1**.**
Let f be a signature with the signature matrix
M(f)=[000x0bz00cy0a000], where a,b,c,x,y,z∈C.
Then Pl-Holant(=2∣f) is
polynomial time
computable in the following cases, and #P-hard otherwise:
f∈P* or A;*
2. 2.
There is a zero in each pair (a,x), (b,y), (c,z);
3. 3.
f∈M* or M;*
4. 4.
c=z=0* and*
(i).
(ax)2=(by)2, or
(ii).
x=aiα,b=aiβ, and y=aiγ,
where α,β,γ∈N, and β≡γ(mod2);
If f satisfies condition 1 or 2, then Holant(=2∣f) is computable in polynomial time without the planarity
restriction; otherwise (the non-planar) Holant(=2∣f) is #P-hard.
Let N be the number of zeros
among a,b,c,x,y,z. The following division of all cases
into Cases I, II, III and IV may not appear to be the most obvious, but it
is done to simplify the organization of the proof.
We define:
Case I:
There is exactly one zero in each pair.
Case II:
There is a zero pair.
Case III:
N=2 and having no zero pair,
or N=1 and the zero is in an outer pair.
Case IV:
N=1 and the
zero is in an inner pair, or N=0.
Cases I, II, III and IV are clearly disjoint.
To see that they cover all cases,
note that
if N⩾3, then
either there is a zero pair (in Case II), or
N=3 and each pair has exactly one zero (in Case I).
If N=2, then either it has a zero pair (in Case II), or
it has no zero pair (in Case III).
If N=1, then either the single zero is in an outer pair
(in Case III), or the single zero is in an inner pair (Case IV).
If N=0 it is in Case IV.
Also note that if N=2 and it has no zero pair,
then the two zeros are in different pairs,
which implies that there is a zero in an outer pair.
So in Case III, there is a zero in an outer pair regardless
N=1 or N=2. In Case III an outer pair
has exactly one zero, and the other two pairs together have at most one
zero.
In Case II, depending on whether the zero pair is
inner or outer we have two different connections to #CSP.
A previously established connection to
#CSP (see [18]) can be adapted in the planar setting to handle
the case with a zero outer pair.
This connection is a local transformation, and we
observe that it preserves planarity.
A significantly more involved non-local connection to #CSP is
discovered in this paper when the inner pair is zero (and no outer pair
is zero).
We show that by the support structure of the signature
we can define a set of
circuits, which
forms a partition of the edge set.
There are exactly two
valid configurations along each such circuit,
corresponding
to its two cyclic orientations.
These circuits may intersect in complicated ways, including
self-intersections. But we can define a #CSP problem,
where
the variables are these circuits, and their edge functions
exactly account for the intersections.
We show that
Pl-Holant(=2∣f) is equivalent to these
#CSP problems, which are
non-planar in general.
However, crucially, because
Pl-Holant(=2∣f) is planar, every two
such circuits must intersect an even number of times.
Due to the planarity of
Pl-Holant(=2∣f) we can exactly carve out
a new class of tractable problems via this non-local #CSP connection,
by the kind of constraint functions they produce in the #CSP problems.
For the proof of #P-hardness in this paper, one particularly difficult case is
in Lemma 6.4.
This is where we introduce Möbius transformations
to prove dichotomy theorems for counting problems.
In this case, all constructible binary
signatures correspond to points
on the unit circle S1, and any iteration of the construction
amounts to mapping this point by a Möbius transformation
which preserves S1.
The following is an outline on how Case I to Case IV are handled.
@slowromancapi@.
There is exactly one zero in each pair. In this case, Holant(=2∣f) is tractable, proved in
[18].
2. @slowromancapii@.
There is a zero pair:
An outer pair (a,x) or (b,y) is a zero pair. We prove that
Pl-Holant(=2∣f) is tractable if
f∈P,A,M or M,
and is #P-hard otherwise.
In this Case @[email protected], we can rotate the signature f such that the matrix MOut(f) is the zero matrix. Let M(fIn)=MIn(f)[0110].
We reduce Pl-#CSP(fIn)
to Pl-Holant(=2∣f) via a local replacement (Lemma 4.2).
We apply the dichotomy of Pl-#CSP to get #P-hardness (Theorem 4.3).
Tractability of Pl-Holant(=2∣f) follows from known tractable signatures.
2. 2.
The inner pair (c,z) is a zero pair and no outer pair is a zero pair.
We prove that
Pl-Holant(=2∣f) is #P-hard unless f
satisfies condition 4, in which case it is tractable.
This is the non-local reduction described above.
The tractable condition 4 is previously unknown.
(Curiously, in Case @[email protected], condition 4 subsumes
f∈M.)
3. @slowromancapiii@.
There are exactly two zeros and they are in different pairs;
2. 2.
There is exactly one zero and it is in an outer pair.
We prove that
Pl-Holant(=2∣f) is #P-hard unless f∈M, in which case it is tractable.
In Case III, there exists an outer pair which contains a single zero.
By connecting two copies of the
signature f, we can construct a 4-ary signature f1 such that one outer pair is a zero pair. When f∈/M, we can realize a signature M(g)=[0000001001000000] by interpolation using f1 (Lemma 5.1).
This g can help us “extract” the inner matrix of M(f).
By connecting f and g, we can construct a signature that belongs to Case @slowromancapii@.
We then prove #P-hardness using the result of Case @slowromancapii@ (Theorem 5.2).
4. @slowromancapiv@.
There is exactly one zero and it is in the inner pair;
2. 2.
All values in {a,x,b,y,c,z} are nonzero.
We prove that
Pl-Holant(=2∣f) is #P-hard unless f∈M, in which case it is tractable.
Assume f∈M.
The main idea is to use Möbius transformations.
However, there are some settings where we cannot do so, either because we don’t
have the initial signature to start the process, or the matrix that
would define the Möbius transformation is singular.
So we first treat the following two special cases.
•
If a=ϵx, b=ϵy and c=ϵz, where ϵ=±1, by interpolation based on a
lattice structure, either we can realize a non-singular redundant signature or
reduce from the evaluation of the Tutte polynomial at (3,3),
both of
which are #P-hard (Lemma 6.1).
•
If
det[bzcy]=0
or
det[aczx]=0,
then either we can realize a non-singular redundant signature or a signature that is #P-hard by Lemma 6.1 (Lemma 6.2).
If f does not belong to the above two cases, we
want to realize binary signatures of the form (0,1,t,0)T,
for arbitrary values of t. If this can be done,
by carefully choosing the values of t,
we can construct a signature that belongs to Case @slowromancapiii@
and it is #P-hard when f∈/M (Lemma 6.3).
We realize binary signatures by connecting f with (=2).
This corresponds naturally to a Möbius transformation.
By discussing the following different forms of binary signatures we get,
we can either realize arbitrary (0,1,t,0)T or a signature belonging
to Case @[email protected] that does not satisfy condition 4, therefore
is #P-hard (Theorem 6.8).
•
If we can get a signature of the form g=(0,1,t,0)T where t=0 is not a root of unity, then by connecting a chain of g, we can
get polynomially many distinct binary signatures gi=(0,1,ti,0)T.
Then, by interpolation, we can realize
arbitrary binary signatures of the form (0,1,t′,0)T.
•
Suppose we can get a signature of the form (0,1,t,0)T, where t=0 is an n-th primitive root of unity (n⩾5). Now, we only have n many distinct signatures gi=(0,1,ti,0)T.
But we can relate f to two Möbius transformations
due to
det[bzcy]=0
and
det[aczx]=0.
For each Möbius transformation φ, we can realize the signatures g=(0,1,φ(ti),0)T.
If ∣φ(ti)∣=0,1 or ∞ for some i,
then this is treated above, as this φ(ti) is
nonzero and not a root of unity.
Otherwise, since φ is a bijection on the extended complex plane
C, it can map at most two points of S1 to
[math] or ∞.
Hence, ∣φ(ti)∣=1 for at least three distinct ti.
But a Möbius transformation is determined by any three distinct points.
This implies that φ maps S1 to itself.
Such mappings φ have a known special form
eiθ1+αˉzz+α
(or eiθ/z, but the latter form actually
cannot occur in our context.)
By exploiting its property we can construct a signature f′ such that
its corresponding Möbius transformation φ′
defines an infinite group. This implies
that φ′k(t) are all distinct. Then, we can get polynomially
many distinct binary signatures (0,1,φ′k(t),0), and
realize arbitrary binary signatures of the form (0,1,t′,0)T (Lemma 6.4).
•
Suppose we can get a signature of the form (0,1,t,0)T where t=0 is an n-th primitive root of unity (n=3,4). Then we can either relate it to two Möbius transformations mapping the unit circle to itself, or
realize a double pinning (0,1,0,0)T=(1,0)T⊗(0,1)T (Corollary 6.5).
•
Suppose we can get a signature of the form (0,1,0,0)T.
By connecting f with it, we can get new signatures of the form (0,1,t,0)T. Similarly, by analyzing the value of t,
we can either realize arbitrary binary signatures of the form (0,1,s,0)T, or realize a signature that belongs to Case @[email protected], which is #P-hard (Lemma 6.6).
•
Suppose we can only get signatures of the form (0,1,±1,0). That implies a=ϵx, b=ϵy and c=ϵz, where ϵ=±1. This has been treated before.
As Case I has already been proved tractable in [18], we only deal
with Cases II, III and IV, and they
are each dealt with in the next three sections.
Before we start the proof, we first illustrate the scope of Theorem 3.1 by several concrete problems.
Problem\leavevmode1: #EO on 4-Regular Planar Graphs.
Input: A 4-regular planar graph G.
Output: The number of Eulerian orientations of G,
i.e., the number of orientations of G such that at every vertex
the in-degree and out-degree are equal.
This problem can be expressed as Pl-Holant(=2∣f), where f
has the signature matrix
M(f)=[0001011001101000].
Huang and Lu proved this problem is #P-complete [32].
Theorem 3.1 confirms this.
Problem\leavevmode2: Pl-T(G;3,3).
Input: A planar graph G.
Output: The value of the Tutte polynomial T(G;x,y) at (3,3).
Let Gm be the medial graph of G, then Gm is a 4-regular planar graph. By Theorem 2.24, we have
[TABLE]
where β(O) is the number of saddle vertices in the orientation O.
Note that ∑O∈EO(Gm)2β(O)
can be expressed as Pl-Holant(=2∣f), where f
has the signature matrix
M(f)=[0001012002101000].
Theorem 3.1 confirms that this problem is #P-hard.
Compared to the six-vertex model over general graphs,
the planar version has new tractable problems
due to the FKT algorithm under holographic transformations.
This tractable class can give highly nontrivial problems.
For example, we consider the following problem.
Problem\leavevmode3: SmallPell
Input: A planar 4-regular graph G and a 4-ary signature f, where f has the signature matrix
[TABLE]
Output: The evaluation of Pl-Holant(f) on G.
By the holographic transformation Z=21[1i1−i], we have
[TABLE]
where
[TABLE]
Since (32188120829134849,1819380158564160) is a solution of Pell’s equation
x2−313y2=1, we have f∈M
by Matchgate Identities [48].
By Theorem 3.1, Pl-Holant(f) can be computed in polynomial time.
In addition to matchgates and matchgates-transformable
signatures,
Theorem 3.1 gives a new class of
tractable problems on planar graphs.
They are provably not contained in any
previously known tractable classes.
For example, we consider the following problem.
Problem\leavevmode4: Pl-Holant(=2∣f), where f has the signature matrix M(f)=[00010i0000i01000].
Input: An instance of Pl-Holant(=2∣f).
Output: The evaluation of this instance.
By Theorem 3.1 (condition 4 (ii)),
Pl-Holant(=2∣f) can be computed in polynomial time.
Note that Holant(=2∣f) is #P-hard without the planar restriction.
It can be shown that f is neither in
M nor M-transformable.
By Lemma 2.14 we know f∈M, and by Lemma 2.21 we know f∈M. By Lemma 2.18, this implies f is neither in
M nor M-transformable.
Therefore,
the tractability is not derivable from the Kasteleyn’s algorithm
or a holographic transformation to it. Hence, condition 4 of
Theorem 3.1 defines a new component
of planar tractability complementing the Kasteleyn’s algorithm.
Furthermore, it is an essential component because
with it the picture is complete.
4 Case @slowromancapii@: One Zero Pair
If an outer pair is a zero pair, by rotational symmetry, we may assume (a,x) is a zero pair.
Definition 4.1**.**
Given a 4-ary signature f with the signature matrix
[TABLE]
we denote by fIn the binary signature with
M(fIn)=MIn(f)[0110]=[cybz].
Given a set F consisting of signatures of the form (4.1), we define FIn={fIn∣f∈F}.
Proof.
We adapt a proof
from [18], making sure that
the reduction preserves planarity.
This need to preserve planarity necessitates the twist introduced
in the definition of fIn and FIn.
We prove this reduction
in two steps.
In each step, we begin with a signature grid and end with a new signature grid such that the Holant values of both signature grids are the same.
For step one, let G=(U,V,E) be a planar bipartite graph
representing an instance of
[TABLE]
where each u∈U is a variable,
and each v∈V has degree two and is labeled by some fIn∈FIn.
We define a cyclic order of the edges incident to each vertex
u∈U, and split u into k=deg(u) vertices.
Then we connect the k
edges originally incident to u to these k new vertices so that each vertex is incident to exactly one edge. We also connect these k new
vertices in a cycle according to the cyclic order
(see Figure 4.1b).
Thus, in effect we have replaced u by a cycle of length k=deg(u).
(If k=1 then there is a self-loop. If k=2 then the cycle consists of
two parallel edges.)
Each of k vertices has degree 3, and we label them by (=3).
This defines a signature grid for a planar holant problem,
since the construction preserves planarity.
Also clearly this does not change the value of the partition function. The resulting graph has the following properties: (1) every vertex has either degree 2 or degree 3; (2) each degree 2 vertex is connected
to degree 3 vertices;
(3) each degree 3 vertex is connected to exactly one degree 2 vertex.
Now step two. For every v∈V, v has degree 2 and
is labeled by some fIn∈FIn.
We contract the two edges incident to v
to produce a new vertex v′.
The resulting graph G′=(V′,E′) is 4-regular and planar.
We put a node on every edge of G′ (these are all edges of the
cycles created in step one) and label it by (=2)
(see Figure 4.1c).
Next, we assign a copy of the corresponding f to every v′∈V′.
The input variables x1,x2,x3,x4 are carefully
assigned at each copy of f (as illustrated in Figure 4.2) such that there are exactly two configurations
to each original cycle, which correspond to cyclic orientations,
due to the (=2) on it and the
support set of f. These cyclic orientations correspond to the {0,1} assignments at
the original variable u∈U.
Under this one-to-one correspondence, the value of fIn is
perfectly mirrored by the value of f.
Therefore, we have
Pl-#CSP(FIn)⩽TPl-Holant(=2∣F).
There is also the possibility that the binary constraint
fIn is applied to a single variable, say w,
resulting in a unary constraint that takes value fIn(0,0)=c if w=0 and fIn(1,1)=z if w=1. To reflect that, we simply introduce a self-loop
on the cycle representing the variable w for every such occurrence,
as illustrated in Figure 4.3.
It is clear that the values c and z are perfectly mirrored by
the values that the local copy f takes under the two orientations
for the cycle corresponding to w=0 and 1.
∎
Theorem 4.3**.**
*Let f be a 4-ary signature of the form (4.1).
Then Pl-Holant(=2∣f)
is *#P-hard unless
f∈P, f∈A, or f∈M,
in which cases the problem is tractable.
Proof. Tractability follows from Theorems 2.13 and 2.16.
For any f of the form (4.1),
note that the support of f is contained in (x1=x2)∧(x3=x4).
We have
[TABLE]
where χ is the 0-1 indicator function.
Thus, fIn∈P or A is equivalent to f∈P or A.
In addition, by Lemmas 2.19 and 2.20, fIn∈M is equivalent to f∈M.
Therefore, if f∈/P,A or M, then fIn∈/P,A or M.
By Theorem 2.25, Pl-#CSP(fIn) is #P-hard, and then by Lemma 4.2, Pl-Holant(=2∣f) is #P-hard. ∎
Remark:
One may observe that if f∈M, then Pl-Holant(=2∣f) is also tractable as f and (=2) are both realized by matchgates. However, Theorem 4.3 already accounted for this case because for signature f of the form (4.1), f∈M implies f∈P.
Now, we consider the case that the inner pair is a zero pair and no outer pair is a zero pair.
Note that a signature in the form (4.2) has support
contained in (x1=x3)∧(x2=x4).
Definition 4.4**.**
Given a 4-ary signature f with the signature matrix
[TABLE]
where (a,x)=(0,0) and (b,y)=(0,0),
let Gf denote the set of all binary signatures gf of the form
[TABLE]
satisfying k=ℓ,
where k=∑i=14ki,ℓ=∑i=14ℓi
and k1,k2,k3,k4,ℓ1,ℓ2,ℓ3,ℓ4∈N.
Let Hf denote the set of all unary signatures hf of the form
[TABLE]
where m1,m2,m3,m4∈N.
Let k=k1=ℓ1=ℓ=1, we get a specific signature g1f∈Gf, with M(g1f)=[a2bybyx2]. Let k=k1=ℓ3=ℓ=1, we get
another specific signature g2f∈Gf, with
M(g2f)=[axy2b2ax].
Remark:
For any i,j∈{1,2,3,4}, let k=ki=ℓj=ℓ=1, we can get 16 signatures in Gf that have similar signature matrices to M(g1f) and M(g2f). For example,
Choosing k=k3=ℓ1=ℓ=1,
we get g2f′(x1,x2)
with the signature matrix
M(g2f′)=[axb2y2ax].
Indeed g2f′(x1,x2)=g2f(x2,x1).
In fact, Gf is the closure by the
Hadamard product (entry-wise product) of these 16 basic signature matrices.
**Proof. ** We divide the proof into two parts: We show the reduction (4.3) in Part @slowromancapi@, and the reduction (4.4) in Part @slowromancapii@.
Part @slowromancapi@:
Suppose Ω=(G,π) is a given instance
of Pl-Holant(=2∣f),
where G=(U,V,E) is a plane bipartite graph. Every vertex v∈V
has degree 4,
and we list its incident four edges in counterclockwise order.
Two edges both incident to a vertex v∈V
are called adjacent if they are adjacent in this cyclic order,
and non-adjacent otherwise.
Two edges in G are called 2-ary edge twins
if they are both incident to a vertex u∈U (of degree 2),
and 4-ary edge twins if they are non-adjacent
but both incident to a vertex v∈V (of degree 4).
Both 2-ary edge twins and 4-ary edge twins are called edge twins.
Each edge has a unique 2-ary edge twin at its endpoint in U of degree 2
and a unique 4-ary edge twin at its endpoint in V of degree 4.
The reflexive and transitive closure of the symmetric binary relation
edge twin forms a partition of E as an edge disjoint union
of circuits:
C1,C2,…,Ck.
Note that Ci may include repeated vertices called self-intersection vertices, but no repeated edges.
We arbitrarily pick an edge ei of Ci to be the leader edge
of Ci.
Given the leader edge ei=(u,v) of Ci, with u∈U and v∈V,
the direction from u to v
defines an orientation of the circuit Ci.
444This default orientation should not be confused with the
orientation in the proof of Lemma 4.2.
For any edge twins {e,e′}, this orientation defines one edge, say e′,
as the successor of the other if e′ comes right after e in the
orientation.
When we list the assignments of edges in a circuit,
we list successive values of successors, starting with the leader edge.
For any nonzero term in the sum
[TABLE]
the assignment of all edges σ:E→{0,1} can be
uniquely extended from its restriction on
leader edges σ′:{e1,e2,⋯ek}→{0,1}.
This is because the support of f is
contained in (x1=x3)∧(x2=x4).
Thus, at each vertex v∈V,
fv(σ∣E(v))=0 only if each pair of edge twins in E(v) is assigned value (0,1) or (1,0).
The same is true for any vertex u∈U of degree 2,
which is labeled (=2).
Thus, if the leader edge ei in Ci takes value [math]
or 1 respectively,
then all edges on Ci must
take values (0,1,0,1,⋯,0,1) or
(1,0,1,0,⋯,1,0) respectively
on successive successor edges,
starting with ei.
In particular, all pairs of 4-ary edge twins in Ci take assignment (0,1) when ei=0 and (1,0) when ei=1
(listing the value of the successor second).
Then, we have
[TABLE]
where σ′ denotes the unique extension of σ′.
For all 1≤i<j≤k,
let Vi,j=Ci∩Cj denote
the set of all intersection vertices between Ci and Cj. Denote by σ(ei,ej)′ an assignment
{ei,ej}→{0,1}.
Define a binary function gi,j on ei and ej as follows:
For any b,b′∈{0,1}, let
[TABLE]
where σ(ei,ej)′ is the unique extension
of σ(ei,ej)′
on the union of edge sets of Ci and Cj as described above,
and σ(ei,ej)′ is the unique assignment on
{ei,ej} such that ei↦b and ej↦b′.
Since all edges incident to vertices in Vi,j are either in Ci or Cj, the assignment values of these edges are determined by
σ(ei,ej)′.
Hence, gi,j is well-defined.
We show that gi,j∈Gf by induction on the number n of self-intersection vertices in Ci.
Note that in this proof, i and j (with i<j)
are not treated symmetrically.
For each vertex v∈Vi,j, consider the two pairs of edge twins
incident to it.
We label the edge twins in Ci by the variables (x1,x3)
such that x3 is the successor of x1 in the orientation of Ci.
Hence, for all v∈Vi,j,
these variables (x1,x3) take the same assignment (0,1) when ei=0 and (1,0) when ei=1.
Then, label the edge twins in Cj at v
by (x2,x4) so that the 4 edges at v are
ordered (x1,x2,x3,x4) in counterclockwise order.
This choice of (x2,x4) is unique given the labeling (x1,x3).
As we traverse Ci according to the orientation of Ci,
locally there is a notion of the left side of Ci.
At any vertex v∈Ci∩Cj,
if we take the traversal of Cj
according to the orientation of Cj,
it either
comes into or goes out of the left side of Ci.
We call v∈Ci∩Cj
of the former kind “entry-vertices”, and the latter
kind “exit-vertices” (see Figure 4.4).
At any entry-vertex v∈Vi,j,
the variable x4 is the successor of x2,
while at any exit-vertex x2 is the successor of x4.
Therefore, at entry-vertices, variables (x2,x4) take assignment (0,1)
when ej=0 and (1,0) when ej=1, while at exit-vertices they take assignment (1,0) and (0,1) respectively instead.
Table 4.1 summarizes the values of f and its rotated copies
at intersection vertices Vi,j.
According to the 4 different assignments of (ei,ej)
as listed in column 1 of the table, column 2 and column 7 (indexed by (x1,x2,x3,x4)) list the assignments of (x1,x2,x3,x4) at entry-vertices and exit-vertices separately. With respect to this local labeling of
(x1,x2,x3,x4), the signature f has four rotated forms:
[TABLE]
columns 3, 4, 5, 6 and columns 8, 9, 10, 11 list the corresponding values of
the signature f in four forms f, f2π, fπ and f23π respectively.
Suppose there are k1,k2,k3 and k4 many entry-vertices
assigned f, f2π, fπ, and f23π,
respectively,
and there are ℓ1,ℓ2,ℓ3 and ℓ4 many exit-vertices
assigned f2π, fπ, f23π and
f, respectively.
Then, according to the assignments of (ei,ej), the values of gi,j are listed in Table 4.2, and its signature matrix is given below:
[TABLE]
Our proof that gi,j∈Gf is based on the
assertion that the number of “entry-vertices” and “exit-vertices”
are equal, namely ∑i=14ki=∑i=14ℓi.
•
First, consider the base case n=0. That is, Ci is a simple cycle without self-intersection. By the Jordan Curve Theorem, Ci divides the plane
into two regions, an interior region and an exterior region.
In this case, as we traverse Ci according to the orientation of Ci,
the left side of the traversal is always the same region; we call
it Li
(which could be either the interior or the exterior region,
depending on the choice of the leader edge ei).
If we traverse Cj according to the orientation of Cj,
we enter and exit the region Li an equal number of times.
Therefore there is an equal
number of “entry-vertices” and “exit-vertices”.
Hence
∑i=14ki=∑i=14ℓi.
It follows that gi,j∈Gf by the definition
of Gf.
•
Inductively, suppose gi,j∈Gf holds for any circuit Ci with at most n self-intersections. Let Ci have n+1 self-intersections.
We decompose
Ci into two edge-disjoint circuits,
each of which has at most n self-intersections (See Figure 4.5).
Take any self-intersection vertex v∗ of Ci.
There are two pairs of 4-ary edge twins
{e,e′} and {e,e′},
where e′ is the successor of e
and e′ is the successor of e.
Note that e and e are oriented toward v∗,
and e′ and e′ are oriented away from v∗.
By the definition of edge twins,
{e,e} are adjacent,
and {e′,e′} are adjacent.
We can break Ci into two
oriented circuits Ci1 and Ci2,
by splitting v∗ into two vertices, and let e′ follow e
and let e′ follow e.
Let the mapping γ:[0,1]→R2,
such that γ(0)=γ(1/2)=γ(1)=v∗,
represent the traversal of Ci.
Then we can define two mappings γ1,γ2:[0,1]→R2,
such that γ1(t)=γ(t/2) and
γ2(t)=γ((t+1)/2).
Then {γ1,γ2} represent {Ci1,Ci2}
respectively.
It follows that Ci is the edge disjoint union of
Ci1 and Ci2 and they both inherit
the same orientation from that of Ci.
Any vertex in Vi,j is distinct from
a self intersection point of Ci
and thus Vi,j is a disjoint union
Vi,j1∪Vi,j2, where
Vi,j1=Ci1∩Cj and Vi,j2=Ci2∩Cj.
Since Ci1 inherits the orientation from Ci,
the orientation on Ci1 is consistent
with the orientation starting by choosing a leader edge on Ci1.
The same is true for the orientation on Ci2.
Thus, by induction, on each Ci1∩Cj and Ci2∩Cj
there are an equal number of
“entry-vertices” and “exit-vertices”.
Hence
∑i=14ki=∑i=14ℓi,
and so gi,j∈Gf, completing the
induction.
Let Vi be the set of all self-intersections of Ci.
Let σ(ei)′ denote the restriction of σ′ on {ei}.
Define a unary function hi on ei as follows:
For any b∈{0,1}, let
[TABLE]
where σ(ei)′ is the unique extension
of σ(ei)′ on the edge set of Ci,
and σ(ei)′ is the unique assignment on
{ei} such that ei↦b.
The assignment of those edges incident to vertices in Vi can be
uniquely extended from the assignment σ(ei)′.
Hence, hi is well-defined. We show that hi∈Hf.
For each vertex in Vi, since it is a self-intersection vertex, the two pairs of edge twins incident to it are both in Ci.
We still first
label each pair of edge twins by a pair of variables (x1,x3) obeying the orientation of Ci. That is, x3 is always the successor of x1.
Now by the definition of 4-ary edge twins,
the two edges labeled x1 are adjacent. Hence
at each vertex in Vi, starting from one x1,
the four incident edges are labeled by (x1,x1,x3,x3)
in counterclockwise order.
We pick the pair of variables (x1,x3) that appear in
the second and fourth positions in this listing
and change them to (x2,x4),
so that the four edges are now labeled by (x1,x2,x3,x4)
in counterclockwise order. Clearly, (x2,x4) and (x1,x3) take the same assignment. That is, at each vertex in Vi, the assignment of (x1,x2,x3,x4) is (0,0,1,1) when ei=0, and (1,1,0,0) when ei=1. Under this labeling, the
signature f still has four rotated forms. The values of
these four forms are listed in Table 4.3.
Suppose on Vi
there are m1,m2,m3 and m4 many vertices assigned f,
f2π, fπ and f23π respectively.
Then, we have
[TABLE]
It follows that hi∈Hf.
For any vertex v∈V, it is either in some Vi,j or some Vi.
Thus,
[TABLE]
where gi,j∈Gf and hi∈Hf.
Therefore, Pl-Holant(=2∣f)⩽T#CSP(Gf∪Hf).
Here, we give some examples for the reduction (4.3).
Example 1.
The signature grid Ω=(G,π)
for Pl-Holant(=2∣f) in Figure 4.6
has two circuits C1 (the Square) and
C2 (the Horizontal Eight) in G.
We have chosen (arbitrarily) a leader edge ei for
each circuit Ci.
In Figure 4.6 they are near the top left corner.
Given the leader, the direction from its endpoint of degree 2 to the endpoint of degree 4 gives a default orientation of the circuit.
Given a nonzero term in the sum Pl-HolantΩ, as a consequence of the support of f, the assignment of edges in each circuit is uniquely determined by the assignment of its leader. That is, any assignment of the leaders σ′:{e1,e2}→{0,1} can be uniquely extended to an assignment of all edges σ:E→{0,1} such that on each circuit
the values of 0,1 alternate.
Consider the signatures fv1,fv2,fv3 and fv4
on the intersection vertices
between C1 and C2.
Assume C1 does not have self-intersection (as is The Square);
otherwise, we will decompose C1 further and reason inductively.
Without self-intersection, C1 has an interior and exterior region
by the Jordan Curve Theorem.
For the chosen orientation of C1, its left side happens to
be the interior region.
With respect to C1, the circuit C2 enters and exits the interior of C1 alternately. Thus, we can divide the intersection vertices into an equal number of “entry-vertices” and “exit-vertices”.
In this example, fv1 and fv4 are on “entry-vertices”, while fv2 and fv3 are on “exit-vertices”.
By analyzing the values of each f when e1 and e2 take assignment [math] or 1, we can view each f as a binary constraint on (C1,C2).
Depending on the 4 different rotation forms of f and whether f is on “entry-vertices” or “exit-vertices”, the resulting binary constraint has 8 different forms (See Table 4.1). By multiplying these constraints, we get the binary constraint g1,2. This can be viewed as a binary edge function on the circuits C1 and C2. The property of g1,2
crucially depends on there are an equal number of “entry-vertices”
and “exit-vertices”.
For any b,b′∈{0,1},
[TABLE]
where σ(e1,e2)′ uniquely extends to C1 and C2
the assignment
σ(e1,e2)′(e1)=b and σ(e1,e2)′(e2)=b′.
If the placement of fv1 were to be rotated clockwise 2π,
then fv1 will be changed to fv12π in the above formula, where Mx1x2,x4x3(fv12π)=Mx2x3,x1x4(fv1).
For the self-intersection vertex fv5, the notions of “entry-vertex” and “exit-vertex” do not apply. fv5 gives rise to a unary constraint
h2 on e2. Depending on the 4 different rotation forms of f, h2 has 4 different forms (see Table 3).
For any b∈{0,1},
[TABLE]
where σ(e2)′ uniquely extends to C2
the assignment
σ(e2)′(e2)=b.
Therefore, we have
[TABLE]
Example 2.
Figure 4.7 is a more complicated example for the reduction (4.3). The graph in Figure 4.7 (a) is an instance of Pl-Holant(=2∣f), where all intersection points are degree 4 vertices labeled by f and we omit the degree 2 vertices labeled by =2.
This graph can be divided into 4 circuits colored by red, blue, purple and green (see Figure 4.7 (b)).
Each circuit can be viewed as a Boolean variable for a #CSP problem (see Figure 4.7 (c), edges are constraints).
The [math]-1 assignment of edges on a circuit Ci is uniquely determined
by the assignment of its leader edge ei, corresponding to two orientations of this circuit.
The binary constraint gi,j on Ci and Cj (for i<j) is determined by the placement of signatures f on intersection vertices between circuits Ci and Cj.
Part @slowromancapii@:
Suppose I is a given instance of #CSP(g1f,g2f).
Each constraint g1f and g2f is applied on certain
pairs of variables.
It is possible that they are applied to a single variable, resulting in two unary constraints. We will deal with such constraints later.
We first consider the case that every constraint is applied on two
distinct variables.
For any pair i<j,
consider all binary constraints on variables xi and xj(i<j).
Note that g1f is symmetric, that is, g1f(xi,xj)=g1f(xj,xi). We assume all the constraints between xi and xj
are: si,j many constraints g1f(xi,xj), ti,j many constraints g2f(xi,xj) and ti,j′ many constraints g2f(xj,xi).
Let gi,j(xi,xj) be the function product of these constraints. That is,
[TABLE]
Then, we have
[TABLE]
We prove the reduction (4.4) in two steps.
We first reduce #CSP(I) to both instances Ωi (for i=1,2)
of Pl-Holant(=2∣f,χi) respectively, where χ1=[0001010000101000] and χ2=[000−1010000101000]. The instance Ωi is constructed as follows:
Draw a cycle D1, i.e.,
a homeomorphic image of S1, on the plane.
For 2⩽j⩽k successively draw cycles Dj,
and for all 1⩽i<j
let Dj intersect
transversally with Di at least 2(si,j+ti,j+ti,j′) many times.
This can be done since we can let Dj enter and exit the interiors of Di
successively.
A concrete realization is as follows:
Place k vertices Di on a semi-circle in the order of i=1,…,k.
For 1⩽i<j⩽k,
connect Di and Dj by a straight line segment Lij.
Now thicken each vertex Di into a small disk, and deform
the boundary circle of Dj so that, for every 1⩽i<j,
it reaches across to Di along the
line segment Lij, and intersects the boundary circle of Di
exactly
2(si,j+ti,j+ti,j′) many times. (There are
also other intersections between these cycles Di’s due to
crossing intersections between those line segments. This is why we
say “at least” this many intersections in the overall description.
We will deal with those extra intersection vertices later.)
We can draw these cycles to satisfy the following conditions:
a.
There is no self-intersection for each Di.
2. b.
Every intersection point is between exactly two cycles.
They intersect transversally. Each intersection creates a vertex of degree 4.
These intersecting cycles define a planar 4-regular graph G′,
where intersection points are the vertices.
2. 2.
Replace each edge of G′ by a path of length two.
We get a planar bipartite graph G=(V,E). On one side, all vertices have degree 2, and on the other side, all vertices have degree 4. We can still define edge twins as in Part @slowromancapi@. Moreover, we still divide the graph into some circuits C1,…,Ck. In fact, Ci is just the cycle Di after the replacement of each edge by a path of length two.
Let Vi,j=Ci∩Cj(i<j) be the intersection vertices between
Ci and Cj. Clearly, ∣Vi,j∣ is even and
at least 2(si,j+ti,j+ti,j′). Since there is no self-intersection, each circuit is a simple cycle.
As we did in Part @slowromancapi@, we pick an edge ei as the leader edge of Ci and this gives an orientation of Ci.
We can define “entry-vertices” and “exit-vertices” as in Part @slowromancapi@. Among Vi,j, half are entry-vertices and the other half are exit-vertices.
(This notion is defined in terms of Cj with respect to Ci;
the roles of i and j are not symmetric.)
List the edges in Ci according to the orientation of Ci
starting with the leader edge ei.
After we place copies of f on each vertex,
the support of f, which is contained in (x1=x3)∧(x2=x4), ensures that every 4-ary twins can only take
values (0,1) or (1,0), since the 4-ary twin edges are non-adjacent.
Then all edges in Ci can only take assignment
(0,1,0,1,⋯,0,1) when ei=0
and (1,0,1,0,⋯,1,0) when ei=1.
3. 3.
Label all vertices of degree 2 by (=2).
For any vertex in Vi,j (i<j), as we showed in Part @slowromancapi@, we can label the four edges incident to it by variables (x1,x2,x3,x4) in a way such that when σ′:(ei,ej)↦(b,b′)∈{0,1}2,
we have (x1,x2,x3,x4)=(b,b′,1−b,1−b′) at any entry-vertex, and (x1,x2,x3,x4)=(b,1−b′,1−b,b′) at any exit-vertex (See Table 4.1).
Note that f has four rotation forms under this labeling.
We have (at least) si,j+ti,j+ti,j′ many entry-vertices
and as many exit-vertices.
Let Vi,j′ be the set of these 2(si,j+ti,j+ti,j′) vertices.
For vertices in Vi,j′,
we label si,j many entry-vertices by f and si,j many exit-vertices by f2π, ti,j many entry-vertices by f and ti,j many exit-vertices by f23π, and ti,j′ many entry-vertices by fπ and ti,j′ many exit-vertices by f2π.
Refer to Table 4.2, this choice amounts to taking
[TABLE]
and all other ki, ℓi’s equal to 0.
Recall that g1f(x1,x2) corresponds to choosing
k1=ℓ1=1 and the others all 0,
g2f(x1,x2) corresponds to choosing
k1=ℓ3=1 and the others all 0,
and g2f(x2,x1) corresponds to choosing
k3=ℓ1=1 and the others all 0,
then we have
[TABLE]
For all vertices in Vi,j\Vi,j′, if
we label them by an auxiliary signature χ1=[0001010000101000], then, referring to Table 4.2 (Here a=x=b=y=1), we have
[TABLE]
for all assignments σ′ on {ei,ej}.
We can also label the vertices in Vi,j\Vi,j′ by
an auxiliary signature χ2=[000−1010000101000].
By our (semi-circle) construction,
in Vi,j\Vi,j′, the number of entry-vertices is equal to the number of exit-vertices. We label all entry-vertices by χ2 and label all exit-vertices by its rotated form χ22π=[0001010000−101000]. Refer to Table 4.2 (here a=b=y=1,x=−1, and k=k1=ℓ1=ℓ, and the crucial equation is gi,j(1,1)=xk1+ℓ1=(−1)2=1), we have
[TABLE]
for all assignments σ′ on {ei,ej}.
Then, consider the case that g1f and g2f are applied to
the pair variables (w,w),
in which case g1f and g2f effectively become
unary constraints [a2,x2] and [ax,ax] on the variable xi. The latter
is a constant multiple of [1,1] and can be ignored.
The unary constraint [a,x], and hence also [a2,x2],
can be easily realized by f
in Pl-Holant(=2∣f,χi), by creating a self-loop
for the cycle representing the variable w, denoted by Cw (See Figure 4.8).
Note that the self-loop is created locally on the cycle Cw such that it does not affect other cycles.
As we did in Part I, we label the four edges incident to a self-intersection vertex by (x1,x2,x3,x4) such that x3 is the successor of x1 and x4 is the successor of x2 depending on the default orientation of Cw, and (x1,x2,x3,x4) are labeled in counterclockwise order.
Then, we have (x1,x3)=(x2,x4)=(0,1) when w=0 and (1,0) when w=1.
That is, g1f(0,0)=a2=f00112 and g1f(1,1)=x2=f11002.
Now, we get an instance Ωs(s=1,2) for each problem Pl-Holant(=2∣f,χs) respectively. Note that χs has the support (x1=x3)∧(x2=x4)
as f. As we have showed in Part @slowromancapi@,
for any nonzero term in the sum Pl-HolantΩs, the assignment of all edges σ:E→{0,1} can be uniquely extended from the assignment of all leader edges σ′:{e1,e2,…,ek}→{0,1}. Therefore, we have
[TABLE]
for s=1,2.
That is, #CSP(g1f,g2f)⩽TPl-Holant(=2∣f,χs),
(s=1,2).
From the hypothesis of the reduction (4.4), we have a=±x=0,b=±y=0, and (b/a)8=1.
We show by interpolation
[TABLE]
when a=ϵx,b=ϵy,
and
[TABLE]
when a=ϵx,b=−ϵy, where ϵ=±1.
•
If a=x and b=y, since they are all nonzero, and (ab)8=1, by normalization we may assume M(f)=[00010b0000b01000], where b=0 and b8=1.
If b is not a root of unity, by Lemma 2.3, we have Pl-Holant(=2∣f,χ1)⩽TPl-Holant(=2∣f).
Otherwise, b is a root of unity. Construct
a gadget f⊠ as shown in Figure 4.9.
Given an assignment (x1,x2,x3,x4) to f⊠,
and suppose f⊠(x1,x2,x3,x4)=0.
Then because of the support of fv1,fv5 and fv3
we must have x1=x3. Similarly x2=x4.
Also fv5 receives the same input as f⊠.
Hence the support of f⊠ is
contained in (x1=x3)∧(x2=x4),
i.e., contained in {(0,0,1,1),(1,1,0,0),(0,1,1,0),(1,0,0,1)}.
In particular,
the edges on each Diagonal Line of this gadget can only take assignments
(0,1,0,1,0,1) or (1,0,1,0,1,0), otherwise the we get zero.
On the other hand,
the Square cycle in this gadget is a circuit itself, so that
the edges in it can only take two assignments (0,1,0,1,0,1,0,1) or (1,0,1,0,1,0,1,0). We simplify the notation to (0,1) and (1,0)
respectively.
On (x1=x3)∧(x2=x4),
the value of f⊠ is the sum over these two terms.
For the signature f, if one pair of its edge twins flips its assignment between (0,1) and (1,0), then the value of f changes from 1 to b, or
from b to 1. If both pairs of edge twins flip their assignments,
then the value of f does not change.
According to this property, we give the Table 4.4.
Here, we place a suitably rotated copy of f
at vertices vi to get fvi (for 1≤i≤5)
so that
the values of fvi are all 1 under the assignment (x1,x2,x3,x4)=(0,0,1,1) and the Square is assigned =(0,1) (row 2 of Table 4.4).
When the assignment of Square flips from (0,1) to (1,0), one pair of edge twins of each vertex except v5 flips its assignment.
So the values of f on these vertices except v5 change from 1 to b (row 3).
When (x1,x3) flips its assignment, one pair of edge twins of v1,v3 and v5 flip their assignments.
When (x2,x4) flips its assignment, one pair of edge twins of v2,v4 and v5 flip their assignments. Using this fact, we get other rows correspondingly.
Hence, f⊠ has the signature matrix
M(f⊠)=[0001+b402b300002b301+b4000]. Since b8=1, we have
1+b4=0, by normalization we can write
M(f⊠)=000101+b42b300001+b42b301000.
Since ∣b∣=1 and b4=1, we have ∣1+b4∣<2. Then ∣1+b42b3∣>∣b3∣=1, which means 1+b42b3 is not a root of unity.
By Lemma 2.3, we have Pl-Holant(=2∣f,χ1)⩽TPl-Holant(=2∣f,f⊠). Since f⊠ is constructed by f, we have Pl-Holant(=2∣f,χ1)⩽TPl-Holant(=2∣f).
•
If a=−x and b=−y, then M(f)=[000−a0b0000−b0a000]. Connect the variable x4 with x3 of f using (=2), and we get a binary signature g′, where
[TABLE]
Since b=0, g′ can be normalized as (0,1,−1,0)T.
Modifying x1=1 of f by −1 scaling, we get a signature f′ with the signature matrix
M(f′)=[000a0b0000b0a000].
As we have proved above, Pl-Holant(=2∣f,χ1)⩽TPl-Holant(=2∣f,f′). Since f′ is constructed by f, we have Pl-Holant(=2∣f,χ1)⩽TPl-Holant(=2∣f).
•
If a=−x, b=y or a=x, b=−y, by normalization and rotational symmetry, we may assume M(f)=[000−10b0000b01000], where b=0 and b8=1.
If b is not a root of unity, by Corollary 2.4, we have Pl-Holant(=2∣f,χ2)⩽TPl-Holant(=2∣f).
Otherwise, b is a root of unity.
Construct the gadget f⊠ in the same way as shown above.
Our discussion on the support of f⊠ still holds:
It is contained in (x1=x3)∧(x2=x4);
on (x1,x2,x3,x4) with (x1=x3)∧(x2=x4),
fv5 receives the same input, and the value of f⊠
is the sum over two assignments (0,1) and (1,0) for the
Square.
For the signature f, if one pair of its edge twins flips its assignment between (0,1) and (1,0),
then the value of f changes from ±1 to b, or b to ∓1.
If two pairs of edge twins both flip their assignments, then the value of f
does not change if the value is b, or changes its sign if the value is ±1.
According to this property, we have the following Table 4.5.
Here, we place a suitably rotated copy of f
at vertices vi to get fvi (for 1≤i≤5)
so that
the values of fvi are all 1 under the assignment
(x1,x2,x3,x4)=(0,0,1,1) and the Square is assigned =(0,1)
(row 2 of Table 4.5).
When the assignment of Square flips from (0,1) to (1,0),
one pair of edge twins at each vertex except v5 flips its assignment.
So the values of f at these vertices except v5 change from 1 to b (row 3).
When (x1,x3) flips its assignment, one pair of edge twins at v1,v3 and v5 flips their assignments.
When (x2,x4) flips its assignment, one pair of edge twins at v2,v4 and v5 flips their assignments.
Using this fact, we get other rows correspondingly.
Hence, f⊠ has the signature matrix
[000−(1+b4)02b300002b301+b4000].
Since ∣b∣=1 and b8=1, we have b4=±1,
therefore 0<∣1+b4∣<2, and so
1+b42b3 is not a root of unity.
By Corollary 2.4, Pl-Holant(=2∣f,χ2)⩽TPl-Holant(=2∣f,f⊠), and hence Pl-Holant(=2∣f,χ2)⩽TPl-Holant(=2∣f).
In summary, we have
[TABLE]
Therefore, we have #CSP(g1f,g2f)⩽TPl-Holant(=2∣f) when a2=x2=0, b2=y2=0 and (ab)8=1.
∎
Remark:
A crucial point in the reduction (4.3)
is the fact that the given instance graph G of Pl-Holant(=2∣f)
is planar so that ∑iki=∑iℓi. Otherwise
this does not hold in general; for example the
latitudinal and longitudinal
closed cycles on a torus intersect at a single point.
The equation ∑iki=∑iℓi is crucial
to obtain tractability in the following theorem.
Theorem 4.6**.**
Let f be a 4-ary signature of the form (4.2),
where (a,x)=(0,0) and (b,y)=(0,0).
Then Pl-Holant(=2∣f)
is #P-hard unless
(i)
(ax)2=(by)2, or
2. (ii)
x=aiα,b=aiβ,y=aiγ, where α,β,γ∈N, and β≡γ(mod2),
in which cases, the problem is tractable in polynomial time.
Proof of Tractability:
•
In case (i), if ax=by=0, then f has support of size at most 2. So we have f∈P, and hence Pl-Holant(=2∣f) is tractable by Theorem 2.13.
Otherwise, (ax)2=(by)2=0.
For any signature g in Gf,
we have g00⋅g11=(ax)k1+ℓ1+k3+ℓ3(by)k2+ℓ2+k4+ℓ4 and g01⋅g10=(ax)k2+ℓ2+k4+ℓ4(by)k1+ℓ1+k3+ℓ3.
Since (k1+ℓ1+k3+ℓ3)−(k2+ℓ2+k4+ℓ4)≡k+ℓ≡0(mod2),
we have
[TABLE]
That is, g∈P. Since any signature h in Hf is unary, h∈P. Hence, we have Gf∪Hf⊆P. By Theorem 2.25, #CSP(Gf∪Hf) is tractable.
By reduction (4.3) of Lemma 4.5, we have Pl-Holant(=2∣f) is tractable.
•
In case (ii), for any signature g∈Gf
defined in Definition 4.4, M(g) is of the form
[TABLE]
where p00,p01,p10 and p11 denote the integer
exponents of i in the respective entries of g.
Since β≡γ(mod2), if they are both even,
then p00≡p01≡p10≡p11≡0(mod2);
if they are both odd, then p00≡p11≡k2+ℓ2+k4+ℓ4≡k1+ℓ1+k3+ℓ3≡p01≡p10(mod2).
If these exponents are all odd, we can take out a i.
Hence, g is of the form a′(iq00,iq01,iq10,iq11)T, where a′=ak+ℓ or ak+ℓi,
and either qij=2pij for all i,j∈{0,1} are integers,
or qij=2pij−1 for all i,j∈{0,1} are integers. Thus,
[TABLE]
Moreover, since p00+p01+p10+p11=(k+ℓ)(β+γ+2α)≡0(mod4), using the assumption that
β≡γ(mod2) and k≡ℓ(mod2),
we conclude that q00+q01+q10+q11≡0(mod2).
Therefore, g∈A by Lemma 2.10.
In this case, for any signature h in Hf, M(h) is of the
form
[TABLE]
Since β≡γ(mod2), we
have βm4+γm2≡βm2+γm4(mod2). Hence, h is of the
form a′′[iq0,iq1],
for some integers q0,q1, where a′′=am or ami. That is, h∈A by Lemma 2.11. Hence, Gf∪Hf⊆A.
By Theorem 2.25, #CSP(Gf∪Hf) is tractable.
By reduction (4.3) of Lemma 4.5, we have Pl-Holant(=2∣f) is tractable.
Proof of Hardness: We are given that f does not belong to
case (i) or case (ii).
Note that Mx4x1,x3x2(f)=[000y0x0000a0b000]
and Mx2x3,x1x4(f)=[000b0a0000x0y000].
Connect variables x3,x2 of a copy of the signature f with variables
x2,x3 of another copy of signature f both using (=2). We get a signature f1 with the signature matrix
[TABLE]
Similarly, connect x3,x2 of a copy of signature f with x4,x1 of another copy of signature f both using (=2). We get a signature f2 with the signature matrix
[TABLE]
Notice that M(f12π)=[00000byx200a2by00000],
M(f22π)=[00000b2ax00axy200000], M(g1f)=[a2bybyx2] and M(g2f)=[axy2b2ax].
Recall that M(fi2πIn)=MIn(fi2π)[0110].
We have gif=fi2πIn.
That is, fi(x1,x2,x3,x4)=gif(x2,x4)⋅χx1=x4⋅χx2=x3. Now, we analyze g1f and g2f.
•
If {g1f,g2f}⊆P, then either (ax)2=(by)2 if
either signature is degenerate, or g1f and g2f are
each generalized Equality or generalized Disequality respectively.
In the latter case, since (a,x)=(0,0) and (b,y)=(0,0), it forces that ax=by=0. So we still have (ax)2=(by)2. That is, {a,b,x,y} belongs to case (i). A contradiction.
•
If {g1f,g2f}⊆A, there are two subcases.
Note that the support of a function in A has size a power of 2.
–
If both g1f and g2f have support of size at most 2, then we have ax=by=0 due to (a,x)=(0,0) and (b,y)=(0,0). This
belongs to case (i). A contradiction.
–
Otherwise, at least one of g1f or g2f
has support of size 4. Then abxy=0
and therefore both g1f and g2f
have support of size 4. Let x′=ax,b′=ab and y′=ay. By normalization, we have
[TABLE]
Since g1f∈A, by Lemma 2.10, x′2 and b′y′ are both powers of i, and the sum of all exponents is even. It forces that x′2=i2α for some α∈N. Then, we can choose α such x′=iα. Also, we have
[TABLE]
Since g2f∈A and x′ is already a power of i, y′2 and b′2 are both powers of i. That is, b′=iβ and y′=iγ.
Also, since g1f∈A, b′y′=iβ+γ is a power of i, which means β≡γ(mod2). That is, {a,b,x,y}
belongs to case (ii). A contradiction.
•
If {g1f,g2f}⊆M,
then by Lemma 2.19, we have
both a2=ϵx2,by=ϵby
and ax=ϵ′ax,y2=ϵ′b2, for some
ϵ,ϵ′∈{1,−1}.
If ϵ=−1 then by=0, and then by the second set
of equations b=y=0, contrary to assumption that (b,y)=(0,0).
So ϵ=1.
Similarly ϵ′=1. Hence
[TABLE]
and it also follows that all 4 entries are nonzero.
Therefore, if {a,b,x,y} does not satisfy (4.5)
then {g1f,g2f}⊈P,A or M. By Theorem 2.25, Pl-#CSP(g1f,g2f) is #P-hard. Then by Lemma 4.2,
Pl-Holant(=2∣f12π,f22π) is #P-hard,
and hence Pl-Holant(=2∣f) is #P-hard.
Otherwise, the 4 nonzero entries
{a,b,x,y} satisfy (4.5).
If (ab)8=1, i.e.,
b=aiβ
for some γ∈N, then
x=±a=aiα,
and y=±b=aiβ+4δ for some α,δ∈N.
It follows that {a,b,x,y} satisfies (ii),
a contradiction.
So (ab)8=1, and we can apply reduction (4.4)
of Lemma 4.5.
By the reduction (4.4),
we have #CSP(g1f,g2f)⩽TPl-Holant(=2∣f).
Moreover, since {a,b,x,y} does not belong to case (i) or case (ii), we have {g1f,g2f}⊈P or A. By Theorem 2.25, #CSP(g1f,g2f) is #P-hard.
Therefore, we have Pl-Holant(=2∣f)
is #P-hard. ∎
Corollary 4.7**.**
Let f be a 4-ary signature of the form (4.2),
where (a,x)=(0,0) and (b,y)=(0,0).
If ∣ax∣=∣by∣ then
Pl-Holant(=2∣f)
is #P-hard.
5 Case @slowromancapiii@: N=2 with No Zero Pair or
N=1 with Zero in an Outer Pair
If there are exactly two zeros N=2 with no zero pair,
then the two zeros
are in different pairs, at least one of them must be in an outer pair.
So in Case @slowromancapiii@ there is a zero in an outer pair
regardless N=1 or N=2.
By rotational symmetry, we may assume a=0,
and we prove this case in Theorem 5.2.
We first give the following lemma.
Lemma 5.1**.**
Let f be a 4-ary signature with the signature matrix
M(f)=[00000bz00cy00000], where detMIn(f)=by−cz=0.
Let g be a 4-ary signature with the signature matrix
M(g)=[0000001001000000].
Then for any signature set F containing f, we have
[TABLE]
Proof.
We construct a series of gadgets fs by a chain of s copies of f linked by double Disequality N. fs has the signature matrix
[TABLE]
The inner matrix of
NM(f) is NInMIn(f)=[zbyc].
Suppose its spectral decomposition is Q−1ΛQ, where Λ=[λ10μλ2] is the Jordan Canonical Form.
Note that λ1λ2=detΛ=det(NInMIn(f))=0.
We have M(fs)=NP−1ΛsP, where
[TABLE]
Suppose μ=0, and λ1λ2 is a root of unity,
with (λ1λ2)n=1. Then
Λn=[00000λ1n0000λ2n00000]=[00000λ1n0000λ1n00000], and
M(fn)=[000000λ1n00λ1n000000]=λ1n[0000001001000000]. After normalization, we can realize the signature g.
2. 2.
Suppose μ=0, and λ1λ2 is not a root of unity. The matrix Λs=[00000λ1s0000λ2s00000] has a good form for interpolation.
Suppose g appears m times in an instance Ω of Pl-Holant(=2∣F∪{g}).
Replace each appearance of g by a copy of the gadget fs to get an instance Ωs of Pl-Holant(=2∣F∪{fs}), which is also an instance of Pl-Holant(=2∣F).
We can treat each of the m appearances of fs as a new gadget composed of four functions in sequence
N, P−1, Λs and P, and denote this new instance by Ωs′.
We divide Ωs′ into two parts.
One part consists of m signatures Λs⊗m. Here Λs⊗m is expressed as a column vector.
The other part is the rest of Ωs′ and its signature is represented by A which is a tensor expressed as a row vector.
Then the Holant value of Ωs′ is the dot product ⟨A,Λs⊗m⟩, which is a summation over 4m bits. That is, the value of the 4m edges connecting the two parts.
We can stratify all 0,1 assignments of these 4m bits having a nonzero evaluation of a term in Pl-HolantΩs′ into the following categories:
•
There are i many copies of Λs receiving inputs 0110;
•
There are j many copies of Λs receiving inputs 1001;
where i+j=m.
For any assignment in the category with parameter (i,j), the evaluation of
Λs⊗m is clearly λ1siλ2sj=λ1sm(λ1λ2)sj.
Let aij be the summation of values of the part A over all assignments in the category (i,j). Note that aij is independent from the value of s since we view the gadget Λs as a block.
Since i+j=m, we can denote aij by aj. Then we rewrite the dot product summation and get
[TABLE]
Note that M(g)=NP−1(NM(g))P, where NM(g)=[0000010000100000].
Similarly, divide Ω into two parts. Under this stratification, we have
[TABLE]
Since λ1λ2 is not a root of unity,
the Vandermonde coefficient matrix
[TABLE]
has full rank,
where ρ=λ1λ2.
Hence, by oracle querying the values of Pl-HolantΩs, we can solve for aj, and thus
obtain the value of Pl-HolantΩ in polynomial time.
3. 3.
Suppose μ=1, and λ1=λ2 denoted by λ. Then Λs=[00000λs000sλs−1λs00000]. We use this form to give a polynomial interpolation. As in the case above, we can stratify the assignments of Λs⊗m of these 4m bits having a nonzero evaluation of a term in Pl-HolantΩs′ into the following categories:
•
There are i many copies of Λs receiving inputs 0110 or 1001;
•
There are j many copies of Λs receiving inputs 0101;
where i+j=m.
For any assignment in the category with parameter (i,j), the evaluation of
Λs⊗m is clearly λsi(sλs−1)j=λsm(λs)j.
Let aij be the summation of values of the part A over all assignments in the category (i,j). aij is independent from s. Since i+j=m, we can denote aij by aj. Then, we rewrite the dot product summation and get
[TABLE]
for s≥1.
We consider this as a linear system for 1≤s≤m+1.
Similarly, divide Ω into two parts.
Under this stratification, we have
[TABLE]
The Vandermonde coefficient matrix
[TABLE]
has full rank, where
ρs=s/λ are all distinct.
Hence, we can solve a0 in polynomial time and it is the value of Pl-HolantΩ.
Therefore, we have Pl-Holant(=2∣F∪{g})⩽TPl-Holant(=2∣F). ∎
Let f be a 4-ary signature with the signature matrix
M(f)=000x0bz00cy00000,
where x=0 and there is at most one number in {b,c,y,z} that is [math].
Then Pl-Holant(=2∣f)
is #P-hard unless
f∈M,
in which case the problem is tractable.
Suppose f∈/M. By Lemma 2.14, detMIn(f)=detMOut(f)=0, that is det[bzcy]=by−cz=0.
Note that Mx1x2,x4x3(f)=[000x0bz00cy00000], Mx3x4,x2x1(f)=[00000yz00cb0x000], and Mx2x3,x1x4(f)=[000b00c00zx0y000].
Connect variables x4,x3 of a copy of signature f with variables x3,x4 of another copy of signature f both using (=2). We get a signature f1 with the signature matrix
[TABLE]
where [b1z1c1y1]=[bzcy]⋅[zybc].
This f1 has the form in Lemma 5.1.
Here, det[b1z1c1y1]=−(by−cz)2=0.
By Lemma 5.1, we have
[TABLE]
where g has the signature matrix M(g)=[0000001001000000].
•
If bcyz=0, connect variables x1,x4 of signature f with variables x1,x2 of signature g both using (=2).
We get a signature f2 with the signature matrix
[TABLE]
•
Otherwise, connect variables x4,x3 of signature f with variables x1,x2 of signature g both using (=2).
We get a signature f2 with the signature matrix
[TABLE]
and there is exactly one entry in {b,c,y,z} that is zero.
In both cases, the support of f2 has size 3, which means f2∈/P,A or M.
By Theorem 4.3, Pl-Holant(=2∣f2) is #P-hard.
Since
[TABLE]
we have Pl-Holant(=2∣f) is #P-hard. ∎
6 Case @slowromancapiv@: N=1 with Zero in the
Inner Pair or N=0
By rotational symmetry, if there is one zero in the inner pair, we may
assume it is c=0, and abxyz=0. We first consider the case
that x=ϵa,y=ϵb and z=ϵc, where ϵ=±1.
Lemma 6.1**.**
Let f be a 4-ary signature with the signature matrix
[TABLE]
Then Pl-Holant(=2∣f) is #P-hard if f∈/M.
Proof.
If ϵ=−1 we first transform the case to ϵ=1
as follows. Connecting the variable x4 with
x3 of f using (=2) we get a binary signature g1, where
[TABLE]
Also connecting the variable x1 with x2 of f using (=2) we get a binary signature g2, where
[TABLE]
Since bc=0, b+c and b−c cannot be both zero. Without loss of generality, suppose b+c=0. By normalization, we have g1=(0,1,−1,0)T.
Then, modifying x1=1 of f with −1 scaling we get a signature with the signature matrix [000a0bc00cb0a000]. Therefore, it suffices to show #P-hardness for the case ϵ=1.
Since f∈/M, by Lemma 2.14, c2−b2=a2. We prove #P-hardness in the following three Cases depending on the values of a,b and c.
Case 1: Either
c2−b2=0 and ∣c+b∣=∣c−b∣, or c2−a2=0 and ∣c+a∣=∣c−a∣.
By rotational symmetry, we may assume c2−b2=0 and ∣c+b∣=∣c−b∣.
We may normalize a=1 and assume
M(f)=[00010bc00cb01000], where c2−b2=0 or 1.
We construct a series of gadgets fs by a chain of s copies of f linked by double Disequality N. fs has the signature matrix
[TABLE]
We diagonalize [cbbc]s using
H=21[111−1] (note that H−1=H),
and get M(fs)=NPΛsP, where
[TABLE]
The signature matrix Λs has a good form for polynomial interpolation.
In the following, we will
reduce Pl-Holant(=2∣f^)
to Pl-Holant(=2∣f),
for suitably chosen M(f^)=[00010b^c^00c^b^01000],
and use that to prove that Pl-Holant(=2∣f)
is #P-hard.
Suppose f^ appears m times in an instance Ω^ of Pl-Holant(=2∣f^).
We replace each appearance of f^ by a copy of
the gadget fs to get an instance Ωs
of Pl-Holant(=2∣f).
We can treat each of the m appearances of fs as a new gadget composed of four functions in sequence
N, P, Λs and P, and denote this new instance by Ωs′.
We divide Ωs′ into two parts.
One part consists of m occurrences of
Λs,
which is Λs⊗m,
and is written as a column vector of dimension 24m.
The other part is the rest of Ωs′ and its signature is expressed by
a tensor A, written as a row vector of dimension 24m.
Then the Holant value of Ωs′ is the dot product ⟨A,Λs⊗m⟩, which is a summation over 4m bits, i.e., the values of the 4m edges connecting the two parts.
We can stratify all 0,1 assignments of these 4m bits having a nonzero evaluation of a term in Pl-HolantΩs′ into the following categories:
•
There are i many copies of Λs receiving inputs 0000 or 1111;
•
There are j many copies of Λs receiving inputs 0110;
•
There are k many copies of Λs receiving inputs 1001;
where i+j+k=m.
For any assignment in the category with parameter (i,j,k), the evaluation of
Λs⊗m is clearly (c+b)sj(c−b)sk.
Let aijk be the summation of values of the part A over all assignments in the category (i,j,k). Note that aijk is independent of the value of s. Since i+j+k=m, we can denote aijk by ajk. Then we rewrite the dot product summation and get
[TABLE]
Under this stratification, correspondingly we can define Ω^′ and Λ^ from Ω^. Then we have
[TABLE]
where the same set of values ajk appear.
Let ϕ=c^+b^ and ψ=c^−b^. If we can obtain the value of p(ϕ,ψ)=0⩽j+k⩽m∑ajkϕjψk
from oracle queries to Pl-HolantΩs (for s≥1)
in polynomial time, then we will have proved
[TABLE]
Let α=c+b and β=c−b. Since c2−b2=0 or 1, we have α=0, β=0 and αβ=1. Also, by assumption ∣c+b∣=∣c−b∣, we have ∣α∣=∣β∣. Define L={(j,k)∈Z2∣αjβk=1}.
This is a sublattice of Z2. Every lattice has a basis.
There are 3 cases depending on the rank of L.
•
L={(0,0)}. All αjβk are distinct. It is an interpolation reduction in full power. That is, we can interpolate p(ϕ,ψ) for any ϕ and ψ in polynomial time. Let ϕ=4 and ψ=0, that is b^=2 and c^=2, and hence M(f^)=[0001022002201000].
That is, f^ is non-singular redundant. By Theorem 2.23, Pl-Holant(=2∣f^) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
•
L contains two independent vectors (j1,k1) and (j2,k2)
over Q.
Then the nonzero vectors j2(j1,k1)−j1(j2,k2)=(0,j2k1−j1k2) and
k2(j1,k1)−k1(j2,k2)=(k2j1−k1j2,0) are in L. Hence, both α and β are roots of unity. This implies that ∣α∣=∣β∣=1,
a contradiction.
•
L={(ns,nt)∣n∈Z}, where s,t∈Z and
(s,t)=(0,0). Without loss of generality, we may assume t⩾0, and s>0 when t=0.
Also, we have s+t=0, otherwise ∣α∣=∣β∣, a contradiction.
By Lemma 2.7, for any numbers ϕ and ψ satisfying ϕsψt=1, we can obtain p(ϕ,ψ) in polynomial time.
Since ϕ=c^+b^ and ψ=c^−b^, we have b^=2ϕ−ψ and c^=2ϕ+ψ. That is M(f^)=[000102ϕ−ψ2ϕ+ψ002ϕ+ψ2ϕ−ψ01000]. There are three cases depending on the values of s and t.
–
s⩾0 and s+t⩾2. Consider the function q(x)=(2−x)sxt−1. Since s⩾0 and t⩾0, q(x) is a polynomial. Clearly, 1 is a root and [math] is not a root.
If q(x) has no other roots, then for some constant λ=0,
[TABLE]
(In fact by comparing leading coefficients, λ=(−1)s.)
Notice that xt∣q(x)+1, while xt∤λ(x−1)s+t+1 for t⩾2.
Also, notice that (2−x)s∣q(x)+1, while (2−x)s∤(−1)s+tλ((2−x)−1)s+t for s⩾2.
Hence, t=s=1, which means αβ=1. Contradiction.
Therefore, q(x) has a root x0, with x0=1 or [math]. Let ψ=x0 and ϕ=2−x0. Then ϕsψt=1 and M(f^)=[000101−x010011−x001000].
Note that Mx2x3,x1x4(f^)=[0001−x0011001101−x0000]. Since 1−x0=0, f^ is non-singular
redundant.
By Theorem 2.23, Pl-Holant(=2∣f^) is #P-hard and hence Pl-Holant(=2∣f) is #P-hard.
–
s<0 and t>0. (Note that s<0 implies t>0.)
Consider the function q(x)=xt−(2−x)−s. Since t>0 and −s>0, it is a polynomial. Clearly, 1 is a root, but neither [math] nor 2 is
a root.
Since t+s=0, the highest order term of q(x) is either xt or −(−x)−s, which means the coefficient of the highest order term is ±1.
While the constant term of q(x) is −2−s=±1.
Hence, q(x) cannot be of the form λ(x−1)max(t,−s) for some constant λ=0.
Moreover, since t+s=0, max(t,−s)⩾2, which means q(x) has a root x0, where x0=0,1,2.
Dividing by the nonzero term (2−x0)−s we have
(2−x0)sx0t=1. Now we
let ψ=x0 and ϕ=2−x0,
and we have Pl-Holant(=2∣f) is #P-hard by the same proof as above.
–
s⩾0 and s+t=1. In this case, we have (s,t)=(0,1) or (1,0)
since t⩾0.
(s,t)=(1,0). Let ϕ=1 and ψ=21. Then we have ϕ1ψ0=1 and M(f^)=[00010414300434101000]. Let M(f′)=4Mx2x3,x1x4(f^)=[0001043003401000]. Clearly, Pl-Holant(=2∣f′)⩽TPl-Holant(=2∣f).
For M(f′), correspondingly we define α′=3+4=7 and β′=3−4=−1. Obviously, α′=0, β′=0, α′β′=1, and ∣α′∣=∣β′∣. Let L′={(j,k)∈Z2∣α′jβ′k=1}.
Then we have L′={(ns′,nt′)∣n∈Z}, where s′=0 and t′=2. Therefore, s′⩾0 and s′+t′⩾2. As we have showed above, we have Pl-Holant(=2∣f′) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
*
(s,t)=(0,1). Let ϕ=3 and ψ=1. Then we have ϕ0ψ1=1 and M(f^)=[0001012002101000].
By Theorem 2.24, Pl-Holant(=2∣f^) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
Case 2: If c2−b2=0 and ∣c+b∣=∣c−b∣, or c2−a2=0 and ∣c+a∣=∣c−a∣.
By rotational symmetry, we may assume c2−b2=0 and ∣c+b∣=∣c−b∣.
Normalizing f by assuming c=1, we have M(f)=[000a0b1001b0a000], where 12−b2=0 and 12−b2=a2 due to f∈/M. Since ∣1+b∣=∣1−b∣, b is a pure imaginary number
(as b=0).
Connect variables x4, x3 of a copy of signature f with variables x1, x2 of another copy of signature f both using (=2).
We get a signature f1 with the signature matrix
[TABLE]
a.
If c2−a2=0, that is a2=1, and then M(f1)=[000102bb2+100b2+12b01000]. Since b2<0, we have (b2+1)2−(2b)2=(b2−1)2>1=(a2)2, which means f1∈/M.
•
If b2=−1, then M(f1)=[00010±2i0000±2i01000]. By Corollary 4.7, Pl-Holant(=2∣f1) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
•
If b2=−2, then M(f1)=[00010±22i−100−1±22i01000]. Connect two copies of f1,
and we have a signature f2 with the signature matrix
[TABLE]
It is easy to check that f2∈/M, by Lemma 2.14.
Then, f2 belongs to Case 1. Therefore, Pl-Holant(=2∣f2) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
•
If b2=−1 or −2, then b2+1=±1 due to b=0,
hence 12−(b2+1)2=0.
Also, since b2+1 is a real number and b2+1=0, we have ∣(b2+1)+1∣=∣(b2+1)−1∣.
Then f1, which is not in M as shown above,
has a signature matrix of the form [000a10b1c100c1b10a1000], where a1=a2=1, b1=2b,
and c1=b2+1, and
a1b1c1=0, c12−a12=0 and ∣c1+a1∣=∣c1−a1∣.
That is, f1 belongs to Case 1. Therefore, Pl-Holant(=2∣f1) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
2. b.
If c2−a2=0 and ∣c+a∣=∣c−a∣,
i.e., ∣1+a∣=∣1−a∣, then a=0 is also a pure imaginary number.
Connect variables x1, x4 of a copy of signature f with variables x2, x3 of another copy of signature f.
We get a signature f3 with the signature matrix
[TABLE]
Note that f3∈M implies (a2−1)2=(b2)2. Since f∈/M, 1−a2=b2. Hence, f3∈M implies a2−1=b2. Similarly, f1∈M implies b2−1=a2. Clearly, f1 and f3
cannot both be in M. Without loss of generality, we may assume f3∈/M.
•
If a2=−1, then there are two subcases.
–
(a2+1)2−(b2)2=0. Since a is a pure imaginary number, ∣a2+1+2a∣=∣a+1∣2=∣a−1∣2=∣a2+1−2a∣. Then f3 has the signature matrix of the form [000a30b3c300c3b30a3000],
where a3b3c3=0, c32−b32=(a2−1)2=0 since a is pure imaginary, ∣c3+b3∣=∣c3−b3∣ and c32−a32=0. That is, f3 belongs to Case 2.a. Therefore, Pl-Holant(=2∣f3) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
–
(a2+1)2−(b2)2=0. Since a2+1 and b2 are both nonzero real numbers due to a and b are both pure imaginary numbers, we have ∣a2+1+b2∣=∣a2+1−b2∣. Then f3 has the signature matrix of the form [000a30b3c300c3b30a3000], where a3b3c3=0, c32−a32=0 and ∣c3+a3∣=∣c3−a3∣. That is, f3 belongs to Case 1.
Therefore, Pl-Holant(=2∣f3) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
•
If a2=−1 and b2=−2, then M(f3)=[000b202a00002a0b2000], where ∣2a∣=2=∣b2∣. By Corollary 4.7, Pl-Holant(=2∣f3) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
•
If a2=−1 and b2=−2, then M(f1)=[000−10±22i−100−1±22i0−1000]. Note that Mx2x3,x1x4(f1)=[000±22i0−1−100−1−10±22i000], which means f1 is non-singular redundant.
Therefore, we have Pl-Holant(=2∣f1) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
3. c.
If c2−a2=0 and ∣c+a∣=∣c−a∣. This is Case 1. Done.
Case 3:c2−b2=0 and c2−a2=0.
If c=b or c=a, then f is non-singular redundant, and hence Pl-Holant(=2∣f) is #P-hard. Otherwise, a=b=−c. By normalization, we have M(f)=[000−10−11001−10−1000], and then M(f1)=[00010−22002−201000]. Notice that 22−12=0 and ∣2+1∣=∣2−1∣. That is, f1 belongs to Case 1.
Therefore, Pl-Holant(=2∣f1) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
Case 1 to Case 3 cover all cases for (a,b,c):
Suppose (a,b,c) does not satisfy Case 3. Then either
c2−b2=0 or c2−a2=0.
If c2−b2=0, then
either ∣c+b∣=∣c−b∣ (Case 1)
or
∣c+b∣=∣c−b∣ (Case 2).
Similarly if c2−a2=0 it is either Case 1 or Case 2.
This completes the proof of the lemma.
∎
Lemma 6.2**.**
Let f be a 4-ary signature with the signature matrix
M(f)=000x0bz00cy0a000, where abcxyz=0.
If by−cz=0 or ax−cz=0, then Pl-Holant(=2∣f) is #P-hard.
Proof.
By rotational symmetry, we assume by−cz=0. By normalization, we assume b=1, and then y=cz. That is,
Mx1x2,x4x3(f)=[000x01z00ccz0a000].
•
If 1+c=0. Connecting the variables x4 with x3 of f using (=2) we get a binary signature g1, where
[TABLE]
Note that g1(x1,x2) can be normalized as (0,z−1,1,0)T. That is g(x2,x1)=(0,1,z−1,0)T.
Modifying x1=1 of f with z−1 scaling
we get a signature f1 with the signature matrix [000x/z01100cc0a000].
Connecting the variable x1 with x2 of f1 using (=2) we get a binary signature g2, where
[TABLE]
and g2(x1,x2) can be normalized to g2(x2,x1)=(0,1,c−1,0)T.
Modifying x4=1 of f1 with c−1 scaling we get a signature f2 with the signature matrix [000x/z01100110a/c000]. It is non-singular redundant. By Lemma 2.23, we have Pl-Holant(=2∣f2) is #P-hard, and hence Pl-Holant(=2∣f) is #P-hard.
•
If 1+z=0, then
connecting the variable x1 with x2 of f using (=2) we get a binary signature g1′, where
[TABLE]
g1′(x1,x2) can be normalized to (0,c−1,1,0)T.
By the same analysis as in the case 1+c=0, we have Pl-Holant(=2∣f) is #P-hard.
•
Otherwise, 1+c=0 and 1+z=0, that is c=z=−1. Then Mx1x2,x4x3(f)=[000x01−100−110a000], and Mx3x4,x2x1(f)=[000a01−100−110x000]. Connecting variables x4,x3 of a copy of signature f with variables x3,x4 of another copy of signature f, we get a signature f3 with the signature matrix
[TABLE]
Clearly, ax=0 and so f3∈/M
by Lemma 2.14. By Lemma 6.1, Pl-Holant(=2∣f3) is #P-hard and hence Pl-Holant(=2∣f) is #P-hard. ∎
In the following Lemmas 6.3, 6.4, 6.6 and Corollaries 6.5, 6.7, let f be a 4-ary signature with the signature matrix
[TABLE]
where abxyz=0, det[bzcy]=by−cz=0 and det[aczx]=ax−cz=0. Moreover f∈/M, that is cz−by=ax.
These lemmas handle “generic” cases
of this section and will culminate in Theorem 6.8,
which is a classification for Case @slowromancapiv@.
It is here we will use Möbius transformations
to handle interpolations where it is
particularly difficult to get desired signatures of “infinite order”.
Lemma 6.3**.**
Let g=(0,1,t,0)T be a binary signature, where t=0 is not a root of unity.
Then Pl-Holant(=2∣f,g) is #P-hard.
Proof.
Let B={g1,g2,g3} be a set of three binary signatures gi=(0,1,ti,0)T, for some ti∈C. By Lemma 2.5, we have Pl-Holant(=2∣{f}∪B)⩽Pl-Holant(=2∣f,g). We will show that Pl-Holant(=2∣{f}∪B) is #P-hard and it follows that Pl-Holant(=2∣f,g) is #P-hard.
Modifying x1=1 of f with ti(i=1,2) scaling separately,
we get two signatures fti with the signature matrices
M(fti)=[000tix0btiz00ctiy0a000]. Note that
[TABLE]
Connecting variables x4,x3 of f with variables x1, x2 of ft1 both using (=2), we get a signature f1 with the signature matrix
[TABLE]
We first show that there is a t1=0 such that b1y1c1z1=0 and (b1z)(y1c)−(c1b)(z1y)=0. Consider the quadratic polynomial
[TABLE]
We have p(t1)=(b1z)(y1c)−(c1b)(z1y).
Notice that the coefficient of the quadratic term in p(t) is byz2(cz−by).
It is not equal to zero since byz2=0 and cz−by=0. That is, p(t) has degree 2, and hence it has at most two roots. Also we have the following
three implications by the definitions of b1,y1,c1,z1: b1y1=0⟹t1=−zc,
c1=0⟹t1=−byc2, and z1=0⟹t1=−z2yb.
Therefore we can choose such a t1 that does not take these values 0,−zc,−byc2 and −z2yb, and t1 is not a root of p(t).
Then, we have t1=0, b1y1c1z1=0 and (b1z)(y1c)−(c1b)(z1y)=0.
Connecting variables x4,x3 of f1 with variables x1, x2 of ft2 both using (=2), we get a signature f2 with the signature matrix
[TABLE]
Since b1z=0 and c1b=0, we can let t2=−b1zc1b and t2=0. Then b2=t2b1z+c1b=0. Since (b1z)(y1c)−(c1b)(z1y)=0, we have y2=t2z1y+y1c=0.
Notice that
[TABLE]
We have detMIn(f2)=b2y2−c2z2=−c2z2=0.
Similarly, we have detMOut(f2)=−a2x2=t1t2detMOut(f)3=0. Therefore, M(f2) is of the form [000x200z200c2y20a2000], where a2x2y2c2z2=0. That is, f2 is a signature in Case @slowromancapiii@. If f2∈/M, then Pl-Holant(=2∣f2) is #P-hard by Theorem 5.2, and hence Pl-Holant(=2∣{f}∪B) is #P-hard.
Otherwise, f2∈M, which means detMOut(f2)detMIn(f2)=1. Thus detMOut(f)3detMIn(f)3=1. Since f∈/M, detMOut(f)detMIn(f)=1, and hence detMOut(f)7detMIn(f)7=1. Similar to the construction of f1, we construct f3. First, modify x1=1 of f1 with t3 scaling.
We get a signature f1t3 with the signature matrix
M(f1t3)=[000t3x10b1t3z100c1t3y10a1000]. Note that detMIn(f1t3)=t3detMIn(f1) and detMOut(f1t3)=t3detMOut(f1). Then connect variables x4,x3 of f1 with variables x1, x2 of f1t3 both using (=2). We get a signature f3 with the signature matrix
[TABLE]
Since c1=0 and z1=0, we can define t3=−z1c1 and t3=0.
Then b3=b1(t3z1+c1)=0 and y3=y1(t3z1+c1)=0.
Notice that
[TABLE]
We have detMIn(f3)=−c3z3=0 and similarly, detMOut(f3)=−a3x3=−t3t12detMOut(f)4=0.
That is, M(f3) is of the form [000x300z300c300a3000] where a3x3c3z3=0.
Connect variables x4,x3 of f2 with variables x1, x2 of f3 both using (=2). We get a signature f4 with the signature matrix
[TABLE]
Clearly, f4 is a signature in Case @slowromancapiii@. Also, notice that
[TABLE]
and
[TABLE]
We have
[TABLE]
which means f4∈/M. By Theorem 5.2, Pl-Holant(=2∣f4) is #P-hard, and hence Pl-Holant(=2∣{f}∪B) is #P-hard. ∎
Lemma 6.4**.**
Let g=(0,1,t,0)T be a binary signature where t is an n-th primitive root of unity, and n≥5.
Then Pl-Holant(=2∣f,g) is #P-hard.
Proof.
Note that Mx1,x2(g)=[0t10]. Connect the variable x2 of a copy of signature g with the variable x1 of another copy of signature g using (=2). We get a signature g2 with the signature matrix
[TABLE]
That is, g2=(0,1,t2,0)T.
Similarly, we can construct gi=(0,1,ti,0)T for 1⩽i⩽5. Here, g1 denotes g. Since the order n⩾5, gi are
distinct (1≤i≤5).
Connect variables x4, x3 of signature f with variables x1, x2 of gi for 1⩽i⩽5 respectively. We get binary signatures hi, where
[TABLE]
Let φ(z)=b+czz+yz. Since det[bzcy]=by−cz=0, φ(z) is a Möbius transformation of the extended complex plane C.
We rewrite hi in the form of (b+cti)(0,1,φ(ti),0)T, with the understanding that if b+cti=0, then φ(ti)=∞, and we define (b+cti)(0,1,φ(ti),0)T to be (0,1,z+yti,0)T.
If there is a ti such that φ(ti) is not a root of unity, and φ(ti)=0 and φ(ti)=∞, by Lemma 6.3,
we have Pl-Holant(=2∣f,hi) is #P-hard, and hence
Pl-Holant(=2∣f,g1) is #P-hard.
Otherwise, φ(ti) is [math], ∞ or a root of unity for 1⩽i⩽5.
Since φ(z) is a bijection of C, there is at most one ti such that φ(ti)=0 and at most one ti such that φ(ti)=∞. That means, there are at least three ti such that ∣φ(ti)∣=1. Since a Möbius transformation is determined by any 3 distinct points, mapping 3 distinct points from S1 to S1 implies that this φ(z) maps S1 homeomorphically onto S1
(so in fact there is no ti such that
φ(ti)=0 or ∞).
Such a Möbius transformation has a special form: M(α,eiθ)=eiθ1+αˉz(z+α), where ∣α∣=1.
(It cannot be of the form eiθ/z,
since b=0.)
By normalization in signature f,
we may assume b=1. Compare the coefficients, we have c=αˉ, y=eiθ and z=αeiθ.
Here α=0 due to z=0.
Also, since
Mx2x3,x1x4(f)=[000b0ac00zx0y000]
and det[aczx]=ax−cz=0,
we have another Möbius transformation ψ(z)=a+zzc+xz.
Plug in c=αˉ and z=αeiθ, we have
[TABLE]
By the same proof for φ(z), we get Pl-Holant(=2∣f,g) is #P-hard, unless ψ(z) also maps S1 to S1. Hence, we can assume ψ(z) has the form M(β,eiθ′)=eiθ′1+βˉz(z+β), where ∣β∣=1. (It is clearly not of the form eiθ′/z.)
Compare the coefficients, we have
[TABLE]
Solving these equations, we get a=eiθα/βˉ
and x=αˉ/β. Let γ=α/βˉ, and we have a=γeiθ and x=γˉ,
where ∣γ∣=∣α∣ since ∣β∣=1 and γ=0 since x=0.
Then, we have signature matrices Mx1x2,x4x3(f)=[000γˉ01αeiθ00αˉeiθ0γeiθ000],Mx2x3,x1x4(f)=[00010γeiθαˉ00αeiθγˉ0eiθ000],Mx3x4,x2x1(f)=[000γeiθ0eiθαeiθ00αˉ10γˉ000]
and
Mx4x1,x3x2(f)=[000eiθ0γˉαˉ00αeiθγeiθ01000].
Connect variables x4,x3 of a copy of signature f with variables x3,x4 of another copy of signature f using (=2). We get a signature f1 with the signature matrix
[TABLE]
•
If α+αˉ=0,
normalizing Mx1x2,x4x3(f1) by dividing by (α+αˉ)eiθ,
we have
[TABLE]
Note that (α+αˉ)(1+α2)eiθ and (α+αˉ)(1+αˉ2)e−iθ are conjugates.
Let δ=(α+αˉ)(1+α2)eiθ,
and then δˉ=(α+αˉ)(1+αˉ2)e−iθ.
We have ∣δ∣2=δδˉ=(α+αˉ)2(1+α2)(1+αˉ2)=1 due to detMIn(f1)=0, and δ=0 due to ∣α∣=1.
Consider the inner matrix of M(f1), we have MIn(f1)=[1δδˉ1].
Notice that the two eigenvalues of MIn(f1) are 1+∣δ∣ and 1−∣δ∣,
and obviously 1+∣δ∣1−∣δ∣=1,
which means there is no integer n>0 and complex number C such that MInn(f1)=CI. Note that φ1(z)=1+δˉzδ+z is a Möbius transformation of the form M(δ,1) mapping S1 to S1.
Connect variables x4, x3 of signature f1 with variables x1, x2 of signatures gi.
We get binary signatures g(i,φ1), where
[TABLE]
Since φ1 is a Möbius transformation mapping S1 to S1 and ∣ti∣=1, we have ∣φ1(ti)∣=1, which means 1+δˉti=0. Hence, g(i,φ1) can be normalized as (0,1,φ1(ti),1)T.
Successively construct binary signatures g(i,φ1n) by connecting f1 with g(i,φ1n−1). We have
[TABLE]
where C(i,n)=0⩽k⩽n−1∏(1+δˉφ1k(ti)). We know C(i,n)=0, because for any k, 1+δˉφ1k−1(ti)=0 due to
∣φ1k(ti)∣=∣1+δˉφ1k−1(ti)∣∣δ+φ1k−1(ti)∣=1.
Hence, g(i,φ1n) can be normalized as (0,1,φ1n(ti),0)T.
Notice that the nonzero entries (1,φ1n(ti))T of g(i,φ1n) are completely decided by the inner matrix MIn(f1).
That is
[TABLE]
If for each i∈{1,2,3}, there is some ni⩾1 such that (1,φ1ni(ti))T=(1,ti)T,
then φ1n0(ti)=ti, where n0=n1n2n3 for 1⩽i⩽3, i.e.,
the Möbius transformation φ1n0 fixes three distinct complex numbers t,t2,t3.
So the Möbius transformation is the identity map,
i.e., φ1n0(z)=z for all z∈C.
This implies that MInn0(f1)=C[1001] for some constant C=0.
This contradicts the fact that the ratio of the eigenvalues of MIn is not a root of unity.
Therefore, there is an i such that (1,φ1n(ti))T are all distinct for n∈N.
Then, we can realize polynomially many distinct binary signatures of the form (0,1,φ1n(ti),1)T.
By Lemma 2.6, we have Pl-Holant(=2∣f,g) is #P-hard.
•
Otherwise α+αˉ=0, which means α is a pure imaginary number. We already have α=0 due to z=0.
Also ∣α∣=1 from the form of
M(α,eiθ).
Let α=ri, where r∈R and ∣r∣=0 or 1.
Connect variables x1, x4 of a copy of signature f with variables x4, x1 of another copy of signature f, we get a signature f2 with the signature matrix
[TABLE]
–
If −γ+γˉ=0,
normalizing M(f2) by dividing the quantity
(−γ+γˉ)rieiθ, we have
[TABLE]
Note that (−γ+γˉ)ri(γ2−r2)eiθ and (−γ+γˉ)ri(γˉ2−r2)e−iθ are conjugates.
Let ζ=(−γ+γˉ)ri(γˉ2−r2)e−iθ,
and then ∣ζ∣=1 due to detMIn(f2)=0, and ζ=0 due to ∣γ∣=∣α∣=∣r∣ (as ∣β∣=1).
With the same analysis as for MIn(f1) in the case α+αˉ=0,
the ratio of the two eigenvalues of MIn(f2)=[1ζζˉ1] is also not equal to 1, which means there is no integer n and complex number C such that MInn(f2)=CI.
Notice that φ2(z′)=1+ζˉz′ζ+z′ is also a Möbius transformation of the form M(ζ,1) mapping S1 to S1.
Similarly, we can realize polynomially many distinct binary signatures, and hence Pl-Holant(=2∣f,g) is #P-hard.
–
Otherwise, −γ+γˉ=0, which means γ is a real number.
We have γ∈R,
∣γ∣=0 or ∣r∣. Connect variables x4, x3 of a copy of signature f with variables x1, x2 of another copy of signature f, we get a signature f′ with the signature matrix
[TABLE]
If eiθ=1, then M(f)=[000γˉ01α00αˉ10γ000], and MIn(f)=[1ααˉ1]. Since ∣α∣=1, same as the analysis of MIn(f1), we can realize polynomially many binary signatures, and hence Pl-Holant(=2∣f,g) is #P-hard.
*
Otherwise eiθ=1, normalizing M(f′) by dividing by (eiθ−1)ri, we have
[TABLE]
Note that (eiθ−1)ri1−eiθr2 and (eiθ−1)rieiθ−r2 are conjugates, and (eiθ−1)riγ2eiθ and (eiθ−1)riγ2 are conjugates.
Let α′=(eiθ−1)ri1−eiθr2 and γ′=(eiθ−1)riγ2eiθ.
Then
[TABLE]
Notice that M(f′) and M(f) have the same form. Similar to the construction of f2, we can construct a signature f2′ using f′ instead of f.
Since −γ′+γ′ˉ=−(eiθ−1)riγ2eiθ+(eiθ−1)riγ2=−riγ2=0,
by the analysis of f2, we can still realize polynomially many binary signatures and hence Pl-Holant(=2∣f,g) is #P-hard. ∎
Remark:
The order n⩾5 promises that there are at least three points mapped to points on S1, since at most one point can be mapped to [math] and at most one can be mapped to ∞. When the order n is 3 or 4, if no point is mapped to [math] or ∞, then there are still at least three points mapped to points on S1. So, we have the following corollary.
Corollary 6.5**.**
Let g=(0,1,t,0)T be a binary signature where t is an n-th primitive root of unity, and n=3 or 4.
Let gm denote (0,1,tm,0)T.
For any cyclic permutation (i,j,k,ℓ) of (1,2,3,4),
if there is no gm such that Mxixj,xℓxk(f)gm=d1(0,1,0,0)T or d2(0,0,1,0)T, where d1,d2∈C , then Pl-Holant(=2∣f,g) is #P-hard.
Let g=(0,1,0,0)T be a binary signature.
Then Pl-Holant(=2∣f,g) is #P-hard.
Proof.
Connecting variables x4, x3 of the signature f with variables x2 and x1 of g both using (=2) we get a binary signature g1, where
[TABLE]
g1(x1,x2) can be normalized to (0,z−1,1,0)T since z=0.
So we have g1(x2,x1)=(0,1,z−1,0).
Then, modifying x1=1 of f with z−1 scaling,
we get a signature f1 with the signature matrix M(f1)=[000x/z01100cy/z0a000]. We denote it by [000x101100cy10a000], where x1y1=0.
•
If c=0, connecting variables x4, x3 of f1 with variables x1, x2 of g both using (=2) we get a binary signature h1, where
[TABLE]
Also, connecting the variable x4 with x3 of f1 using (=2) we get a binary signature h2, where
[TABLE]
h2 can be normalized to (0,1,2y1,0)T. Clearly, ∣y1∣=∣2y1∣, so they cannot both be roots of unity. By Lemma 6.3, Pl-Holant(=2∣f,h1,h2) is #P-hard, and we conclude that
Pl-Holant(=2∣f,g) is #P-hard.
•
Otherwise c=0. Connecting variables x2, x1 of g with variables x1, x2 of f both using (=2) we get a binary signature g2, where
[TABLE]
which can be normalized to
g2(x2,x1)=(0,1,c−1,0)T.
Then, modifying x4=1 of f1 with c−1 scaling,
we get a signature f2 with the signature matrix
M(f2)=[000zx011001zcy0ca000] which we denote by [000x2011001y20a2000], where a2x2y2=0. Notice that Mx2x3,x1x4(f2)=[00010a21001x20y2000]. Connect variables x1, x4 of signature f2 with variables x2, x1 of g both using (=2). We get a binary signature h3, where
[TABLE]
h3 can be normalized as (0,1,a21,0)T. Also connect variables x1, x4 of signature f2 with variables x1, x2 of g both using (=2). We get a binary signature h4, where
[TABLE]
If ∣a2∣=1 or ∣x2∣=1, then a2 or x2 is not a root of unity. By Lemma 6.3, Pl-Holant(=2∣f,h3,h4) is #P-hard, and hence Pl-Holant(=2∣f,g) is #P-hard.
Otherwise, ∣a2∣=∣x2∣=1.
Same as the construction of h1 and h2, construct binary signatures h1′ and h2′ using f2 instead of f1. We get
[TABLE]
and
[TABLE]
Note that h2′ can be normalized as (0,1,21+y2,0)T.
–
If y2 is not a root of unity,
then by Lemma 6.3, Pl-Holant(=2∣f,h1′) is #P-hard, and hence Pl-Holant(=2∣f,g) is #P-hard.
–
If y2 is an n-th primitive root of unity and n⩾5, then by Lemma 6.4, Pl-Holant(=2∣f,h1′) is #P-hard, and hence Pl-Holant(=2∣f,g) is #P-hard.
–
If y2=2−1±3i or ±i, then 0<∣21+y2∣<1, which means it is not zero neither a root of unity. By Lemma 6.3, Pl-Holant(=2∣f,h2′) is #P-hard, and hence Pl-Holant(=2∣f,g) is #P-hard.
–
If y2=1, then f2 is non-singular redundant and hence Pl-Holant(=2∣f,g) is #P-hard.
–
If y2=−1. Connect two copies of f2, we get a signature f3 with the signature matrix
[TABLE]
Since ∣a2∣=∣x2∣=1, ∣a22x22∣=1=4.
Therefore, applying Corollary 4.7 to
{a22,2,x22,−2},
we get Pl-Holant(=2∣f3) is #P-hard, and hence Pl-Holant(=2∣f,g) is #P-hard. ∎
Combining Lemma 6.4, Corollary 6.5 and Lemma 6.6, we have the following corollary.
Corollary 6.7**.**
Let g=(0,1,t,0)T be a binary signature where t is an n-th primitive root of unity, and n⩾3.
Then Pl-Holant(=2∣f,g) is #P-hard.
Now, we are able to prove the following theorem for Case @slowromancapiv@.
Theorem 6.8**.**
Let f be a 4-ary signature with the signature matrix
M(f)=000x0bz00cy0a000,
*where abxyz=0.
Pl-Holant(=2∣f) is #P-hard unless f∈M, in which case, Pl-Holant(=2∣f) is tractable.
*
Now suppose f∈/M.
Connect the variable x4 with x3 of f using (=2), and we get a binary signature g1, where
[TABLE]
Connect the variable x1 with x2 of f using (=2), and we get a binary signature g2, where
[TABLE]
•
If one of g1 and g2 is of the form (0,0,0,0)T, then by=(−c)(−z)=cz. That is by−cz=0. Here c=0 due to by=0.
By Lemma 6.2, Pl-Holant(=2∣f) is #P-hard.
•
If one of g1 and g2 can be normalized as (0,1,0,0) or (0,0,1,0). By Lemma 6.6, Pl-Holant(=2∣f) is #P-hard.
•
If one of g1 and g2 can be normalized as (0,1,t,0)T, where t=0 is not a root of unity, then by Lemma 6.3, Pl-Holant(=2∣f) is #P-hard.
•
If one of g1 and g2 can be normalized as (0,1,t,0)T, where t is an n-th primitive root of unity and n⩾3, then by Corollary 6.7, Pl-Holant(=2∣f) is #P-hard.
•
Otherwise, g1 and g2 do not belong to those cases above, which means both g1 and g2 both can be normalized as (0,1,ϵ1,0) and (0,1,ϵ2,0), where ϵ1=±1 and ϵ2=±1. That is, b+c=ϵ1(z+y)=0 and b+z=ϵ2(c+y)=0.
–
If b+c=z+y and b+z=c+y, then b=y and c=z.
This case will be proved below.
–
If b+c=−(z+y) and b+z=c+y, then b+z=c+y=0,
so g2=(0,0,0,0)T, a contradiction.
–
If b+c=z+y and b+z=−(c+y), then
b+c=z+y=0, so g1=(0,0,0,0)T, a contradiction.
–
If b+c=−(z+y) and b+z=−(c+y),
we get b+c+y+z=0.
But b+c=0, otherwise g1=(0,0,0,0)T, a contradiction.
So we can normalize g1 to (0,1,−1,0)T. Modify x1=1 of f with −1 scaling, and we get a signature f′ with the signature matrix M(f′)=[000−x0b−z00c−y0a000].
Connect the variable x1 with x2 of f′ using (=2), and we get a binary signature g′=(0,b−z,c−y,0)T.
Same as the analysis of g1 and g2 above, we have Pl-Holant(=2∣f′) is #P-hard unless g′ can be normalized as (0,1,ϵ3,0), where ϵ3=±1. That is, b−z=ϵ3(c−y)=0, ϵ3=±1.
If b−z=c−y, combined with b+c=−(z+y), we have b=−y and c=−z.
This case will be proved below.
*
If b−z=−(c−y), combined with b+c=−(z+y), we have b+c=z+y=0,
and so g1=(0,0,0,0)T, a contradiction.
Therefore, Pl-Holant(=2∣f′) is #P-hard and hence Pl-Holant(=2∣f) is #P-hard.
To summarize, except for the cases b=ϵy and c=ϵz, where ϵ=±1, we have proved that Pl-Holant(=2∣f) is #P-hard.
We can connect the variable x2 with x3 of f using (=2), and get a binary signature g3=(0,a+c,z+x,0)T. Connect the variable x1 with x4 of f using (=2), and we get a binary signature g4=(0,a+z,c+x,0)T. Same as the analysis of g1 and g2, we have Pl-Holant(=2∣f) is #P-hard unless a=ϵ′x and c=ϵ′z, where ϵ′=±1. By both c=ϵz and c=ϵ′z
and z=0 we get ϵ=ϵ′.
Therefore, Pl-Holant(=2∣f) is #P-hard unless a=ϵx, b=ϵy and c=ϵz, where ϵ=±1. In this case, since z=0, we have abc=0. By Lemma 6.1, Pl-Holant(=2∣f) is #P-hard, since we have assumed
f∈/M. ∎
7 Proof of the Main Theorem
Now we are ready to prove the main theorem, Theorem 3.1.
Proof of Tractability:
•
If f satisfies condition 1 or 2, then by Theorem 2.26, Holant(=2∣f) is tractable without the planarity
restriction. Obviously, Pl-Holant(=2∣f) is tractable.
•
If f satisfies condition 3, then by Theorem 2.16, Pl-Holant(=2∣f) is tractable.
•
If f satisfies condition 4, then by Theorem 4.6, Pl-Holant(=2∣f) is tractable.
Proof of Hardness:
Since f does not satisfy condition 2, f does not belong to Case @slowromancapi@. Therefore it belongs to Cases @slowromancapii@, @slowromancapiii@, or @slowromancapiv@.
•
Suppose f belongs to Case @slowromancapii@.
–
If an outer pair is a zero pair, since f does not satisfy condition 1 or condition 3, then by Theorem 4.3, Pl-Holant(=2∣f) is #P-hard.
–
If the inner pair is a zero pair and no outer pair is zero, since f does not satisfy condition 4, then by Theorem 4.6, Pl-Holant(=2∣f) is #P-hard.
•
Suppose f belongs to Case @slowromancapiii@. Since f does not satisfy condition 3, then by Theorem 5.2, Pl-Holant(=2∣f) is #P-hard.
•
Suppose f belongs to Case @slowromancapiv@. Since f does not satisfy condition 3, then by Theorem 6.8, Pl-Holant(=2∣f) is #P-hard. ∎
Acknowledgment
We thank Martin Dyer, Heng Guo, Dana Randall and Mingji Xia for their comments and
interests.
Bibliography51
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] Dorit Aharonov, Itai Arad, Elad Eban, and Zeph Landau. Polynomial quantum algorithms for additive approximations of the Potts model and other points of the Tutte plane. ar Xiv preprint quant-ph/0702008 , 2007.
2[2] Lars Ahlfors. Complex analysis: an introduction to the theory of analytic functions of one complex variable . Mc Graw-Hill, 1979.
3[3] Miriam Backens. A new Holant dichotomy inspired by quantum computation. In Proceedings of the 44th International Colloquium on Automata, Languages, and Programming , pages 16:1–16:14, 2017.
4[4] Miriam Backens. A complete dichotomy for complex-valued Holant c . In Proceedings of the 45th International Colloquium on Automata, Languages, and Programming , pages 12:1–12:14, 2018.
5[5] Rodney J. Baxter. The six and eight-vertex models revisited. Journal of statistical physics , 116(1-4):43–66, 2004.
6[6] Rodney J. Baxter. Exactly solved models in statistical mechanics . Elsevier, 2016.
7[7] David M. Bressoud. Proofs and confirmations: the story of the alternating-sign matrix conjecture . Cambridge University Press, 1999.
8[8] Andrei Bulatov. The complexity of the counting constraint satisfaction problem. Journal of the ACM (JACM) , 60(5):1–41, 2013.