Physics-inspired derivations of some algorithms for computing the permanent
Johan Nilsson

TL;DR
This paper introduces physics-inspired methods for computing the matrix permanent, using Grassmann integrals and fermion problems, leading to new derivations and estimators that connect to existing algorithms.
Contribution
It formulates the permanent computation as a Grassmann integral, deriving known and new algorithms through physics-inspired techniques.
Findings
Derivation of the Godsil-Gutman and Karmarkar estimators from Grassmann integrals.
Connection of the permanent computation to interacting many-fermion problems.
Introduction of new estimators and formulas based on gauge invariance.
Abstract
We provide physics-inspired derivations of a number of algorithms for computing the permanent of a matrix. In particular we formulate the computation of the permanent as a Grassmann integral that may be viewed as an interacting many-fermion problem. Applying a discrete Hubbard-Stratonovich decoupling then gives approximation schemes that are equivalent to the familiar determinant Monte Carlo algorithm. This leads to elementary derivations of the well-known estimators of Godsil-Gutman and Karmarkar et al. Another straightfoward manipulation of the Grassmann integral, making use of gauge invariance, gives the efficient exact formula of Glynn. In addition to these known results we also give some additional estimators and formulas that are natural in our formulation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Markov Chains and Monte Carlo Methods · Matrix Theory and Algorithms
Physics-inspired derivations of some algorithms for computing the permanent
Johan Nilsson
Department of Physics and Astronomy, Uppsala University, P.O. Box 516, SE-75120, Uppsala, Sweden
(June 21, 2017)
Abstract
We provide physics-inspired derivations of a number of algorithms for computing the permanent of a matrix. In particular we formulate the computation of the permanent as a Grassmann integral that may be viewed as an interacting many-fermion problem. Applying a discrete Hubbard-Stratonovich decoupling then gives approximation schemes that are equivalent to the familiar determinant Monte Carlo algorithm. This leads to elementary derivations of the well-known estimators of Godsil-Gutman and Karmarkar et al. Another straightfoward manipulation of the Grassmann integral, making use of gauge invariance, gives the efficient exact formula of Glynn. In addition to these known results we also give some additional estimators and formulas that are natural in our formulation.
The permanent of an matrix is defined by
[TABLE]
where the sum is over all permutations of . The permanent is therefore somewhat similar to the determinant, but the signature of the permutation is absent in the formula. In physics the permanent appears for example when calculating the overlap of two bosonic many-body wave functions Negele and Orland (1988). It is widely appreciated that it is much more difficult to evaluate the permanent than the determinant. This was put on firm ground in the context of complexity theory when Valiant proved that even the computation of the permanent of a matrix with - valued entries is in the class of -complete problems Valiant (1979). On a less sophisticated level one important reason for this complexity is that the permanent is not invariant under similarity transformations, whereas the determinant is.
Grassmann integrals are commonly used as a device in many-body quantum mechanics to study interacting fermions in the path integral formalism. They were introduced by Berezin Berezin (1965), and are since long textbook material, see e.g. Negele and Orland (1988). The Gaussian Grassmann integral is of particular importance and results in a determinant
[TABLE]
The permanent of a matrix can also be expressed as a Gaussian Grassmann integral
[TABLE]
over even Grassmann numbers that satisfy and . This simple formula is one of crucial importance for this letter. It has appeared before in the physics literature Palumbo (1994a, b), but is apparently not well-known. The even Grassmann numbers have also been introduced in the applied mathematics and computer science communities where they go under the name of “zeons”, see e.g. Schott and Staples (2012). The proof of (3) is easily obtained by direct expansion sup . Since different commute no sign is generated upon rearranging the ’s into canonical order after expanding the exponential. The final result is therefore the permanent instead of the determinant. For our purposes it is important to note that these Grassmann numbers can also be viewed as composite objects Palumbo (1994a, b); sup , defined as products of two ordinary anti-commuting Grassmann numbers and : . We will also use that the integration measure for may be separated into a product of two independent ones over and
[TABLE]
In the language of physics integrals of the type (3) can describe an interacting fermion theory of doublons in the Hubbard model, and lattice models involving spin-1/2 objects and hard-core bosons nil . In this case the matrix is of large dimension but has a specific block structure and is very sparse.
.1 Formulas from discrete Hubbard-Stratonovich transformations
The Hubbard-Stratonovich (HS) transformation is another standard method used in many-body physics to reformulate an interacting theory in terms of a weighted sum of non-interacting ones Stratonovich (1957); Hubbard (1959). This transformation is important both in analytical approaches as well as in the construction of the determinant quantum Monte Carlo formalism Blankenbecler et al. (1981). For our Grassmann numbers we will first use a discrete variant of this transformation which is the simple identity
[TABLE]
which is valid for any real or complex number . Similar discrete HS transformations can also be introduced at the operator level Hirsch (1983), but the Grassmann version is almost trivial in comparison. Introducing a sign for each of the non-vanishing matrix elements of we may rewrite the exponential using
[TABLE]
where the matrix elements of are
[TABLE]
and is a shorthand for the particular configuration of all of the . Using the behavior of the measure (4) we can now perform the Gaussian Grassmann integrals over and independently to get
[TABLE]
This is the unbiased Godsil-Gutman estimator for the permanent Godsil and Gutman (1978). This estimator unfortunately has a very large variance in the worst case, and does therefore not work well for all matrices.
The Godsil-Gutman estimator has been generalized in many different ways to reduce its variance. In the physics language some such generalizations may be generated by using other HS transformations than the one of (5). In particular we could instead use a decoupling
[TABLE]
where is a -th root of unity. Note that (5) is a special case of this corresponding to . Clearly this can again be done independently on each non-vanishing element of . Introducing and in analogy with and above with
[TABLE]
we may perform the Grassmann integrals with the result
[TABLE]
when has only positive semi-definite elements. This includes (setting ) the KKLLL esimator of Karmarkar et al. Karmarkar et al. (1993). The bounds on the variance of this estimator is much better than that of Godsil-Gutman. The variance is however still extremely large in the worst case Karmarkar et al. (1993). On the other hand it has been proven that the KKLLL estimator is very efficient for sufficiently dense matrices Frieze and Jerrum (1995).
In the physics context it is well-known that it is possible to decouple interaction terms in different channels Negele and Orland (1988). This leads to additional formulas for the permanent. So far we have only considered decouplings that preserves “spin rotational symmetry” around one axis, meaning that we have only made use of bilinears of the types and . In (5) we only use the density channel, meaning that the bilinears are of the form . It is also possible to do decouplings in the spin channel, the simplest of these involves , which do appear in (9). We may also decouple using the bilinears and , which breaks the spin rotational symmetry. These are all examples of time-reversal invariant decoupling schemes Wu and Zhang (2005), which have the appealing property that the weights are always positive semi-definite when the elements of are. In fact it is also possible to use different forms of the decoupling on different matrix elements. A simple example demonstrates that using this freedom may be extremely fruitful. Consider the following matrix and one particular decoupled version
[TABLE]
The corresponding estimator has zero variance! Of course it is a very difficult problem to pick the “best” decoupling scheme for a given large matrix. In the physics context the choice of decoupling is supposed to be “guided by the physics of the problem” Negele and Orland (1988), something that seems quite difficult in the abstract setting of a generic matrix .
It is also possible to perform a decoupling in the pairing channel, which is equivalent to a decoupling in and . Let us use a general decoupling (
[TABLE]
Implementing this on each non-vanishing term in as above and performing the Grassmann integral we obtain
[TABLE]
Specializing to matrices with - valued entries and taking we get
[TABLE]
This formula treats rows and columns symmetrically and has a simple interpretation: all non-zero elements of are substituted with with equal probability, and the contribution from a configuration is the product of all row sums and columns sums.
.2 Efficient exact formulas
Efficient exact formulas may also be easily obtained from manipulations of the Grassmann integral. Let us first note that because the Gaussian exponent may be expanded as
[TABLE]
Sticking this into the formula for the permanent and performing the integrals over the ’s we get
[TABLE]
Now let us attach a local gauge freedom to each . We implement this by making a change of variables inside the integral, taking with . The Jacobian generated by going to the new measure is simply . Since nothing can depend on the variable change, and hence the ’s, we may average over all possibilities
[TABLE]
In this formula the role of the Grassmann integral, i.e., to pick out the terms for which each appears exactly once, is superfluous since terms in the integrand without this property are anyway set to zero by the sum over . Getting rid of the integral we therefore get
[TABLE]
This is an exact formula that is a sum of terms. A more efficient albeit less symmetric formula involving a sum of terms may easily be obtained from this one by noting that each term is invariant upon inverting all signs. This implies that for each configuration with say there is another one with with equal weight. We may therefore write
[TABLE]
where the primed sum is over all sign configurations but with always. This is the formula of Glynn Glynn (2010). Glynn has also formulated this in terms of polarization identities Glynn (2013), this makes clear the connection to the Ryser formula Ryser (1963), which is of comparable efficiency. It is interesting to note that in the physics language (19) is a result of local gauge invariance, and the reduction to (20) a consequence of global gauge invariance.
.3 Additional formulas
The derivations above motivates us to consider a few generalizations that gives some additional formulas for the permanent.
It is clear that the formulas (19) and (20) may also be considered as unbiased estimators for the permanent when the signs are treated as random variables Aaronson and Hance (2014). We now note that in the discrete HS case the variance was reduced in going from Godsil-Gutman to KKLLL, which may be viewed as changing the decoupling variable from to in our language. In the derivation of the exact formula above we may easily implement the same idea, i.e., we make the gauge transformation (variable change) . This leads to the formula
[TABLE]
Viewed as an exact formula this is obviously less efficient than (19) since the number of terms is instead of . From the point of view of an unbiased estimator the variance is however substantially reduced for certain matrices. A common example is the block-diagonal matrix with the matrix in (12) repeated on the diagonal. For such a matrix the second moment is reduced by a factor of in going from (19) to (21) for all . This is similar to the reduction in going from Godsil-Gutman to KKLLL Karmarkar et al. (1993); Chien et al. (2003).
It is also possible to consider various continuous decoupling schemes. Let us decompose our matrix as where is a permutation matrix and () are lower (upper) triangular matrices (such a decomposition is always possible). does not affect the permanent and we only need to consider the permanent of the matrix . Now we may perform a conventional continuous HS transformation on the exponent as follows
[TABLE]
with a normalized Gaussian integration measure
[TABLE]
Sticking this into the formula for the permanent it is easy to perform the Grassmann integral with the result
[TABLE]
This formula is a -dimensional integral representation of the permanent. Similar formulas also works for other types of matrix decompositions. Consider for example a singular value decomposition , with and orthogonal and diagonal. Let us denote the diagonal elements of by , then the same manipulations gives
[TABLE]
These -dimensional integral representations involves an unbounded integration region. It is also possible to reformulate these on bounded integration regions, what is needed is a weight function that satisfy
[TABLE]
The integrals (24) and (25) may be estimated using stochastic Monte Carlo methods. The last version is particularly efficient for matrices of low rank since the number of necessary ’s is equal to the rank of the matrix.
Let us finally mention another possibility that may be obtained from a slight extension of our formalism. Suppose that we represent , with and a pair of even Grassmann numbers. We may then decouple a term in the exponent as before
[TABLE]
and the measure still factorizes in analogy with (4). Doing this for each of the non-vanishing elements of like we did to get to (8) and using (3) we arrive at the formula
[TABLE]
This formula should be compared with (8) and can be used recursively to generate additional formulas.
Acknowledgements.
.4 Acknowledgements
Funding from the Knut and Alice Wallenberg Foundation and the Swedish research council Vetenskapsrådet is gratefully acknowledged.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Negele and Orland (1988) J. W. Negele and H. Orland, Quantum Many-Particle Systems (Westview Press, 1988).
- 2Valiant (1979) L. Valiant, Theoretical Computer Science 8 , 189 (1979) . · doi ↗
- 3Berezin (1965) F. A. Berezin, The Method of Second Quantization (Academic Press. New York, 1965).
- 4Palumbo (1994 a) F. Palumbo, Phys. Rev. D 50 , 2826 (1994 a) . · doi ↗
- 5Palumbo (1994 b) F. Palumbo, Physics Letters B 328 , 79 (1994 b) . · doi ↗
- 6Schott and Staples (2012) R. Schott and G. S. Staples, Operator Calculus on Graphs (Imperial College Press, 2012).
- 7(7) Supplementary material.
- 8(8) J. Nilsson et al., unpublished.
