Automated construction of $U(1)$-invariant matrix-product operators from graph representations
Sebastian Paeckel, Thomas K\"ohler, and Salvatore R. Manmana

TL;DR
This paper introduces an automated method to construct $U(1)$-invariant matrix-product operators from finite-states machine representations, simplifying the process and reducing computational costs in tensor network algorithms.
Contribution
The authors develop an algorithmic scheme that automates the construction of $U(1)$-invariant MPOs from FSM representations, handling quantum number shifts and phase factors efficiently.
Findings
Automated MPO construction reduces computational complexity.
Exact MPO representations for Hamiltonian variances are generated.
Method applicable to various $U(1)$-invariant operators.
Abstract
We present an algorithmic construction scheme for matrix-product-operator (MPO) representations of arbitrary -invariant operators whenever there is an expression of the local structure in terms of a finite-states machine (FSM). Given a set of local operators as building blocks, the method automatizes two major steps when constructing a -invariant MPO representation: (i) the bookkeeping of auxiliary bond-index shifts arising from the application of operators changing the local quantum numbers and (ii) the appearance of phase factors due to particular commutation rules. The automatization is achieved by post-processing the operator strings generated by the FSM. Consequently, MPO representations of various types of -invariant operators can be constructed generically in MPS algorithms reducing the necessity of expensive MPO arithmetics. This is demonstrated by generating…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 0
Figure 1
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19
Figure 2
Figure 20
Figure 21
Figure 22
Figure 23
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRough Sets and Fuzzy Logic · Semantic Web and Ontologies · Graph Theory and Algorithms
Automated construction of -invariant matrix-product operators from graph representations
S. Paeckel*, T. Köhler, and S.R. Manmana
Institut für Theoretische Physik, Universität Göttingen, 37077 Göttingen, Germany
Abstract
**We present an algorithmic construction scheme for matrix-product-operator (MPO) representations of arbitrary -invariant operators whenever there is an expression of the local structure in terms of a finite-states machine (FSM). Given a set of local operators as building blocks, the method automatizes two major steps when constructing a -invariant MPO representation: (i) the bookkeeping of auxiliary bond-index shifts arising from the application of operators changing the local quantum numbers and (ii) the appearance of phase factors due to particular commutation rules. The automatization is achieved by post-processing the operator strings generated by the FSM. Consequently, MPO representations of various types of -invariant operators can be constructed generically in MPS algorithms reducing the necessity of expensive MPO arithmetics. This is demonstrated by generating arbitrary products of operators in terms of FSM, from which we obtain exact MPO representations for the variance of the Hamiltonian of a Heisenberg chain. **
Contents
-
2 -invariant tensor networks and application of two-site gates
-
3.2 Maximally branched representation and local transformations on graphs
-
4 Graph arithmetics and MPO representation of the variance of operators
-
5 Numerical behavior of the variance for antiferromagnetic Heisenberg chains
-
A Construction of a graph representation for lattice-ordered string operator products
1 Introduction
The density-matrix renormalization group (DMRG) [1, 2, 3, 4, 5] has proven to be one of the most powerful tools to treat low-dimensional problems in strongly correlated quantum systems. Inspired by quantum information theory, a formulation in terms of matrix-product states (MPS) [6] and matrix-product operators (MPO) [5] boosted the development of related algorithms (e.g., [7, 8, 9, 10]). Numerous different techniques have been developed in the framework of MPS, most of which require the representation of the Hamiltonian of the system and of further observables in terms of MPOs [11, 12, 13]. Even though finding such a representation is a well-known and generally solved problem [14, 15, 16], it can become arbitrarily complicated as systems may incorporate complex lattice geometries, site-dependent interaction strengths, or transformations of local operators due to their commutation relations.
It is possible to build generic construction schemes for MPO representations of operators by means of finite-state machines (FSM) [15]. However, the numerical realization of these FSMs can be quite involved, especially when exploiting global symmetries [14, 17, 18].
In this paper, we use the fact that FSMs have an underlying graph structure to obtain a generic algorithmic construction scheme for MPO representations of operators conserving symmetries. Consequently, most of the complexity can be unwrapped into tensor-network manipulations of strings of local operators, which can be mapped one-to-one to graph representations. The obtained construction scheme can be automatized so that an implementation is capable to efficiently create MPO representations. In particular, this also allows us to include commutation rules as well as conservation laws. We demonstrate how to map the operator arithmetics to operations on graph representations, leading to an algorithm for computing the sum as well as the product of two arbitrary operators. We use this algorithm to demonstrate how to compute the variance of a Hamiltonian with very high accuracy. This is achieved by reducing the effects of catastrophic cancellation [19] and the total amount of required floating-point operations by using exact operator arithmetics in terms of FSM graphs. In addition, this also minimizes the overall computational costs.
2 -invariant tensor networks and application of two-site gates
Consider an operator acting on a Hilbert space , which decomposes into a tensor product of -identical -dimensional local Hilbert spaces (),
[TABLE]
and assume that this operator is invariant under a global symmetry generated by local observables 111Note that by we denote the local generators of the symmetry and not the local density operators, which play this role only in the case of conservation of the total particle number.,
[TABLE]
A state can be expanded into an MPS with local basis states spanned by the generators of the global symmetry
[TABLE]
with being rank- tensors acting on site , which carry a physical bond index and two virtual bond indices . The coefficients can be reobtained by contracting the corresponding tensors over all their virtual indices. In a graphical notation, a tensor is represented by drawing a shape (circles, squares, …) with as many legs attached to it as there are tensor indices (see fig. 1). Contractions over indices are denoted by connecting the corresponding edges. This yields a tensor network.
Due to conservation of the global quantum number , these rank- site tensors are irreducible representations of the global symmetry. The physical indices label the basis states of the local generators. As a consequence, the site tensors can be decomposed into blocks that are subject to an on-site conservation law [17, 18]
[TABLE]
with the block indices labeling the irreducible representations of the global quantum number on the particular site . These tensor blocks in general are complex matrices of dimensions .222For brevity, in the following, we suppress bond indices whenever it is clear, which matrix contractions have to be performed, or if the contractions themselves are irrelevant for the discussion. Hence, thinking in terms of block-diagonal MPS, each site matrix is decomposed into the block diagonal form with the non-vanishing blocks having block dimensions and labeling the irreducible representations at site .
There is a simple graphical representation for these local conservation laws acting on the tensor blocks, which is derived from the tensor-network framework [17]. Whenever there are indices with a (hidden) plus sign in the function, the corresponding bonds in the network are labeled by an ingoing arrow, whereas, for indices with a minus sign, an outgoing arrow is placed on the respective bond (see fig. 1).
Next, we take a more elaborate look at the action of a two-site gate on the symmetry conserving state, ignoring the common framework of irreducible representations and block diagonal structures for a moment (later we generalize the considerations to arbitrary operator expressions). Introducing operators acting only on the sites , we start from a generic -invariant expression of the form
[TABLE]
where the function ensures that the total quantum number is conserved when applying the operator. A dummy index with is introduced to factorize the function. Thus, suppressing the sums over the physical indices for a moment, we obtain
[TABLE]
which nearly restores the factorized shape of generic MPS, even though the physical sites are still connected by the sum over the dummy index . It is desirable to restore a tensor-network form so that the action of an operator pair can be written as contraction of tensors acting on the local Hilbert spaces. Therefore, we need to take a closer look at the matrix elements of the local operators . If the total operator expression is -invariant, then the local operators themselves have to be one-dimensional representations in the physical indices (acting on the local Hilbert space) so that each operator carries a unique mapping between states . In other words, for -invariant operator pairs each local operator is non-vanishing only for a certain value of the dummy index . For instance, in case of spin-ladder operators the total value of is locally changed by . Hence,
[TABLE]
where we introduced the change of local quantum numbers . Having this in mind, block-index conservation laws for the local operators can be realized via
[TABLE]
The case of two-site gates is obtained by setting as well as ; the non-vanishing operator blocks now contain the reduced operator matrix elements acting on the local basis states . The contraction of the block index over the intermediate site tensors can be recast into a matrix contraction using the matrix identity
[TABLE]
Therefore, the application of the above two-site gate is factorized completely if we define intermediate shift tensors that act on the sites with as identity, but are contracted over auxiliary bond indices ,
[TABLE]
Hence, the action of the -invariant operator pair can conveniently be written as tensor network,
[TABLE]
with the definitions
[TABLE]
When further local operators to the left and right of are absent, the auxiliary indices are shrunk to dummy indices . Thus, the sum over all remaining auxiliary block indices restores the global conservation law. Note that we have implicitly fixed a gauge freedom carried by the auxiliary indices to (we can even go further and permit any global change of the overall quantum number by setting ).
Even though these expressions look a bit tedious, they can be represented compactly in form of a tensor network, which also reveals how useful this decoupling turns out to be (see fig. 2(a)). For example, it is possible to identify the well-known transformation law for -invariant MPO site tensors in the expressions above [17], yet the block indices are related to the change of the quantum number, captured by the shift tensors .
Generally, contractions over physical bond indices of such shift tensors correspond to a mapping from one block of an irreducible representation on a site to another block . Hence, given a decomposition into the different quantum-number sectors of an -invariant MPS or MPO tensor
[TABLE]
the contraction along a chain of shift tensors over physical indices (with the changes in the local quantum numbers) is given via
[TABLE]
From this point on it is clear how to generalize the considerations to arbitrary strings of local operators. In a given expression of local operators, we need to identify pairs of operators that conserve the global quantum number but locally change the on-site quantum numbers . For each pair, shift tensors then have to be inserted acting on the intermediate sites . Note that in a code there is no need to explicitly implement the shift tensors. It suffices to implement only their action on the virtual bonds of either the MPS or of another MPO, i.e., to collect all vertical strings of shift tensors in the network and to apply the shifting in the auxiliary block indices (see fig. 2(b)).
3 -invariant MPO representation from FSMs
As discussed in [15, 16], the MPO representation of an operator on a tensor-product space can be obtained from the transition amplitudes of FSMs. In the following, the underlying graph structure of FSMs is used extensively. Thus, we first give a brief review on how to identify FSMs with MPO representations, following [15].
3.1 MPO construction from FSMs
Let be a set of local operators . Any global operator is of the general form
[TABLE]
with being strings of local operators coupling lattice sites with range , amplitude , and abbreviating the index set . For later convenience we will call lattice ordered -point -ranged operator strings.
The set of all lattice-ordered -point -ranged operator strings defines a language of a FSM, i.e., there is a set of states and a transition function so that with a proper initial and final state , the FSM is obtained with a generated language . The corresponding graph is then a representation of and is denoted by .
An example for the graph representation of the XX model is given in fig. 3(a) and in the following, we shortly explain how to obtain this graph.
At first, we have to rewrite the global operator in terms of lattice-ordered -point -ranged operator strings333In this example we change our convention and write site indices as lower indices.
[TABLE]
i.e., we have two distinct -point -ranged operator strings. Then, we define a default set of states with being the initial and final state of the FSM, respectively. For convenience, we will highlight them in the corresponding graphs with green () and red () background. In general, FSMs are capable to generate sequences of symbols by transitioning between states via permitted transitions, i.e., those with non-vanishing transition function . Thus, we define the initial/final state by demanding that all the previous/subsequent transitions have to be identities.
Next, we build the graph fig. 3(a) by constructing each operator string separately, so we may start with . We therefore add intermediate states (here only one state ) to the set of states: and define permitted transitions and . Thus, the FSM can transition from the initial state (after placing an arbitrary number of identities) into state by appending an operator onto the so far constructed operator string. Being in state , the FSM has no other choice than to transition into state by appending a subsequent operator . A generalization of the construction of such local operator strings is shown in fig. 3(b) for the case of a general -point -ranged operator string.
From this point on, we will call graphs that generate only one distinct type of -point -ranged operator string single-branched graphs and identify them with the corresponding contribution in the global operator .
Returning to the XX model, we can finish the construction of the FSM by adding another single-branched graph transitioning from the initial state into an additional state via a local operator and a transition into the final state via a local operator .
Finally, we can construct the MPO representation, i.e., the operator-valued matrices , by assigning matrices of dimension for all non-vanishing symmetry blocks (i.e., those with at least one local operator block ). Figure 4(a) sketches a general FSM and fig. 4(b) shows the generated MPO bulk tensor, in which the rows and columns are labeled by the states of the FSM. The corresponding boundary tensors are obtained by projecting out (a) the transition from the initial state into the bulk for and (b) the transitions from the bulk into the final state for . We emphasize that the site-dependent coefficients are free parameters and therefore can be chosen independently for every site.
3.2 Maximally branched representation and local transformations on graphs
As already mentioned, every global operator on can be formulated as a sum over lattice-ordered -point -ranged operator strings
[TABLE]
Note that the graph representation via FSM is not unique; for every operator , there is a set of corresponding FSMs . Therefore, we are free to choose one representation , which makes it easier to perform operator arithmetics and then switch to another representation to find the most compact MPO.
Referring to eq. 27, a natural translation to the graph representations of can be obtained by introducing a commutative map between graph representations of sums of operators via
[TABLE]
The realization of in terms of graphs is obtained by taking the graph representations of the operators and by merging the initial and final states as depicted in fig. 5.
Next, we can define the notion of a maximally branched graph representation, which is given by the graph satisfying the conditions: a) the initial state is the only state with more than one child state, b) the final state is the only state with more than one parent state. satisfies the equation
[TABLE]
This representation has several advantages; most importantly for our discussion, any local transformation of the operator can be mapped one-to-one to the transitions of the graph representation. Here local transformations are those transformations that map a lattice-ordered -point -ranged operator string into another lattice-ordered -point -ranged operator string without changing . To clarify what is meant by this mapping we emphasize that each branch generates exactly one type of local-operator string and therefore the tensor representation of these strings factorizes on the local Hilbert spaces. Hence, if we can give a factorization of the local transformation in terms of tensors acting on the local Hilbert spaces (e.g., ), then we can represent the transformation directly by contracting the transformation tensors with the local operators over their physical indices
[TABLE]
Note that the transformations in eq. 19 forcing conservation of quantum numbers for two-site gates are of exactly this kind and so is their generalization to arbitrary strings of local operators. Thus, conservation of quantum numbers can be implemented by transforming the graph representation of an operator into its maximally branched representation. The local operator strings in each branch are transformed by applying shift tensors . For example, let us consider the transformation for a next-to-nearest-neighbor spin-flip term
[TABLE]
with being the usual angular-momentum ladder operators with lower site indices. The transformed graph representation is given in fig. 6(a).
Conveniently, these rules can be extended to anticommuting operators by applying a Jordan-Wigner transformation444Note that this argument also holds in higher dimensions, because in MPS approaches a 1D path is used to sweep through the system, and the Jordan-Wigner transformation is also applicable in the presence of long-range interactions.. For -invariant operators, the Jordan-Wigner transformation is also local, as there has to be a corresponding annihilation operator for every fermionic creation operator appearing in a string operator – and vice versa – because of quantum-number conservation. The Jordan-Wigner transformation can be implemented as a product of parity operators via
[TABLE]
with annihilation (creation) operators for hard-core bosons at site . Then we find that, for any -conserving product of fermionic creation and annihilation operators, the transformations act only within the operator strings. For instance, for a next-to-nearest-neighbor-hopping term we find
[TABLE]
Again, these transformations have a simple graph representation, see fig. 6(b).
To sum up, we have derived a construction scheme for MPO representations of generic -invariant operators that takes a FSM as input. Specifying the phase factor for the commutation relations of the local operators (e.g., for bosons, for fermions), the scheme automatizes the construction of the MPO site tensors so that we can identify the graph representation of the FSM with the MPO. This permits us to take advantage of the graph representation to improve operator arithmetics, which will be discussed in the following section.
4 Graph arithmetics and MPO representation of the variance of operators
Having derived a construction scheme that permits us to automatize the generation of MPO representations for -invariant operators, we now make use of the established connection between graphs and operators by replacing operator arithmetics with graph manipulations. We then demonstrate the power of this approach by employing the graph algebra to derive various expressions for the variance of a Hamiltonian . As we will show, this not only allows for more efficient calculations, but also addresses the problem of catastrophic cancellation that comes up in a naïve evaluation of due to the need to subtract large numbers, which scale roughly as in the system size [19].
Let us consider the product of two global operators in terms of their maximally branched representations
[TABLE]
and in particular a single summand that is the product of two lattice-ordered string operators555We use the notation introduced in eq. 25.
[TABLE]
Note that we have introduced superscripts to distinguish the index sets of the global operator666Expanding the indices one would have . A representation of this product in terms of a FSM and therefore its graph representation requires a reformulation in terms of lattice-ordered string operators. Although the product of two lattice-ordered operators is no longer lattice-ordered, a careful inspection of the terms violating the lattice order reveals how to build a graph representation generating the product . It turns out to be useful to define a non-commutative product, which maps two single-branched graphs to a single-branched graph via
[TABLE]
A graph realization of is obtained by identifying the final state of with the initial state of . For instance, see fig. 7(a) for an exemplary evaluation of .
As carried out in appendix A, an algorithm can be constructed that yields the following graph representation
[TABLE]
with the symmetrized wedge product
[TABLE]
and the sign of the commutation relation between local operators and acting on different sites. Employing linearity of the graph representation for addition, the product of two general operators can then be formulated via
[TABLE]
Despite the compact form, we emphasize that eq. 37 and eq. 39 describe a graph representation that is maximally expanded, so that there is no branching below the initial node, and the bond dimension of the generated MPO is very large. However, the size of the graph can be reduced very efficiently by shrinking it into its most compact form. The idea behind the shrinking is best demonstrated with a concrete example. Consider an expanded graph generated from merging two branches
[TABLE]
The number of nodes can be reduced by fusing edges sharing one node and carrying the same transitions. For example, in fig. 7(b) these are the edges connecting the states and . In the same way, entire branches that are completely equal with respect to their transitions can be fused together by employing the linearity of
[TABLE]
But we can go even further by defining which local operator pairs acting on the same lattice site vanish identically. For example, this is the case for for models. Generated branches that contain this type of transitions can be discarded completely. For more complex graphs, these shrinking procedures can be applied iteratively, so that only the reduced graph is stored and used in the actual calculations.
An illustrative example demonstrating the advantages of the approach above is to construct two different expressions for the variance of the Hamiltonian of a system, where the goal is to avoid catastrophic cancellation as best as possible while keeping calculations as cheap as possible. For instance, the variance can be used as a control parameter in numerical simulations to test whether a state is close to an eigenstate of . A naïve evaluation is obtained by directly calculating the expectation values in
[TABLE]
which eventually vanishes, if is an exact eigenstate. However, as the expectation value of scales as , explicit evaluation of eq. 42 has major drawbacks when it comes to numerical calculations with finite-precision arithmetics such as catastrophic cancellation. Every (exact) number is numerically represented up to a certain precision [20], which is usually measured in orders of magnitudes so that we can mimic limited numerical precision by replacing with a random variable . Let be numbers represented with the same numerical precision and their exact difference. In finite-precision arithmetics, we then obtain
[TABLE]
For , the second term cannot be represented due to finite precision. In our case, the values for are obtained from expectation values of operators acting on the whole system. Hence, if we estimate them by their leading-order contribution with magnitude , with , and , we obtain
[TABLE]
where is a random variable of order . It follows that we need in order to have a reasonable numerical outcome. In case of the variance, i.e., , with double-precision arithmetics, , the naïve evaluation, , yields an upper bound of a maximally possible precision of for a lattice with sites. However, the numerical precision in general is not only bound by the exact numerical precision , but it is also subject to round-off errors of preceding calculations. Thus, in actual calculations, we have an effective precision that depends on simulation parameters . Returning to eq. 42, we realize that the calculated variance is not only bound, but may even become negative (as in eq. 44 can be negative).
The problem can be addressed by a minimization of , for instance by constructing an MPO representation for the operator , (e.g., by distributing the expectation value over the lattice sites). Unfortunately, this comes at the cost of having constructed an operator that is dependent on its expectation value. Hence, its MPO representation has to be rebuilt for every new state , which requires an efficient way of obtaining MPO representations of powers of operators while keeping the numerical effort low.
To analyze both problems, we investigate two MPO representations (see fig. 8(a) for the corresponding graph) and , which are obtained from the graph representations
[TABLE]
Let us briefly discuss the properties of the two graph representations and their generated MPO. We start with the numerical implementation and its costs. Both representations are obtained by loading pre-computed graph products at runtime. Then, we explicitly calculate with numerical costs scaling as with and being the maximal matrix dimensions of the MPS and MPO site tensors, respectively. Construction of the MPO representation for then only requires allocation of the site tensors with costs scaling as . Here, denotes the maximal bond dimensions of the MPO site-tensor matrices generated from the graphs . Subsequently, the MPO tensors of the graphs are compressed by using an SVD with numerical costs scaling as .
To sum up, the numerical costs for obtaining the MPO representation of the graphs are to leading order governed by the expenses when calculating the expectation value , as long as the MPS bond dimensions are the cost-determining factor. This demonstrates a major advantage of mapping the MPO arithmetics onto the graph representations: As the expectation value , in general, is already available, the additional numerical costs for constructing the MPO representation are not significant.
Next, we take a look at the numerical stability. The graph generates the MPO representation of the operator multiplied by itself. Thus, the graph of is expanded so that the expectation value is equally distributed over all lattice sites with an on-site value of . As operator arithmetics are represented exactly by means of the constructed graph , the only relevant source of catastrophic cancellation is the evaluation of along the lattice that compares terms of order , hence . Thus, we expect the variance to be bound from below by . Yet, the graph in general also suffers from catastrophic cancellation with , as in the naïve evaluation of the variance. This is best seen by evaluating the structure of the generated matrix representation for non-vanishing tensor blocks (see fig. 8(b)). As the latter contains the complete matrix representation of , we end up at the final site by comparing numbers of order when performing the tensor contractions to evaluate the variance. Therefore, yields the variance to be bound from below by .
In addition, graph arithmetics are exact, whereas MPO arithmetics need a vast amount of numerical operations completed before expectation values can be calculated. Therefore, using MPO arithmetics is much more prone to collecting round-off errors, which, in drastic cases, reduces the numerical precision by orders of magnitudes [5].
5 Numerical behavior of the variance for antiferromagnetic Heisenberg chains
In this section, we test the behavior of the variance obtained from site-tensor representations of for antiferromagnetic Heisenberg chains [21, 22]
[TABLE]
with the vector of spin operators on lattice site . For this purpose, we exploit the total magnetization of a system with lattice sites as the conserved quantity. For the ground state search, we sweeps through the system (using open boundary conditions) and optimize the site tensors via a standard Lanczos algorithm, see [23, 24, 25]. All MPS contractions are formulated in two-site representation [5].
The largest observed bond dimensions of the site-tensor matrices for the MPO representations of are and , respectively. Note that these bond dimensions are generally smaller than MPS bond dimensions used during the simulation. We conclude from that the distribution of the constant energy term over the lattice sites in ensures a more efficient compression of the resulting MPO representation. Consistently, for common MPS bond dimensions, we find that is evaluated faster than . In addition, fig. 9 displays the build time of the MPO representation from the graphs for various system sizes. We find a perfect linear scaling for the dependency of the build time, which moreover has no impact on the overall computation time.
We have performed simulations, in which we varied either the total bond dimension per MPS site tensor or the discarded weight while keeping the respective other fixed.777We define the total bond dimension as the number of singular values kept per site and the discarded weight as the sum over all squares of neglected singular values . Figures 10(a) and 10(b) show results for various values of and . An important criterion for consistency of the calculations is the independence of the threshold at which catastrophic cancellation sets in. We find that varying both parameters and yields a constant value of
[TABLE]
which corresponds to the value at which the graph representation saturates. Employing eq. 44, these results suggest for the actual numerical precision an ansatz of the form with a correction , which to first order depends only on the lattice size . Repeating the calculations for Heisenberg chains with various system sizes, we can thus extract from eq. 44 by estimating from the saturated value for the variances. For the two graph representations we consider the estimator for the actual numerical precision
[TABLE]
For both graphs, we perform a linear fit obtaining
[TABLE]
which is shown in Fig. 11. We emphasize that the observed behavior is perfectly consistent with the ansatz above for constant double-precision arithmetics and residual numeric precision . We hence find that, aside from catastrophic cancellation, the dominating contribution to the loss of numerical precision is proportional to the lattice size, which can be associated with inevitable rounding errors generating an error per lattice site.
To complete this section, we now turn to the method discussed by Hubig et al. in [14], which suggests a complementary access for introducing operator arithmetics for MPOs. In short, Hubig et al. present a scheme to build MPO representations numerically via in-code evaluation of direct sums and Kronecker products of local operator strings acting on individual lattice sites. To account for the growth in the MPO bond dimension, they discuss various numerical compression schemes and benchmark the obtained MPOs for a -site Heisenberg chain. This scheme is somewhat more straightforward, as it only involves elementary matrix operations.
However, the FSM approach has additional advantages when it comes to numerical calculations. In particular, when comparing our results for the variance with those obtained by Hubig et al. for the same numerical simulation parameters888Note that Hubig et al. employ conservation of non-abelean quantum numbers. Therefore, the MPS bond dimensions used there do not directly compare to the ones used here. However, this does not affect the maximally achievable resolution of due to finite precision arithmetics., we find that the maximally obtained precision is two orders of magnitudes larger when using the MPOs obtained from graph arithmetics. The reason lies in the above observation that the graph representations only require a minimal number of floating-point operations. In fact, the MPO representation for constructed from numerical MPO arithmetics as suggested by Hubig et al. already requires at least Kronecker products and sums to be evaluated for the advantageous MPO representation of the variance. In contrast, the FSM is generated from abstract graph operations and is therefore an exact representation.
Thus, before the actual variance calculation (namely the contraction of the tensor network representation of ) can begin, the MPO representation obtained from numerical MPO arithmetics already picked up numerical round-off errors of magnitude . Therefore, in the example of the -site Heisenberg chain, the decrease in precision can be estimated to be magnitudes, which is exactly what we found in our calculations.
Another nice side effect is that performing graph compression as described in section 4 already covers the deparallelisation compression method which was first introduced in [26].
Finally, note that implementing local transformations on an abstract level as discussed in section 3 maps the whole complexity of index shifting, as required by quantum-number conservation or in an implementation of fermionic anticommutation rules, from the code to an input level. In other words, there is no need for the programmer to hard-code these features when MPOs are generated from FSMs. Instead, they can be incorporated by designing the corresponding graphs, which in turn enormously increases the flexibility of the code.
6 Conclusions
We have formulated an optimized algorithmic construction scheme for efficient MPO representations of -invariant operators generated by FSMs. This scheme allows implementations to automatize the application of local transformations of operators in MPO representations, e.g., while exploiting symmetries by propagating local changes in quantum numbers or by tackling the fermionic sign via a Jordan-Wigner transformation. As a consequence, graph representations for FSMs can be interpreted directly as representations of operators. Based on this, operator arithmetics are then mapped to transformations of the underlying graphs to generate exact MPO representations of operator sums and products. This permits us to exactly calculate operator arithmetics, which can be stored and quickly loaded in the course of simulations. We demonstrated the effectiveness of our approach by considering two graph representations of the variance of a system’s Hamiltonian . Investigating their numerical properties in a ground-state calculation for a Heisenberg chain with lattice sites, both representations behave numerically consistent and stable with a resolution for the variances of the graph representations of and , respectively. Investigating the dependence of the numerical breakdown on the lattice size shows that the graph representations achieve a numerical precision of in the calculations. We conclude that this high numerical precision is due to the exact graph representation of the operator arithmetics and comes without any significant additional computational costs during runtime.
Finally, we note that wrapping operator arithmetics into re-usable graph representations helps to obtain efficient and exact MPO representations, which are useful for various applications. For instance, consider variational problems of the form
[TABLE]
to find highly excited eigenstates and eigenvalues. Solving this minimization problem benefits significantly from the increased numerical precision and gains in efficiency of the calculation of the expectation value of .
Further applications of the introduced approach cover the efficient representation of long-ranged swap gates, and the concepts can be generalized to higher-dimensional tensor networks (e.g. PEPS) [27]. In this case, the transition amplitudes get additional degrees of freedom (’color’), corresponding to either transversal or longitudinal auxiliary indices. Similar benefits are to be expected as in 1D MPS, allowing for more precise and more flexible implementations of tensor network methods in higher dimensions.
Acknowledgements
We thank M. Marahrens and B. Lenz for helpful discussions, and I. Köhler for carefully proofreading the manuscript. We gratefully acknowledge financial support by the Deutsche Forschungsgemeinschaft (DFG) through Research Unit FOR 1807 (project P7) and SFB/CRC 1073 (project B03).
Appendix A Construction of a graph representation for lattice-ordered string operator products
Consider two lattice-ordered string operators as introduced in the main text (eqn. 25) with single-branch graph representations and , which are parts of maximally branched operators . In the following, we present an algorithm to construct the graph representation of the operator product in terms of generating a new graph. This procedure can then be applied to all branches to construct a new graph for .
Let be the set of all single-branch graphs representing lattice-ordered -point operators. Then, with a proper , we look for a realization of the non-commutative map
[TABLE]
with denoting the symmetrized on-site tensor-product set of and . For this purpose, we apply the definition of in eq. 28 and search for a graph representation of by ordering the appearing types of terms in the resulting sum of the operator product according to the lattice treated. We construct single-branch graph representations for all different types of generated lattice-ordered operator strings, which we denote by . Then, a graph representation is obtained by summing up all these strings
[TABLE]
From now on, we focus on the special case of -point operators, i.e., operators of the form
[TABLE]
Nevertheless, the generalization to arbitrary -point string operators is straightforward: Simply replace identities with additional local operators. Decomposing the operator product, we find
[TABLE]
Next, the commutation relation of local operators acting on different sites fulfills
[TABLE]
with the sign according to the commutation relation. Then, we can decompose the product into lattice-ordered sums by commuting local operators acting on strictly unequal sites.
The first lattice-ordered contribution is given via . The corresponding diagram is a single-branch graph obtained by identifying the final state of the graph with the initial state of the graph by introducing an intermediate state (see fig. 12(a)). We now make use of the wedge product for single-branched graphs as defined in eq. 36 to rewrite in short as
[TABLE]
Swapping the operators, we obtain another lattice-ordered sum by commuting all local operator contributions, so that the corresponding graph picks up two factors, for commuting and for commuting with (see fig. 12(b)). Again, the corresponding graph can be expressed via a wedge product,
[TABLE]
where we have introduced for brevity. Note that, for -invariant operators, the identity holds.
The remaining sums over correspond to overlapping interaction terms, i.e., all those lattice indices that fulfill
[TABLE]
In order to generate all these terms using one algorithm, we write the -point operator expressions graphically by representing a local operator with a cross (“”) and the intermediate vacant operator sites with a circle (“”), e.g.,
[TABLE]
Employing this condensed notation, all lattice-ordered combinations of local operator strings can be generated by placing the graphical representations of and next to each other by aligning the last operator of with the first operator of (see fig. 12(c)). Subsequently, the right string is shifted upward until the initial operator of is aligned with the final operator of (see fig. 12(f)). While the right string’s position is shifted by one step, the local operators of the left operator string pick up a sign factor whenever they pass a local operator in the right string, which is denoted by adding a factor to the left condensed representation (see figs. 12(d) and 12(e)).
For each such step , with , this algorithm generates a string of local operators by merging the shifted right string into the left and resubstituting the original operators: . Missing sites are replaced with identities, whereas two local operators per site are contracted into a new local site operator . Note that, in the latter case, the local operators are ordered in a way that avoids evaluation of on-site commutators . Finally, each string is converted into a single-branch graph by introducing a set of states with the transitions between the ’s properly chosen from the corresponding new local site operators (see fig. 12(g) for an example).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. R. White, Density matrix formulation for quantum renormalization groups , Phys. Rev. Lett. 69 (19), 2863 (1992), 10.1103/Phys Rev Lett.69.2863 . · doi ↗
- 2[2] S. R. White, Density-matrix algorithms for quantum renormalization groups , Phys. Rev. B 48 (14), 10345 (1993), 10.1103/Phys Rev B.48.10345 . · doi ↗
- 3[3] I. Peschel, X. Wang, M. Kaulke and K. Hallberg, eds., Density Matrix Renormalization - A New Numerical Method in Physics , Springer Verlag, Berlin, 10.1007/B Fb 0106062 (1999). · doi ↗
- 4[4] U. Schollwöck, The density-matrix renormalization group , Rev. Mod. Phys. 77 , 259 (2005), 10.1103/Rev Mod Phys.77.259 . · doi ↗
- 5[5] U. Schollwöck, The density-matrix renormalization group in the age of matrix product states , Annals of Physics 326 , 96 (2011), 10.1016/j.aop.2010.09.012 . · doi ↗
- 6[6] S. Östlund and S. Rommer, Thermodynamic limit of density matrix renormalization , Phys. Rev. Lett. 75 (19), 3537 (1995), 10.1103/Phys Rev Lett.75.3537 . · doi ↗
- 7[7] F. Verstraete, D. Porras and J. I. Cirac, Density matrix renormalization group and periodic boundary conditions: A quantum information perspective , Phys. Rev. Lett. 93 (22), 227205 (2004), 10.1103/Phys Rev Lett.93.227205 . · doi ↗
- 8[8] F. Verstraete, J. J. Garcia-Ripoll and J. I. Cirac, Matrix product density operators: Simulation of finite-temperature and dissipative systems , Phys. Rev. Lett. 93 (20), 207204 (2004), 10.1103/Phys Rev Lett.93.207204 . · doi ↗
