Higher order tensor renormalization group for relativistic fermion systems
Ryo Sakai, Shinji Takeda, Yusuke Yoshimura

TL;DR
This paper introduces a Grassmann higher order tensor renormalization group method for simulating relativistic fermion systems on lattices, demonstrating its effectiveness through comparisons with exact and existing methods.
Contribution
The paper develops a novel Grassmann higher order tensor renormalization group algorithm for fermionic systems, enabling efficient coarse-graining of tensor networks with Grassmann variables.
Findings
The new method accurately reproduces known results for relativistic fermion systems.
It extends tensor network techniques to include Grassmann variables.
Validation against exact and previous methods confirms its reliability.
Abstract
We apply the higher order tensor renormalization group to two and three dimensional relativistic fermion systems on the lattice. In order to perform a coarse-graining of tensor networks including Grassmann variables, we introduce Grassmann higher order tensor renormalization group. We test the validity of the new algorithm by comparing its results with those of exact or previous methods.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\preprintnumber
KANAZAWA-17-03
Higher order tensor renormalization group
for relativistic fermion systems
Ryo Sakai
Institute for Theoretical Physics, Kanazawa University, Kanazawa 920-1192, Japan
Shinji Takeda
Institute for Theoretical Physics, Kanazawa University, Kanazawa 920-1192, Japan
Yusuke Yoshimura
Center for Computational Sciences, University of Tsukuba, Tsukuba 305-8577, Japan
Abstract
We apply the higher order tensor renormalization group to two and three dimensional relativistic fermion systems on the lattice. In order to perform a coarse-graining of tensor networks including Grassmann variables, we introduce Grassmann higher order tensor renormalization group. We test the validity of the new algorithm by comparing its results with those of exact or previous methods.
1 Introduction
Monte Carlo simulations of lattice gauge theories have been shown successful as a non-perturbative numerical approach since the celebrated formulation by Wilson Wilson:1974sk and the first simulation by Creutz Creutz:1980zw . Such a stochastic approach, however, is generally suffering from the sign problem when the Boltzmann weight is complex. For example, finite quark density systems, term included systems, or chiral gauge theories are not easily accessible due to the sign problem. To avoid the problem, one may rely on other methods using a deterministic algorithm, say the tensor renormalization group (TRG) Levin:2006jai .
The basic procedure of the TRG is as follows. A target quantity which the TRG can compute is a partition function in the path integral representation. For that purpose, first of all, one has to rewrite a partition function in terms of a tensor network. In general, this step can be done by expanding the Boltzmann weight with some proper expansion scheme depending on the type of physical degrees of freedom (non-compact boson fields, compact boson fields, or fermion fields). And then after carrying out the integration of the original fields, it turns out that tensors live at lattice sites and the partition function is expressed as a contraction of them. The number of terms in the summation is exponentially large as a function of the system size. To avoid such an expensive computational cost, one may rely on a coarse-graining of the tensor network. In this step, the singular value decomposition is used to reduce the d.o.f of the system while preserving important information. By repeating local blocking transformations, one can approximately compute a value of the partition function. Actually, thanks to the coarse-graining algorithm, the cost now is proportional to the logarithm of system size.
Since the original idea of TRG was introduced by Levin and Nave Levin:2006jai , it has been applied to many two dimensional models associated with elementary particle physics: the lattice model Shimizu:2012wfa , the lattice Schwinger model Shimizu:2014uva ; Shimizu:2014fsa , and the finite density lattice Thirring model Takeda:2014vwa . In the studies of the relativistic fermion systems, the Grassmann TRG (GTRG) proposed by Gu et al. Gu:2010yh ; Gu:2013gba was used. On the other hand, although the original idea of TRG was limited to two dimensional systems, a new coarse-graining method suited for any higher dimensional system was proposed in Ref. 2012PhRvB..86d5139X . The new method uses the higher order singular value decomposition; thus it is called higher order tensor renormalization group (HOTRG). The authors in Ref. 2014ChPhL..31g0503W analyzed the three dimensional Potts models by using the HOTRG and obtained the critical temperature and exponents precisely. The HOTRG was also applied in two dimensional systems: the XY model Yu:2013sbi , the O(3) model Unmuth-Yockey:2014iga , and the CP(1) model Kawauchi:2016xng .
It is natural to extend the idea of TRG to higher dimensional systems and more complicated systems. Our final target is lattice QCD which is a four dimensional relativistic field theory of quarks and gluons. The tensor network representation of non-Abelian gauge theories was already attempted in Ref. Liu:2013nsa , and the HOTRG for boson systems was already done as mentioned above. Therefore, the final missing piece to achieve the goal is to treat fermions in higher dimensional systems. In this paper, we formulate HOTRG for fermions applicable to any dimensional system, and we call this Grassmann higher order tensor renormalization group (GHOTRG). As a concrete example, we will provide some details of GHOTRG in a two dimensional system 111The extension to four dimensional systems is straightforward although both memory and computational costs are extremely demanding..
This paper is organized as follows. In Sec. 2 we introduce GHOTRG for a two dimensional fermion system with a focus on the Grassmann part in the tensor network. In Sec. 3 we show numerical results and compare them with exact or previous ones. A summary and an outlook for the future work are given in Sec. 4.
2 Grassmann higher order tensor renormalization group
In this section, after briefly reviewing the original HOTRG, we explain GHOTRG in two dimensions in detail. The three dimensional version will be obtained straightforwardly. Finally we discuss how to treat the anti-periodic boundary conditions for fermion fields. Lattice units are assumed in the following.
2.1 Model and notation in two dimensional systems
The Lagrangian density of the free Wilson fermion 222 Inclusion of interaction terms is straightforward as seen in Shimizu:2014uva and Takeda:2014vwa for the Schwinger model and the Thirring model, respectively. in two dimensions is given by
[TABLE]
where is the lattice coordinate, the fermion fields , have two spinor components, and and denote the mass and the chemical potential. Following the prescription described in Refs. Shimizu:2014uva ; Takeda:2014vwa , one can obtain tensor network representation of the partition function
[TABLE]
The tensor , an elementary building block of the tensor network, is defined as
[TABLE]
where the original fermion fields , have been already integrated out while the another set of Grassmann variables , , , and () has been introduced. They are independent each other and satisfy
[TABLE]
The graphical expression of the tensor is shown in Fig. 1 (left) together with the index assignment. In Eq. (4), is a normal tensor whose components are normal numbers, and we call it bosonic tensor. The elements of the bosonic tensor are explicitly given in Refs. Shimizu:2014uva ; Takeda:2014vwa for the Schwinger model and the Thirring model, respectively. A bunch of Grassmann variables in Eq. (4)
[TABLE]
is called Grassmann part. Note that the indices of tensor in Eq. (4) should be read as
[TABLE]
reflecting the fact that the original Grassmann variable has two components. Each index runs from 0 to 1; thus the tensor
[TABLE]
has elements at the initial stage. Due to the Grassmann nature, the element of the tensor takes a non-trivial value only when the indices satisfy
[TABLE]
otherwise the element is zero.
In the following we discuss the renormalization procedure for the bosonic tensor and the Grassmann part separately.
2.2 Normal HOTRG procedure for the bosonic tensor
In this subsection, let us see the renormalization of bosonic tensors, especially focusing on the coarse-graining along the -direction. This is a brief review of the normal HOTRG 2012PhRvB..86d5139X .
First we consider the new tensor by contracting two bosonic tensors placed next to each other along the -direction (see Fig. 1)
[TABLE]
where the integrated indices are defined as
[TABLE]
Then one defines and as
[TABLE]
with
[TABLE]
Next we apply the eigenvalue decomposition to and obtain a unitary matrix and the eigenvalues :
[TABLE]
where the new index will be regarded as the second component of the index of the new tensor as it will be clear soon (see Eq. (18)). Similarly one obtains and from , and then we define and as
[TABLE]
with a given one can choose. represent an amount of truncation error and are used to select a unitary matrix which maintains better precision. That is, if , then is adopted and vice versa. Now one obtains the new tensor by using the selected unitary matrix (denoted by ) and restricting the new indices ,
[TABLE]
Actually, when dealing with the Grassmann variables, this part should be modified in order to incorporate the sign factors originated from the coarse-graining of them. See Eq. (31) for details.
The indices of the new tensor now contain two kinds of coordinates and . At this point, we make clear the relationship between and as shown in Fig. 2. Following the relationship, the indices are changed as follows:
[TABLE]
In the last step, has been renamed since all coordinates are labeled by . To remember that the lattice spacing of the -direction is doubled, however, the new unit vector defined as will be used.
2.3 Coarse-graining of the Grassmann part
In this subsection, we discuss the coarse-graining of the Grassmann part. Our objective here is to coarse-grain the Grassmann parts of the two tensors and , which are located next to each other along the -direction, and obtain a new Grassmann part with new Grassmann variables.
For that purpose, first of all, we collect their Grassmann parts
[TABLE]
Note that since the indices of tensor satisfy Eq. (10), the Grassmann part of each tensor can freely move around without invoking awkward sign factors. Next we can simply integrate out the Grassmann variables , , , and since the associated measure is contained in Eq. (20). This integration corresponds to the summation of in Eq. (11) for the bosonic part, and Eq. (20) turns out to be
[TABLE]
The sign factors have arisen along with the interchange of Grassmann variables.
The next step is to integrate out the “old” Grassmann variables in the hopping factors for the -direction. For , , , and , one can simply integrate out them since the corresponding measures are already there, and then Eq. (21) turns out to be
[TABLE]
where the additional sign factors arise when breaking the hopping factors for . On the other hand, for , , , and , there are no corresponding measures in Eq. (22). This mismatch, however, can be resolved by noting that the desired measures are obtained by shifting a site to the -direction. In other words, the corresponding measures 333Note that the Grassmann part for other sites are there as written in Eq. (3). exist at neighboring site . This statement can be easily seen if one arranges the ordering of the Grassmann variables and their measures
[TABLE]
Needless to say, when moving a bunch of the measures for in Eq. (23)
[TABLE]
one should take care of numerous sign factors. In order to control them, we introduce a new index and a new set of Grassmann variables 444Note that the new Grassmann variable has a single component. satisfying
[TABLE]
where the new index is defined as
[TABLE]
Thanks to Eq. (26), the combination of Grassmann measures
[TABLE]
is Grassmann-even; thus we can now freely move it. By inserting Eq. (25) into the proper position in Eq. (23) and moving the combination of Grassmann measures in Eq. (27) as shown in Fig. 3, one can execute the integration of the old Grassmann variable :
[TABLE]
where the hopping factors for the new Grassmann variable have been shifted . Note that new constraints including the new indices hold:
[TABLE]
This constraint is a consequence of Eq. (10) and Eq. (26). From a practical point of view, it is convenient to explicitly multiply the factor
[TABLE]
to the new tensor.
As mentioned around Eq. (18), the sign factor in Eq. (28) should be incorporated into the new bosonic tensor (before renaming the indices),
[TABLE]
One can include the phase factors when contracting and consist of purely bosonic in Eq. (11) or include them when making . We observe that the latter shows a slightly better approximation. In Sec. 3, we show the results in the latter case. For later use, we summarize the new bosonic tensor including the constraint in Eq. (30),
[TABLE]
where the index in the LHS should be understood as and while and are the same as before in Eq. (7).
Following the same rule in Eq. (19) for the bosonic part, the Grassmann part of the new tensor in Eq. (28) is renamed as follows,
[TABLE]
The ordering of the Grassmann variables and the index assignment are consistent with that of the initial tensor in Eq. (4).
By combining the coarse-graining of the bosonic part and the Grassmann part along the -direction, we obtain a coarse-grained tensor after the renaming
[TABLE]
The Grassmann part of the above form is similar to that of the initial tensor in Eq. (4). Actually, by replacing , , and (a single component) in Eq. (4), one obtains the above structure.
The second half of the coarse-graining along the -direction can be done as follows. First, let us interchange the - and the -axis and obtain a flipped tensor
[TABLE]
By applying a similar coarse-graining procedure for the -direction to this tensor, one can carry out the coarse-graining for the -direction. The coarse-grained tensor is then obtained by
[TABLE]
where the all indices of the tensor have the structure . The sign factor and the constraint have been already included in the bosonic tensor. Now, the scale of the -direction is equal to that of the -direction. We define an “iteration” of coarse-graining which consists of the coarse-graining for both directions. Then Eq. (36) is recognized as the coarse-grained tensor after the first iteration as well as the starting tensor of the second iteration. Actually, it turns out that from the second iteration, the structure of tensor in Eq. (36) does not change. The actual coarse-graining procedure from the second iteration can be done by replacing the second components of the indices to zero in the first iteration. For example, for the -direction one only has to replace
[TABLE]
in the Grassmann part.
The GHOTRG algorithm presented here can be immediately extended to any dimensional system. We do not show any details of a coarse-graining for higher dimensional systems, but we present an initial tensor for the three dimensional free Wilson fermion with two spinor components in App. A.
2.4 Anti-periodic boundary conditions
If anti-periodic boundary conditions are imposed, the sign factor arises when a fermion line passes the boundary for the -direction. Therefore, one can realize anti-periodic boundary conditions on the tensor network by inserting a boundary tensor into each link along the -direction as shown in Fig. 4,
[TABLE]
This insertion affects only the tensors around the boundary.
Even when there is a boundary, the coarse-graining along the -direction is simple. As shown in Fig. 5, there is no difficulty for the coarse-graining as long as the number of lattice sites for this direction is even. The coarse-graining along the -direction is also straightforward although one needs to care about the additional sign factor in . In the coarse-graining step involving the boundary shown in Fig. 6, as a result of the insertion of , a sign factor
[TABLE]
appears. By using the new index in Eq. (26), the factor is rewritten as
[TABLE]
Therefore we define a new boundary tensor for the next coarse-graining,
[TABLE]
From the second iteration, the structure of boundary tensor does not change. In this way, the boundary effect is not involved in the coarse-graining procedure for the tensor in the bulk and appears only in the contraction procedure to compute the partition function.
3 Numerical results
3.1 Two dimensional systems
3.1.1 The error of the free energy and the hierarchy of eigenvalues
Here we exclusively consider the massless free Wilson fermion. Figure 7 shows the free energy as a function of the chemical potential on the space-time lattice computed by using the GHOTRG. For the maximal , it agrees with the analytical exact result up to the machine precision. On the other hand, when decreases, the difference becomes larger especially in the region .
Figure 8 shows the error defined as
[TABLE]
For larger , the error rapidly reduces as expected. In the figure, we observe that the error becomes larger for while it is smaller for . This tendency is seen for all . In order to qualitatively understand this behavior, we investigate the hierarchy of eigenvalues of 555Note that the phase factors which have arisen in the Grassmann part are included in .. We show them at and in Fig. 9. For , the hierarchy is milder than that of . Therefore we confirm the strong correlation between the error of the free energy and the hierarchy of eigenvalues; a low compression in the coarse-graining procedure causes a large error in a physical quantity.
We also investigate the error for a larger lattice volume as shown in Fig. 10. The error tends to be large around . The eigenvalues of is shown in Fig. 11 where the hierarchy at is quite poor compared with that of . This behavior is very similar to that of GTRG Takeda:2014vwa .
3.1.2 The fermion number density of the lattice Thirring model
Figure 12 shows the fermion number density
[TABLE]
for the lattice Thirring model whose tensor network representation is presented in Ref. Takeda:2014vwa . In the large region, the fermion number density reaches the saturation density. When switching on the interaction , the Silver Blaze like phenomenon seems to occur around the small region where the fermion number density is constantly zero while the onset is observed at finite . Such a behavior was already seen in Ref. Takeda:2014vwa where GTRG is used. In any case, the exact massless free results are well reproduced by GHOTRG with in a wide range of although the approximation gets worse around ; this tendency was also observed in the GTRG analysis Takeda:2014vwa .
3.2 Three dimensional system
3.2.1 The free energy
We apply GHOTRG to the three dimensional massless free Wilson fermion and calculate the free energy. Since it is very demanding to carry out the full calculation even for the lattice, we have to compromise to use an anisotropic lattice, say, to check the algorithm and the code. As a check, we carry out the coarse-graining from to with several as shown in Fig. 13. With the full , the GHOTRG result is consistent with the exact result up to the machine precision. Even for smaller , the accuracy is reasonable, and for larger the free energy rapidly converges to the exact one. Note that at the smaller , the free energy may be complex due to the truncation in the coarse-graining procedure; thus the real part of the free energy is shown in the figure.
4 Summary and outlook
In this paper we have formulated Grassmann higher order tensor renormalization group and applied it to the free fermion system and the lattice Thirring model. The numerical results are consistent with analytical or previous ones. Thus we conclude that GHOTRG is a correct algorithm.
In the study of two dimensional finite chemical potential systems, we observed the very poor hierarchy of eigenvalues at , and this is the reason why the accuracy gets worse around this parameter region. This shows that the situation of accuracy for GHOTRG is similar to that of the GTRG.
The algorithm of GHOTRG can be straightforwardly extended to three dimensions or more. In principle, now one can deal with any dimensional fermion system by using GHOTRG. It is, however, still hard to study four dimensional fermion systems using GHOTRG owing to a huge computational cost. We hope that our formulation is a starting point towards the four dimensional lattice QCD. Apart from lattice QCD, there are some interesting models in low dimensions such as lattice chiral gauge theories and lattice SUSY which are generally suffering from the sign problem. We are now in the position to approach such models in two or three dimensions by using GHOTRG.
The bottleneck of (G)HOTRG is a tensor contraction whose cost is proportional to with the dimensionality . This cost could be reduced by randomizing the contraction. So far, many random algorithms have been proposed in the frameworks of MPS, PEPS, and TTN 2007PhRvL..99v0602S ; schuch2008simulation ; 2009NJPh…11h3026M . Recently, new algorithms combined with MERA are proposed 2012PhRvB..85p5146F ; 2012PhRvB..85p5147F . A tensor network Monte Carlo method introduced by Ferris 2015arXiv150700767F may be applied to the HOTRG. It seems worthwhile to pursue this direction.
Acknowledgments
We acknowledge helpful discussions with H. Kawauchi. R. S. would like to thank Prof. Y. Kuramashi for giving an opportunity to stay at RIKEN AICS. R. S. also acknowledge valuable discussions with Dr. Y. Nakamura and Dr. Y. Shimizu. This work is supported by JSPS KAKENHI Grant Numbers 17K05411, Kanazawa University SAKIGAKE Project, and MEXT as “Exploratory Challenge on Post-K computer” (Frontiers of Basic Science: Challenging the Limits).
Appendix A Tensor network representation of the free fermion system in three dimensions
We explain how to obtain a tensor network representation for the free Wilson fermions with two spinor components on the three dimensional lattice. The content of this appendix follows the strategy in Ref. Takeda:2014vwa .
By using an explicit representation of the matrices
[TABLE]
one can write down the lattice action as
[TABLE]
As you can see, the hopping terms along the -direction and the -direction mix the different spinor components. Here, we introduce new linear combinations as
[TABLE]
By using the new fields, one obtains a diagonal representation in the spinor space
[TABLE]
Thanks to the nilpotency of Grassmann variables, each exponential factor in the partition function can be expanded binomially:
[TABLE]
In order to integrate out the original fields , (, , , , and ), we insert new Grassmann variables , , , , , and in the hopping factors
[TABLE]
Note that these insertions do not generate sign factors. Now, we can collect all factors which include the original fields at a site , namely , , , , , and , without invoking sign factors. By integrating out the original fields at the site , we define the bosonic tensor by
[TABLE]
By repeating this procedure for all other sites, we define the full tensor as
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) K. G. Wilson, Phys. Rev., D 10 , 2445, (1974).
- 2(2) M. Creutz, Phys. Rev., D 21 , 2308 (1980).
- 3(3) M. Levin and C. P. Nave, Phys. Rev. Lett., 99 , 120601 (2007), ar Xiv:cond-mat/0611687 [cond-mat.stat-mech].
- 4(4) Y. Shimizu, Chin. J. Phys., 50 , 749 (2012).
- 5(5) Y. Shimizu and Y. Kuramashi, Phys. Rev., D 90 , 014508 (2014), ar Xiv:1403.0642 [hep-lat].
- 6(6) Y. Shimizu and Y Kuramashi, Phys. Rev., D 90 , 074503 (2014), ar Xiv:1408.0897 [hep-lat].
- 7(7) S. Takeda and Y. Yoshimura, Prog. Theor. Exp. Phys. 2015 , 043B 01 (2015), ar Xiv:1412.7855 [hep-lat].
- 8(8) Z. -C. Gu, F. Verstraete, and X. -G. Wen (2010), ar Xiv:1004.2563 [cond-mat.str-el].
