Gradient optimization of fermionic projected entangled pair states on directed lattices
Shao-Jun Dong, Chao Wang, Yongjian Han, Guang-can Guo, Lixin He

TL;DR
This paper extends a stochastic gradient optimization method to fermionic PEPS, introducing a fermi arrow notation, and demonstrates its effectiveness in simulating complex fermionic models with improved accuracy and scalability.
Contribution
The authors develop a fermionic PEPS gradient optimization method with a new fermi arrow notation, enabling efficient simulation of fermionic systems with larger bond dimensions.
Findings
Gradient optimization improves simple update results.
Larger bond dimensions are needed for convergence.
Method offers lower scaling than direct contraction methods.
Abstract
The recently developed stochastic gradient method combined with Monte Carlo sampling techniques [PRB {\bf 95}, 195154 (2017)] offers a low scaling and accurate method to optimize the projected entangled pair states (PEPS). We extended this method to the fermionic PEPS (fPEPS). To simplify the implementation, we introduce a fermi arrow notation to specify the order of the fermion operators in the virtual entangled EPR pairs. By defining some local operation rules associated with the fermi arrows, one can implement fPEPS algorithms very similar to that of standard PEPS. We benchmark the method for the interacting spinless fermion models, and the t-J models. The numerical calculations show that the gradient optimization greatly improves the results of simple update method. Furthermore, much larger virtual bond dimensions () and truncation dimensions () than those of boson and spin…
| SU | GO | Exact | relative error | |
|---|---|---|---|---|
| 0.1 | -0.66590 | -0.67124 | -0.67125 | 110-5 |
| 0.8 | -0.59255 | -0.59309 | -0.59312 | 510-5 |
| 2 | -0.48136 | -0.50643 | -0.50646 | 510-5 |
| SU(D=8) | GO(D=6) | GO(D=8) | ||||||
|---|---|---|---|---|---|---|---|---|
| Size | Energy | Energy | relative error | Energy | relative error | exact | ||
| 44 | - | -0.68398 | 16 | 410-5 | -0.68401 | 32 | 110-5 | -0.68402 |
| 66 | -0.67721 | -0.73269 | 24 | 510-4 | -0.73305 | 52 | 510-5 | -0.73309 |
| 88 | -0.74763 | -0.75414 | 40 | 110-3 | -0.75492 | 84 | 210-4 | -0.75510 |
| 1010 | -0.75387 | -0.76619 | 55 | 110-3 | -0.76705 | 110 | 510-4 | -0.76748 |
| 1212 | - | -0.77094 | 80 | 510-3 | - | - | - | -0.77538 |
| size | SU | GO |
|---|---|---|
| 44 | -0.55108 | -0.56420 |
| 48 | -0.57994 | -0.59055 |
| 68 | -0.59431 | -0.60349 |
| 88 | -0.60849 | -0.61184 |
| 810 | -0.61068 | -0.61738 |
| 812 | -0.61707 | -0.62164 |
| 1212 | -0.62307 | -0.62973 |
| -0.66757 | -0.67008 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
.
Gradient optimization of fermionic projected entangled pair states on directed lattices
Shao-Jun Dong
Key Laboratory of Quantum Information, University of Science and Technology of China, Hefei, 230026, China
Synergetic Innovation Center of Quantum Information and Quantum Physics, University of Science and Technology of China, Hefei, 230026, China
Chao Wang
Key Laboratory of Quantum Information, University of Science and Technology of China, Hefei, China
Synergetic Innovation Center of Quantum Information and Quantum Physics, University of Science and Technology of China, Hefei, 230026, China
Yongjian Han
Key Laboratory of Quantum Information, University of Science and Technology of China, Hefei, China
Synergetic Innovation Center of Quantum Information and Quantum Physics, University of Science and Technology of China, Hefei, 230026, China
Guang-can Guo
Key Laboratory of Quantum Information, University of Science and Technology of China, Hefei, China
Synergetic Innovation Center of Quantum Information and Quantum Physics, University of Science and Technology of China, Hefei, 230026, China
Lixin He
Key Laboratory of Quantum Information, University of Science and Technology of China, Hefei, China
Synergetic Innovation Center of Quantum Information and Quantum Physics, University of Science and Technology of China, Hefei, 230026, China
Abstract
The recently developed stochastic gradient method combined with Monte Carlo sampling techniques [PRB 95, 195154 (2017)] offers a low scaling and accurate method to optimize the projected entangled pair states (PEPS). We extended this method to the fermionic PEPS (fPEPS). To simplify the implementation, we introduce a Fermi arrow notation to specify the order of the fermion operators in the virtual entangled EPR pairs. By defining some local operation rules associated with the Fermi arrows, one can implement fPEPS algorithms very similar to that of standard PEPS. We benchmark the method for the interacting spin-less fermion models, and the t-J models. The numerical calculations show that the gradient optimization greatly improves the results of simple update method. Furthermore, very large virtual bond dimensions () and truncation dimensions () are necessary to converge the results of these models. The method therefore offer a powerful tool to simulate fermion systems because it has much lower scaling than the direct contraction methods.
I introduction
Interacting quantum many-body systems pose some of the most exciting open problems in physics. Particularly, fermion systems are central to many of the most fascinating effects in condensed matter physics, such as high-temperature superconductivity,Lee et al. (2006) the fractional quantum Hall effect,Stormer et al. (1999) and Mott insulator transitions Edwards and Hewson (1968); Imada et al. (1998). The simulation of the strongly correlated fermion system plays the critical role to understand these system and is also one of the most challenging problems in condensed matter physics.
The Quantum Monte Carlo (QMC)Foulkes et al. (2001) method as one of the leading methods in studying many-body physics has achieved great success in bosonic and spin systems since its first proposed. However, except in some special cases, Li and Yao (2018) the fermion systems are extremely difficult to treat using QMC simulationsLoh et al. (1990); Troyer and Wiese (2005) because of the notorious “sign problems”.
Recently, the methods based on tensor network states (TNS), especially the projected entangled states (PEPS) Schollwöck (2011); Perez-Garcia et al. (2007); Verstraete et al. (2008); Jiang et al. (2008); Vidal (2008); Verstraete and Cirac (2004); Sfondrini et al. (2010); Verstraete et al. (2006) have shown their power on simulation of the strongly correlated many-particle systems. The PEPS is sign-problem free and has achieved great successes in studying the frustrated spin models Wang et al. (2016); Vidal (2007); M.-H. et al. (2012). The PEPS method has been extended to study fermion models (namely fPEPS) by different approaches Barthel et al. (2009); Corboz et al. (2010, 2011); Gu et al. (2010, 2013); Kraus et al. (2010). Apperently, the fPEPS are more complicated than PEPS because of the anti-commutation properties of the fermion operators. In addition, fermion systems are highly frustrated. It has been proven that the entanglements of the ground states of some fermion systems are beyond the area law Hastings (2007); Wolf (2006). Therefore, to faithfully simulate such models, it usually requires very large bond dimensions (). Furthermore, it has been shown that the imaginary time evolution with simple update Jiang et al. (2008) method may have large errors because the environment effects are oversimplified. To exactly consider the environment, the traditional methods, e.g., the full update method Lubasch et al. (2014a, b), suffer from extremely high computational scaling to the bond dimensions. This problem is more serious for the fermion models when large is required. We note that the recently developed infinite PEPS (iPEPS) with full update method has achieved great success,Corboz et al. (2014, 2010) by making use of the translation symmetry, which may greatly reduce the number of independent tensors. However, not all systems have such symmetry, e.g., defects, disorders and systems with spontaneous symmetry broken, etc. In these cases, the finite PEPS method is essential.
The recently developed Monte Carlo sampling techniques for PEPS can greatly reduce the computational scalingSandvik and Vidal (2007); Schuch et al. (2008); Sfondrini et al. (2010); Bañuls et al. (2017); Dong et al. (2017); Liu et al. (2017). By combining with stochastic gradient optimization (GO) method, one can achieve great precision in obtaining the ground states. Liu et al. (2017); He et al. (2018) In this work, we extended the stochastic gradient method Liu et al. (2017); He et al. (2018) to optimize the fPEPS wave functions for fermion systems. To simplify the implementation of the fPEPS algorithms, we introduce a “Fermi arrow” notation to specify the order of the fermion operators in the entangled EPR pairs. With this notation and some local operation rules associated with the Fermi arrows, we can greatly simplify the implementation of the stochastic gradient optimization method (and other methods) for fPEPS. We implement this local operation rules for fPEPS in our recently developed TNSpack Dong et al. (2018), in which the anti-commutation properties of the fermion operators are automatically taken account of. Therefore, one can implement fPEPS algorithm very similar to that of the standard PEPS without worry too much about the details of the anti-commutation between the fermion operators.
We benchmark the stochastic gradient method for fPEPS on the interacting spin-less fermion models, and the t-J models. The numerical calculations show that the gradient optimization greatly improves the results of simple update method. Furthermore, for these models, very large virtual bond dimensions and truncation dimensions are necessary to converge the results which is the dominate difficult to simulate the fermion systems. Therefore the present method is advantageous because it has much lower scaling than the traditional direct contraction method.
II Definition of fPEPS based on directed network
The definition of the fPEPSKraus et al. (2010) on a lattice is similar to that of the standard PEPSVerstraete and Cirac (2004); Verstraete et al. (2006). Without lose of generality, we take a fermion system on a square lattice as an example, where the physical dimension of each site is . For each horizontal bond connecting sites and , there is a EPR pair, i.e., a Bell type entangled state,
[TABLE]
where and are the fermion states on site and site . States are generated as, , where ) is the binary representation of and is the vacuum state. s and s are the fermion operators that satisfy . For convenience, we denote the state . Similarly, for each vertical bond connecting site and , there is also a Bell type entangled state, (in short) . Therefore, a standard virtual mother state of a fPEPS can be defined as,
[TABLE]
To define a quantum state in the real physical space, we project to the physical space. The projector on site is defined as:
[TABLE]
Here, is the creation operator of the physical particle on site whereas ( and ) are the annihilation operators of the state . The fPEPS is then defined as,
[TABLE]
To make the fPEPS well defined, the state should be independent of the order of the projectors up to a global phase, i.e, the parity of all elements in a projector should be the same. The parity of the element of the projector is obtained by , where =-1, if the parity of is odd, and =+1 if the parity of is even. Therefore, the parity of all elements can be obtained by the lower indices of tensor . Without lose of generality, we assume all nonzero projector elements have even parity in this paper. As a consequence, the elements with odd parity vanish, i.e., =0, if ++++ is odd. In this definition of fPEPS, we may interchange the positions of any two projectors and EPR pairs, because they all have even parity.
One of the key issues in the fPEPS is the order of the fermion operators, including the operators in the projectors and in EPRs. We define the standard order of the fermion operators in each projector operators on the square lattice as followings, physical creation operator, left, down, right, and up virtual operators (i.e., anti-clockwise order), which is the same as the order of the lower indices in the tensor (see Eq. II). When changing the order of fermion operators, a sign which is determined by the parity of the indices will appear. For example, if we exchange the two adjacent fermion operators and , there will be an extra phase, i.e., , where
[TABLE]
Beside the fermion operators appeared in projector , we also need to specify the operators’ order in the EPR pairs, which is not given in the tensors explicitly. In this work, we introduce a Fermi arrow notation to specify the order of the EPR pairs. For example, as shown in Fig. 1(a), the arrow points from site A to site B , and the corresponding EPR state is , whereas in Fig. 1(b), the arrow points from B to A, and the corresponding EPR state is . The two states can be transformed to each other as follows,
[TABLE]
We may also assign Fermi arrows to the physical indices: the Fermi arrows point into the sites for the physical creation operators, and pointing out of the sites for the annihilation operators. This definition is equivalent to insert EPR pairs between the physical operators when contracting the physical indices e.g., . With this definition, we can treat the physical indices and the virtual indices in the same manner, and do not need to distinguish the real fermions and virtual fermions during operations. We can now uniquely define a fPEPS on a directed lattice, as shown for example in Fig. 2, on a 44 lattice.
By defining some calculation rules associated with Fermi arrows, we are able to perform fPEPS calculations. Contraction is one of the most important operations in PEPS algorithms. When we contract the tensors on two sites in a fPEPS, we need to consider the Fermi arrow direction. We take the two situations in Fig. 1 as an example, which gives two different contraction formula,
[TABLE]
for Fig. 1(a) and,
[TABLE]
for Fig. 1(b). The anti-commutation relation of fermions has been used. The Fermi arrow helps to distinguish the two situations of contraction in the graphical notions of Fig. 1. Using the graphic representation may greatly simplify the notation.
More generally, giving two tensors and , connected via multi virtual bonds (EPRs), where are the joint bonds to be contracted. Assume that bonds have Fermi arrows pointing from to , and the rest bonds have Fermi arrows pointing from to . We first reshape to and reshape to , where , are the bonds that are not to be contracted in and respectively. For the convenience of notation, we assume the signs due to the change of bond order in the tensors according to Eq. 7 have been absorbed into the tensors, then the resulting tensor is,
[TABLE]
Other often used operations associated with Fermi arrows are given in the Appendix. We implement these operation rules for fPEPS in our recently developed TNSpack Dong et al. (2018), in which the anti-commutation properties of the fermion operators are automatically taken account of by these rules. Therefore, one can implement fPEPS algorithm very similar to that of standard PEPS without worrying too much about the details of the anti-commutation between the fermion operators.
The Fermi arrows define the fermionic order for the fPEPS. In some previous methods, Corboz et al. (2010) the EPS pairs are not explicitly used. We note that in this work, the EPR pairs are only used in the derivation of the operation rules associated with Fermi arrows. Once we have these rules, one may ignore the underlying EPR pairs, and use only Fermi arrows for all operations. In Ref. Barthel et al., 2009, the authors proposed a general fermionization procedure using so called fermionic operator circuits (FOCs), in a bra and ket notation, instead of EPR pairs. Our Fermi arrows are similar to the contraction arcs defined in Ref. Barthel et al., 2009 albeit the starting point and detailed implementations of the two methods are different.
III Stochastic gradient optimization of fPEPS
In order to find the ground state of a given Hamiltonian using fPEPS, different methods have been introduced. The leading method is the imaginary time evolution (ITE) method.Jiang et al. (2008) However, due to the high computation complexity to obtain the exact environment during the time evolution, some kinds of approximations are necessary. The simple update methodJiang et al. (2008) has been widely used, which however may have large errors because the environment is over simplified. Several methods have been developed to treat the environment more rigorously, such as the full update methodLubasch et al. (2014a, b), and the gradient methodVanderstraeten et al. (2016); Liu et al. (2017), which may significantly improve the results, but the scaling to of these methods is rather high.
We recently developed stochastic gradient optimization method for PEPS, combined with Monte Carlo sampling techniques. Liu et al. (2017); He et al. (2018) This method gives remarkable accuracy of the results which may be even better than the results of full update method at given .Liu et al. (2017) The method has two advantages. First, the environments of tensors are treated rigorously, and therefore, the results are more accurate than SU and even FU methodsLiu et al. (2017). Secondly, the Monte Carlo sampling technique may greatly reduce the scaling of the method to the virtual bond dimension from to for OBC,Sandvik and Vidal (2007); Schuch et al. (2008); Sfondrini et al. (2010); Bañuls et al. (2017) which is even more crucial for fPEPS, where larger is often needed to converge the results. In this work, we extended this method to fPEPS.
The fPEPS wave functions of a many-particle system in Eq. 4 can be rewritten as,
[TABLE]
where = is the site index of the lattice, and means to contract all the entangled virtual fermions according to the rules defined in Sec. II. is the coefficient of the physical state = in the particle number representation. The energy of the system can be written as,
[TABLE]
where,
[TABLE]
The total energy of the system for a given fPEPS can be evaluated via Monte Carlo sampling over the physical configurations space. Sandvik and Vidal (2007); Schuch et al. (2008); Sfondrini et al. (2010); Bañuls et al. (2017); Liu et al. (2017)
To optimize the energy function, we need the derivatives of the energy with respect to the tensor elements,
[TABLE]
where denotes the MC average. is defined as
[TABLE]
and the derivative of is
[TABLE]
The derivatives can be also evaluated by the MC samplings.Liu et al. (2017)
Once we have the energy and its gradients, we can optimize the system energy using stochastic gradient methodSandvik and Vidal (2007); Liu et al. (2017), which has been successfully applied to the standard PEPS method.
The overall algorithm for fPEPS is similar to that of PEPS. We need to contract the fPEPS tensors at given particle configuration to obtain and the gradients. However, contracting a fPEPS is much more complicate than contacting a standard PEPS, because of the anti-commutation relation of fermions. One must be very careful about the contraction order and underlying fermions’ order in EPR pairs. We show here that with the help of Fermi arrows and the operation rules associated with them, the contraction can be done easily as in the standard PEPS algorithms.
To obtain , we need to contact a single layer of fPEPS with fixed particle configuration . We adopt the boundary-MPO method, Lubasch et al. (2014a, b)where we need to find a fermionic matrix product operator (fMPO) denoted as with bond dimension [see Fig.3(b)] to approximate the two rows of fPEPS with bond dimension , denoted as [see Fig.3(a)]. To find such , we minimize
[TABLE]
which lead to the linear equation for each tensor on site ,
[TABLE]
where is obtained by taking the tensor out of , as graphically shown in Fig. 4.
To solve the equation, we first contract the tensors on the left side of Fig. 4. We change the arrow directions from Fig. 3(b) to Fig. 5(a), i.e., all arrows are pointing into site . The rule of changing the directions of Fermi arrows are given in the Appendix. As will be seen in the following text, the change of Fermi arrow directions is to take the advantages of the canonic form of fMPO.Perez-Garcia et al. (2007)
We next do QR decomposition to the tensor on the first site of , resulting in two tensors, and as shown in Fig. 5(b). The rules for QR (and other decompositions) in the presence of Fermi arrows are also given in the Appendix. We then contract the tensor with the second tensor on the right site, and perform QR decomposition on the second site again to obtain the tensor. We repeat this process until reach the tensor . We contract the last tensor with the , resulting in a new tenor . Similarly, we perform the LQ decomposition on the right side of , starting from the last site to the site (, ), and contract the last tensor with to get . During the LQ (QR) decompositions, new Fermi arrows have been inserted between L (Q) tensors and Q (R) tensor. After these processes, we obtain in Fig. 5(c).
We perform the same operations for . After these operations, the left side of Fig. 4 become that of Fig. 6. By using the orthogonality of and , i.e., =, which is discussed in the Appendix for the fPEPS with Fermi arrows, we obtain the right side of Fig. 6. The original equation Fig. 4 become of Fig. 7, which can be solved iteratively as in standard boundary MPO method,Lubasch et al. (2014a, b) which usually converges in a few sweeps.
The contraction in Eq. 17 can be calculated in the same procedures. Once we have , and , the energies and their gradients can be easily calculated.
In our calculations, we first perform ITE with simple update method to obtain a approximate ground state,Liu et al. (2017) which usually have energy errors around . We further optimize the fPEPS via gradient decent method to obtain the highly accurate ground state.
IV benchmark results
We benchmark our method for two typical fermion models on finite size square lattices, including the interacting spin-less fermions model and the t-J model. We demonstrate that our method can give highly accurate results compared with the exact results.
IV.1 The spin-less fermions model
The interacting spin-less fermions model reads,
[TABLE]
where and are the creation and the annihilation operators, and denotes the nearest neighbor pairs. The chemical potential is set to zero here. We set hopping parameter =1, and the interaction strength . In general this model is not exactly solvable, and has been numerically investigated in Ref. de Woul and Langmann, 2010 by the mean field theory and in Ref. Corboz et al., 2010 by the iPEPS method. Both methods give similar phase diagrams. For the parameters we used, the ground state is in a uniform metallic phase when is small and moves towards the phase boundary between the uniform phase and the phase separation with the increasing of . Therefore the ground state of the model is expected to have entanglement beyond area law. Wolf (2006)
We firstly calculate this model on a 44 square lattice so we can compare the fPEPS results with those obtained from the exact diagonlization method. In the calculations, we take , and . The convergence of these parameters will be discussed in details in Sec.V. The results are presented in Table 1 for various . As seen from Table I, the SU method may give the results with errors around 510*-3* when is small, but the errors increase for larger . When =2, the error of SU is about 10*-2*. The GO may significantly improve the ground state energies. By using the given and , we are able to obtain an impressive highly accurate ground state with relative error near .
We now consider a special case of =0, where the model reduces to the free fermion model. Although in this case, the model is exactly solvable, it is a challenging model for the fPEPS method because the free fermions have strong entanglement in real space that violates the area law Wolf (2006). Especially at =0, the Fermi surface is very large, making the problem more difficult. One may expect that to obtain the high accuracy results it requires very large and . Furthermore, the required parameters and will generally increase rapidly with the increasing of the size of the system to keep the given accuracy. In Table 2, we list the ground states energies of the free fermion model on the square lattice with different sizes obtained from the SU and GO methods, compared with the exact results . We see that the relative errors of the SU are usually about for =8, but sometimes the SU method may have numerical instability in some small systems. The performance of GO is much better, and we always get stable results. Even with a small =6, the relative errors are about , and reduce to when =8 is used. On the other hand, the violation of the area law is also showed in this table, that the accuracy gets lower in larger systems for a given .
From the above tests, we find that the SU method sometimes may give rather accurate results ( 10*-2* - 10*-3*), but the situation may change from case to case. On the other hand the GO always gives reliable and highly accurate results ( 10*-5*).
IV.2 t-J model
In this section, we benchmark our method on the t-J model,
[TABLE]
where is the spin index and is the spin operator on site . is the number of electrons on site . In t-J model, the electron double occupancy is forbidden. The t-J model is one of the key models to understand many important physical phenomenaZhang and Rice (1988), such as high superconductivity Lee et al. (2006). Here, we calculate the model with and hole filling of 0.125. The U(1) symmetry is adopted to enforce the particle number conservation. But true physics of the system at this point, whether the ground state is a stripe state Hellberg and Manousakis (1999); White and Scalapino (1998); Corboz et al. (2011, 2014) or an uniform phaseSherman and Schreiber (2003); Vineet Mallik et al. (2018), is still under debate. Without doubt, the energy is one of the critical criterions to determine the ground state of the system. We calculate the ground energies of different system sizes, using , , and the results are shown in Table 3 for both SU and GO methods. Again we see GO method greatly improves the energies obtained from SU method. By extrapolating the energy to thermodynamic limit, we obtain that the ground state energy -0.6701, which is lower than the value -0.6693 Corboz et al. (2014) from state of art DMRG calculations White and Scalapino (1998) with , and -0.6619 obtained from variation quantum Monte Carlo plus -step Lanczos methods. Hu et al. (2012) More results of t-J model Don will be published in a separate paper.
V convergence of fPEPS
Fermion systems may have large entanglement that beyond the area law Wolf (2006) and therefore it may need large to represent the many-particle state. One may also expect that the and will increase with the size of the system. The speed of the increasing of and along with the size of the system indicates the efficiency of the simulation methods. It is important to understand how the fPEPS calculations converge respect to and . For finite systems, we can explicitly exam what and are needed to converge the results as the size of the systems grow up. In this section, we will discuss the convergence of the parameters and respectively. We show that the behavior depends strongly on the models.
We first investigate the convergency of the ground state energies to in a given system with fixed parameter . We calculate the error of energy defined as,
[TABLE]
where is the energy with a giving truncation parameter and is the converged energy where the maximal is used.
Figure 8(a) depicts the results of the spin-less fermion model with (free fermion) and ; and different system sizes, the and lattices. We fix the bond dimension at . We use for the system and for the system. We first note that s approach 0 in a non-trivial way, which are not always from above (i.e., 0). This means that the ground state energy is not variational to , and therefore one must be very careful to extrapolate to infinite. The convergency of energy is model dependent. As shown in the figure, converge much faster for (correlated electrons) than for =0 (free electrons). In both cases, the convergency of energy strongly depend on the size of the systems. In the cases of small sizes =4, the energies converge rather fast with . However, for the system, converge much slower as functions of . For , the energy is well converged at (about 3), whereas the energy of free fermions is not well converged even at .
We investigate the convergence of the t-J model at hole doping =0.125, and the results are shown in Fig. 8(b). In the calculations, is used, and the result of =50 is used as a reference. Interestingly, we find the ground state energies converge rather fast with . The errors reduce to 3 for =2, and the errors reduce to 1 for =3, More importantly, unlike the interacting fermion model, is only slightly dependent on the size of the system.
The energy errors in the calculations are induced by the contraction errors due to bond dimension truncation. We further test the relationship between the convergent truncation dimension and the size of the system, i.e. we exam the minimal needed to ensure the relative contracting error (see Eq. 18) along with the increasing of the system size. The bond dimension used here is fixed to a relatively small one . We compare the truncation errors for the spin-less interacting electron model at =0 and =2. For the t-J model, we compare two situations, the hole doping =0.125, and the =0, and the later one reduces to the Heisenberg model. The results are shown in Fig. 9(a). We find that the required is almost independent of the size of the system for the Heisenberg model, and for the t-J model with hole filling =0.125. However, the required increases rapidly with the size of the system for the interacting electron model, especially for the free electrons. At =12, =80 is required to ensure the desired contraction accuracy for the free electron model and =40 for the =2 model.
We also exam the relationship between and the bond dimension . In this test, we fixed the size of the system to =10. The results are shown in Fig. 9(b). We see that the required increase roughly linearly with for these models. For the Heisenberg model (and even J1-J2 model) Liu et al. (2018) and the t-J model with hole filling 0.125, D_{c}$$\sim 2 - 3 is enough to ensure the accuracy of contraction, whereas for the interacting electron model, D_{c}$$\sim 9 - 15 are required to ensure the desired contraction accuracy, which becomes the major difficult to simulate these models. We note that in the standard contraction method, the bond truncation dimension for a double layer tensor network should scale as , Lubasch et al. (2014a, b) making the simulation of fermions with large even more difficult.
With the convergent for each , we can analyze the convergence of the energy against for a given system size. The energy of a model in the thermodynamic limit can be further extracted by finite size scaling method. The convergence of the energy against the parameter are shown in Fig. 10, where is defined as the energy differences compared to those of maxima , which are =8 for the free fermion model and =10 for the interacting fermion model with =2. A =12 is used for the t-J model with =0.125. Surprisingly, for the free fermion model (=0), we have for =7 on the 1010 lattice, as shown in Fig. 10(a), where one may expect a much larger error. For the interacting fermion with , which is shown in Fig. 10(b), the energy is also converged to at =8. On the other hand, for the t-J model, the energies converge rather slowly with . For the system, the energy errors reduces to about at . The non-trivial dependence of and for different models may pose some interesting questions to understand the structure of fPEPS. We leave these problems for future studies.
VI Summary
In this work, we extend the stochastic gradient optimization method combined with Monte Carlo sampling techniques to optimize the fPEPS wave functions for fermion systems. The Monte Carlo sampling techniques may greatly reduce the scaling of the calculation, and therefore allow using larger bond dimensions () and bond truncation dimensions () in the calculations, which is important for the faithful simulations of fermion systems.
We benchmark the method on the interacting spinless fermion models, and the t-J models. The numerical calculations show that the gradient optimization may greatly enhance the accuracy of the results over the simple update method. We further investigate the converge of fPEPS calculation with respect to and for the models. The free fermion model is most challenging to simulate with fPEPS, because the increase very rapidly with and the size of the system. For t-J models, we find that large s are needed to converge the results. Our method therefore offer a powerful tool to simulate fermion systems because it has much lower scaling in both computational time and memory than direct contraction methods.
VII Acknowledge
This work was funded by the National Key Research and Development Program of China (Grant No. 2016YFB0201202), the Chinese National Science Foundation (Grants No. 11774327, No. 11874343, No. 11474267), and the Strategic Priority Research Program (B) of the Chinese Academy of Sciences (Grant No. XDB01030200). China Postdoctoral Science Foundation funded project (Grant No. 2018M632529). The numerical calculations have been done on the USTC HPC facilities.
Appendix A Rules for Fermi arrows
In this Appendix, we give the rules of operations associated with the Fermi arrows in fPEPS. These rules are straightforward to prove.
A.0.1 Reversing Fermi arrows and the Hermitian conjugate
Sometimes, we need to reverse the direction of a Fermi arrow. The rule of reversing Fermi arrows is giving as follows. Suppose,
[TABLE]
are two projectors in a fPEPS that are connected by a Fermi arrow pointing from to , as shown on the left side of Fig. 11. We may reverse the Fermi arrow, pointing from to , and resulting in two possible (but equivalent) forms that are given on the right side of Fig. 11. It is easy to prove that,
[TABLE]
When we calculate the expectation value of a physical quantity, , we need take the Hermitian conjugate of ket state to get the bra state . When taking the Hermitian conjugate of the projectors in a fPEPS, we need to (i) reserve the orders of the indices of the tensor associated with the projectors, e.g., change tensor to , as shown in Fig. 12; and (ii) reverse all the Fermi arrows associated with the projectors. Note that here the reversion of the Ferim arrows is required by the Hermitian conjugate, and no change is needed for the tensors during the process.
A.0.2 Matrix decompositions and contractions
The operations such as tensor decompositions also have close relation to the Fermi arrows. For example, in the standard PEPS, when we do SVD to a matrix , we have =. However, in fPEPS, two Fermi arrows should be inserted to the inner bonds after the decomposition, i.e., the Fermi arrow pointing from to , and the one pointing from to as follows, and schematically shown in Fig. 13(a),
[TABLE]
where
[TABLE]
Other matrix decompositions such as LQ/QR decompositions follow the similar rules, i.e., one need to insert Fermi arrows (i.e., directed EPR pairs) between the decomposed matrices, as shown in Fig. 13(b),(c).
In standard PEPS, we often use so called canonical form of MPS in the MPO algorithmLubasch et al. (2014a, b) to contract the PEPS, taking the advantage of the orthogonality of the tensors obtained from LQ/QR decompositions (or the and matrices from SVD decompostions), Schollwöck (2011) i.e., =, where is a unit matrix. However, this relation cann’t be directly used in the fPEPS, where we need to take the Fermi arrows into consideration during the contractions. It is easily prove that only when the Fermi arrows have “consistent directions”, i.e., all Fermi arrows point from to , or from to , we can use the orthogonality condition for matrix. The results after contraction are Fermi arrows pointing to the right or to the left, as schematically shown in Fig. 14. If the Fermi arrows are not “consistent”, we need to rearrange the directions of the Fermi arrows first to make them “consistent”, before we can use the orthogonality condition. This is done in Sec.III, when we contract two rows of fPEPS via a MPO scheme.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Lee et al. (2006) P. A. Lee, N. Nagaosa, and X.-G. Wen, Rev. Mod. Phys. 78 , 17 (2006).
- 2Stormer et al. (1999) H. L. Stormer, D. C. Tsui, and A. C. Gossard, Rev. Mod. Phys. 71 , S 298 (1999).
- 3Edwards and Hewson (1968) D. M. Edwards and A. C. Hewson, Rev. Mod. Phys. 40 , 810 (1968).
- 4Imada et al. (1998) M. Imada, A. Fujimori, and Y. Tokura, Rev. Mod. Phys. 70 , 1039 (1998).
- 5Foulkes et al. (2001) W. M. C. Foulkes, L. Mitas, R. J. Needs, and G. Rajagopal, Rev. Mod. Phys. 73 , 33 (2001).
- 6Li and Yao (2018) Z.-X. Li and H. Yao, ar Xiv:1805.08219 v 2 (2018).
- 7Loh et al. (1990) E. Y. Loh, J. E. Gubernatis, R. T. Scalettar, S. R. White, D. J. Scalapino, and R. L. Sugar, Phys. Rev. B 41 , 9301 (1990).
- 8Troyer and Wiese (2005) M. Troyer and U.-J. Wiese, Phys. Rev. Lett. 94 , 170201 (2005).
