Topological Defects in Quantum Field Theory with Matrix Product States
Edward Gillman, Arttu Rajantie

TL;DR
This paper demonstrates how matrix product states can effectively model topological defects in a relativistic scalar field theory, enabling calculation of kink properties and insights into defect dynamics.
Contribution
It introduces a general tensor network approach to study topological defects in quantum field theories, including methods to compute kink mass and analyze defect-antikink contributions.
Findings
Successfully approximated kink states as matrix product states
Calculated kink mass and compared it with theoretical expectations
Showed the method's applicability to non-equilibrium defect physics
Abstract
Topological defects (kinks) in a relativistic scalar field theory in are studied using the matrix product state tensor network. The one kink state is approximated as a matrix product state and the kink mass is calculated. The approach used is quite general and can be applied to a variety of theories and tensor networks. Additionally, the contribution of kink-antikink excitations to the ground state is examined and a general method to estimate the scalar mass from equal time ground state observables is provided. The scalar and kink mass are compared at strong coupling and behave as expected from universality arguments. This suggests that the matrix product state can adequately capture the physics of defect-antidefect excitations and thus provides a promising technique to study challenging non-equilibrium physics such as the Kibble-Zurek mechanism of defect formation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Topological Defects in Quantum Field Theory with Matrix Product States
Edward Gillman
Arttu Rajantie
Department of Physics, Imperial College London, SW7 2AZ, UK
(31 October 2017)
Abstract
Topological defects (kinks) in a relativistic scalar field theory in are studied using the matrix product state tensor network. The one kink state is approximated as a matrix product state and the kink mass is calculated. The approach used is quite general and can be applied to a variety of theories and tensor networks. Additionally, the contribution of kink-antikink excitations to the ground state is examined and a general method to estimate the scalar mass from equal time ground state observables is provided. The scalar and kink mass are compared at strong coupling and behave as expected from universality arguments. This suggests that the matrix product state can adequately capture the physics of defect-antidefect excitations and thus provides a promising technique to study challenging non-equilibrium physics such as the Kibble-Zurek mechanism of defect formation.
I Introduction : Topological Defects and Tensor Networks
A significant problem within quantum field theory (QFT) is the calculation of non-perturbative and non-equilibrium problems. Standard perturbation theory can be used for near-equilibrium weak coupling problems but fails in other cases. For equilibrium calculations, lattice theory provides a powerful approach but has limited applicability to non-equilibrium problems Smit (2002). On the non-equilibrium side, the use of 2-particle irreducible (2PI) or more generally nPI effective actions is common Berges (2004) and can be used in conjunction with large expansions for a power series approach while stochastic quantisation allows for the extension of non-perturbative lattice techniques to the case of real-time Berges and Stamatescu (2005). Recently, Hamiltonian truncation techniques have also emerged as an alternative to the available functional/lattice techniques and have been applied to both equilibrium Rychkov and Vitale (2015) and non-equilibrium calculations Rakovszky et al. (2016).
A particularly good test for the various available techniques is offered by the study of topological defects in QFT. Topological defects are naturally non-perturbative and cannot be included by perturbative expansions about classical vacuum solutions Rajaraman (1987). Instead, the QFT is split into different topological sectors, associated to a particular topological charge, and each sector must be treated separately in perturbative expansions. For non-perturbative equilibrium calculations involving defects, lattice techniques have been successful Groeneveld et al. (1981); Rajantie and Weir (2009, 2010, 2012) and more recently Hamiltonian truncation has also been used Bajnok and Lajer (2016). However, non-equilibrium calculations can be more problematic as highlighted in the study of defect formation during phase transitions. Despite having a simple and general description via the Kibble-Zurek mechanism Rajantie (2002), naive applications of the effective action techniques fail to include the contribution of defects Rajantie and Tranberg (2006) requiring the use of less standard techniques Berges and Roth (2011). Additionally, simple observables such as the quasi-particle excitation density in general fail to capture the relevant physics in such problems Uhlmann et al. (2010) .
In this paper we apply a recent approach based on the use of matrix product states (MPS) to the study of topological defects. These techniques deal with states directly and are most commonly used to approximate the ground states of strongly-coupled quantum systems in . While originally specialised to the study of gapped, finite size lattice systems with open boundary conditions Schollwöck (2011), these techniques have seen significant developments in recent years and have been applied to e.g. periodic systems Verstraete et al. (2004); Pippan et al. (2010); Rossini et al. (2011), infinite volume systems Haegeman et al. (2013); Zauner-Stauber et al. (2017) and non-equilbrium calculations Haegeman et al. (2011); Wall and Carr (2012). More generally, matrix product states are part of a broader class of tensor networks (TN) which allow further extensions to e.g. critical/gapless systems Vidal (2008); Evenbly and Vidal (2014) and the equilbrium study of excited states (including kink excitations) Haegeman et al. (2012); Porras et al. (2006). Matrix product states have also been applied previously to the study of field theories in a number of settings. For example, they have been applied to lattice regularised relativistic field theories in with both global symmetries Weir (2010); Milsted et al. (2013) and gauge symmetries with substantial progress being made in the Schwinger model () Bañuls et al. (2013); Rico et al. (2014); Buyens et al. (2014); Pichler et al. (2016); Buyens et al. (2016), including an explcit example overcoming the sign problem Bañuls et al. (2017), in addition to studies in the gauge theory Kühn et al. (2015). More recently, there has also been a focus on the description of field theories using MPS and TN in the continuum, providing an alternative regularisation scheme to the lattice Ganahl et al. (2017); Verstraete and Cirac (2010); Jennings et al. (2015). Finally, while many of the studies involving TN have been for , there have been a number of recent developments towards the application of tensor network methods to higher dimensions, particularly to using the projected-entangled-pairs states (PEPS) TN Vanderstraeten et al. (2016); Lubasch et al. (2014). These developments include work towards the application of TN to gauge theories in , Zapp and Orús (2017), making TN a promising tool for the future study of high energy physics.
In the following, we apply matrix product states to the study of topological defects (kinks) in the theory in . We study the equilibrium physics by obtaining MPS approximations to the ground state and one kink state in the lattice regularised setting. The main goal is to assess to what degree the MPS can capture the physics of defects non-perturbatively, both in the sense of providing a direct approximation to the one kink state and in capturing the contribution of kink-antikink excitations to the ground state observables. To achieve this, we focus on studying the kink mass and the scalar mass at strong and weak couplings where we can compare with analytic results. Confirming that the MPS can capture the physics of defects, particularly of the kink-antikink excitations, is essential if they are to be used to study non-equilibrium phenomena such as defect formation.
The structure of our discussion will be as follows : Firstly, we will introduce the theory in Section II and review some weak coupling and strong coupling results before introducing the lattice regularisation used throughout. Secondly, we review the construction of matrix product states in Section III and discuss their relationship to entanglement and tensor networks more generally. Following this, we provide an overview of the standard methods used to find MPS approximations to the ground state and show how they apply directly to one kink state in Section IV. More details on the information outlined in Sections III and IV , along with a useful guide to implementation, can be found in the review Schollwöck (2011). In Section V we provide a discussion of how to estimate the scalar mass from the equal time two point functions before examining some weak and strong coupling results in Section VI and concluding in Section VII.
II Scalar Field Theory in
II.1 Classical and Semi-Classical Results
In this section, we review some classical and semi-classical results for the scalar field theory in as defined by the action
[TABLE]
When , the classical potential density
[TABLE]
has two degenerate minima (vacua) corresponding to the spontaneous breaking of the global symmetry . Such static, uniform field configurations solve the classical equations of motion but there are additional topologically non-trivial static solutions called kinks given by
[TABLE]
with the corresponding antikink given by . Such solutions interpolate between the two minima and are not spatially uniform. Additionally, they have a degree of freedom (zero-mode) such that they form a continuous family of solutions. These kink solutions are the simplest possible example of a topological defect in quantum field theory. To relate these solutions to a topological charge, all the finite energy field configurations of the theory can be classified according to their homotopy. Configurations in distinct homotopy classes cannot be continuously deformed into one another (e.g. by time evolution) and the theory is split into distinct topological sectors. These sectors are labelled by a topological charge determined by the boundary conditions of the configurations as
[TABLE]
The minima have while the kink and antikink have respectively and they provide the lowest energy configurations for each of these sectors. The existence of distinct topological sectors can be confirmed from a few general features of a theory so that one can check easily for the possibility of topological defects in a wide variety of cases Vachaspati (2006).
The classical kink mass can be calculated from the classical energy
[TABLE]
by subtracting the classical vacuum energy to give
[TABLE]
In the quantum theory, the kink and anti-kink will appear as charged particles. Since they lie outside the vacuum sector, they are non-perturbative in the sense one cannot include their contribution by using perturbation theory starting from the vacuum sector. Instead, to gain semi-classical information about the defects, one must begin from the appropriate sector e.g. with classical kink configurations. Results in these sectors tends to require more work than the topologically trivial sector both due to the fact the classical kink is not spatially uniform and due to the presence of the zero-mode .
The one-loop order calculation of the kink mass is a classic result known as the ‘DHN’ result following the work of Dashen, Hasslacher and Neveu Dashen et al. (1974a, b). This can be written in terms of the scalar mass up to to give
[TABLE]
From this expression one can see that in the semi-classical region the kink appears as a heavy particle such that . As such, in this region kink-antikink excitations will provide a negligible contribution to ground state observables. More recently, zeta-function regularisation has allowed for one-loop results in finite size systems Pawellek (2009) and dimensional regularisation has provided a more systematic approach to one-loop corrections allowing for results at finite temperatures and in higher dimensions Rebhan et al. (2009). While giving equivalent results to one-loop order, a rigorous treatment of the zero-mode requires more work through e.g. the use of canonical coordinates Christ and Lee (1975), which also allow for the computation of the scalar field -point functions in the presence of the kink Gervais et al. (1975); Papageorgakis and Royston (2014).
While semi-classical expansions can provide information about the weak-coupling region , at stronger couplings they will breakdown. This is particularly important for the theory in since the phase transition corresponds to strong couplings. Therefore, near the critical point perturbation theory can no longer be used and one must turn to alternative methods.
II.2 Strong-Coupling Results : Universality
On approach to the critical point, the correlation length of the theory diverges. In this regime, the microscopic (lattice) details are irrelevant and the critical behaviour is described by a simple field theory. The nature of this critical field theory depends only a few general features of the underlying theory, e.g. symmetry or dimensionality, so that many different theories have the same critical description and can be separated into universality classes. In the theory falls into the same universality class as the classical Ising model. The critical field theory can be determined by the fixed points of a suitable renormalisation group (RG) flow and in this case corresponds to a Wilson-Fisher fixed point Peskin and Schroeder (1995). Since the corresponding critical field theory is not a free scalar field theory, standard perturbation theory cannot be used and we can say the phase transition is strongly coupled. Analytic results can instead be gained for the universality class by using RG or studying the Ising Model which is integrable. The most familiar universal results are the critical exponents and for the Ising universality class the correlation length in the symmetric and symmetry broken phases is given by
[TABLE]
where is the reduced temperature such that indicates the symmetric (high temperature) phase , the symmetry broken (low temperature) phase and the critical point. Additionally, universal amplitude ratios Delfino (1998) can be derived and in particular
[TABLE]
These results can be related to the topological defects in the theory through the Kramers-Wannier duality present in the Ising Model. While explicit kink creation operators cannot be constructed for the theory, the corresponding disorder operators can be introduced in the classical Ising model Fradkin (2017). The Kramers-Wannier duality relates the disorder operator two point correlation function on the dual lattice at coupling (temperature) to the spin operator two point function at a coupling as
[TABLE]
This duality establishes the relation between the critical exponents where is the critical exponent associated to the diverging correlation length of the disorder two point function in the symmetry broken phase. When combined with the universal amplitude ratio Equation (10) one can establish
[TABLE]
which uses the hyperscaling relation . This result corresponds in the theory to the relationship between the scalar mass and the kink mass . While universality establishes this result rigorously in the vicinity of the critical point, physically it should hold from the point where first down to the critical point, since in this region the scalar excitation will decay into a kink-antikink pair excitation which is the lightest excitation for the sector in this region. Since this result incorporates a simple linear scaling with strong coupling physics and topological defects it provides a good test for non-perturbative calculation methods.
II.3 Lattice Methods for Non-Perturbative Calculations
For general non-perturbative calculations it is common to use lattice methods. Essentially, one approximates observables of the full theory by observables of a theory defined on a lattice. An appropriate lattice theory can be obtained for a particular QFT by discretising the continuum action. Usually, this discretisation takes place in both time and space with Euclidean spacetime being used to allow for sampling, effectively transforming the problem of quantum field theory into statistical mechanics. Since analytically continuing back to real time requires further assumptions in the discrete case and tends to dramatically increase the errors from sampling, this method is essentially restricted to equilibrium physics and has been highly successful in this regard, for an introduction to these techniques see Smit (2002). Here, we will be interested in working with states at a particular time so that the lattice discretisation need only be applied in the spatial dimension leaving time continuous. In this case, the full QFT Hilbert space is truncated down to a lattice quantum theory and we can write schematically where the lattice spacing and size act as truncation parameters. For spatial discretisation, the Hamiltonian formalism can be used and the continuum theory can be written as
[TABLE]
An appropriate lattice theory can then be constructed by discretisation of this Hamiltonian. A good option is to simply replace the spatial derivatives by first order finite difference approximations leading to the Hamiltonian
[TABLE]
which is given in lattice units with . Additionally, the time-derivatives have been replaced by which obeys the canonical commutation relation . The finite lattice spacing, which we will set to one throughout, serves to restrict the possible momentum modes so that low momentum (long distance) observables of the full theory can be well approximated in the lattice theory, while higher momentum (short distance) behaviour will be modified significantly by the lattice. We will be particularly interested in the equal-time observables of the ground state and the one kink state . These are approximated in the lattice theory by the corresponding lattice observables and where the state is simply the minimum energy state of the lattice theory and is the lattice one kink state. To approximate the one kink state on the lattice it is typical to write it explicitly as the minimum energy state of the topological sector which can be selected out by enforcing twisted periodic boundary conditions (TPBC) in the spatial dimension of the theory Groeneveld et al. (1981). Both states of interest can then be written in terms of energy minimisation problems as
[TABLE]
for the lattice ground state and
[TABLE]
for the one kink lattice state (note that higher kink number excitations are suppressed exponentially with the lattice size ). Specifying states further requires the choice of a basis. The field eigenbasis is a natural choice for mean field theory type approximations but to go beyond these it is better to pick a numerically stable basis of real space harmonic oscillators Drell et al. (1976). Introducing real space creation and annihilation operators via and , a natural basis set is then the tensor products where are the eigenstates of the number operator at each site. All basis states are now labelled by a -tuple with and a general state can be written as
[TABLE]
where the state coefficient (wavefunction) now specifies the state in this basis.
III Tensor Networks and Matrix Product States
III.1 Matrix Product States
To study the ground state and one kink state observables in the theory, we would like to solve the minimisation problems Equations (15) and (16) directly. Of course, this is not possible in general since even a finite dimensional Hilbert space grows exponentially with the number of lattice sites. However, matrix product states provide a method to truncate the Hilbert space further down to a tiny subset where states can be specified efficiently i.e. with a cost that rises at most polynomially in the number of lattice sites. Within this subset, the minimisation problems can be solved numerically and direct approximations of the ground state and one kink state can be obtained.
To proceed, one first rewrites the expression for a generic state in the theory Equation (17) in the matrix product form. For the lattice MPS used here, this first requires an additional truncation in the local Hilbert space dimension such that the resulting total state space is finite. This can be achieved by simply keeping the first basis states at each site. A state can then be expressed as
[TABLE]
When using the field eigenbasis, the error associated with a similar truncation can be rigourously bound by the magnitude of local expectation values and or alternatively the energy expectation value Jordan et al. (2014). In practice, since the limit tends to be well behaved, it is usually possible to simply increase the value of in calculations until convergence in the desired observables is reached. To keep notation standard, we will let so that the total dimension of the regularised theory is now given by . The matrix product form can now be introduced by considering the wavefunction as a rank- tensor. Such a tensor can always be decomposed using a tensor train decomposition into a matrix product state form so that
[TABLE]
Here, the state coefficient has been re-expressed as the nearest-neighbour contraction of a set of rank- tensors, one per site. The size of the tensors can vary site-to-site but we fix them for simplicity to all be . The first index provides the physical index corresponding to the local basis state while the latter two provide the internal or virtual degrees of freedom. All the internal indices are contracted over while the physical indices remain uncontracted. The MPS form is made more intuitive by the convenient diagrammatic notation available for tensor networks. In this notation, tensors are represented by shapes with legs corresponding to the indices of the tensor. Contractions between indices are then indicated by the joining of two legs. For example, the rank-3 tensor is represented by
such that the PBC lattice MPS with sites is represented by
.
Often, as in the original DMRG algorithm, MPS with open boundary conditions (OBC) are used. These can be written by considering the first and last tensors as rank- tensors so that
[TABLE]
where are rank- tensors of size and the trace is no longer needed. The diagrams for OBC MPS are somewhat simpler than their PBC counterparts e.g. for
.
Due to this simplification, for convenience we will often use OBC in diagrams, though calculations will tend to involve PBC.
We emphasise that this MPS form is complete in the sense that all states can be expressed in this manner. However, the real power of MPS is revealed by considering the subset of states that can be efficiently parametrised by the MPS form. The number of parameters required to specify a MPS is simply given by the lattice size multiplied by the size of the tensor i.e . A general state requires an exponentially large number of parameters to specify so that at most . However, some states, often simply called “matrix product states” require only and these states make up the natural efficiently parametrised subset of the MPS form. By working with the MPS form and picking some value of , we can truncate the Hilbert space sufficiently so that we can deal with states directly, i.e. without the need for sampling. This allows one to approximate the states of interest by solving minimisation problems within this subset. Schematically, the series of truncations can then be shown as where the subset of states no longer retains the structure of but can be shown in many cases to form a smooth manifold, for details see Haegeman et al. (2014).
While states in the subset can be represented efficiently by a tensor network, the central question is which states actually belong to the subset and what physics is well approximated within it. The answer to this is given by considering the entanglement entropy of a reduced state for a state in the subset, defined over a spatial subregion . The entanglement entropy is then defined as
[TABLE]
A generic, random state in the Hilbert space displays an entanglement entropy that is extensive . However, states in the MPS subset have an entanglement entropy that is bound by a constant . This is an example of a low entanglement law and is extremely restrictive making matrix product states highly atypical. However, perhaps surprisingly at first, many of the physical states of interest are also low entanglement states. This can be seen somewhat intuitively from the more familiar fact that often physical states are actually quite special in that they do not have arbitrarily long distance correlations. In particular, the ground states of gapped, local systems tend to have exponentially decaying correlations while the ground states of gapless/critical systems tend to have algebraically decaying correlations. As such, one would not expect the entire subregion of a system to be correlated with the rest of the system but for the dominant contribution to come from the boundary of the subsystem . This intuition is indeed correct in many cases and e.g. it has been shown that states with exponentially decaying correlations defined on a ring obey entanglement area laws Brandão and Horodecki (2015), see Eisert et al. (2010) for a review of entanglement area laws.
The understanding that a lot of relevant physics is indeed low entanglement physics has been a driving force behind the development of tensor networks generally, see the review Eisert (2013). By using a tensor network form to parametrise the Hilbert space of interest, one can truncate down to a tiny low entanglement subset throwing out the vast majority of (highly entangled) states. In this way tensor networks provide a low entanglement effective theory with low entanglement observables being well approximated while high entanglement observables/contributions are lost. In the MPS case, the fact that means that they are suited to the description of area law states in , since in that case the area law is simply a constant. In fact, one can show that all ground states of local, gapped, lattice Hamiltonians in can be efficiently represented as MPS and that MPS always have exponentially decaying correlations asymptotically, as expected of an area law state Eisert (2013).
In terms of the observables of interest here, the effect of the entanglement cutoff can be seen easily in the connected equal-time two-point function . In the simple lattice truncation, such an observable is approximated by which will agree with the full observable in the region . Outside this region the lattice effects will be significant and the approximation poor. Since the relevant physics of the full theory takes place on a scale determined by the correlation length , this means that the physics of the full theory will be well approximated by the lattice theory whenever . The additional truncation to the low entanglement subset then restricts this region of validly further and one can think of an additional infrared length scale being introduced after which the inevitable exponential decay of the MPS will dominate and the observable will be heavily modified by the entanglement restriction. Additionally, the truncation parameter provides a short distance ultraviolet cutoff which, while essential when studying MPS formulated in the continuum Haegeman et al. (2010), is made irrelevant by the lattice spacing . Similarly, the lattice size will tend to be less relevant that the long distance scale and so we can summarise that will be well approximated by the MPS theory within the range so that to capture the relevant physics of the full theory we require the hierarchy .
III.2 Representation of Observables as Tensor Networks
Once a MPS representation for a state has been obtained, one would like to be able to calculate observables in an efficient way. This can often be achieved by representing the observable as a tensor network. The simplest example of this is the representation of the overlap between two states which can be found explicitly from the MPS form as
[TABLE]
This expression is clearer in the diagrammatic representation and can be written by introducing the convention that the conjugation of a tensor is represented by flipping the vertical, physical index such that
.
The overlap can then be represented as
.
In words, one simply contracts the physical indices of the states together site-by-site. This tensor network representation for the overlap contains no uncontracted indices so that contracting it fully will produce a single number i.e. the value of the overlap. To represent more general operator matrix elements as tensor networks it is standard to introduce a form for operators that corresponds to the MPS form. Such a representation for lattice systems is called a matrix product operator (MPO) form which can be found for a particular operator using some tricks (see Section III.3) or using more generic construction methods Hubig et al. (2017). To specify an operator, a natural basis choice is the transition basis consisting of tensor products of local operators so that the basis element is which is labelled by 2 L-tuples . In this basis an operator can be expanded as
[TABLE]
A matrix operator form is then given by rewriting the coefficient as a set of rank-4 tensors of size contracted in nearest-neighbour fashion as
[TABLE]
A diagrammatic expression for operators in (OBC) MPO form is then
such that operator matrix elements can be represented as a tensor network by sandwiching the MPO between two MPS as
.
Once an observable has been expressed as a tensor network in terms of MPS and MPO, it must be evaluated by contracting the tensors together in the pattern indicated. However, not all patterns will be equally efficient and can be exponentially expensive in the number of sites. In the present case, an efficient contraction ordering is given by proceeding horizontally from the left or right boundary. This can be expressed in terms of transfer matrices by defining the object , or if no operator is included, where and label the rank-3 tensors placed in the upper and lower positions respectively. Diagrammatically we have
and
.
In this notation, the matrix element is given by
[TABLE]
Viewing as matrix of size the cost of this matrix multiplication would naively be though this can be lowered in a variety of ways which we discuss further in Section IV.2.
Since any state can be expressed as an MPS and any operator as an MPO this representation for observables can always be used. However, we will only be interested in those observables which can be represented efficiently. For the operators this means finding which can be represented with sufficiently small . The set of operators that can be represented trivially, i.e. with , are simply operators of the tensor product form e.g. all equal-time -point functions. It is not so obvious that other important operators such as the lattice Hamiltonian can also be expressed in an efficient MPO form. However, this is indeed possible and we review the construction in the next section.
III.3 Representation of Lattice Hamiltonian as MPO
To represent the lattice Hamiltonian as an MPO it is helpful to first simplify the expression by collecting all one site terms into a single operator so that the Hamiltonian takes the form
[TABLE]
with
[TABLE]
Clearly, this has the same nearest neighbour structure as the Ising model Hamiltonian and we can use the same MPO construction methods as in that case Wall and Carr (2012). A correct MPO representation can then be obtained by building the Hamiltonian up iteratively from the rightmost site to the left. To see this, consider the matrix of operators
[TABLE]
The matrix is tri-diagonal so that multiplying gives
[TABLE]
The general structure remains the same while the Hamiltonian is built up in the bottom-left corner. The bulk terms of the Hamiltonian can be built up iteratively in this way. It then only remains to pick a single boundary term (e.g. at site ) to correctly select out the bottom-left corner, where the bulk of the Hamiltonian has been built up, and encode any remaining boundaries using the trace appearing in the MPO definitions. For PBC a consistent choice is
[TABLE]
Open boundary conditions can also be encoded by setting and to be vectors e.g. by the choice
[TABLE]
To specialise to TPBC such that , we can introduce the field variable
[TABLE]
for . This means we can treat the finite size Hamiltonian as having an impurity (the twist) at a particular lattice site and observables such as correlations functions must be transformed back to the original variables to keep the periodicity intact. The boundary matrix then takes the form
[TABLE]
Through the above constructions we have found that indeed the nearest-neighbour Hamiltonian can be expressed efficiently as a MPO with , reflecting its local nature. Longer range interactions can also be considered, as would be the case if higher-order finite difference approximations were used. However, in that case larger values of would be required reflecting the less local nature of the operator. Having an efficient representation of the lattice Hamiltonian as an MPO allows for the efficient computation of observables such as the energy expectation value or energy variance, which are central to obtaining approximations of the ground state and one kink state as MPS.
IV Approximation of ground state and one kink state via variational search.
IV.1 Variational Energy Minimisation for Matrix Product States
We now turn to the approximation of the ground state and one kink state using MPS. While the minimisation problems given by Equations (15) and (16) are too difficult to solve in the full lattice Hilbert space, we can restrict them to the MPS subset. The approximations to the states then become
[TABLE]
for the ground state and
[TABLE]
for the one kink state, where only the most important truncation parameters and have been kept explicit. Despite the restriction, such a problem is still too hard to solve globally but one can try an iterative procedure that minimises the energy at each step, converging to a best estimate for the global solution. This is now a standard procedure for approximating ground states with MPS and has been highly successful in a variety of cases, see Schollwöck (2011) for a detailed guide to implementation. The tensor network structure of the MPS provides a natural way to proceed : one can minimise the energy with respect to just a single tensor (i.e. at a single site) while keeping all other tensors fixed. One then proceeds tensor by tensor minimising the energy iteratively. This is most efficiently performed in a sweeping pattern moving from site to site in a given direction until some convergence criteria are met. In the tensor network representation of observables this local minimisation has a useful form in terms of a generalised eigenvalue problem. To see this, consider the tensor network representation of the energy expectation value with in MPO form with
.
If we are only interested in varying a single tensor at a particular site, e.g. the central site , all other tensors can be contracted together, giving
for OBC while in PBC the boundary tensors would include additional indices to be traced over. This expression can then be thought of as the action of an effective Hamiltonian on the tensor at the uncontracted site. In terms of the tensor network, the effective Hamiltonian then takes the form
which is a linear operator on the space of rank-3 tensors. As such, it can be considered a matrix of size that acts on vectors of size (i.e. the rank-3 tensors). In a similar way, an effective normalisation matrix can be constructed by replacing the MPO representing the Hamiltonian by the identity operator via where . To emphasise this structure we can use the notation and write the two effective operators as matrices on this space, and . In this notation the (ground state) minimisation problem at this site can be written as
[TABLE]
which can be solved by finding the minimum eigenvector of the generalised eigenvalue problem
[TABLE]
To find an approximation to the ground state one then initiates a (random) MPS, chooses a site , forms the effective operators , finds the minimum eigenvector of the generalised eigenvalue problem Equation (37) and updates the current MPS by replacing the rank-3 tensor at the site with the rank-3 tensor corresponding to the minimum eigenvector . The updated MPS will have a lower energy and so by proceeding to the next site the energy can be lowered iteratively until convergence is achieved.
The above method can also be used to find an MPS approximation to the one kink state by simply exchanging the PBC Hamiltonian with the TPBC Hamiltonian. However, there are important differences to consider that make the approximation of the one kink state more difficult than the ground state. In particular, the matrix product state techniques described here are naturally inhomogeneous and during the minimisation procedure translational invariance will be broken numerically leading to spatial dependence of the tensors . As such, the translational invariance of observables is only approximated. In the case of the ground state, this is no problem since the MPO representation of the lattice Hamiltonian is reasonably homogeneous and translational invariance can be easily approximated with a low MPS. However, for the one kink state, the Hamiltonian appears quite inhomogeneous with a particular location being selected for the twist. This makes it much harder to approximate translational invariance and the kink must be “delocalised” by using a sufficiently high .
The approximation of translational invariance tends to happen quickly so that one can think of a threshold after which the spatial variance of local observables drops dramatically. The value of will depend on the observable in question as well as the values of and with higher and larger leading to an increased . The dependence on is particularly important since it means that in regions of parameter space requiring high it will become impossible to approximate translational invariance for the kink state with this method. Since the size of is determined by the value of the field expectation value (or to allow for invariant cases) we see that it is the semi-classical region that will be hard to approximate in this sense, while the strong coupling region will be less problematic. Of course, this is not too much of an issue since the semi-classical region can be treated perturbatively and observables that include contributions from the entire lattice, e.g. the kink mass, do not depend strongly on the translational invariance of the state.
Similar issues are present when approximating excited states with MPS more generally and have led to the development of various tensor network excitation ansatz, including one for kinks, that enforce translational invariance explicitly Haegeman et al. (2011). However, there is some evidence that an inhomogeneous representation could better capture certain aspects of the kinks, which are naturally non-linear field configurations Berges and Roth (2011). This would likely make little difference when considering an observable such as the kink mass, but might be important with observables that have significant contributions from the kink “width” e.g. the form factor or the equal time two point functions in the presence of a kink near the critical point. We do not explore this further but the excitation ansatz is at least more efficient when determining the mass of excitations near the critical point and we will use them for comparison with the methods here.
IV.2 Computational efficiency, numerical stability and uMPS minimisation
The computational efficiency and numerical stability of the minimisation procedure must be considered at two main stages. Firstly, observables must be calculated efficiently corresponding to the correct choice of contraction ordering. This also covers the construction of the effective operators which are built up by the partial contraction of a similar tensor network. Secondly, the generalised eigenvalue problem Equation (37) must be solved.
In the first case, an efficient contraction pattern is given by simply multiplying the transfer matrices as matrices for a computational cost . A more efficient way to multiply the transfer matrices can be found by making use of their tensor network structure, reducing the cost to . In the case of OBC MPS and MPO, a significant speed up is possible since the boundaries act as vectors and only matrix-vector multiplications are needed. In this case, the naive scaling (i.e. without taking advantage of the tensor network structure) is simply , which can be reduced to when exploiting the tensor network structure. When considering sufficiently long chains of transfer matrices, the cost of PBC contractions can be reduced as the boundaries become less relevant (becoming completely irrelevant in the infinite distance limit), see Pippan et al. (2010) for details and implementation. In principle this reduces the cost to , equal to OBC, but numerical stability tends to require .
The second case, that of solving the generalised eigenvalue problem, also demonstrates the significant computational advantage of OBC vs PBC. Naively, the cost of solving a generalised eigenvalue problem scales as matrix-matrix multiplication . However, if a sparse implementation is possible then only matrix-vector multiplications are required for a naive cost of . For OBC this can again be reduced to . Unfortunately, solving a generalised eigenvalue problem tends to be ill-conditioned and additional stabilisation steps must be taken. In MPS and tensor networks more generally, stabilisation is often achieved by exploiting the significant gauge freedom in the MPS representation. This freedom can be easily seen since any MPS can be equivalently rewritten by inserting identity matrices of size between any of the rank- tensors. Decomposing the identities as then leads to an equivalent MPS form of the state as
[TABLE]
Equivalently, and more commonly in practice, one can think of performing a matrix decomposition on the tensors which can be chosen so that the eigenvalue problem to be solved is better conditioned, see Schollwöck (2011) for details. In the case of OBC it is possible to choose the gauge such that the effective normalisation matrix simply becomes equal to the identity matrix i.e. the generalised eigenvalue problem is transformed into a standard eigenvalue problem which is considerably more stable. For PBC, this transformation is not possible and we have instead followed the stabilisation strategy outlined in Pippan et al. (2010). Unfortunately, in the case of TPBC this was not sufficient to be able to solve the generalised eigenvalue problem with sparse methods and we have used dense methods at a cost of . Despite the relative expense, we find that the reachable are sufficient for studying the kink mass at strong couplings though for studying other observables e.g. the two point function in the presence of the kink, higher would be needed and an alternative stabilisation strategy would be useful such as the one in Rossini et al. (2011) which was applied to a spin system with TPBC.
For the approximation of the ground state, since TPBC are not required, it is possible to gain the computational advantage of OBC by using a MPS with an explicitly translationally invariant representation which can be achieved by simply requiring all tensors to be identical i.e. for all . Since there is no spatial variation this MPS can be defined in the infinite size limit such that the boundaries are irrelevant and the computational savings of OBC can be taken advantage of. Such MPS are called uniform matrix product state (uMPS) and, due to the greatly decreased number of free parameters, more standard minimisation procedures can be used to obtain approximations to the ground state Haegeman et al. (2013); Vanderstraeten et al. (2015). Due to the greatly increased efficiency we use uMPS to approximate the ground state following the conjugate gradient procedure outlined in Milsted et al. (2013). The use of uMPS allows for much higher values of which is essential for capturing the relevant physics of the ground state near the critical point due to the diverging correlation length. Additionally, we note there is an efficient time evolution procedure associated to uMPS known as the time dependent variational principle (TDVP) Haegeman et al. (2011) making uMPS a good candidate for the study of non-equilibrium physics. For an open source uMPS code, which aided the development of the code used here, see Milsted (2017).
V Scalar Mass
V.1 Long-Distance Behaviour of
Since tensor networks represent the state of the quantum system directly, a possible method for obtaining the scalar mass is to try and directly approximate the one particle excitation in the system. However, since tensor networks are often particularly suited to the description of ground states, it is useful to have a general procedure for extracting the scalar mass from ground state observables alone. While it is possible to extract the scalar mass from the long time behaviour of two point functions, for MPS it is much easier to consider the ground state equal time two point functions which can be directly calculated once a ground state approximation is found.
To see how the scalar mass can be extracted, we can consider the Källén-Lehmann spectral representation of the time ordered ground state two point function which can be constructed quite generally for a Lorentz invariant theory Peskin and Schroeder (1995). This representation relates the full two point function to the two point function of the non-interacting theory, specifically the Feynman propagator via
[TABLE]
where and are space-time coordinates, is the spectral density given by
[TABLE]
and is a zero-momentum energy eigenstate. We can evaluate the Feynman propagator easily in at equal times to give
[TABLE]
where is a modified Bessel function of the second kind. The equal time two point function can then be written as
[TABLE]
If the spectrum contains an isolated pole then we can extract this contribution and write schematically
[TABLE]
This suggests that at sufficiently long distances, the non-interacting form of the two-point function will be dominant and depend on the dimensionless combination .
V.2 Extracting from
The above arguments motivate the use of the ansatz
[TABLE]
to extract the scalar mass. This can be achieved by taking the appropriate ratios (finite differences) to cancel overall factors as
[TABLE]
This equation can then be solved numerically to extract which depends on due the fact that is not a pure Bessel function. Following the previous arguments we can expect for some initial the value of will vary due to the lattice effects and higher eigenstate contributions before becoming uniform such that can be extracted (in practice, the uniform region must be selected by some criteria e.g. the gradient of falling below some specified tolerance, with then estimated by averaging over the selected region). As discussed previously, the use of MPS will modify the long-distance behaviour of the observable due to finite entanglement effects ultimately leading to a pure exponential decay. The distance where this occurs is determined by the truncation parameter and we denote the length scale associated to this as . We can therefore expect a constant region of to occur in some intermediate distance region above the scale of higher mass contributions and below the scale of finite entanglement corrections. A method to diagnose the finite entanglement effects is to repeat the procedure used to extract but now use an exponentially decaying form as
[TABLE]
so that
[TABLE]
One again, will vary with since only at long distances where the finite entanglement effects dominate will become a true exponential. This expression can be used to quickly get a sense of the distance where the correlation function becomes strongly modified by the finite entanglement effects of the MPS.
V.3 Extracting for Lattice MPS at Strong Couplings
At strong couplings, the lightest excitations will become kink-antikink pairs. This motivates the use of a Bessel-squared function ansatz for the two point function
[TABLE]
corresponding to the form for two non-interacting excitations. The behaviour of can be established more rigorously in the critical region by considering the critical behaviour of the classical Ising model which is described by a field theory of free massive Majorana fermions Yurov and Zamolodchikov (1991).
At strong couplings, the Bessel-squared ansatz can then be used in a similar way to the Bessel function form. In fact, both and have similar exponentially decaying asymptotic forms
[TABLE]
As such, in principle either form can be used to estimate . However, unlike , we can expect to be valid outside the asymptotic regime which is essential since then shorter distances of will need to be approximated, requiring smaller .
VI Results
We study the QFT using MPS in the lattice regularised setting. An approximation to the ground state is obtained using uMPS while a finite size lattice MPS is used to approximate the one kink state. We study both weak and strong coupling behaviour by fixing the value of or while using a range of . When the effective coupling is small we compare the MPS approximation to the classical continuum. The lowest order mass renormalisation is included numerically by fitting the classical forms to the MPS data, replacing the bare mass with where is treated as a free parameter. In the region the lattice effects will be small and the comparison with the classical continuum is appropriate. In the strong coupling region the MPS approximations can be compared with universal results and we will focus on a comparison of the mass ratio to the universal result .
The accuracy of our approximations will depend on the observable in question. Essentially, the important features are the observation distance and to what degree the observable represents an average over the system. In the first case, only the truncation parameter is important and increasing will allow longer distances to be better approximated. This can be seen clearly in ground state connected two point function where larger are required to approximate the observable at larger distances . We can associate this behaviour to a length scale corresponding to the distance at which the approximation of is dominated by finite entanglement effects and decays as a pure exponential. In the second case, one can think of a particular threshold being required before the translational invariance of observables is well approximated. This will depend strongly on the observable/state in question and on the truncation parameters and .
The issues surrounding the approximation of translational invariance can be seen clearly when calculating the field expectation values and . Since both observables are local, they will converge quickly in so long as the threshold is met. In the ground state case, is essentially negligible and the approximation of converges rapidly. However, in the one kink state is important and, for sufficiently large values of or , local expectation values such as will show significant spatial variations and cannot be accurately approximated. We note that, even in the ground-state case, the field expectation value is in princible zero, respecting the symmetry but is broken numerically during the approximation and must be enforced explicitly if desired Singh and Vidal (2013). The uMPS approximation of the vacuum expectation value is shown in Figure 1 for perturbative and non-perturbative bare couplings . In the first case, the results (blue triangles) can be compared with the classical continuum result (red line) . In the stronger coupling case, there is no analytic comparison but the expected symmetry breaking pattern can be seen. In principle, such plots can be used to determine the location of the critical point e.g. by using the critical exponent associated with the vanishing field expectation value. However, as discussed in Milsted et al. (2013), such fits are highly sensitive and it is much better to use observables with a simpler scaling, e.g. the kink mass, to determine the location of the critical point.
The more problematic behaviour of is shown in Figure 2. In the semi-classical case with bare coupling , the high field expectation value requires a relatively high value of to converge and the threshold is higher than the shown . This means that a classical-like kink profile can be seen and increasing achieves only very slight changes to the width such that the correct value is not obtained. Moreover, the zero-mode means that the point at which crosses zero is independent of the energy making convergence in or difficult to quantify. However, at stronger couplings the field expectation value is much lower, corresponding to a lower , which makes it easy to approximate translational invariance and obtain even for the modest values of shown.
Observables that average over the whole system can be much less sensitive to spatial variations in the MPS representation than observables evaluated at a particular point. For example, the behaviour of displays similar spatial variations at weak coupling as for the case of , see Figure 3 . However, the spatial average of this expectation value has a much weaker dependence on . This is shown in Figure 4 where the spatial variation of the expectation values of both and are shown by error bars corresponding to their standard deviation with . Despite the strong spatial variation in , the spatial average changes only very weakly with in both cases indicating that this observable can be well approximated even in the weak coupling case. This behaviour can be compared with that of ; while the spatial average does not display much variation in the weak coupling case, since the operator is anti-symmetric, it still gives the incorrect non-zero value and is only correctly approximated in the stronger coupling region where translational invariance is approximated. In general, observables corresponding to anti-symmetric operators cannot be reliably approximated outside the translational invariant region, while the spatial average of those corresponding to symmetric operators can be.
The kink mass is calculated from the difference of the one kink energy expectation value , obtained from the finite size lattice MPS, and the ground state energy density, obtained from the uMPS. The latter converges quickly in and the only potential issue is the approximation of . However, since the kink mass both includes contributions from the whole system, is fairly local and symmetric, we can expect a reasonable convergence with even in the weak coupling case. In the case of the scalar mass, since it is estimated from the ground state observable , approximating translational invariance should not be an issue and we can focus on the need to increase so that the region up to is well approximated. At weak couplings, is relatively small so that the required should not be too high allowing for an estimate of to be extracted relatively easily. The kink mass and scalar mass are shown for a variety of weak couplings in Figure 5 along with the classical continuum results for comparison.
At stronger couplings the correlation length increases so that longer distances of need to be approximated requiring larger . Ultimately, this means that this method cannot be used with MPS arbitrarily close to the critical point where the scalar mass vanishes. This is reflected in the fact that at the critical point the correlation length diverges leading to algebraically decaying correlations which correspond to a logarithmic violation of the entanglement area law i.e. . While an MPS can still be used to approximate short distance observables in the critical region Pirvu et al. (2012) an alternative tensor network , e.g. the multi-scale entanglement renormalisation ansatz (MERA) Vidal (2008), that obeys the correct low entanglement law will also allow the approximation of the long range physics. Of course, this means that MPS are not especially suited to the study of universal physics and we can expect difficulty when trying to reproduce the strong coupling behaviour e.g. the universal mass ratio.
The scalar mass at strong couplings is plotted along with the kink mass in Figure 6 and the qualitative change in the scaling can been seen in the left-hand plot at the point when as expected. However, the estimate of with in the critical region extracted from the Bessel function tends to be somewhat higher than the value of suggesting that, as might be expected, it is inaccurate in this region. The scalar mass extracted from the excitation ansatz with is also plotted (red dashed line) and is somewhat closer to the value of suggesting that it can provide a more efficient and accurate method to extract the scalar mass in the critical region. To increase the accuracy of the uMPS method one can simply increase the value of but it is also possible to use the Bessel-squared ansatz Equation (49). A comparison of these methods is shown for the strong coupling region in the right-hand plot. The estimate of the scalar mass is closer to the expected behaviour when is increased (red dots and diamonds) but the use of the Bessel-squared ansatz improves the estimate again (black squares) agreeing fairly well with the excitation ansatz. The significant improvement of the Bessel-squared method over the single Bessel method suggests that the uMPS is able to capture the contributions coming from the kink-antikink excitations in this observable. To achieve higher accuracies than obtained here, larger can be used by following more recently developed algorithms than the conjugate gradient minimisation used here Vanderstraeten et al. (2015). Alternatively, one can also turn to better suited tensor networks such as MERA and both the methods to obtain the kink mass and scalar mass should be easily adaptable to this case. Of course, if one is only interested in the equilibrium physics chosen here, the excitation ansatz provides good accuracy and efficiency. However, it is less flexible and cannot be readily applied to different tensor networks and instead must be built explicitly for each case.
VII Conclusion
We have studied the topological defects (kinks) of the relativistic quantum field theory (QFT) in using matrix product states (MPS). We have shown how a finite size lattice MPS approximation to the one kink state can be obtained by making use of twisted periodic boundary conditions (TPBC) and that the resulting kink mass agrees with expectations. While alternative specific excitation ansatz provide a more efficient method to calculate the kink mass, the TPBC method can be easily adapted to other theories and tensor networks while also allowing for the easy calculation of a wide variety of observables e.g. equal time field n-point functions in the presence of the kink. We have also outlined a general method to extract the scalar mass from the ground state equal time two point functions. A comparison of the one kink and ground state approximations with universal results suggests that the MPS (specifically the uMPS) is able to capture the contribution of kink-antikink excitations to observables making it an interesting candidate to study challenging non-equilibrium phenomena such as defect formation via the Kibble Zurek mechanism during quantum phase transitions.
Acknowledgements
We have made use of the Imperial College London High Performance Computing Service. E.G. was supported by the EPSRC Centre for Doctoral Training in Controlled Quantum Dynamics and A.R. by STFC grant ST/L00044X/1.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Smit (2002) J. Smit, Introduction to Quantum Fields on a Lattice (Cambridge University Press, 2002). · doi ↗
- 2Berges (2004) J. Berges, in AIP Conference Proceedings , Vol. 739 (AIP, 2004) pp. 3–62, ar Xiv:0409233 [hep-ph] . · doi ↗
- 3Berges and Stamatescu (2005) J. Berges and I. O. Stamatescu, Physical Review Letters 95 , 202003 (2005) , ar Xiv:0508030 [hep-lat] . · doi ↗
- 4Rychkov and Vitale (2015) S. Rychkov and L. G. Vitale, Physical Review D 91 , 085011 (2015) , ar Xiv:1412.3460 . · doi ↗
- 5Rakovszky et al. (2016) T. Rakovszky, M. Mestyán, M. Collura, M. Kormos, and G. Takács, Nuclear Physics B 911 , 805 (2016) , ar Xiv:1607.01068 . · doi ↗
- 6Rajaraman (1987) R. Rajaraman, Solitons and Instantons (North Holland, 1987).
- 7Groeneveld et al. (1981) J. Groeneveld, J. Jurkiewicz, and C. P. K. Altes, Physica Scripta 23 , 1022 (1981) . · doi ↗
- 8Rajantie and Weir (2009) A. Rajantie and D. J. Weir, Journal of High Energy Physics 2009 , 068 (2009) , ar Xiv:0902.0367 . · doi ↗
