A minimal-variable symplectic method for isospectral flows
Milo Viviani

TL;DR
This paper introduces a simple, second-order numerical integrator for isospectral flows that is intrinsically defined on quadratic Lie algebras and symmetric matrices, preserving spectral properties and Lie--Poisson structure.
Contribution
It presents a minimal-variable, structure-preserving integrator that is computationally efficient and applicable to a broad class of isospectral flows, improving upon existing methods.
Findings
The integrator is isospectral for general flows.
It preserves Lie--Poisson structure when Hamiltonian.
It is simple, second-order, and structure-preserving.
Abstract
Isospectral flows are abundant in mathematical physics; the rigid body, the the Toda lattice, the Brockett flow, the Heisenberg spin chain, and point vortex dynamics, to mention but a few. Their connection on the one hand with integrable systems and, on the other, with Lie--Poisson systems motivates the research for optimal numerical schemes to solve them. Several works about numerical methods to integrate isospectral flows have produced a large varieties of solutions to this problem. However, many of these algorithms are not intrinsically defined in the space where the equations take place and/or rely on computationally heavy transformations. In the literature, only few examples of numerical methods avoiding these issues are known, for instance, the \textit{spherical midpoint method} on . In this paper we introduce a new minimal-variable, second order, numerical integrator for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A minimal-variable symplectic method for isospectral flows
Milo Viviani
Abstract.
Isospectral flows are abundant in mathematical physics; the rigid body, the the Toda lattice, the Brockett flow, the Heisenberg spin chain, and point vortex dynamics, to mention but a few. Their connection on the one hand with integrable systems and, on the other, with Lie–Poisson systems motivates the research for optimal numerical schemes to solve them. Several works about numerical methods to integrate isospectral flows have produced a large varieties of solutions to this problem. However, many of these algorithms are not intrinsically defined in the space where the equations take place and/or rely on computationally heavy transformations. In the literature, only few examples of numerical methods avoiding these issues are known, for instance, the spherical midpoint method on . In this paper we introduce a new minimal-variable, second order, numerical integrator for isospectral flows intrinsically defined on quadratic Lie algebras and symmetric matrices. The algorithm is isospectral for general isospectral flows and Lie–Poisson preserving when the isospectral flow is Hamiltonian. The simplicity of the scheme, together with its structure-preserving properties, makes it a competitive alternative to those already present in literature.
Key words and phrases:
isospectral flow Lie–Poisson integrator symplectic Runge–Kutta methods generalized rigid body Brockett flow Heisenberg spin chain Point-vortex on the hyperbolic plane
1. Introduction
The numerical integration of isospectral flows is a classical subject of study in numerical analysis [6, 9]. The interest in this problem is motivated by the numerical simulation of integrable systems, which are deeply related to isospectral flows via the Lax pair formulation. The quasi-periodic dynamics of integrable systems depends on the presence of a large number of first integrals. In the Lax pair formulation, some of these first integrals can be presented as a linear combination of the eigenvalues of the dynamical variable. Therefore, the preservation of the spectrum of the dynamical variable is a key feature of a numerical scheme applied to isospectral flows, in order to expect the right qualitative behaviour of the discrete approximate solutions [6]. Furthermore, as a special case, Lie–Poisson systems on the dual of reductive Lie algebras can be seen as isospectral flows [18]. A reductive Lie algebra is defined as the direct sum of a semisimple Lie algebra and an abelian Lie algebra. In this paper, the crucial property of real semisimple Lie algebras is that they can be represented as matrix Lie algbras which are closed under conjugate transpose [11, Prop. 6.28]. Moreover, any real matrix Lie algebra which is closed under conjugate transpose is reductive [11, Prop. 1.56].111In view of [11, Sec. I.8], there is no restriction in looking at real matrix Lie algebras, since the complex ones can be seen as real matrix Lie algebras of double dimension. Because of this, Lie–Poisson systems on the dual of a reductive Lie algebra can be equivalently seen as isospectral flows of the form (2) below. Lie–Poisson systems originate from the Poisson reduction of canonical Hamiltonian systems on the cotangent bundle of a Lie group [14]. Classical examples of Lie–Poisson systems are the rigid body [21], the heavy top and the incompressible Euler equations [1]. It is known from the backward error analysis, that Lie–Poisson preserving numerical schemes are superior to standard methods when applied to Lie–Poisson systems, especially for long-time simulations. As state-of-the-art, well established theories on numerical methods for both isospectral and Lie–Poisson systems exist in the literature (see for example [4, 9, 20]). For Lie–Poisson systems various symplectic algorithms have been developed (see [15] for a recent survey). However, few examples of numerical schemes that are intrinsically defined in the space where the dynamics takes place are known (e.g. [16]). This issue often causes a lack of efficiency for these schemes, which rely on group to algebra maps (e.g. the matrix exponential or the Cayley map) or a large number of unknowns. Before presenting our result, let us introduce the mathematical setup used throughout the paper.
Isospectral flows are first order ODEs of the form:
[TABLE]
Here, denotes the matrix commutator, is a linear subspace of the Lie algebra , and the function maps into its -normalizer algebra 222 is the largest Lie subalgebra of such that (see Definition 1 in Section 2).. The most studied case in literature is when is the space of symmetric real matrices, for which the normalizer is the Lie algebra of skew-symmetric real matrices . For Lie–Poisson systems on the dual of a reductive Lie algebra, we have that is the dual of a reductive Lie subalgebra of , for which the normalizer is (see Definition 2 and Lemma 2, in Section 2). Throughout the paper, we identify with , via the Frobenius inner product , where † is the conjugate transpose. Via this identifications, Lie–Poisson systems on the dual of a reductive Lie algebra take the form:
[TABLE]
for , a smooth function called Hamiltonian.
A class of numerical methods to solve (1)-(2), called Isospectral Symplectic Runge-Kutta (IsoSyRK), has been introduced in [18]. In the case of Lie–Poisson systems these schemes are symplectic. In this paper, we focus on the IsoSyRK associated to the implicit midpoint method, which turns out to have a specially nice structure. On the one hand, we provide a simpler proof (avoiding the use of the B-series theory) that for the implicit midpoint method, the respective IsoSyRK defined in [18] is isospectral for any . On the other hand, we derive a simpler scheme reducing the number of unknowns up to minimality, revealing an intrinsic relation between the implicit midpoint method and the Cayley transform. The resulting integrator, although implicit, is second order, isospectral and symplectic when the isospectral flow is Lie–Poisson. The scheme is also intrinsically defined on , for a large class of isospectral flows (see section 2 for details) and, when the isospectral flow is Lie–Poisson, it preserves the coadjoint orbits. Furthermore, only one evaluation of and two matrix multiplications per iteration are required, making the scheme very efficient. In the last section of this paper, we show some numerical examples of our scheme and we compare it with the spherical midpoint method, which is another minimal-variable Lie–Poisson integrator on . Finally, we show how our scheme looks on , defining what we call the hyperbolic midpoint method.
Acknowledgements. The author was supported by EU Horizon 2020 grant No 691070, by the Swedish Foundation for International Cooperation in Research and Higher Eduction (STINT) grant No PT2014-5823, by the Swedish Foundation for Strategic Research grant ICA12-0052, and by the Swedish Research Council (VR) grant No 2017-05040. The author would like to thank Klas Modin for the support and the enlightening discussions during the work on this paper.
2. Main result
Let us consider an isospectral flow of the form (1). In order to present our result, we need a short detour on some concepts and basic results on Lie algebras. As already mentioned in section 1, for (1) to be well defined, we have to require to take values in the -normalizer algebra of . We recall here the definition of the normalizer Lie group and normalizer Lie algebra:
Definition 1**.**
Let be a Lie group and its Lie algebra. Furthermore, let be a linear subspace. Then the two sets
[TABLE]
are respectively called the -normalizer and the -normalizer of . Notice that is a subgroup of and is a Lie subalgebra of .
A related concept to normalizer is the centralizer Lie algebra.
Definition 2**.**
Let be a Lie algebra and let be a linear subspace. Then the set
[TABLE]
is called the -centralizer of . Notice that is a Lie subalgebra of .
We now recall the definition of a quadratic Lie algebra.
Definition 3**.**
A Lie subalgebra of is called quadratic Lie algebra if there exists an invertible matrix , such that
[TABLE]
Lemma 1**.**
Let be a Lie subalgebra of such that there exists a matrix for which
[TABLE]
Then implies . Moreover, if is invertible and , then .
Proof.
Suppose . Then, for all , both the following identities hold:
[TABLE]
The second of these implies that and the first one . Subtracting these identities, we get . Hence .
Now assume that is quadratic and . Then, for all we have and hence , being invertible. Therefore and are defined by the same identity and they coincide. ∎
Lemma 2**.**
Let be a Lie subalgebra of such that . Then the normalizer of is , where is the semisimple ideal of such that , for the center of .333Such a decomposition always exists being reductive by [11, Prop. 1.56].
Proof.
Let be the orthogonal complement of in with respect to the Frobenius inner product. It is not hard to check that the following properties hold:
[TABLE]
Hence, if it must be . Therefore, has to be in . Moreover, we have that the following inclusions always hold . Therefore, , being centerless. ∎
Notice that, since always , we have that , whenever . In conclusion, we have the following corollary:
Corollary 1**.**
Let be a quadratic Lie subalgebra of such that . Then the normalizer of and are respectively and . In particular, under the identification of with via the Frobenius inner product, any Lie–Poisson system on can be written in the form (2).
Notice that by [11, Prop. 1.56] any such as in Corollary 1 is a reductive Lie algebra. As mention in the introduction, Lie–Poisson systems on the dual of reductive Lie algebras can be written in the form (2). In particular, this is true for Lie–Poisson systems on the dual of , where is an Abelian Lie algebra. We can now state the main result of this paper.
Theorem 1**.**
Let , for a domain in the linear subspace . Assume that the normalizer splits as , for some Lie algebra , which satisfies
[TABLE]
for some constant matrix . Furthermore, let be continuously differentiable. Then, for some , there exists such that the numerical scheme , implicitly defined by:
[TABLE]
is a second order isospectral integrator for (1), for any .444Here denotes the identity matrix.. Moreover, when (1) is a Lie-Poisson system on or on the dual of some quadratic Lie algebra such that (or even on , where is an Abelian Lie algebra), then (5) is a Lie-Poisson integrator for (1) which preserves the coadjoint orbits in .
Remark 1**.**
The main contribution of Theorem 1, with respect to the results presented in [18], is that the scheme (5) is a minimal-variable isospectral (Lie–Poisson) integrator. Minimal-variable means here that the only unknown is , which lives in a vector space of dimension . Hereafter, we will refer to the scheme (5) as the isospectral minimal midpoint. Moreover, the proof of the properties of (5), unlikely to [18, Cor. 1], does not require any application of the B-series theory and reveals a deep connection with the Cayley transform (see the proof of Lemma 4). The latter is a quite interesting fact because the Cayley transform arises as a necessary consequence of the use of the implicit midpoint scheme and not, as it has always appeared in literature, as a prescribed choice to construct a certain numerical scheme. We also emphasize that the condition (4) and the ones on in Theorem 1 to get a Lie–Poisson integrator is slightly more general than the one considered in [18, Thm. 1-2].
We will give the proof of Theorem 1 in some lemmas.
Lemma 3**.**
Let continuously differentiable in the domain . Then, for every , there exist such that the equation
[TABLE]
has a solution for any .
Proof.
In order to get a solution to (6), we consider the function , such that (6) is equal to . In order to determine , we consider the initial value problem:
[TABLE]
Since is continuous, if we prove that the operator is continuous and invertible, then the Peano existence theorem will ensure a solution , for any in some interval , for . Indeed, , where is polynomial in its variables and is continuous by hypothesis. Hence, , for , therefore there exist some such that is invertible, for any . ∎
Lemma 4**.**
Let continuously differentiable in the domain and let as in lemma 3. Then, for every , the numerical scheme is isospectral. Moreover, if , for domain in the linear subspace , and , where , for some Lie algebra which satisfies (4), then . Furthermore, when is a quadratic Lie algebra such that , then , where is the coadjoint orbit of which belongs.
Proof.
Clearly and hence and are similar. Furthermore, we notice that , where Cay is the Cayley transform. Therefore we have that:
[TABLE]
Assuming , for some Lie algebra which satisfies (4), by [6, Lemma IV.8.7], is in the normalizer group of and therefore is in as well. When for a quadratic Lie algebra such that , the transformation (7) coincides with the coadjoint action of on , where is the respective connected component to the identity of a Lie group with Lie algebra . Therefore, (7) fixes the coadjoint orbits. ∎
Remark 2**.**
We point out that the equation (7) reveals an interesting relation between the Cayley transform and the implicit midpoint method. Indeed, the fact that the Cayley transform appears as a consequence of the reduction of the implicit midpoint from to may indicate a deeper, perhaps canonical, relation between symplectic Runge–Kutta methods and the Cayley transform. The former are associated to conservation of quadratic first integrals of ODEs and the latter to transforming quadratic Lie algebras in quadratic Lie groups.
Corollary 2**.**
Let the hypotesis of Lemma 3 hold. Then, if and is a compact Lie algebra, there exists independent from such that the scheme (5) has solution.
Proof.
Since is a compact Lie algebra, the associate connected Lie group is compact and therefore the orbits of the action (7) are compact. Hence, we can find a minimum in Lemma 3 independent from the iteration . ∎
Lemma 5**.**
Let continuously differentiable in the domain and let as in lemma 3. Then, for every , the numerical scheme in (5) descends from the method defined in [18, Def. 1] associated with the implicit midpoint method. In particular, if for some function , then the method is Lie–Poisson in .
Proof.
Consider the second order method as defined in [18, Def. 1] associated with the implicit midpoint method:
[TABLE]
for with unknowns . It is not hard to check that the following identities hold:
[TABLE]
Applying these to (8) we get, after some computations, the scheme (5). In [18, Thm. 3], it has been proven that when , for some functions , the method is a Lie-Poisson integrator in . The scheme defined in (5) coincides with (8), but with the elimination of the intermediate variables . Therefore, (5) is a Lie–Poisson integrator. ∎
Proof.
. The proof simply follows from the lemmas. Lemma 3 says that the method (5) has solution for sufficiently small. Under the assumptions of Lemma 4 proves the isospectrality of the scheme and its intrinsic restriction , when , for some Lie algebra which satisfies (4). Finally, in Lemma 5 is shown that the scheme descends from the isospectral midpoint method defined in [18] and therefore it is a second order Lie–Poisson integrator in , when (1) is. Putting together Lemma 4, Lemma 5 and Corollary 1, we have that when a is the dual of a quadratic Lie algebra such that (possibly plus a commutative Lie algebra) and (1) is Lie–Poisson, the scheme (5) is a Lie–Poisson integrator on , which preserves the coadjoint orbits. ∎
Remark 3**.**
We notice that the isospectral minimal midpoint (5) is somehow similar to the modified implicit midpoint rule introduced in [4]. However, in their scheme, was set to be which does not hold in general while solving the isospectral minimal midpoint (5). In fact, even though the scheme in [4] is isospectral, it is not symplectic.
Remark 4**.**
The isospectral minimal midpoint (5) can be derived in a different way, as proposed in [15]. The construction there is more general and (5) can be recovered choosing as a retraction map the Cayley transform instead of the exponential map. This surprising connection opens up a question about a geometrical description of the methods proposed in [18, Def. 1]. Let us consider for any and real matrix, a retraction map . Then, similarly to [15], for each , it is implicitly defined by the differential of the retraction map a discrete map . Finally, we define our integrator as:
[TABLE]
for some real numbers . The question is whether, for any -stage symplectic Runge–Kutta method, there exists a retraction map , such that any Lie–Poisson integrator defined in [18] can be obtained in this way.
3. Numerical examples
In this section we present some applications of the isospectral minimal midpoint (5) on isospectral flows and Lie–Poisson systems found in literature. We also compare our method in the case of with the spherical midpoint method, showing that the isospectral minimal midpoint (5) has the same computational cost. Finally, we show explicitly how the isospectral minimal midpoint (5) looks on , applying it to the the point vortex equations on the hyperbolic plane. For the example considered in this section, we plot the variation of the first integrals of (1). As expected, we get exact conservation (up to round-off errors) of the Casimir functions and, when the flow is Hamiltonian, near conservation of the Hamiltonian.
3.1. The generalized rigid body
A classical example among Hamiltonian isospectral systems is the generalized rigid body. It represents a class of completely integrable systems on , for every , [13]. The Hamiltonian is given by
[TABLE]
where is a symmetric positive definite inertia tensor. The equations of motion are then
[TABLE]
We discretize this system for . Our implementation uses Newton iterations for the non-linear system.555Fixed-point iteration could be used as well, no numerical issues have arisen in our experiments with it. The inertia tensor is given by
[TABLE]
and we use the stepsize . The initial conditions are given by
[TABLE]
As shown in Figure 1, the Hamiltonian is nearly conserved and the Casimir functions are conserved up to the accuracy of the Newton iterations.
3.2. The Brockett flow
In this section we specify the isospectral minimal midpoint (5) for the Brockett flow, or double bracket flow:
[TABLE]
where are self-adjoint complex matrices. In [3], Brockett shows that for diagonal matrix with distinct entries and self-adjoint matrix with distinct eigenvalues, for , converges exponentially fast to a diagonal matrix with the eigenvalues sorted accordingly to the order of the entries of . In figure 2, we plot the eigenvalue variation for a randomly generated666Throughout the whole paper, a randomly generated matrix is understood as a matrix whose components are defined as pseudorandom values drawn from the standard uniform distribution on the open interval , generated via the MATLAB function rand. self-adjoint initial matrix of dimension and . The asymptotic stationarity of the eigenvalues variation reflects the fact that is close to a diagonal matrix.
3.3. Lie–Poisson systems on
On the isospectral minimal midpoint (5) can be written as:
[TABLE]
for and . We want to compare the isospectral minimal midpoint (14) with another minimal-variable symplectic integrator on introduced in [17], i.e. the spherical midpoint method:
[TABLE]
Remark 5**.**
Let be orthogonal with respect to the rays, i.e. , for every . It is immediate to check that then (14) coincides with the classical midpoint scheme:
[TABLE]
In [17] it is shown that also (15) coincides with (16) when , for some Hamiltonian function constant on the rays (which implies to be orthogonal to the rays). In this case, (16) is known to be symplectic, whereas this fails for general Hamiltonian . Therefore, (14) can be seen as the second order correction of (16) to be symplectic for any Hamiltonian .
Let us now consider the two schemes (14) and (15). Both methods are implicit and therefore an implicit solver has to be used. Here we show that they exhibit the same computational cost. The example we consider is the Heisenberg spin chain on . For this one has to extend both the isospectral minimal midpoint (14) and the spherical midpoint (15) to direct products of (see [17] and [18]).
The Heisenberg spin chain of micromagnetics is defined as:
[TABLE]
where , for and . It corresponds, up to scaling, to spatial discretization of the Landau-Lifschitz PDE:
[TABLE]
for smooth. We notice that (17) is a Lie–Poisson system on , with Hamiltonian:
[TABLE]
Clearly, to get a good approximation of (18), has to be large. In figure 3 we show the average time cost for time-step with respect to the number of spin particles for both the isospectral minimal midpoint (14) and the spherical midpoint (15). We conclude that the complexity grows similarly.
In terms of conservation properties, the two schemes exactly preserve the linear invariants and nearly conserve the the quadratic first integrals of the form . Moreover, the spherical midpoint has the advantage of exactly conserving all the quadratic first integrals of the form , for some square matrix , whereas the isospectral minimal midpoint (14) exactly conserves only the quadratic invariants . In figure 4 we compare the isospectral minimal midpoint (14) and the spherical midpoint (15) for the initial data given as am equispaced discretization in points of the closed spherical curve:
[TABLE]
for .
We can conclude from figure 4 that the spherical midpoint performs slightly better than the isospectral minimal midpoint (14). However, both the schemes show the desired conservation properties due to their symplecticity.
Finally, in figure 5, we present the error diagram for the isospectral minimal midpoint (14) and the spherical midpoint (15). The curves in figure 5 show the expected second order of the schemes, with no significant difference.
3.4. Lie–Poisson systems on
In this section we specify the isospectral minimal midpoint (5) in the case of the Lie algebra , i.e. the matrices with zero trace. The first observation is that , which is a non-compact quadratic Lie algebra with respect to From this, it is straightforward to see that, when and , in (5) is also in . We notice that this is no longer true for , for . On the other hand, any element in can be written as a vector in , via the vector spaces isomorphism:
[TABLE]
for any . In this coordinates, we can express the isospectral minimal midpoint (5) for as:
[TABLE]
for and , where :
[TABLE]
and
[TABLE]
for any .
We notice that the map (19) is a Lie algebra isomorphism from to , where , for any . Furthermore, the tensor defines also the hyperbolic inner product , for any . We recall that the coadjoint orbits in are hyperboloids of the form . Hence, in analogy with the spherical midpoint method in [16], we will call (20) the hyperbolic midpoint method.
We illustrate an application of the hyperbolic midpoint method (20) on the point vortex equations on the hyperbolic plane (see for example [7, 8, 19]). The interest in this equations is motivated also by the studies on ideal hydrodynamics on hyperbolic spaces [5, 10], and in particular on the Euler equations, for which the point vortices can be seen as a finite dimensional approximation [2]. These equations are a Lie–Poisson system on , with initial values on the coadjoint orbit determined by the equations , for . The Hamiltonian is given by:
[TABLE]
The equations of motion are then:
[TABLE]
Equations (22) constrain the vortices to move on the hyperboloid . Furthermore, the symmetry of (22) gives the conservation of the momentum vector:
[TABLE]
Equations (22) and their -relative equilibria have been studied in [7, 8, 19]. In particular, for two and three vortices most of the stability issues have been worked out in [19]. However, unlike to point vortex equations on a sphere [12], it is still unknown a general result on the stability of a relative equilibrium of point vortices on the hyperbolic plane.
Here we present the results of some numerical simulations of (22) with the hyperbolic midpoint method (20). In particular, we consider as initial values some of the relative equilibria found in [8, 19]. Let us take two different initial values in , and , whose columns represent the initial position of three vortices with strengths respectively equal to and , as defined here below:
[TABLE]
[TABLE]
The initial condition is an equilateral relative equilibrium, whereas is a geodesic relative equilibrium, as defined in [19]. The fist initial condition is known to be stable, whereas it is not known for the second one. However, from figure 6 we can see that both the initial conditions evolve in close trajectories, which proves numerically the stability for both of them.
We conclude showing in figure 7 the conservation properties for the hyperbolic midpoint method (20), concerning the first integrals (23) and (21).
Acknowledgements The author would like to thank Klas Modin for the support and the enlightening discussions during the work on this paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Arnold and Khesin [1998] Vladimir I. Arnold and Boris A. Khesin. Topological Methods in Hydrodynamics , volume 125 of Applied Mathematical Sciences . Springer-Verlag, New York, 1998.
- 2Beale and Majda [1982] J.T. Beale and A. Majda. Vortex methods. ii: Higher order accuracy in two and three dimensions. Mathematics of Computation , 39(159):29–52, 1982.
- 3Brockett [1991] R. W. Brockett. Dynamical systems that sort lists, diagonalize matrices, and solve linear programming problems. Lin. Alg. Appl. , 146:79–91, 1991.
- 4Calvo et al. [1997] M. P. Calvo, Arieh Iserles, and A Zanna. Numerical solution of isospectral flows. Math. Comput. , 66:1461–1486, 10 1997. doi: 10.1090/S 0025-5718-97-00902-2.
- 5Chan and Czubak [2013] Chu Hsiang Chan and Magdalena Czubak. Non-uniqueness of the leray-hopf solutions in the hyperbolic setting. 2013.
- 6Hairer et al. [2006] E. Hairer, C. Lubich, and G. Wanner. Geometric Numerical Integration , volume 31. Springer, Berlin, Heidelberg, 2006. ISBN 978-3-540-30663-4.
- 7Hwang and Kim [2009] S. Hwang and S.-C. Kim. Point vortices on hyperbolic sphere. J. Geom. Phys. , 59:475–488, 2009.
- 8Hwang and Kim [2013] S. Hwang and S.-C. Kim. Relative equilibria of point vortices on the hyperbolic sphere. J. Geom. Phys. , 54:063101/1–15, 2013.
