The Maslov and Morse Indices for Sturm-Liouville Systems on the Half-Line
Peter Howard, Alim Sukhtayev

TL;DR
This paper establishes a relationship between the Morse index and the Maslov index for Sturm-Liouville systems on the half-line, incorporating boundary conditions, and applies it to a nonlinear Schrödinger eigenvalue problem.
Contribution
It provides a new formula linking Morse and Maslov indices for Sturm-Liouville systems with boundary conditions, extending previous results and illustrating with a Schrödinger eigenvalue application.
Findings
Morse index expressed via Maslov index and boundary terms
Formulas valid for solutions in L^2 and boundary condition spaces
Application to eigenvalue problem in nonlinear Schrödinger equation
Abstract
We show that for Sturm-Liouville Systems on the half-line , the Morse index can be expressed in terms of the Maslov index and an additional term associated with the boundary conditions at . Relations are given both for the case in which the target Lagrangian subspace is associated with the space of solutions to the Sturm-Liouville System, and the case when the target Lagrangian subspace is associated with the space of solutions satisfying the boundary conditions at . In the former case, a formula of H\"ormander's is used to show that the target space can be replaced with the Dirichlet space, along with additional explicit terms. We illustrate our theory by applying it to an eigenvalue problem that arises when the nonlinear Schr\"odinger equation on a star graph is linearized about a half-soliton solution.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The Maslov and Morse Indices for Sturm-Liouville
Systems on the Half-Line
Peter Howard and Alim Sukhtayev
Abstract
We show that for Sturm-Liouville Systems on the half-line , the Morse index can be expressed in terms of the Maslov index and an additional term associated with the boundary conditions at . Relations are given both for the case in which the target Lagrangian subspace is associated with the space of solutions to the Sturm-Liouville System, and the case when the target Lagrangian subspace is associated with the space of solutions satisfying the boundary conditions at . In the former case, a formula of Hörmander’s is used to show that the target space can be replaced with the Dirichlet space, along with additional explicit terms. We illustrate our theory by applying it to an eigenvalue problem that arises when the nonlinear Schrödinger equation on a star graph is linearized about a half-soliton solution.
1 Introduction
We consider Sturm-Liouville systems
[TABLE]
with the one-sided self-adjoint boundary conditions
[TABLE]
Here, , and we assume:
(A1) The matrices , , and are defined and self-adjoint for a.e. , with also and . Moreover, there exist constants and so that
[TABLE]
for a.e. . Here, denotes the standard inner product on , and denotes the standard norm on the same space. We emphasize that is included in our local designations, so the boundary condition at is regular.
(A2) We assume that , , and all approach well-defined asymptotic endstates at exponential rate. That is, we assume there exist self-adjoint matrices , with positive definite, and constants and so that
[TABLE]
and similarly for and . In addition, we assume for a.e. .
(A3) For the boundary conditions, we take , and for notational convenience, set . We assume , , which is equivalent to self-adjointness in this case. Here, denotes the standard symplectic matrix
[TABLE]
with denoting the usual identity matrix.
We can think of (1.1) in terms of the operator
[TABLE]
with which we associate the domain
[TABLE]
and the inner product
[TABLE]
With this choice of domain and inner product, is densely defined, closed, and self-adjoint, so .
Our particular interest lies in counting the number of negative eigenvalues of (i.e., the Morse index). We proceed by relating the Morse index to the Maslov index, which is described in Section 3. We find that the Morse index can be computed in terms of the Maslov index and an additional term associated with the boundary condition at .
As a starting point, we define what we will mean by a Lagrangian subspace of . For comments about working in rather than , the reader is referred to Remark 1.1 of [10], and the references mentioned in that remark.
Definition 1.1**.**
We say is a Lagrangian subspace of if has dimension and
[TABLE]
for all . Here, denotes the standard inner product on . In addition, we denote by the collection of all Lagrangian subspaces of , and we will refer to this as the Lagrangian Grassmannian.
Any Lagrangian subspace of can be spanned by a choice of linearly independent vectors in . We will generally find it convenient to collect these vectors as the columns of a matrix , which we will refer to as a frame for . Moreover, we will often coordinatize our frames as , where and are matrices. Following [4] (p. 274), we specify a metric on in terms of appropriate orthogonal projections. Precisely, let denote the orthogonal projection matrix onto for . I.e., if denotes a frame for , then . We take our metric on to be defined by
[TABLE]
where can denote any matrix norm. We will say that a path of Lagrangian subspaces is continuous provided it is continuous under the metric .
Suppose denote continuous paths of Lagrangian subspaces , for some parameter interval . The Maslov index associated with these paths, which we will denote , is a count of the number of times the paths and intersect, counted with both multiplicity and direction. (In this setting, if we let denote the point of intersection (often referred to as a conjugate point), then multiplicity corresponds with the dimension of the intersection ; a precise definition of what we mean in this context by direction will be given in Section 3.)
In order to place our analysis in the usual Hamiltonian framework, we express (1.1) as a first order system for , with and . We find
[TABLE]
where
[TABLE]
which can be expressed in the standard linear Hamiltonian form
[TABLE]
with
[TABLE]
Let denote the matrix solution to
[TABLE]
We will verify in Section 4 that for each , is the frame for a Lagrangian subspace of , . Likewise, let denote the matrix solution to
[TABLE]
We will verify in Section 2 that for
[TABLE]
we have , where denotes essential spectrum, as defined in Section 2. Subsequently, we verify in Section 4 that for each , is the frame for a Lagrangian subspace of , , and moreover that for any , the asymptotic space
[TABLE]
is well-defined and Lagrangian (with convergence in the metric described above). Finally, we will establish that the map is continuous.
There are two different ways in which we can formulate a relation between the Maslov index and the Morse index, depending upon whether we view as our target or as our target. We state these results respectively as Theorems 1.1 and 1.2. Prior to these statements, we set some terminology with the following lemma.
Lemma 1.1**.**
Let Assumptions (A1), (A2), and (A3) hold, and let . Then there exists so that
[TABLE]
In this case, we refer to as boundary inconjugate.
We emphasize that for any , is the Lagrangian subspace with frame , independent of . Likewise, in Theorems 1.1 and 1.2 below, the Lagrangian space is independent of . In all such cases, the appearance of a spectral coordinate is only for notational consistency, since does in general depend on for all .
In the following statements, we use the notation to indicate the number of eigenvalues that has, including multiplicities, on the interval .
Theorem 1.1**.**
Let Assumptions (A1), (A2), and (A3) hold, and fix any (with defined in (1.7)). Then there exists a value sufficiently large so that for any boundary inconjugate , we have
[TABLE]
Theorem 1.2**.**
Let Assumptions (A1), (A2), and (A3) hold, and fix any (with defined in (1.7)). Then there exists a value sufficiently large so that for any boundary inconjugate , we have
[TABLE]
Remark 1.1**.**
For Theorem 1.2, the target space can be replaced by the Dirichlet space (with frame ), at the cost of additional terms that can be expressed explicitly. See Corollary 5.1. We also note that by combining Theorems 1.1 and 1.2 we see that
[TABLE]
2 ODE Preliminaries
In this section, we develop preliminary ODE results that will serve as the foundation of our analysis. This development is standard, and follows [18], pp. 779-781 (see, e.g., [2] for similar analyses). We begin by clarifying our terminology.
Definition 2.1**.**
We define the point spectrum of , denoted , as the set
[TABLE]
Elements of the point spectrum will be referred to as eigenvalues. We define the essential spectrum of , denoted , as the values in (and so , by self-adjointness) that are not in the resolvent set of and are not isolated eigenvalues of finite multiplicity.
We note that the total spectrum is , and the discrete spectrum is defined as . Since our analysis takes place entirely away from essential spectrum, the eigenvalues we are counting are elements of the discrete spectrum.
If we consider (1.1) as , we obtain the asymptotic system
[TABLE]
For operators such as posed on , it’s well-known that the essential spectrum is entirely determined by the associated asymptotic problems at (see, for example, in [5, 11]). As we will verify at the end of this section, it’s straightforward to show that a similar result holds true in the current setting. In particular, if we look for solutions of (2.1) of the form , for some scalar constant and (non-zero) constant vector then the essential spectrum will be confined to the allowable values of . For (2.1), we find
[TABLE]
and upon taking an inner product with we see that
[TABLE]
Since and are positive definite, we see that
[TABLE]
for all , and consequently , where
[TABLE]
In order to describe the Lagrangian subspaces , we need to characterize the solutions of (1.6) in . As a starting point for this characterization, we fix any and look for solutions of (2.1) of the form , where in this case is a scalar function of , and is a vector function of (in ). Computing directly, we obtain the relation
[TABLE]
which we can rearrange as
[TABLE]
Since is positive definite, we can work with the inner product
[TABLE]
and it’s clear that for , the operator is self-adjoint with respect to this inner product, and moreover positive definite for . We conclude that for , the eigenvalues will be positive real values, and that the associated eigenvectors can be chosen to be orthonormal with respect to the inner product (2.2). For each of the values of (counted with multiplicity), we can associate two values . By a choice of labeling, we can split these values into negative values and positive values with the correspondence (again, by labeling convention)
[TABLE]
For , we denote by the eigenvector of with associated eigenvalue . I.e.,
[TABLE]
Recalling (1.4), we note that under our asymptotic assumptions on , , and , the limit
[TABLE]
is well-defined. The values described above comprise a labeling of the eigenvalues of . Each of these eigenvalues is semi-simple, and so we can associate them with a choice of eigenvectors so that
[TABLE]
We see that for , we have relations
[TABLE]
If we set
[TABLE]
and
[TABLE]
then we can express a frame for the eigenspace of associated with negative eigenvalues as , and likewise we can express a frame for the eigenspace of associated with positive eigenvalues as .
Lemma 2.1**.**
Assume (A1) and (A2), and let and be as described above. Then there exists a -dependent family of bases , , for the spaces of solutions of (1.4), chosen so that
[TABLE]
where
[TABLE]
for some , and where the term is uniform for .
Moreover, a basis for the space of non- solutions of (1.4) can be chosen so that
[TABLE]
with satisfying the same properties as .
Finally, for each and each , .
Proof.
For any , we follow [18] and write (1.4) as
[TABLE]
where
[TABLE]
We can now fix a particular index , and look for solutions to (2.5) of the form
[TABLE]
for which
[TABLE]
Based on , let satisfy . Then there exists a neighborhood of on which we can define a continuous projector onto the direct sum of all eigenspaces of with eigenvalues satisfying , and likewise a projector projecting onto the direct sum of all eigenspaces of with eigenvalues satisfying
[TABLE]
For some fixed value taken sufficiently large, we will look for solutions to (2.6) of the form
[TABLE]
We proceed by contraction mapping, defining an operator action as the right-hand side of (2.7). For this, we use the following fact, which is immediate from the definitions of and : there exist constants and so that
[TABLE]
We check that is a contraction on the space . To see this, we note that given any , there exist constants and so that
[TABLE]
Combining terms, we see that for some constant ,
[TABLE]
from which it’s clear that by taking sufficiently large, we can ensure that we have a contraction. Invariance of on follows similarly, and we conclude that there exists a unique satifying (2.7). Upon direct differentiation of (2.7), we see that solves (2.6). Solutions to (2.6) are absolutely continuous, so in fact . But then we can continue from back to 0 by standard ODE continuation, so that we have .
We can now substitute back into (2.7) to obtain the asymptotic estimates we’re after. Proceeding similarly as in our verification that is a contraction, we find that
[TABLE]
Finally, differentiability in is obtained by differentiating (2.7) with respect to and proceeding with a similar argument for the resulting integral equation. ∎
We see from Lemma 2.1 that for each fixed , we can create a frame for the Lagrangian subspace of solutions of (1.4), namely
[TABLE]
If we set
[TABLE]
then can be replaced by the frame . From this latter frame, it’s clear that we can take to obtain an asymptotic frame comprising the eigenvectors as its columns.
We can now verify directly that
[TABLE]
First, for , we can directly construct a Green’s function satisfying . In particular, we obtain
[TABLE]
where
[TABLE]
(The verification that is independent of proceeds almost precisely as the verification that and are Lagrangian subspaces.)
According to Lemma 2.2 in [10], for , exists if and only if the Lagrangian subspaces and do not intersect, and these Lagrangian subspaces intersect if and only if is an eigenvalue of (i.e., an element of the point spectrum). Moreover, for , the frames and are analytic in (see, e.g., Theorem 2.1 in [17], and this can also be seen with an approach essentially identical to our proof of Lemma 2.1). It follows that is meromorphic in , and so there can be no accumulation of eigenvalues on this interval. This allows us to conclude in fact that for , can only fail to exist if .
In the case that exists, it can be shown (e.g., as in the proof of Proposition 7.1 in [18]) that there exist constants , so that
[TABLE]
for all . We can conclude that for any that is not an eigenvalue of , the resolvent map
[TABLE]
defines a bounded, linear operator on . In particular, .
Although it’s not required for the current analysis, we can also readily verify that in fact . In order to see this, we first not that for any , the matrix will have one or more non-positive eigenvalues. It follows that will have two or more eigenvalues with zero real part. The proof of Lemma 2.1 proceeds essentially unchanged in this case, and we see that for the space of solutions of has dimension less than . It follows immediately from Theorem 11.4.c of [17] that in these cases.
3 The Maslov Index
Our framework for computing the Maslov index is adapted from Section 2 of [10], and we briefly sketch the main ideas here. Given any pair of Lagrangian subspaces and with respective frames and , we consider the matrix
[TABLE]
In [10], the authors establish: (1) the inverses appearing in (3.1) exist; (2) is independent of the specific frames and (as long as these are indeed frames for and ); (3) is unitary; and (4) the identity
[TABLE]
Given two continuous paths of Lagrangian subspaces , , with respective frames , relation (3.2) allows us to compute the Maslov index as a spectral flow through for the path of matrices
[TABLE]
In [10], the authors provide a rigorous definition of the Maslov index based on the spectral flow developed in [16]. Here, rather, we give only an intuitive discussion. As a starting point, if for some , then we refer to as a conjugate point, and its multiplicity is taken to be , which by virtue of (3.2) is equivalent to its multiplicity as an eigenvalue of . We compute the Maslov index by allowing to run from [math] to and incrementing the index whenever an eigenvalue crosses in the counterclockwise direction, while decrementing the index whenever an eigenvalue crosses in the clockwise direction. These increments/decrements are counted with multiplicity, so for example, if a pair of eigenvalues crosses together in the counterclockwise direction, then a net amount of is added to the index. Regarding behavior at the endpoints, if an eigenvalue of rotates away from in the clockwise direction as increases from [math], then the Maslov index decrements (according to multiplicity), while if an eigenvalue of rotates away from in the counterclockwise direction as increases from [math], then the Maslov index does not change. Likewise, if an eigenvalue of rotates into in the counterclockwise direction as increases to , then the Maslov index increments (according to multiplicity), while if an eigenvalue of rotates into in the clockwise direction as increases to , then the Maslov index does not change. Finally, it’s possible that an eigenvalue of will arrive at for and stay. In these cases, the Maslov index only increments/decrements upon arrival or departure, and the increments/decrements are determined as for the endpoints (departures determined as with , arrivals determined as with ).
One of the most important features of the Maslov index is homotopy invariance, for which we need to consider continuously varying families of Lagrangian paths. To set some notation, we denote by the collection of all paths , where are continuous paths in the Lagrangian–Grassmannian. We say that two paths are homotopic provided there exists a family so that , , and is continuous as a map from into .
The Maslov index has the following properties.
(P1) (Path Additivity) If and , with , then
[TABLE]
(P2) (Homotopy Invariance) If are homotopic, with and (i.e., if are homotopic with fixed endpoints) then
[TABLE]
Straightforward proofs of these properties appear in [6] for Lagrangian subspaces of , and proofs in the current setting of Lagrangian subspaces of are essentially identical.
3.1 Exchanging Target Spaces
Suppose we have a continuous path of Lagrangian subspaces , along with two fixed target Lagrangian subspaces and . Our goal in this section is to relate the two Maslov indices and . This question goes back at least to Hörmander [8], and has also been discussed in our primary references [3] and [19].
We suppose intersects neither nor and likewise intersects neither nor , and we also suppose . Then the difference
[TABLE]
does not depend on the specific path (see, e.g., [3, 8, 19], as discussed below), and we define the Hörmander index by the relation
[TABLE]
With slight adjustments for notation, this is equation (2.9) in [3] and Definition 3.9 in [19]. We will evaluate the Hörmander index with an expression from [8], and for this we need to define an associated bilinear form.
Definition 3.1**.**
Fix any with . Then any -dimensional linear subspace (i.e., not necessarily Lagrangian) with can be expressed as
[TABLE]
for some matrix that maps to . We define a bilinear form
[TABLE]
by the relation
[TABLE]
for all .
Remark 3.1**.**
Although we will only utilize the bilinear forms in combination, it’s worth noting how we should interpret the meaning of an individual form. Given three Lagrangian subspaces , , and , provides information about the relative orientation of these three spaces. For the case , the nature of this information is particularly clear. In that setting, we can associate to any Lagrangian subspace with frame a unique point on ,
[TABLE]
If , then and correspond with distinct points on . Given any third Lagrangian plane distinct from both and , will lie either on the arc going from to in the clockwise direction or on the arc going from to in the counterclockwise direction. In the former case, we will have , while in the latter case we will have . Using this observation, we can readily derive Hörmander’s formula ((3.5), just below) for the case , and it can subsequently be established that (3.5) is valid for as well.
Hörmander’s Q-form is precisely the form defined in [3], and aside from a sign convention is also the same form specified in Section 3.1 of [19]. Suppose intersects neither nor and likewise intersects neither nor . Then if , Hörmander’s formula for can be expressed as
[TABLE]
where denotes the usual signature of a bilinear form (number of positive eigenvalues minus the number of negative eigenvalues).
One immediate consequence of this formula is that if is a closed path (i.e., ), then . We see that if , then for any closed path so that intersects neither nor and likewise intersects neither nor , the target space can be changed from to without affecting the Maslov index.
In practice, we would often prefer the Dirichlet plane as our target (e.g., when the target is Dirichlet, all crossings will necessarily be in the same direction), and so let’s check the calculation associated with exchanging a general Lagrangian target space with the Dirichlet plane. For notational convenience, we will think of this the other way around, taking and in our general formulation. Following our general development, we assume , and also that intersects neither nor and likewise intersects neither nor . Since the analysis of and are the same, we will proceed with each replaced by the general notation .
Our starting point is to characterize the maps . If , then for some , and consequently . In particular, if maps onto , then will be a frame for . We denote the set of all such maps by . Next, we must be able to find some so that given any there will exist so that . I.e., we must have . Under our assumption that , we can only have if is 0, and so is indeed a frame for .
For , we can now compute
[TABLE]
from which it’s clear that
[TABLE]
and moreover if is invertible
[TABLE]
Since is a frame for , we must have that for any other frame , there exists an invertible matrix so that
[TABLE]
Likewise, if is any frame for , then any other frame for can be expressed as for some invertible matrix . In this way, we can express the relation
[TABLE]
in terms of frames
[TABLE]
First, if is invertible (as will be the case in the current analysis), we can write , and subsequently
[TABLE]
I.e., we have , from which we see that is the inverse of , and so . We conclude,
[TABLE]
Remark 3.2**.**
We note that in the event that is also invertible, we obtain the expression
[TABLE]
so that
[TABLE]
On the other hand, suppose is not invertible. In this case, we observe that the system (3.6) can be combined into the form
[TABLE]
The advantage of this formulation is simply that we can be sure that every matrix is invertible. In this case, we obtain from the first equation in (3.9),
[TABLE]
and upon substitution into the second equation in (3.9),
[TABLE]
Rearranging terms, we can write
[TABLE]
By assumption, does not intersect , and so the matrix multiplying on the left (i.e., the entire matrix, including the square brackets) must be invertible. In this way, we arrive at
[TABLE]
where
[TABLE]
In this case,
[TABLE]
We summarize these observations in the following lemma.
Lemma 3.1**.**
Fix any , and let be such that and . Then
[TABLE]
where is specified in (3.10). Moreover, if is invertible, then is given in (3.7), and if in addition is invertible then is given in (3.8).
Remark 3.3**.**
We will use these considerations in Section 5 to establish Corollary 5.1.
4 Proof of Theorem 1.1
In this section, we prove Theorem 1.1. Our starting point is to verify that and are indeed frames for Lagrangian subspaces. According to Proposition 2.1 of [10], a matrix is the frame for a Lagrangian subspace if and only if the following two conditions both hold: (1) ; and (2) .
For , we have . According to (A3), , and it follows immediately that . Moreover,
[TABLE]
For , we fix and temporarily set . (Our notation here doesn’t assert that is independent of , but rather that is fixed in the ensuing calculations.) Since , we see that
[TABLE]
In addition, we can compute directly to find,
[TABLE]
where in obtaining the final equality to [math] we have used the fact that is self-adjoint. Combining these observations, we can conclude that on . Since this argument holds for any , we conclude that is the frame for a Lagrangian subspace for any .
Finally, we recall from Section 2 that the Lagrangian subspaces with frames can be extended as tends to infinity to the Lagrangian subspaces with frames . Here, and are specified respectively in (2.3) and (2.4). In order to verify that is indeed Lagrangian, we compute
[TABLE]
where we have observed that and are self-adjoint. Recalling the normalization identity , we see that for all , from which we can conclude that is Lagrangian.
We proceed now by considering the Maslov box, for which we fix , and work with a value that will be chosen sufficiently large during the proof, and certainly large enough so that . The Maslov box in this case will refer to the following sequence of four lines, creating a rectangle in the -plane: we fix and let run from to (the bottom shelf); we fix and let run from [math] to (the right shelf); we fix and let run from to (the top shelf); and we fix and let run from to [math] (the left shelf).
For Theorem 1.1, we view the bottom shelf at as our target, and the Lagrangian subspace we associate with the target is , with frame . Clearly, does not depend on , and only appears as an argument for notational consistency. In this case, the evolving Lagrangian subspace is , which we recall corresponds with the space of solutions that decay as . As our frame for , we use the matrix constructed in (1.6). We set
[TABLE]
The Maslov index computed with will detect intersections between and . For expositional convenience, we consider the sides of the Maslov box in the following order: bottom, top, left, right.
Bottom shelf. Beginning with the bottom shelf, we observe that our Lagrangian subspaces have been constructed in such a way that conjugate points correspond with eigenvalues of , with the multiplicity of as an eigenvalue of matching the multiplicity of the intersection. This means that if each conjugate point along the bottom shelf has the same direction then the Maslov index along the bottom shelf will be (up to a sign) a count of the total number of eigenvalues that has between and . We will show below that as ranges from toward on the bottom shelf, the conjugate points are all negatively directed, and so the corresponding Maslov index is a negative of this count. In addition, we will show during our discussion of the left shelf that we can choose sufficiently large so that has no eigenvalues on the interval . We will be able to conclude, then, that the Maslov index along the bottom shelf is negative a count of the total number of eigenvalues, including multiplicity, that has below ; i.e.,
[TABLE]
According to Lemma 3.1 of [10] (also Lemma 4.2 of [6]), rotation of the eigenvalues of as varies—for any fixed —can be determined from the matrix in the following sense: If this matrix is positive definite at some point , then as increases through (with fixed), all eigenvalues of will monotonically rotate in the counterclockwise direction.
For this calculation, we temporarily set
[TABLE]
for which we can compute (with prime denoting differentiation with respect to )
[TABLE]
Integrating, we see that
[TABLE]
where convergence of the integral is assured by the exponential decay of the elements in our frame . In this case,
[TABLE]
so that
[TABLE]
This matrix is clearly non-positive (since is positive definite), and moreover it cannot have [math] as an eigenvalue, because the associated eigenvector would necessarily satisfy for all , and this would contradict linear independence of the columns of (as solutions of (1.1)).
Since is negative definite, we can conclude that as increases, the eigenvalues of rotative monotonically clockwise. It follows immediately that for the bottom shelf, (4.2) holds.
Top shelf. For the top shelf (obtained in the limit as ), we set
[TABLE]
and note that detects intersections between and . Our frames for these Lagrangian subspaces are explicit, and , and we can use these frames to explicitly compute .
We observe that the monotonicity that we found along horizontal shelves does not immediately carry over to the top shelf (since that calculation is only valid for ). Nonetheless, we can conclude monotonicity along the top shelf in the following way: by continuity of our frames, we know that as increases along the top shelf the eigenvalues of cannot rotate in the counterclockwise direction. Moreover, eigenvalues of cannot remain at for any interval of values. In order to clarify this last statement, we observe that the Lagrangian subspaces and intersect if and only if is an eigenvalue for the constant coefficient equation
[TABLE]
(Due to the appearance of in the boundary condition rather than , this equation may not be self-adjoint, but that doesn’t affect this argument.) If is an eigenvalue of (4.3) that is not isolated from the rest of the spectrum, then it must be in the essential spectrum of (4.3), but by an argument essentially identical to the one given at the end of Section 2, we see that the essential spectrum for (4.3) is confined to the interval , so there can be no interval of eigenvalues below .
Left shelf. For the left shelf, intersections between and at some value will correspond with one or more non-trivial solutions to the truncated boundary value problem
[TABLE]
where
[TABLE]
For this calculation, it’s useful to use the projector formulation of our boundary conditions, developed in [1, 14] (see also [9] for an implentation of this formulation in circumstances quite similar to those of the current analysis). Briefly, there exist three orthogonal (and mutually orthogonal) projection matrices (the Dirichlet projection), (the Neumann projection), and (the Robin projection), and an invertible self-adjoint operator acting on the space such that the boundary condition
[TABLE]
can be expressed as
[TABLE]
Moreover, can be constructed as the projection onto the kernel of and can be constructed as the projection onto the kernel of .
Suppose is an eigenvalue for (4.4), with corresponding eigenvector , and consider an inner product of with (4.4). Integrating once by parts, we obtain (suppressing dependence on for notational brevity)
[TABLE]
Using uniform positivity of the matrices and , we can assert that for the positive constants and described in (A1), we have
[TABLE]
In addition, with as described in (A1), we have
[TABLE]
For the boundary term, we can use our projection formulation to write
[TABLE]
We have, then,
[TABLE]
where depends only on the boundary matrices and . For , we can write
[TABLE]
so that the Cauchy-Schwarz inequality leads to
[TABLE]
for any .
Combining these observations, we see that (4.6) leads, for any , to the inequality
[TABLE]
We choose so that to obtain the inequality
[TABLE]
from which we conclude the lower bound
[TABLE]
We see that we can choose sufficiently large so that has no eigenvalues on the interval for any . Consequently, there can be no conjugate points along a left shelf at .
Remark 4.1**.**
We contrast this observation with the case of Sturm-Liouville systems on , for which conjugate points are possible on the left shelf. In the -setting, if the boundary conditions at either [math] or are Dirichlet, then there are no crossings along the left shelf (for sufficiently large). The boundary condition often has the same effect on unbounded domains as Dirichlet conditions have on bounded domains, and this is an example of that observation.
We note that this analysis leaves open the possibility that the asymptotic point at is conjugate. In the event that it is conjugate, can be increased slightly to break the conjugacy. This is an immediate consequence of monotonicity along the top shelf, and serves to establish Lemma 1.1.
Right shelf. For the right shelf, we leave the Maslov index as a computation,
[TABLE]
Combining these observations, and using catenation of paths along with homotopy invariance, we find that the sum
[TABLE]
respectively becomes
[TABLE]
and Theorem 1.1 is a rearrangement of this equality.
5 Proof of Theorem 1.2
We established in our proof of Theorem 1.1 that is Lagrangian for all , and we can proceed similarly to verify that the same is true for . We omit the details.
As with our proof of Theorem 1.1, we work with the Maslov box, but in this case, we place the top shelf at , for chosen sufficiently large during the analysis. We proceed in this way, because the Lagrangian subspace
[TABLE]
(which is well-defined for each ) is not generally continuous as a function of . In particular, it is discontinuous at each eigenvalue of (see [7] for a discussion in the context of Schrödinger operators on ).
We will use the Maslov index to detect intersections between our evolving Lagrangian subspace and our target Lagrangian subspace . Re-defining for this section, we now set
[TABLE]
For expositional convenience, we consider the sides of the Maslov box in the following order: left, top, bottom/right (together).
Left shelf. In this case, conjugate points along the left shelf correspond with values for which is an eigenvalue for the ODE
[TABLE]
where for notational brevity we are suppressing dependence of on . By taking sufficiently large, we can make as close as we like to the invertible matrix , so that in this case is also invertible, and we can write,
[TABLE]
Moreover, we have
[TABLE]
where the error on this approximation is for some . The matrix
[TABLE]
is self-adjoint, and since the entries of are the negative eigenvalues of , it is negative definite. Also, the entries of approach as approaches , so the eigenvalues of approach as approaches .
Let denote a solution to (5.2). Upon taking an inner product of with (5.2), we obtain
[TABLE]
For the first integral in this last expression, we compute
[TABLE]
Using (5.3), we see that
[TABLE]
For the boundary term at , we proceed using the projectors , , and determined by and (as specified in (4.5)). Proceeding as in the proof of Theorem 1.1, we find
[TABLE]
Combining these observations, we see that the boundary terms can be expressed as
[TABLE]
For sufficiently small, , so that we approximately have
[TABLE]
which is positive for and both chosen sufficiently large (by the discussion following (5.4)). We conclude that there exists sufficiently small so that
[TABLE]
for all .
Similarly as in the proof of Theorem 1.1, we have
[TABLE]
For , this allows us to write (still for )
[TABLE]
from which we can immediately conclude
[TABLE]
for all .
For , we scale the independent variable by setting
[TABLE]
Our system becomes
[TABLE]
Suppose solves (5.6) for . Taking an inner product of with (5.6), we get
[TABLE]
For the first integral, we have
[TABLE]
For the boundary term at , we have
[TABLE]
where the inequality follows for sufficiently large from our prior discussion of
[TABLE]
For the boundary term at , we have
[TABLE]
According to Lemma 1.3.8 in [1], we can compute the upper bound
[TABLE]
For , this allows us to compute
[TABLE]
For each , we choose . This ensures
[TABLE]
which leads immediately to
[TABLE]
We conclude a lower bound on ,
[TABLE]
Combining these observations, we can conclude that for any value chosen so that
[TABLE]
we will have no crossings along the left shelf. Similarly as in the proof of Theorem 1.1, this leaves open the possibility of a conjugate point at , corresponding with an intersection between and . Precisely as in the proof of Theorem 1.1, we can increase (if necessary) to ensure that , and then we can choose sufficiently large to ensure that this implies . For these choices of and , we have
[TABLE]
Top shelf. In the case of Theorem 1.2, has been constructed so that conjugate points along the top shelf correspond precisely with eigenvalues of . In order to verify that the Maslov index along the top shelf corresponds with a count of eigenvalues, we need to check that the eigenvalues of rotate monotonically counterclockwise as decreases. In this case, both and depend on , so according to Lemma 3.1 of [10] (also Lemma 4.2 of [6]), rotation of the eigenvalues of —for any —can be determined from the matrices and in the following sense: If both of these matrices are non-positive, and at least one is negative definite at some point , then as increases through (with fixed), all eigenvalues of will monotonically rotate in the clockwise direction.
We have already established during the proof of Theorem 1.1 that the matrix
[TABLE]
is negative definite, so we only need to check that is non-positive. In fact, this latter matrix is negative definite as well, and since the proof is essentially identical to the proof for , we omit the details.
We can conclude, similarly as for the bottom shelf in the proof of Theorem 1.1, that
[TABLE]
Bottom and right shelves. We will need to compute Maslov indices along the bottom and right shelves, so it’s natural to address the two of them together. Our approach is based substantially on the proofs of Claims 4.11 and 4.12 in [7].
As a starting point, we introduce the new unitary matrix
[TABLE]
which detects intersections between and the asymptotic Lagrangian subspace
[TABLE]
Likewise, we specify the asymptotic matrix
[TABLE]
which is well-defined for each , but not generally continuous as a function of . (See the appendix in [7] for a discussion of this discontinuity.) Since and can be written down explicitly, it is much more convenient to work with than it is to work with . In light of this, we will show that our calculations can be carried out entirely in terms of the former matrix. In particular, we have the following claim:
Claim 5.1**.**
Under the assumptions of Theorem 1.2, we have the relation
[TABLE]
Proof.
First, it’s clear that we have the relation
[TABLE]
Recalling from Lemma 2.1 that
[TABLE]
for some , we see that by choosing sufficiently large, we can ensure that the eigenvalues of are as close as we like to the eigenvalues of for all . (Here, exponential decay in allows us to compactify with the usual one-point compactification.) In particular, we can ensure that no eigenvalue of can complete a loop of unless a corresponding eigenvalue of completes a loop of , with the converse holding as well.
Following our discussion of the left shelf, we have chosen so that
[TABLE]
and sufficiently large to ensure that this implies
[TABLE]
With these choices, we see that does not have as an eigenvalue, and also does not have as an eigenvalue.
Case 1. First, suppose is not an eigenvalue for . Then does not have as an eigenvalue, and also does not have as an eigenvalue. By continuity, we can take large enough so that does not have as an eigenvalue, and additionally so that does not have as an eigenvalue for any . Since the eigenvalues of and remain uniformly close, the total spectral flow associated with the bottom and right shelves for must be the same as for . Specifically, we have
[TABLE]
and the claim for Case 1 follows immediately from the specification that is taken large enough so that and do not intersect for .
Case 2. Next, suppose is an eigenvalue for . Then certainly has as an eigenvalue, and its multiplicity corresponds with the multiplicity of as an eigenvalue of . Likewise, will have as an eigenvalue, and its multiplicity corresponds with the multiplicity of as an eigenvalue of . As in the case when is not an eigenvalue, we can choose large enough so that for the eigenvalues of that do not approach as remain bounded away from as .
We now proceed precisely as in Case 1 for the eigenvalues of other than , and we note that an eigenvalue of will approach as if and only if an eigenvalue of approaches as . Moreover, despite possible transient crossings, the net number of crossings associated with these eigenvalues must coincide, because otherwise, an eigenvalue of would complete a full loop of without a corresponding eigenvalue of also completing such a loop (or vice versa). ∎
Combining now our observations for the four shelves, we find that the sum
[TABLE]
respectively becomes
[TABLE]
and Theorem 1.2 is just a rearrangement of this equality.
5.1 Changing the Target
In this section, we verify that under certain conditions the target frame in the calculation can be replaced with the Dirichlet plane . As noted earlier, one advantage of this replacement is that for a Dirichlet target the rotation of eigenvalues of as increases is monotonically clockwise. (This is straightforward to show, e.g., with the methods of [10].) The key observation we take advantage of here is that if is not an eigenvalue of , then we explicitly know both and
[TABLE]
where denotes the Lagrangian subspace associated with solutions that grow as tends to positive infinity. This allows us to compute both
[TABLE]
and consequently we can compute the Hörmander index .
In order to apply our development from Section 3.1, we need the following five conditions to hold: (i) ; (ii) ; (iii) ; (iv) ; and . We will check below that Items (iii), (iv), and (v) hold under our general assumptions, and we will take Items (i) and (ii) to be additional assumptions for this section.
The first items to check are (iii) and (iv), which we can express as the intersections and . For these, we recall that our frame for is
[TABLE]
where and are as in (2.3) and (2.4). For the Dirichlet plane,
[TABLE]
and since is invertible we have \ker\Big{(}(\tilde{\mathbf{X}}_{2}^{+}(\lambda_{0}))^{*}J\mathbf{X}_{D}\Big{)}=\{0\}. Likewise, the frame for is , so that
[TABLE]
which is positive definite. The verification that (i.e., Item (v) above) is essentially identical to the verification that , and we omit the details.
By definition, the Hörmander index for these Lagrangian subspaces is
[TABLE]
According to Hörmander’s formula (3.5),
[TABLE]
We can now use Lemma 3.1 to compute the two quantities and . First, recalling that the frame for is , and noting that the condition implies that is invertible, we have (from Lemma 3.1)
[TABLE]
Likewise,
[TABLE]
because is negative definite.
Combining these observations, we see that
[TABLE]
In this way, we obtain the following corollary to Theorem 1.2.
Corollary 5.1**.**
Let the assumptions of Theorem 1.2 hold, and suppose additionally that , , and . Then
[TABLE]
6 Application to Quantum Graphs
In this section, we apply our framework to an operator on the half-line that arises through consideration of nonlinear Schrödinger equations on quantum graphs with infinite edges extending from a single vertex (i.e., on star graphs). Our direct motivation for considering this example is the recent analysis of Kairzhan and Pelinovsky (see [12]), and we also note that Kostrykin and Schrader have shown how the symplectic framework fits well with such problems (see [13]) and that Latushkin and Sukhtaiev have recently developed this framework in the case of quantum graphs with edges of finite length (see [15]). Finally, we mention that our general approach to quantum graphs is adapted from the reference [1].
6.1 The Schrödinger Operator on Star Graphs
We consider a star graph with edges, which can be visualized as a single point with distinct half-lines emerging from it. We will associate with each edge of our graph the interval , and our basic Hilbert space associated with the full graph will be
[TABLE]
We will view elements as vector functions , and we specify the linear operator by
[TABLE]
where is a scalar potential for which we will assume the limit
[TABLE]
exists and satisfies the asymptotic relation
[TABLE]
(This is slightly weaker than our Assumption (A2), but sufficient in the current setting (see [7]).) We specify boundary conditions at the vertex as
[TABLE]
with and satisfying the assumptions described in (A3). Under these assumptions, we take as our domain for ,
[TABLE]
With this notation in place, we can consider the eigenvalue problem with boundary conditions (6.1). In order to place this system in the framework of our analysis, we set , with and . In this way, we arrive at our standard Hamiltonian system
[TABLE]
where denotes the diagonal matrix
[TABLE]
Under our assumptions on the scalar potential , it’s well known that for each the scalar equation
[TABLE]
has one non-trivial solution that decays as and one non-trivial solution that grows as . (See, e.g., [7].) If we denote by the solution that decays as , then we can express our frame of solutions of (6.2) decaying as as
[TABLE]
We see that in this case, and in the context of Theorem 1.1,
[TABLE]
(I.e., this is (4.1) for the current case.) In particular, if we denote the eigenvalues of by , then the eigenvalues of will be
[TABLE]
Remark 6.1**.**
We distinguish the Neumann or Neumann-Kirchhoff boundary conditions as those specified by the relations
[TABLE]
(See p. 14 of [1] for a discussion of terminology.) These correspond with
[TABLE]
and
[TABLE]
In this case, the eigenvalues of are and , with simple and occurring with multiplicity . This fact is straightforward to verify directly, and is also an immediate consequence of Corollary 2.3 from [13].
6.2 NLS on Star Graphs
We now consider the nonlinear Schrödinger equation
[TABLE]
where and with taking the values of on edge of the graph. We interpret the notation and in this setting as
[TABLE]
Such equations are known to admit standing wave solutions
[TABLE]
for any . Upon direct substitution into (6.6), we see that
[TABLE]
In [12], the authors observe that by setting
[TABLE]
we arrive at
[TABLE]
This scaling justifies restricting our attention to the case . It’s straightforward to verify that for any (6.7) admits the explicit solution
[TABLE]
We linearize (6.6) about , writing
[TABLE]
where and are both real-valued functions. Dropping off higher order terms, we obtain the linear system
[TABLE]
where
[TABLE]
Our framework can now be used in order to determine the Morse indices of with Neumann–Kirchhoff boundary conditions. We focus on the slightly more interesting case, . (The Morse index of with Neumann–Kirchhoff boundary conditions is [math].) The eigenvalue problem for can be expressed as
[TABLE]
with and as expressed in (6.4) and (6.5).
For this calculation, we will use Theorem 1.1 with . We observe that by construction,
[TABLE]
solves (6.8) for (just differentiate (6.7) to see this; here, is not expected to satisfy the boundary condition at ). This allows us to express our frame for solutions of (6.8) that decay as as
[TABLE]
We set , so that
[TABLE]
According to Remark 6.1, the eigenvalues of are
[TABLE]
with multiplicity and the negative of this with multiplicity 1. (Here, the notation has been introduced simply for expositional convenience).
In [10], the authors have developed a straightforward approach toward determining the direction of rotation for the eigenvalues of as varies, but in the current setting this rotation can be determined directly from the form of . We observe that
[TABLE]
We can write
[TABLE]
for which we focus on the real and imaginary parts of the numerator
[TABLE]
We note that for any ,
[TABLE]
We now consider the motion of as runs from [math] to . First, and , so
[TABLE]
This means that is an eigenvalue of with multiplicity , and is an eigenvalue of with multiplicity . As increases from 0, we see from (6.9) that the imaginary part of is negative, so rotation is in the counterclockwise direction. Moreover, since and are both monotonic in (for ), we see that the imaginary part of remains negative until arrives at the unique value for which
[TABLE]
We see from (6.9) that , so . For , the imaginary part of is positive, and by noting the asymptotic relations , , we see that as , approaches . In summary, we see that as runs from [math] to , rotates from to , leaving in the counterclockwise direction and never crossing . Indeed, with a bit more work, we can verify that the rotation is entirely counterclockwise, but we don’t require that much information to draw our conclusions.
Returning to the matrix , we can conclude that eigenvalues trace out precisely the path described in the previous paragraph, and the final eigenvalue begins at when and rotates in the counterclockwise direction, approaching as . We conclude that
[TABLE]
Finally, in order to use Theorem 1.1, we need to compute . For this, we observe that if we set , with and , then (6.8) can be expressed as , with
[TABLE]
Since as , we see that
[TABLE]
We can readily check that as a choice for the corresponding asymptotic frame , we can take . Thus for the top shelf, we have
[TABLE]
We conclude from Remark 6.1 that the eigenvalues of are with multiplicity and the negative of this with multiplicity . For the value of cannot be , so there are no conjugate points along the top shelf. We conclude that in this case
[TABLE]
Applying Theorem 1.1, we find that
[TABLE]
Remark 6.2**.**
For a more complete discussion of the instability of the half-soliton as a solution to (6.6), including a calculation of by other means, we refer the reader to [12].
Acknowledgments. This work was initiated while P.H. was visiting Miami University in March, 2018. The authors are grateful to the Department of Mathematics at Miami University for supporting this trip.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] G. Berkolaiko and P. Kuchment, Introduction to quantum graphs , Mathematical Surveys and Monographs 186 , AMS 2013.
- 2[2] W. A. Coppel, Stability and asymptotic behavior of differential equations , D. C. Heath and Co., Boston, MA 1965.
- 3[3] J. J. Duistermaat, On the Morse index in variational calculus , Adv. Math. 21 (1976) 173–195.
- 4[4] K. Furutani, Fredholm-Lagrangian-Grassmannian and the Maslov index , Journal of Geometry and Physics 51 (2004) 269 – 331.
- 5[5] D. Henry, Geometric theory of semilinear parabolic equations , Lect. Notes Math. 840 , Springer-Verlag, Berlin-New York, 1981.
- 6[6] P. Howard, Y. Latushkin, and A. Sukhtayev, The Maslov index for Lagrangian pairs on ℝ 2 n superscript ℝ 2 𝑛 \mathbb{R}^{2n} , Journal of Mathematical Analysis and Applications 451 (2017) 794-821.
- 7[7] P. Howard, Y. Latushkin, and A. Sukhtayev, The Maslov and Morse indices for system Schrödinger operators on ℝ ℝ \mathbb{R} , Indiana J. Mathematics 67 (2018) 1765–1815.
- 8[8] L. Hörmander, Fourier integral operators I , Acta Math. 127 (1971) 79–183.
