An Analytical Analogue of Morse's Lemma
Yixuan Wang

TL;DR
This paper establishes an analytical analogue of Morse's lemma, demonstrating that near a non-degenerate critical point, the gradient field of a Morse function can be transformed into a standard linear form in suitable local coordinates.
Contribution
It introduces a method to linearize the gradient field of a Morse function near critical points, extending Morse's topological results to an analytical setting.
Findings
Gradient field near critical points can be expressed in a standard linear form.
Existence of local coordinates transforming the gradient to a unique linear vector field.
Constructive proof for transforming Morse gradient fields into standard form.
Abstract
The Morse function near a non-degenerate critical point is understood topologically, in the light of Morse's lemma. However, Morse's lemma standardizes the function itself, providing little information of how the gradient behaves. In this paper, we prove an analytical analogue of Morse's lemma, showing that there exist smooth local coordinates on which a generic Morse gradient field near the critical point exhibits a unique linear vector field. We show that on a small neighbourhood of the critical point, the gradient field has a natural choice of standard form , and this form only depend on the local behaviour of the Morse function and the Riemannian metric near the critical point. Then we present a constructive proof of the fact that given a generic Morse function ,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometric Analysis and Curvature Flows · Homotopy and Cohomology in Algebraic Topology · Topological and Geometric Data Analysis
An Analytical Analogue of Morse’s Lemma111This work is supervised by Prof. Dominic Joyce, and supported by the Engineering and Physical Sciences Research Council [EP/L015811/1].
Yixuan Wang
Abstract
The Morse function near a non-degenerate critical point is understood topologically, in the light of Morse’s lemma. However, Morse’s lemma standardizes the function itself, providing little information of how the gradient behaves.
In this paper, we prove an analytical analogue of Morse’s lemma, showing that there exist smooth local coordinates on which a generic Morse gradient field near the critical point exhibits a unique linear vector field. We show that on a small neighbourhood of the critical point, the gradient field has a natural choice of standard form , and this form only depend on the local behaviour of the Morse function and the Riemannian metric near the critical point. Then we present a constructive proof of the fact that given a generic Morse function , for every critical point, there is a local coordinate on which the gradient field reduces to its standard form.
Contents
1 The focus and main results.
Given a Morse function on a Riemannian manifold with a critical point , the Morse Lemma guarantees the existence of a coordinate chart centred at , where the Morse function exhibits a standard formula determined only by the Morse index of . The focus of this paper is to answer the following question: analogously, is there a local coordinate where the gradient of is standardized to a nice vector field?
As a theme, the question of finding the analytical friendly representations of vector fields via choices of coordinates orchestrates its variations in different chords. For example, Takens’ work [12] concerns singularities of vector fields and their codimensions, and the proof utlizes general normal forms for these vector fields; Meanwhile, Palis and Smale [9] formulate the question implicitly in their theory of structural stability, for gradient dynamic systems satisfying what’s now called Axiom A and strong transversality.
In the case of rectifying the Morse gradient field on a Riemannian manifold, we elaborate the answer to this question as follows, which is our main result.
Definition 1.1** (-linearity condition).**
We say that a group of real numbers satisfies the -linearity condition*, if the following equation*
[TABLE]
has no solution for all with , and any .
As generically chosen satisfies the -linearity condition, this requirement is not restrictive.
Theorem 1.2**.**
*Let be an n-dimensional Riemannian manifold equipped with Riemannian metric and a Morse function , , and be a critical point of . Let Morse eigenvalues of be eigenvalues of . Assume that Morse eigenvalues of satisfy the -linearity condition.
Then there exists a local coordinate chart on a small neighbourhood containing , which is a function with open and , such that is smooth with a smooth inverse on , and*
[TABLE]
for all .
Remark:
- •
If we assume in addition that , then is unique up to linear transformation
[TABLE]
where if ; and if the Morse eigenvalue is of multiplicity , with , then .
- •
When the set of Morse eigenvalues fails to satisfy the -linearity condition, there exist some gradient fields with standard form that cannot be standardized to . This is shown in Corollary 3.2. In Example 3.3, we demonstrate this conclusion for the gradient field .
2 The standard form of a Morse gradient field.
In this section, we shall briefly recall Morse theory and the Morse Lemma, and give a rigours definition of the standard form of a Morse gradient field.
2.1 Introduction to Morse theory.
Let be a -dimensional manifold equipped with Riemannian metric .
Definition 2.1** (Morse function).**
For a smooth function , its critical points are those points where , and a critical point is called non-degenerate if the matrix is of full rank. A function is called a Morse function, if it has only non-degenerate critical points.
The existence of Morse functions is guaranteed, and in fact, Morse functions are dense in , as the following result from [6, §6] elaborates:
Theorem 2.2** (Abundance of Morse functions).**
Any bounded smooth function can be uniformly approximated by a smooth Morse function . Furthermore, can be chosen so that the i-th derivatives of on the compact set uniformly approximate the corresponding derivatives of , for all .
Let be a fixed Morse function. Its gradient is a well-defined vector field over , and Morse flow is the negative gradient flow induced by this vector field, which is the solution to
[TABLE]
for any .
Definition 2.3** (Index, stable and unstable manifold of a critical point).**
*Let be a critical point of Morse function . Then the Morse index of is the number of negative eigenvalues of .
We define the stable manifold of to be*
[TABLE]
and the unstable manifold of to be
[TABLE]
As the name suggests, and are indeed submanifolds of . A detailed discussion along with proof can be found in [1, §2.1.d], where a proof is given for so-called pseudo-gradient fields, vector fields that generalize and coincide with the gradient on a local chart containing the critical point. So here we just quote the result as follows.
Theorem 2.4**.**
The stable and unstable manifolds of the critical point are submanifolds of that are diffeomorphic to open disks. More over, we have
[TABLE]
The gateway of Morse theory is the Morse lemma, without doubt. The lemma reveals the fact that for every critical point there exists a local chart where the Morse function is of standard form characterized by its Morse index, and as a result, it is justified to consider the index of a critical point as an invariant of local homeomorphisms. This lemma validates the definition of the Morse complex, a chain complex defined over the set of critical points ranked with respect to their indices, and the definition of the corresponding Morse homology. Some classical upshots of Morse homology are [1, Chapter 4]: a) the alternating sum of the number of critical points of index (sign alters w.r.t. ) of a Morse function is a topological invariant — the Euler characteristic; and b) the number of critical points of index is bounded below by the -th Betti number of the manifold.
The original lemma was proven by Marston Morse in his paper [7], 1925, using Gram–Schmidt orthogonalization method. Later on it was generalized to firstly suit calculus of variations in Hilbert spaces and then for Banach spaces in general, by Richard Palais [8]. The lemma also has a variation for Morse–Bott functions, which are smooth functions with their critical loci being submanifolds of instead of isolated points, and their Hessian at a critical point are non-degenerate along the normal of the corresponding critical locus.
For the convenience of the reader, let us revisit the classical version of Morse lemma, and its proof, seen here mainly paraphrasing [6, §2].
Lemma 2.5** (Morse Lemma).**
Let be a non-degenerate critical point of Morse function , and the index of is . Then there is a local coordinate on a neighbourhood containing , with associated with . On this local coordinate, ,
[TABLE]
Proof.
Firstly, we claim that the following is true. {addmargin}[1em].5em Claim: Let be a function in a convex neighbourhood of in , with . Then
[TABLE]
for some , and .
Proof of claim. Note that
[TABLE]
Therefore we can always let , and the claim follows.
Apparently there exists a local coordinate where the critical point is mapped to in , and we can assume that on this local chart. Applying aforementioned claim to yields
[TABLE]
for in some neighbourhood of . Since is the critical point,
[TABLE]
Once again, applying the claim to , we have
[TABLE]
for some smooth functions . So
[TABLE]
We assume that , as taking reduces the case to our assumption. Moreover, the matrix , and hence it is non-singular.
The coordinate where is of the desired form, possibly on a smaller neighbourhood of , is constructed inductively. Suppose there exists a coordinate on where
[TABLE]
with the matrix symmetric. After a linear exchange in the last coordinates, we may assume that . Now introduce the new coordinate as
[TABLE]
On a small enough neighbourhood of the origin, is a smooth transformation with a smooth inverse, hence it will serve as a coordinate on a sufficiently small neighbourhood . It is easy to verify that
[TABLE]
which completes the induction, and proves this lemma. ∎
2.2 First observations, and the standard form of a gradient field.
The Morse lemma offers a constructive way of finding a local coordinated standardizing the Morse function. However, it is less helpful when the Morse gradient field is of our concern. In fact,
[TABLE]
where . So the local vector field may appear fully non-linear, even if is standard in the sense of the lemma.
What do we expect for the Morse vector field on a local coordinate near the critical point? A first look yields that we can always set the first order terms of to be diagonal, and the coefficient of this diagonal form is coordinate independent:
Proposition 2.6**.**
There exists a local coordinate on a local neighbourhood containing the critical point where is mapped to , and
[TABLE]
*where , , with the Morse index of .
Furthermore, the set of eigenvalues is independent of the choice of coordinates.*
Proof.
A closer look at equation (1) tells us that
[TABLE]
The Hessian as well as the metric are symmetric bilinear forms over . Let be such a bilinear form, then a coordinate transformation of the local coordinate induces a congruence of the matrix of w.r.t. Jacobian of the coordinate change, i.e.
[TABLE]
Consequently, an invertible linear transformation gives the matrix similarity
[TABLE]
Let be the positive definite matrix that diagonalizes both and , such that . The existence of is guaranteed by the fact that two symmetric matrices, with one of which being positive definite, can be diagonalized simultaneously. On this new coordinate,
[TABLE]
Equation (1) gives
[TABLE]
and combining this with (2) and (3),
[TABLE]
As signs are preserved during our construction, we may assume that and , where is the Morse index of .
Furthermore, for the vector field , the Hessian . This indicates that is the set of eigenvalues of , and it is invariant under local diffeomorphisms. Or this can be viewed directly from our proof: a local diffeomorphism induces matrix similarity for the matrix of coefficients of the leading order term of , with its Jacobi , which apparently doesn’t alter the set of eigenvalues. Thus the proposition is proven. ∎
And we give the linear part of a name.
Definition 2.7** (Standard form).**
For a critical point of a Morse function , there exists a local coordinate over a neighbourhood containing , such that
[TABLE]
The linear part of is called the standard form of , denoted by
[TABLE]
The set of eigenvalues is coordinate independent, and they will be called Morse eigenvalues of the critical point .
The discussion above leads to the natural question: Given a Morse function on a Riemannian manifold , is there a local coordinate system on a neighbourhood containing the critical point , with to the origin, such that reduces to its standard form throughout ?
In general, the answer is: yes, if the problem is generically posed, then such local coordinates exist. For the rest of the paper, we will see what “generic” means in strict, analytical sense, and give a constructive proof of such a local coordinate in this case.
3 Proof of main results.
Here is a roadmap to help the reader navigate the proof of our results:
- Step 1:
§3.1.1. Under the -linearity assumption, we find a local coordinate where the general vector field differs from its standard form only by a locally flat function, namely . This is shown in Proposition 3.1. Moreover, when -linearity fails, Corollary 3.2 proves that there exists a gradient field that cannot be standardized, illustrated by Example 3.3, and both demonstrate that the restriction of -linearity condition is sufficient and almost necessary in this process. 2. Step 2:
§3.1.2. There exist local coordinates where the unstable manifold corresponds to , the stable manifold ; furthermore, the relation is preserved. This is Proposition 3.4. 3. Step 3:
Based on the first two results, we modify the choice of local coordinates further, and construct a coordinate chart where on both the unstable submanifold, which corresponds to , and the stable submanifold, which is ; on top of that, is preserved within a small neighbourhood of the critical point. This is proven in Proposition 3.5. 4. Step 4:
§3.1.3. In Theorem 3.6, the final estimate of this subsection §3.1 is given, and it will prove to be crucial later: on a small neighbourhood containing , there exists a coordinate chart such that for every large enough and some positive constants , we have . 5. Step 5:
In §3.2.2, we present an operator with its fixed point closely related to the local diffeomorphism of our concern. The norm of operator relies closely on the weighted Sobolev spaces which it is defined on, as explained in Proposition 3.7 and Lemma 3.8; also, the operator moves the zero function in a controlled manner, which is discussed in Proposition 3.9. 6. Step 6:
By carefully choosing regularity and weight of weighted Sobolev spaces, we find a convex region of the function space where is a contraction operator. This is shown in Theorem 3.10. 7. Step 7:
In §3.2.3, the local diffeomorphism is constructed from the fixed point of , and its uniqueness and regularity are validated by a Banach space version of the Implicit Function Theorem. And we complete the proof of Theorem 1.2.
3.1 Standardizing the vector field with controlled errors.
In this section, we will standardize the vector field of the gradient by finding suitable local coordinates centred at the the critical point, on which the gradient reduces to a form which is reasonably close to its standard form.
3.1.1 Standardising generic to flat functions.
To begin with, we will establish that we can find submanifold diffeomorphisms such that a general gradient field is arbitrarily close to its standard form.
Proposition 3.1**.**
Let and be the standard form of . Assume that satisfy the -linearity condition. Then there exists a local coordinate chart , with the critical point mapped to , such that
[TABLE]
Proof.
Let be a local coordinate chart on a neighbourhood of the critical point , with the critical point mapped to .
We now work with germs near on . A germ of functions at in , denoted by , is the equivalence class of functions that are identical near : Let be a pair, where , open, and function be smooth; then two such pairs and are equivalent if there exists open neighbourhood with , such that .
The germ of diffeomorphisms of fixing is defined with the equivalence relation as follows: For triplets of the form , where are open with , and is a diffeomorphism with , we say that and are equivalent if there exists an open set , , such that . The germ of smooth diffeomorphisms of fixing the origin, denoted by , is an infinite dimensional Lie group. The Lie group structure of this group of germs and its Lie algebra are studied by Robart and Kamran in [10, Theorem 3].
The group of germs of diffeomorphisms fixing the origin, denoted by , has nested normal subgroups . Each consists of those diffeomorphisms of the form , where is a -vector of polynomials of order bigger than or equal to . Then these infinite dimensional normal subgroups are nested as . In addition, for , the quotient of subgroups can be defined . This quotient space is a finite dimensional Lie group, with dimension,
[TABLE]
As the error of compared to its standard form is of at least second order, we will work with , .
In the same manner, we define the vector space of germs of the vector fields of the same standard form as the equivalent class of vector fields that agrees on an open neighbourhood containing the origin, namely,
[TABLE]
where if there exists an open neighbourhood of such that . Apparently, is an infinite dimensional vector space with its origin being the germ of .
Germs of vector fields in the vector space that are identical up to rank form another vector space,
[TABLE]
where for , iff .
Vector space is finite dimensional, and it is easy to see that .
The action of the diffeomorphism group on is well-defined. The lemma is proven if for all , vector field is in the orbit of the standard form up to rank , namely if the orbit contains . If this is true for some , then we get the existence of a diffeomorphism that “straightens” up to order , namely .
The vector field is in the orbit of if acts transitively on . In other words, let be the subgroup that fixes , then we claim that the orbit of is the whole set of if and only if is trivial. {addmargin}[1em].5em Here is why this claim is true.
Firstly, that the stabilizer is trivial is equivalent to , and is equivalent to that the orbit of , , is open in . This is because as a submanifold of , the orbit is of the same dimension as the ambient manifold, .
Secondly, that the orbit is open is equivalent to being the whole space . This is because for any , which is represented by with each polynomial of rank no larger , the dilation leaves invariant and reduces non-linearly by at least a factor of , as are at least quadric in . For every open neighbourhood of , there exists a small enough for , such that is in that neighbourhood; and as is an open neighbourhood of , there exist an and a diffeomorphism such that . Consequently, , and .
As is connected, that is trivial is equivalent to the Lie algebra of being trivial. This Lie algebra is characterized by
[TABLE]
Using the fact that the basis of the Lie algebra is ,
[TABLE]
As a result, is trivial if for all and with . This gives the -linearity condition for ’s at the -th step.
If we require the -linearity condition to hold for all , which is not restrictive as such choices of Morse functions are still generic, then for every there exists a diffeomorphism such that . In other words, .
A closer look at the derivation of yields that
[TABLE]
Hence is a formal power series with being its first terms, and formally . Consequently, there exists a formal sequence of diffeomorphisms that “standardizes” up to an error of order .
In fact, we can choose a local diffeomorphism that has exactly this formal sequence, with the help of the Borel theorem [5, Theorem I.1.3]: {addmargin}[1em].5em Let be the ring of smooth functions over , and be the ideal of functions which are flat on (flat means all partial derivatives vanish at ). Let be the ring of formal power series of with smooth coefficients in . Then the Taylor series gives an isomorphism
[TABLE]
Then for the constructed formal power series , there exists a (non-unique) smooth function , such that the Taylor series of at is , and is a local diffeomorphism near , as is invertible. Moreover, , where is a locally flat function. ∎
Corollary 3.2**.**
When -linearity condition fails to hold for the set of eigenvalues , there exists a Morse function whose gradient cannot be standardized with the method of Proposition 3.1.
Proof.
Assume that there exists an and a set of non-negative integers with such that
[TABLE]
Then we claim that the stabilizer of in the proof of Proposition 3.1 is non-trivial. {addmargin}[1em].5em Let be
[TABLE]
Then is a local diffeomorphism near the origin. Moreover, by the chain rule, that is a stabilizer is equivalent to
[TABLE]
So is a stabilizer of , if and only if
[TABLE]
It is easy to check that the we proposed earlier satisfies this relation, hence it is a non-trivial element of the stabilizer of .
As a result, the dimension argument reveals that the orbit is a genuine subgroup of , so long as . By choosing such that , we find a collection of Morse functions that cannot be standardized by a formal power series, as in Proposition 3.1, which concludes our proof. ∎
Example 3.3**.**
Let
[TABLE]
and its standard form , then there exists no local diffeomorphism of fixing such that
Proof.
To prove that is not in the orbit of and consequently such a diffeomorphism does not exist, it is enough to prove that is not in the orbit generated by acting on . Let ,
[TABLE]
[TABLE]
then
[TABLE]
if and only if
[TABLE]
This set of equation has solution for . However, for , its coefficient has to satisfy , which has no solution. As a result, is not in the orbit of , so cannot be standardised. ∎
3.1.2 on stable and unstable submanifolds.
Now we show that it is possible to build coordinates out of the stable and unstable loci near a critical point, while maintaining the flatness of the difference between the vector field and its standard counterpart . It is useful to recall that the stable and unstable loci of the critical point are in fact submanifolds, intersecting transversally at only .
Proposition 3.4**.**
Let be a critical point, and a small open neighbourhood containing . Let be the -dimensional unstable submanifold of contained in , and be the -dimensional stable submanifold in . Given that are -linearly independent, there exists a local coordinate chart , where is an open neighbourhood of , such that , and
[TABLE]
[TABLE]
In addition, for all .
Proof.
Utilizing Proposition 3.1, let be the coordinate chart covering , with open, and , such that the estimate holds. Now, and are submanifolds of dimension and in , and they transversally intersect at only . Hence there exist charts , , where is an open neighbourhood of and is an open neighbourhood of , s.t. , , and
[TABLE]
[TABLE]
where , and are smooth functions, as they specify the embedding of corresponding submanifolds and in . Moreover, we claim that and are locally flat, namely , and . {addmargin}[1em].5em Let us justify this claim for the unstable submanifold on the coordinate chart. Denote
[TABLE]
where for all . Then flow lines on the unstable submanifold parametrized by , consists exactly of those that are the solutions to
[TABLE]
and the implicit solution is
[TABLE]
For any multi-index , the -th derivative of at is
[TABLE]
because requires , and all derivatives of vanishes at . Hence, attributes its local flatness to that of ’s.
The same conclusion can be drawn following a similar argument for the stable submanifold and diffeomorphism .
Now let us construct an diffeomorphism,
[TABLE]
where for some an open neighbourhood of . For simplicity, we still denote by , and similarly for , then
[TABLE]
[TABLE]
Then there is a local chart near the critical point, , , and
[TABLE]
[TABLE]
Hence we find a local chart where we can always split the stable and unstable manifolds by coordinates, while maintaining the estimate of being flat. ∎
Let us denote that , and , on a open bounded neighbourhood of . Then we write a flow induced by as , and the corresponding standard flow by , namely,
[TABLE]
Then solutions can be written down explicitly, or at least formally for , as families of local diffeomorphisms near parametrized by ,
[TABLE]
where .
With the help of and , now we construct local coordinates where on the stable and unstable submanifolds.
Proposition 3.5**.**
Assume that Morse eigenvalues satisfy the -linearity condition, and , . Let and be open neighbourhoods of the critical point on the stable and the unstable submanifolds of , and let formal conjugate functions on and be
[TABLE]
Then are smooth local diffeomorphisms on and respectively. They fix the critical point, and
[TABLE]
In addition, there exist local coordinates on a small neighbourhood of , induced by and , such that
[TABLE]
[TABLE]
for all
Proof.
As established in Proposition 3.4, there exist local coordinates where the critical point is mapped to , with , and . Denote this local coordinate chart explicitly by , namely, coordinates , with an open neighbourhood covering in .
From now on, we restrict ourselves on within this local coordinate chart, and prove that is a diffeomorphism. With this choice of coordinates,
[TABLE]
and we write in this proof.
The formal conjugate function is
[TABLE]
By dominated convergence theorem, the following equation
[TABLE]
holds for all non-negative multi-indices , so long as the right hand side limit exists for all .
Taking into account that from the defining equation of ,
[TABLE]
we have that for any and multi-index ,
[TABLE]
Take equation (12) into (11), so the -th partial derivative of satisfies the relation
[TABLE]
as long as the the limit on the right hand side of the equation exits.
To prove that is a local diffeomorphism, it is enough to show that is smooth and is invertible near . Apparently, is invertible. For smoothness, it is sufficient to prove that the right hand side of equation (13) is continuous for each , and we will achieve that with decay estimates of -derivatives of when .
** regularity of .
**To prove the existence of , we need decay estimate of on . Note that for every large enough integer , there exists a positive constant such that
[TABLE]
as is locally flat for each . Consider
[TABLE]
Denote , , then the above inequality reduces to
[TABLE]
which has explicit solution
[TABLE]
for some positive constant . In addition, as we only care about for , the above inequality simplifies to
[TABLE]
An upshoot of this decaestimate is the observation
[TABLE]
And because the derivative is absolutely dominated by an integrable function, we know that indeed its intergation on the whole of exists, and consiquently so does , as
[TABLE]
Furthermore, the existence of this limit improves our initial decay estimate to
[TABLE]
** regularity of .
**This will be done using bootstrapping method. Let the -th induction hypothesis be that there exist a constant such that the norm of has exponential decay, namely,
[TABLE]
Then it will be sufficient to show that the corresponding -th decay estimate holds.
We will utilize the following ODE
[TABLE]
and with higher order chain rule
[TABLE]
this ODE can be rewritten as
[TABLE]
note that when , the term vanishes, which otherwise contains derivatives of of order lower than .
Summing up gives
[TABLE]
where , and . For every large enough integer , there exists a constant such that , so there exists some such that whenever ,
[TABLE]
As a result,
[TABLE]
and this amounts to
[TABLE]
Note that using higher order chain rule,
[TABLE]
so the integrand in (3.1.2) has exponential decay, following from that of . Hence the integration converges, and there exists some constant such that
[TABLE]
for all .
Based on this estimate we can proceed with the following observation
[TABLE]
for some positive constants . Because the derivative is absolutely dominated by an integrable function, itself is integrable over . Hence exists, and there exists a constant such that
[TABLE]
which is exactly what we want for the -th induction hypothesis.
**The local diffeomorphism generated by .
** Given the construction as before, it is clear that and are diffeomorphisms on an open neighbourhood covering the origin of and respectively,and both of them fix the origin. Then there exist open neighbourhoods and such that , and
[TABLE]
The diffeomorphism is a local coordinate chart covering , with .
Apparently, over this coordinate chart, , and , . Moreover, as we start with the local coordinate where , it is obvious that this local flatness of is preserved by the diffeomorphism — as the vanishing of all derivatives at the origin is passed along by chain rule. ∎
3.1.3 Estimating with controlled error terms.
Theorem 3.6**.**
Assume that the Morse eigenvalues are -linearly independent. Then on a small neighbourhood containing , for every large enough positive integer , there exist local coordinates with mapped to , and a positive constant such that
[TABLE]
for every .
Proof.
Observe that due to on the stable and unstable manifold (Proposition 3.5), combined with Proposition 3.1, the Taylor expansion of reduces to
[TABLE]
where and are multi-indices, , and , . This is true on a small neighbourhood near the critical point. In addition,
[TABLE]
Apparently, we only need instead of to conclude our proof. To achieve this, we will construct a sequence of local coordinates that get rid of the (m+1)-th order terms at m-th step:
[TABLE]
[TABLE]
where are multi-indices with k and (n-k) entries respectively.
The induction hypothesis is
[TABLE]
Specifically, that and are locally flat is part of our assumption.
Then it is sufficient to prove the (m+1)-th hypothesis from the m-th. In fact, because is dominated by the linear part, all we need is that has vanishing (m+1)-th order term, in terms of .
Applying to in (16) gives us
[TABLE]
For simplicity we omit condition on ’s,
[TABLE]
Note that we only care about the vanishing -order terms, so all higher orders are discarded. Also, using the assumption (17), we know that indeed will at least add an order of to and , rendering the higher order terms that we may omit. By doing this, we alter higher order coefficients at each step. Nevertheless, the local flatness of the coefficients is carried to the -th step because this process is linear and finite in higher order terms. As a result,
[TABLE]
or more explicitly as
[TABLE]
Now we solve the first PDE at point . From a simple calculation we learn that
[TABLE]
so the PDE reduces to
[TABLE]
Combined with the fact that and , we have
[TABLE]
and the actual solution is given by the evaluation of at [math], which is
[TABLE]
Because is flat at , by dominated convergence, exists and is smooth at least for all where is controlled by a large enough power of . Especially, if is supported on a sufficiently small neighbourhood containing , which we later will require, then exists throughout the support of .
Similarly, the solution to the second set of PDEs is
[TABLE]
Repeating this induction process until , we then get the estimate within a small neighbourhood , as claimed. ∎
3.2 The contraction operator and conjugate functions.
An intuitive way of approaching the analytical Morse lemma is to seek for a dynamical system that relates a general Morse flow to its standard counterpart. The conjugate function that we used earlier on submanifolds, can be formally defined on the whole neighbourhood of , and seemingly serve the propose of conjugating the generic flow with the standard one pretty well. The problem is that the existence of such a conjugation is far from straightforward, let alone its regularity. The idea behind our method here is motivated by the 1969 paper [9] by Palais and Smale, in which they discuss the existence of topological conjugation, which is they call structural stability, for a certain type of diffeomorphisms.
In this section, we shall give an analytical justification for the existence and regularity of these conjugate functions, and then construct the analytical Morse coordinates we want from them.
From now on, we assume the following.
As we are only interested in the local behaviour of the flow lines, we will assume that on a open bounded coordinate neighbourhood of , with mapped to , and outside a compact neighbourhood containing . In addition, is smooth on . Such a vector field can be derived by the convolution with smooth bump function that is compactly supported and identically over , with .
Moreover, we shall assume that the conclusion of Theorem 3.6, namely, for all large enough positive integer and constant ,
[TABLE]
holds for coordinates , with as . As commented in the proof of the theorem, this is a reasonable requirement.
3.2.1 Weighted Sobolev spaces on .
The following definitions of weighted Sobolev spaces are introduced by Lockhart and McOwen in their pioneer work [3] and [4], which concern the definition and the analysis of weighted Sobolev spaces with finite open ends. By specifiying the base space to be the open ended line in [4, §3], our notation is as follows.
Let , , and . The Sobolev norm and the weighted Sobolev norm of are
[TABLE]
[TABLE]
The Sobolev space is defined as
[TABLE]
and the local Sobolev space consists of all functions that belong to when restricted to any compact support, namely
[TABLE]
It is easy to check that this definition coincides with the usual definition of Sobolev spaces and local Sobolev spaces. Then the weighted Sobolev space over with weight is
[TABLE]
3.2.2 The contraction operator .
Let , . Then we see that for all , and resembles the conjugate function that we want. In addition, is the solution to the ODE
[TABLE]
This ODE has formal solution . Using this integration formula, we cook up an operator ,
[TABLE]
where is an open coordinate chart containing the critical point, with and the critical point corresponds to , and for simplicity, we assume , the open ball of radius centred at .
We claim that under suitable conditions, is indeed a contracting operator. To prove this, we begin with the fact that is shirking within its range.
Let be the closed convex domain , defined by
[TABLE]
for some constant . This domain owns its convexity to that of the -norm.
Proposition 3.7**.**
For suitably chosen , there exists a positive real number , such that
[TABLE]
for all and all .
To prove this, we will need the following lemma.
Lemma 3.8**.**
Given , . Let . Then for any ,
[TABLE]
where is a constant that only depends on .
Proof.
Throughout this proof, let with for all , given some large enough . Note that using the fact that is dense in , this proof holds without assuming that is compactly supported.
First let .
It is sufficient to prove that for with for all , .
Integrating by parts and using the fact that for all small enough , we get
[TABLE]
For each of the two terms, we apply Cauchy-Schwartz inequality to get
[TABLE]
and
[TABLE]
We relabel the two norms as following
[TABLE]
then the original inequality (29) simplifies to
[TABLE]
In non-trivial cases where , we have
[TABLE]
Let be the positive solution to , then the monotonicity of this polynomial in gives . Let the constant , which obviously only depends on . As a result,
[TABLE]
which is what we claim.
Secondly, . By the definition of ,
[TABLE]
∎
Proof of Proposition 3.7.
With the help of Lemma 3.8, we can control with its integrand, as it is an integration:
[TABLE]
As is globally linear, we fully understand that the second term of (30), which is controlled by
[TABLE]
Hence the problem amounts to finding an upper bound for the first term, .
Here are two helpful observations which we will use extensively in bounding the first term. One observation is the chain rule for higher order derivatives
[TABLE]
where with the summation done over all such possible ’s, and . And the other observation is the pointwise control of the difference of two products,
[TABLE]
which can be proven by a simple argument of induction.
Now, let us examine the first term, which is
[TABLE]
with , and we start with a pointwise estimate for each -derivative. For , we have
[TABLE]
combining constants and taking ,
[TABLE]
and denoting gives
[TABLE]
Given this pointwise estimate, we now place a bound on the corresponding -norm term,
[TABLE]
where is an algebraic constant that only depends on . We may also assume that is big enough in our case, so that by Sobolev embedding, , where is the radius of domain , and is the Sobolev constant. As a result,
[TABLE]
with .
To force this constant , it is sufficient to require to be large. As a result, is a contraction mapping. ∎
We will assume that is fixed from now on.
The following lemma guarantees the choice of nice constants such that the zero function does not get “drifted” too far away by .
Proposition 3.9**.**
Given , such that the positive contraction constant is chosen as before, then
[TABLE]
for all .
Proof.
Recall that
[TABLE]
Then Lemma 3.8 and chain rule combined tell us
[TABLE]
where and should be seen as -vectors with powers taking on each elements, e.g. .
If eigenvalues ’s satisfy -linearity condition, then by Theorem 3.6, we have the pointwise assumption for ,
[TABLE]
for any given positive constants and its associated constant . Based on this assumption we have
[TABLE]
Hence,
[TABLE]
Given that is defined as satisfying
[TABLE]
we have the estimate
[TABLE]
Combining this estimate with the conclusion of Theorem 3.6,
[TABLE]
To achieve , we want to employ parameters that satisfy the following inequality
[TABLE]
It becomes clear that by choosing large enough and sufficiently small, our estimate in concern is achieved. Note that the when is reasonably close to zero, the contraction constant only gets smaller, resulting in the improvement of right hand side of the inequality. So the choice of and the reduction in the radius of the region , or even altering for a larger , will not jeopardise our estimate of , even if the constant depends on and .
As a final comment, in terms of the behaviour of for every fixed , the estimate gets improved near the stable and the unstable manifold as well, for is small. ∎
Theorem 3.10**.**
Assume that the set of eigenvalues satisfies the -linearity condition. Given properly chosen constants , , and , there exists a positive constant , and , such that
[TABLE]
is a contraction operator with contracting constant .
Proof.
From Proposition 3.9, it is straightforward to see that for all , the zero function isn’t mapped too far away, i.e.
[TABLE]
In addition, Proposition 3.7 shows that for all ,
[TABLE]
especially for all ,
[TABLE]
then we have , namely . As a result, is a contraction mapping on . ∎
3.2.3 Proof of Theorem 1.2.
Now we present a proof for the main theorem.
Proof of Theorem 1.2.
Theorem 3.10 establishes the fact that is a contracting operator on , which is a convex subset of the complete metric space and hence also is a convex complete metric space. By contraction mapping theorem, there exists a unique fixed point of , such that
[TABLE]
In fact, the integration indicates that is smooth in . And all large enough is a suitable choice, as the proof of Lemma 3.8 shows. Hence when , for every fixed .
Let , then it solves , with its initial condition determined by . In other words, , and
[TABLE]
As is the difference of two flow lines, both of which is smooth in , it is legitimate to fix in equation (33) and write
[TABLE]
Now let us determine the explicit form of . Denote the generalized real number for any , which is the time of the flow when it first exists the domain ; and similarly be the time parameter of exiting the domain . It follows that equals if and only if , and when , .
Let .
If is finite, then for any ,
[TABLE]
as and have identical flow lines outside of , and is linear with respect to . Consequently, is a flow line of , and when , namely is on the unstable submanifold, and it is contained in for all sufficiently large . Such a flow line can only be the static flow at the critical point. Hence in this case
[TABLE]
[TABLE]
and is a conjugate function between the local standard and general flow lines,
[TABLE]
[TABLE]
Otherwise, is infinite. This implies that at least one of and is on the unstable submanifold of , which is also the unstable submanifold of . In fact, both of them are in the unstable locus: If not, let’s say that but , then for sufficiently small , is at least bounded by the radius of , while can be made as close to as we wish; this contradicts , so both and are in . But as we established before, for any in the stable and unstable loci, so we can always write
[TABLE]
Again using the fact that an unstable flow line that is an element in for all large is indeed trivial, we have in this case
[TABLE]
for all , and it indeed is the conjugate function between and , which in this case are identical. And trivially, it still holds that
[TABLE]
To construct local coordinates by means of conjugation, it is necessary for to have enough regularity. It turns out that this is guaranteed by the Implicit Function Theorem (IFT) for Banach spaces [2, 10.2.1, 10.2.3], which we recall as follows : {addmargin}[1em].5em Let be Banach spaces, a continuously differentiable mapping of an open subset of into . Let be a point of s.t. and that the partial derivative be a linear homeomorphism of onto . Then there is an open neighbourhood of in E such that for every open connected neighbourhood contained in , there is a unique continuous mapping of into such that , and for any . Furthermore, is continuously differentiable in , and its derivative is
[TABLE]
If in addition is times continuously differentiable in a neighbourhood of , then is times continuously differentiable in a neighbourhood of .
Let Banach spaces , , and the open subset , with being contracting and small enough. Consider the operator
[TABLE]
Then what we established before translates to that is the unique solution to . Moreover, if is indeed a linear homeomorphism from onto itself, then IFT gives us the regularity of with respect to – because the uniqueness of the fixed point guarantees that coincides with the local inverse function. Consequently, the fact that is in implies that is in . Specifically, by , the smoothness of follows. Combining the smoothness of with the fact that on the stable and the unstable manifold, which includes the origin, is smooth and invertible on a small neighbourhood of , and this makes a local diffeomorphism.
Now we show that is a surjective linear homeomorphism. Let us denote to be a linear operator from to itself. If is a small variation that is sufficiently smooth, then we have
[TABLE]
As a result, the derivative of is
[TABLE]
Apparently is linear. Moreover, there exists a positive constant , which we may assume less than 1, such that
[TABLE]
This is because
[TABLE]
following from Lemma 3.8; and in addition we claim that , which can be easily achieved by picking larger in Lemma 3.8’s proof. It follows that is a continuous linear map on and reasonably close to the identity. Then this operator is an isomorphism, and its inverse can be constructed as
[TABLE]
which is the unique limit of an absolutely convergent sequence in the dual space of . Hence is a homeomorphism and onto, as the IFT requires, which then concludes our proof.
∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Audin, Michèle, and Mihai Damian. “Morse theory and Floer homology”. London: Springer, 2014.
- 2[2] Dieudonn e ´ ´ 𝑒 \acute{e} , Jean. “Foundations of Modern Analysis”. Second Ed. Academic Press, 1969.
- 3[3] Lockhart, Robert B., and Robert C. Mc Owen. ”Elliptic differential operators on noncompact manifolds.” Annali della Scuola Normale Superiore di Pisa-Classe di Scienze 12.3 (1985): 409-447.
- 4[4] Lockhart, Robert. “Fredholm, Hodge and Liouville theorems on noncompact manifolds.” Transactions of the American Mathematical Society 301.1 (1987): 1-35.
- 5[5] Moerdijk, Ieke, and Gonzalo E. Reyes. “Models for smooth infinitesimal analysis.” Springer Science & Business Media, 2013.
- 6[6] Milnor, John. “Morse theory, volume 51 of Annals of Mathematics Studies.” Princeton, NJ, USA (1963).
- 7[7] Morse, Marston. “Relations between the critical points of a real function of n 𝑛 n independent variables.” Transactions of the American Mathematical Society 27.3 (1925): 345-396.
- 8[8] Palais, Richard S. “The Morse lemma for Banach spaces.” Bulletin of the American Mathematical Society 75.5 (1969): 968-971.
