On the topology of nearly-integrable Hamiltonians at simple resonances
L. Biasco, L. Chierchia

TL;DR
This paper demonstrates that averaging near simple resonances in nearly-integrable Hamiltonians results in a one-dimensional cosine-like potential system, with the full Hamiltonian closely approximated by an effective pendulum-like Hamiltonian, advancing understanding of phase space structures.
Contribution
It provides a detailed analysis of the effective Hamiltonian near simple resonances, showing it reduces to a pendulum-like system with exponentially small perturbations, supporting a key conjecture in Hamiltonian dynamics.
Findings
Effective Hamiltonian at simple resonances resembles a pendulum.
Perturbations are exponentially small relative to potential oscillation.
Results support the Arnold–Kozlov–Neishtadt conjecture on measure of non-torus sets.
Abstract
We show that, in general, averaging at simple resonances a real--analytic, nearly--integrable Hamiltonian, one obtains a one--dimensional system with a cosine--like potential; ``in general'' means for a generic class of holomorphic perturbations and apart from a finite number of simple resonances with small Fourier modes; ``cosine--like'' means that the potential depends only on the resonant angle, with respect to which it is a Morse function with one maximum and one minimum. \\ Furthermore, the (full) transformed Hamiltonian is the sum of an effective one--dimen\-sio\-nal Hamiltonian (which is, in turn, the sum of the unperturbed Hamiltonian plus the cosine--like potential) and a perturbation, which is exponentially small with respect to the oscillation of the potential. \\ As a corollary, under the above hypotheses, if the unperturbed Hamiltonian is also strictly convex, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the topology of nearly–integrable Hamiltonians at simple resonances
L. Biasco & L. Chierchia
Dipartimento di Matematica e Fisica
Università degli Studi Roma Tre
Largo San L. Murialdo 1 - 00146 Roma, Italy
[email protected], [email protected]
Abstract
We show that, in general, averaging at simple resonances a real–analytic, nearly–integrable Hamiltonian, one obtains a one–dimensional system with a cosine–like potential; “in general” means for a generic class of holomorphic perturbations and apart from a finite number of simple resonances with small Fourier modes; “cosine–like” means that the potential depends only on the resonant angle, with respect to which it is a Morse function with one maximum and one minimum.
Furthermore, the (full) transformed Hamiltonian is the sum of an effective one–dimensional Hamiltonian (which is, in turn, the sum of the unperturbed Hamiltonian plus the cosine–like potential) and a perturbation, which is exponentially small with respect to the oscillation of the potential.
As a corollary, under the above hypotheses, if the unperturbed Hamiltonian is also strictly convex, the effective Hamiltonian at any simple resonance (apart a finite number of low–mode resonances) has the phase portrait of a pendulum.
The results presented in this paper are an essential step in the proof (in the “mechanical” case) of a conjecture by Arnold–Kozlov–Neishdadt ([3, Remark 6.8, p. 285]), claiming that the measure of the “non–torus set” in general nearly–integrable Hamiltonian systems has the same size of the perturbation; compare [5], [4].
Contents
1 Introduction
Consider a real–analytic, nearly–integrable Hamiltonian given, in action–angle variables, by
[TABLE]
where is a bounded domain in , is the usual flat dimensional torus and is a small parameter measuring the size of the perturbation . The phase space is endowed with the standard symplectic form so that the Hamiltonian flow governed by is the solution of the standard Hamiltonian equations
[TABLE]
(where is time and dot is time derivative).
It is well known that, in general, the –dynamics is strongly influenced by resonances of the (unperturbed) frequencies , i.e., by rational relations
[TABLE]
with ; for general information, compare, e.g., [3]. Indeed, assuming a standard KAM non–degeneracy assumption on , e.g., that the frequency map is a real–analytic diffeomorphism of onto the “frequency space” , then the action space can be covered by three open sets
[TABLE]
so that the following holds. Roughly speaking, is a fully non–resonant set which is filled, up to an exponentially small set, by primary KAM tori, namely, by homotopically trivial, Lagrangian tori –invariant on which the flow is analytically conjugated to the linear flow
[TABLE]
with satisfying a Diophantine condition
[TABLE]
(for some ); denoting the standard inner product and Furthermore, such tori are deformation of integrable tori.
is an open –neighbourhood of simple resonances (i.e., of regions where exactly one independent resonance holds) and is a set of measure ; compare the Covering Lemma (Proposition 2.1) below111This description follows by choosing carefully certain parameters (such as the “small divisor constant” and “Fourier cut–offs” ) as functions of and disregarding logarithmic corrections..
The region contains double (and higher) resonances and, in general, in there are regions where the dynamics is non–perturbative, being “essentially” governed (after suitable rescalings) by an –independent Hamiltonian; compare222 [3, Remark 6.8, p. 285]: “It is natural to expect that in a generic system with three or more degrees of freedom the measure of the “non–torus” set has order . Indeed, the –neighbourhoods of two resonant surfaces intersect in a domain of measure . In this domain, after the partial averaging taking into account the resonances under consideration, normalizing the deviations of the “actions” from the resonant values by the quantity , normalizing time, and discarding the terms of higher order, we obtain a Hamiltonian of the form , which does not involve a small parameter (see the definition of the quantity above). Generally speaking, for this Hamiltonian there is a set of measure that does not contain points of invariant tori. Returning to the original variables we obtain a “non–torus” set of measure .” [3].
The dynamics in the simple–resonance region is particularly relevant and interesting. For example, it plays a major rôle in Arnold diffusion, as showed by Arnold himself [2], who based his famous instability argument on shadowing partially hyperbolic trajectories arising near simple resonances.
On the other hand, in there appear secondary KAM tori, namely –dimensional KAM tori with different topologies, which depend upon specific characteristics of the perturbation . The appearance of secondary tori is a genuine non–integrable effect, since such tori do not exist in the integrable regime.
In the announcement [5] it is claimed that, in the case of mechanical systems – namely systems governed by Hamiltonians of the form – and for generic potentials , primary and secondary tori fill the region up to a set of measure nearly exponentially small, showing that the “non–torus set” is, at most, as conjectured in333See footnote 2 above. Note also that, as it was proved in [10] (in dimension 2) and [13], [15] (in any dimension), the union of primary invariant tori fills the phase space up to a set of measure . This result is optimal: the phase region inside the separatrix of the pendulum , with and , does not contain any primary invariant torus, namely a circle which is a global graph over the angle on , and this region has measure . Indeed this region is filled by secondary tori, corresponding to oscillations of the pendulum. [3] and studied in [12]. In fact, Theorem 2.1 below is one of the building block of the proof (in the mechanical case) of the Arnold–Kozlov–Neishdadt conjecture as outlined in [4].
This paper is devoted to the fine topological and quantitative analysis of the behaviour of generic systems in the simple resonant region .
In this introduction, we briefly discuss the main aspects of this analysis in the particular case of purely positional potentials; precise statements are given in Theorem 2.1 of § 2 below, and, for the general (but more technical and implicit) case, in Theorem 7.2 of § 7.2).
is the union of suitable regions , which are -close to exact simple resonances , and which are labelled by generators of one dimensional, maximal sublattices of (see (21) below); “exact” meaning that does not verify double or higher resonant relations.
In averaging (or normal form) theory, one typically considers a finite but large (possibly, –dependent) number of simple resonances. More precisely, one considers generators with , and can be chosen according to the application one has in mind. Typically, one chooses for a suitable (as in Nekhoroshev theorem [16], [8]) or (as in the KAM theory for secondary tori of [5], [4]).
By averaging theory, in any fixed simple resonant region , one can remove the non–resonant angle dependence, so as to symplectically conjugate , for small enough, to a Hamiltonian of the form
[TABLE]
where is a function of one angle and is a “very small” remainder with444 denotes the Fourier coefficient of . Thus, up to the remainder , the Hamiltonian depends effectively only on the “resonant angle” and therefore the “effective Hamiltonian” is integrable: this is the starting point for (“a priori stable”) Arnold diffusion or for the KAM theory for secondary tori of [5]–[4].
Obviously, there are here two main issues:
- (a)
What is the actual “generic form” of ?
- (b)
How small (and compared to what) is the remainder ?
(a) According to averaging (or normal form) theory555For generalities on Averaging Theory, see, e.g., [3, § 6] and references therein. is “close” to the projection of the potential on the Fourier modes of the resonant maximal sublattice :
[TABLE]
Now, since is real-analytic on , it is holomorphic in a complex strip around of width and its Fourier coefficients decay exponentially fast as666Precise norms will be introduced in the next section § 2. . Hence, typically (i.e., if )
[TABLE]
which, by the reality condition , can be written as
[TABLE]
for a suitable . Thus,
[TABLE]
provided
[TABLE]
In other words, one expects (8) to hold for generic real–analytic potentials and for all generators ’s satisfying (9).
Indeed, this is the case: we shall introduce certain classes of periodic holomorphic functions , for which (choosing suitably the “tail” fuction ) (8) holds for generators satisfying (9).
The class turns out to be “generic” in several ways:
(i) it contains an open dense set in the class of real–analytic functions having holomorphic extension on a complex neighbourhood of size of (in the topology induced by a suitably weighted Fourier norm);
(ii) its unit ball is of measure 1 (with respect to a natural probability measure);
(iii) it is a “prevalent set”.
For precise statements see Definition 2.1 and Proposition 3.1 below.
Next, in order for to be close to in (8), one needs to have a bound of the type
[TABLE]
As well known, averaging methods involve an analyticity loss in complex domains. In particular, the Hamiltonian in (5) and, therefore, , can be analytically defined only in a smaller complex strip with . Therefore, by analyticity arguments, the best one can hope for is an estimate of the type
[TABLE]
for a suitable constant that can be taken to be smaller than any prefixed positive number. But then, for (11) and (10) to be compatible one sees that one must “essentially” have and that standard averaging theory is not enough777Compare, e.g., [16], where . For a more detailed comparison with the averaging lemma of [16], see also Remark 4.1–(iv) below. Compare also [6] and [7].. To overcome this problem, we provide (Section 4) a normal form lemma with small analyticity loss, “small” meaning that one can take
[TABLE]
(compare, in particular, (74)). The value (12) is compatible with (10) for , showing that, indeed, generically, one has
[TABLE]
In particular we prove that: The “effective Hamiltonians” , as vary (), have (up to a phase–shift) the same cosine–like form and, hence, the same topological feature; compare, also, Remark 2.2–(i) below.
Notice also that on low modes this last property, in general, does not holds, as one immediately sees by considering and a potential such that
[TABLE]
which is a Morse function with two maxima and two minima in .
(b) What we just discussed gives also an indication for the question “with respect to what has to be small”. In fact, if, as expected, (8) is the leading behaviour, one should have
[TABLE]
But in order to perform averaging procedures, one has, typically, control on small divisors up to the truncation order , so that the remainder will contain high Fourier modes, , of the potential . Such terms are bounded by , which are of the same size of , at least for .
To overcome this problem, we introduce in § 5, at difference with standard geometry of resonances (such as in [14], [16], [8]), two Fourier cut offs in such a way that on the simple resonant regions one has non–resonance conditions for double and higher resonances up to order , while is the maximum value of the size of the generators (i.e., ). Therefore, we will get an estimate of the remainder of the type
[TABLE]
for suitable constants . The final upshot is the complete normal form
[TABLE]
with and ; compare Theorem 2.1 below and, in particular, formula (42).
Summarizing: for all large enough, is “cosine–like” i.e. a Morse function with one maximum and one minimum (compare Remark 2.2–(i) below) and is exponentially small with respect to the oscillations of (see (42) and (44) below).
As a consequence we get that, if is strictly convex, the effective Hamiltonian has a phase portrait of a pendulum (compare Remark 2.2–(iv) below).
2 Statements
Assume that in (1), for some , admits holomorphic extension on the complex domain , where is the open complex neighbourhood of formed by points such that888We denote by the usual Euclidean norm. , for some and denotes the open complex neighbourhood of given by
[TABLE]
The integrable hamiltonian is supposed to be “KAM non–degenerate” in the following sense.
Assumption A Let be a real-analytic function
[TABLE]
where is a bounded domain of and such that the frequency map
[TABLE]
is a global diffeomorphism of onto with Lipschitz constants given by
[TABLE]
Now, we describe the covering of frequency/action domain, which allows to apply averaging theory (Proposition 4.1 below) to a perturbation of an integrable system with Hamiltonian at non–resonant (modulus a lattice) zones999We use here the term “zone” in a loose way, not in the technical meaning of Nekhoroshev’s Theory; compare, e.g., [8]..
Here, the main point is to find a suitable covering of simple resonances, which are the regions where the averaged Hamiltonian is integrable101010In the sense that the, up to a small remainder, the averaged Hamiltonian depends on one angle., up to a small remainder. All other higher–order resonances are covered by one set, which is of small measure: how small depending on the choice of the various parameters involved and it will vary according to the applications one has in mind.
Let denote the set of integer vectors in such that the first non–null component is positive:
[TABLE]
and denote by the generators of 1d maximal lattices, namely, the set of vectors such that the greater common divisor (gcd) of their components is 1:
[TABLE]
Then, the list of one–dimensional maximal lattices is given by the sets with .
Given we set111111
[TABLE]
Proposition 2.1
(Covering Lemma)* Let and be as in Assumption A (§ 2) and fix and . Then, the domain can be covered by three sets , *
[TABLE]
so that the following holds.
(i)* is completely non–resonant (i.e., non–resonant modulus ), namely,*
[TABLE]
(ii)* , where, for each121212Recall (22). is a neighbourhood of a simple resonance , which is non–resonant modulo , namely,*
[TABLE]
(iii)* contains all the resonances of order two or more and has Lebesgue measure small with : more precisely, there exists a constant depending only on such that*
[TABLE]
Remark 2.1
(i) The neighbourhoods of simple resonances are explicitly defined as follows. Denote by the orthogonal projection on the subspace perpendicular to131313Explicitly, . and, for , define
[TABLE]
Then,
[TABLE]
(ii) The domains are explicitly defined in (146), (130) and (134) below.
(iii) The simply resonant regions in the above Proposition are labelled by generators of 1–d maximal lattices up to size , however, the non–resonance condition (25) holds for integer vectors with up to a (possibly) larger order . This improvement (with respect to having as, e.g., in [16]) is technical but important if one wants to have sharp control over the averaged Hamiltonian in a normal form near simple resonances; in particular in order to obtain (159) and (182), which lead to (44).
(iv) The non–resonance relations (24) and (25) allow to apply averaging theory and to remove the dependence upon the “non–resonant angle variables” up to exponential order; for precise statements, see Theorem 6.1 in § 6.
We proceed, now, to describe the generic non–degeneracy assumption on periodic holomorphic functions, which will allow to state the main theorem (for the case of positional potentials).
If , we denote by the Banach space of real–analytic functions on having zero average and finite –Fourier norm:
[TABLE]
Note that can be uniquely written as:
[TABLE]
For functions141414Not necessarily holomorphic in . we will also use the (stronger) norm151515See Remark 3.1 for details.
[TABLE]
Definition 2.1** (Non degenerate potentials)**
*A tail function is, by definition, a non–increasing, non–negative continuous function *
[TABLE]
Given and a (possibly –dependent) tail function , we define, for , as the set of functions in such that, for any generator , the following holds161616 One could substitute with every in (32); compare Remark 3.3 below. The “weight” is necessary in order to show that in (33) has positive measure in a suitable probability space; compare Proposition 3.1–(ii) below.* *
[TABLE]
*The class is the union over of the classes : *
[TABLE]
The classes contain (if the tail is choosen properly) the non degenerate potentials for which Theorem 2.1 below holds and, as mentioned in the Introduction, satisfy three main genericity properties171717Such properties hold for any tail , which can be chosen differently according to the particular problem at hand., as showed in Proposition 3.1 below (compare, also, Remark 3.2).
Theorem 2.1
*Let , such that *
[TABLE]
Consider a Hamilonian as in (1) where satisfies the non–degeneracy Assumption A (§ 2) and is purely positional (i.e., independent of the –variable) with
[TABLE]
Assume that the potential is non–degenerate in the sense that
[TABLE]
with tail function
[TABLE]
*Let satisfying *
[TABLE]
and where defined in (19). Set
[TABLE]
*Finally assume that *
[TABLE]
Then, for any with , there exists and a symplectic change of variables defined in a neighbourhood of the simple resonance such that the following holds:
[TABLE]
and
[TABLE]
*where for every and *
[TABLE]
Finally,
[TABLE]
Remark 2.2
(i) Recalling (31), estimate (43) means
[TABLE]
This implies that for every the -periodic real function
[TABLE]
behaves like a cosine in the sense that it is a Morse function with only one maximum and one minimum and no other critical points. To prove this, notice that by (45) we have
[TABLE]
Therefore, denoting by the derivative of the function in (46), we have that for and for where . Moreover in the interval the function has a zero and is strictly increasing since Finally in the interval it has a zero and is strictly decreasing since
(ii) As a consequence the phase portrait of the effective Hamiltonian
[TABLE]
and that of the Hamiltonian are topologically equivalent.
(iii) As well know the effective Hamiltonian (47) is an integrable system as it depends only on one angle. Indeed, fix with gcd, then, there exists a matrix such that181818Here, is a row vector.
[TABLE]
where denotes the sup–norm of the matrix and of the vector, respectively. The existence of such a matrix is guaranteed by an elementary result of linear algebra based on Bezout’s Lemma (see Lemma A.1 in Appendix A).
Let us perform the linear symplectic change of variables
[TABLE]
which is generated by the generating function . Note that does not mix actions with angles, its projection on the angles is a diffeomorphism of onto , and, most relevantly, is the “secular angle”.
In the –variables, the secular Hamiltonian in (47) takes the form
[TABLE]
Fix on the exact resonance, namely Let be such that We have
[TABLE]
where is the Hessian matrix of . By Taylor expansion the secular Hamiltonian in (50) takes the form (up to an addictive constant)
[TABLE]
(iv) In particular if the Hamiltonian is convex the coefficient is bounded away from zero and the phase portrait of the secular Hamiltonian in (51) is topologically equivalent, for small191919Namely in the region in the original variables., to that of the pendulum
[TABLE]
The -dependent case.
Let us briefly turn to the -dependent case. First we note that it can happen that, even if the potential satisfies the non-degeneracy condition given in Definition 2.1 at some point , there is no neighborhoud of on which the non-degeneracy condition holds. For example consider the potential
[TABLE]
We have that with and
[TABLE]
However, for every in particular for every odd number and Then for every , tail function and odd we have
[TABLE]
Then, we will prove that the non-degeneracy condition holds in a set of large measure. In particular we fix and prove that, if for a certain point the potential for a suitable and then (42)-(44) holds (with and for a suitable phase ) for every up to a set of relative measure smaller that .
For the precise statement, we refer to Theorem 7.2 below.
3 Functional setting and generic holomorphic classes
3.1 Analytic function spaces
- (a)
Norms and neighbourhoods
In this paper denotes the standard Euclidean norm on and its subspaces, namely for every .
denotes the 1-norm .
For linear maps and matrices (which we shall always identify), denotes the standard “operator norm” .
, with matrix (or vector), denotes the maximum norm (or ).
We shall use three different (non–equivalent) norms on holomorphic functions with domain , or ( being a susbset of ): given a holomorphic function with values in and domain , or , with we denote by
[TABLE]
its Fourier expansion and define the following sup–norm, –Fourier norm and –Fourier norm:
[TABLE]
If the (real) domain need to be specified, we set, respectively,
[TABLE]
Remark 3.1
(i) The space of functions endowed with the sup–norm or the –Fourier norm is a Banach algebra, while is just a Banach space (not a Banach algebra). However, the norm is particularly suited to describe as a probability space; compare item (h) below.
(ii) As already mentioned the three norms in (55) are not equivalent. Indeed, for any , one has202020We have Moreover Indeed for the estimates follows by In the case we have
where in the second inequality we have used that for , while in the last one we exploit
[TABLE]
The Banach subspace of of real–analytic functions with zero average () will be the natural ambient function space. Generic elements of such a space will be the typical potentials to which our uniformaveraging theory applies. We givet it a name:
The following two definitions are needed in order to decompose a holomorphic function on into a sum over generators of 1-d maximal lattices of holomorphic functions on . Later the Fourier modes will be identified with simple resonances.
- (b)
Lattice Fourier projectors
Given and a sublattice of , we denote by the projection on the Fourier coefficients in namely
[TABLE]
and by its “orthogonal” operator (projection on the Fourier modes in ):
[TABLE]
Obviously
[TABLE]
- (c)
Fourier Truncation operators
Given , we introduce the following “truncation” and “high–mode” operators and :
[TABLE]
Note that and commute. Note, also, that
[TABLE]
and that
[TABLE]
- (d)
1d–Fourier projectors
Given and we define the following 1d–Fourier projector
[TABLE]
being the Fourier coefficient of with Fourier index .
It is immediate to see that:
Any can be uniquely written as:
[TABLE]
Notice also that, if and , then
[TABLE]
We are now ready to define the main function spaces.
Finally, we introduce a probability measure on the unit ball in .
- (e)
Denote by the Banach space of complex sequences (over ) given by
[TABLE]
Then, the map
[TABLE]
is an isomorphism of Banach spaces212121Recall that since the functions in are real–analytic one has the reality condition ., which allows to identify functions in with points in and the Borellians of with those of .
Denote by the closed ball of radius one in and by the Borellians in .
On we can introduce the following natural (product) probability measure.
Consider, first, the probability measure given by the normalized Lebesgue–product measure on the unit closed ball of , namely, the unique probability measure on the Borellians of such that, given Lebesgue measurable sets in the unit complex disk with only for finitely many , one has
[TABLE]
where “meas” denotes the Lebesgue measure on the unit complex disk .
Then, the isometry in (66) naturally induces a probability measure222222I.e, . on the Borellians .
3.2 Generic properties of periodic holomorphic classes
Here we discuss some properties of the classes of non-degenerate introduced in Definition 2.1.
Remark 3.2
(i) Since , one has that for all ’s and (32) says that, when is a generator of maximal 1d–lattices (later corresponding to simple resonances), the –Fourier coefficient does not vanish and is controlled in a quantitive way from below: is a suitable weight (needed in the proof of Proposition 3.1 below), while is any number satisfiying
[TABLE]
(ii) It is easy to construct functions in . For example let
[TABLE]
which has Fourier coefficients
[TABLE]
and 1d–Fourier projections
[TABLE]
Then, and, also, for any choice of tail function .
Here, we show that the classes are “general” in several (topological and measure theoretical) ways.
Proposition 3.1
(Properties of )* Let and be a tail function. Then:*
- (i)
The set contains an open dense set.
- (ii)
* and .*
- (iii)
* is a prevalent set*232323We recall that a Borel set of a Banach space is called prevalent if there exists a compactly supported probability measure on the Borellians of such that for all ; compare, e.g., [9] .
Proof
(i) contains an open subset which is dense in the unit ball of .
Let us define as but with the difference that (32) is replaced by the stronger condition242424Note, however, that .
[TABLE]
Let us first prove that is open. Let . We have to show that there exists such that if , then . Fix such that (69) holds and, by continuity of , choose small enough such that , where and denotes integer part. Then, since is not increasing, it is immediate to verify that . Moreover
[TABLE]
namely satisfies (69) (with instead of ).
Let us now show that is dense in the unit ball of . Take in the unit ball of and . We have to find with . Let and denote by and (to be defined) be the Fourier coefficients of, respectively, and . We, then, let unless , and , in which case, . It is, now, easy to check that and is –close to .
(ii) and
We shall prove that, for every , the measure of the sets of potentials that do not satisfy (32) is , the result will follow letting .
By the identification (66), the measure of the set of potentials that do not satisfy (32) with a given is bounded by
[TABLE]
Remark 3.3
Recalling footnote 16, one could impose the condition in (32). Then (70) would become which is still fine if
(iii) is prevalent.
Consider the following compact subset of : let , where , and let be the unique probability measure supported on such that, given Lebesgue measurable sets , with only for finitely many , one has
[TABLE]
The isometry in (66) naturally induces a probability measure on with support in the compact set . Reasoning as in the proof of , one can show that . It is also easy to check that, for every , the translated set satisfies . Thus, one gets , , which means that is prevalent (recall footnote 23).
4 A normal form lemma with “small” analyticity loss
In this section we describe an analytic normal form lemma for nearly–integrable Hamiltonians , which allows to average out non–resonant Fourier modes of the perturbation on suitable non–resonant regions, and allows for “very small” analyticity loss in the angle variables, a fact, which will be crucial in our applications.
We recall ([14], [16]) that, given an integrable Hamiltonian , positive numbers and a lattice , a (real or complex) domain is non–resonant modulo (with respect to ) if
[TABLE]
The main point of the following “Normal Form Lemma” is that the “new” averaged Hamiltonian is defined, in the fast variable (angle) domain, in a region “almost equal” to the original domain, “almost equal” meaning a complex strip of width if is the width of the initial angle analyticity. More precisely, we have:
Proposition 4.1** (Normal form with “small” analyticity loss)**
Let , , , and let be a lattice of . Let
[TABLE]
be real–analytic on with Assume that is (,) non–resonant modulo and that
[TABLE]
Then, there exists a real–analytic symplectic change of variables
[TABLE]
satisfying
[TABLE]
*and such that *
[TABLE]
with
[TABLE]
*Moreover, re-writing (76) as *
[TABLE]
*one has *
[TABLE]
where
[TABLE]
Remark 4.1
- (i)
The “novelty” of this lemma is that the bounds in (77) and the first one in (79) hold on the large angle domain with . In particular the first estimate in (77) (or, equivalently, in (79)) will be important in our analysis in order to obtain (159), (162) and, therefore, (170), (177) and finally (181), which is the key to prove (43) in Theorem 2.1. The drawback of the gain in angle–analyticity strip is that the power of in the smallness condition (73) is not optimal: for example in [16] the power of is one (but , which would not work in our applications).
- (ii)
Having information on non–resonant Fourier modes up to order , the best one can do is to average out the non–resonant Fourier modes up to order , namely, to “kill” the term of the Fourier expansion of the perturbation. This explains the “flat” term surviving in (76) and which cannot be removed in general. Now, think of the remainder term as
[TABLE]
then, is a –perturbation of the part in normal form (i.e., with Fourier modes in ), while is, by (61), a term exponentially small with (see also below) and is a very small remainder bounded by .
- (iii)
We note that (78) follows from (76). Indeed we take
[TABLE]
Then the first estimate in (79) follows by the first bound in (77) and (58). Regarding the second estimate in (79), we first note by (77) and (61) (used with , , , and so that and )
[TABLE]
[TABLE]
- (iv)
Let us compare our results with more standard formulations, such as the Normal Form Lemma in § 2 of [16]. In that formulation, imposing the weaker smallness condition the normal form Hamiltonian writes with exponentially small (of order ) and, regarding one knows that
[TABLE]
For our purposes we need to prove that, when ( indexes the simple resonance we want to consider while indexes the second order resonance beyond ) and the quantity
[TABLE]
is small. Indeed by (79) we have
[TABLE]
which is small when
[TABLE]
Consider, for example, the function with small and defined in (68). We have that , for a suitable constant . In this case (83) writes
[TABLE]
On the other hand by estimate (81) one only have
[TABLE]
which is small only for
[TABLE]
that is a considerably stronger bound than the one in (84).
Since we are considering simple resonances indexed by the non resonant region will be non-resonant only up to order therefore we have that the perturbation, after normal form in the non-resonant region, will be of magnitude
[TABLE]
when the bound (85) applies. This estimate is very bad. On the other hand, in our case, the weaker bound (84) applies and we obtain that the perturbation is exponentially small.
Given a function we denote by the hamiltonian flow at time generated by and by “ad” the linear operator and its iterates:
[TABLE]
as standard, denotes Poisson bracket252525Explicitly, ..
Recall the identity (“Lie series expansion”)
[TABLE]
valid for analytic functions and small . We recall the following technical lemma by [16].
Lemma 4.1** (Lemma B.3 of [16])**
For
[TABLE]
By Lemma 4.1 we get (see also Lemma B.4 of [16])
Lemma 4.2
For
[TABLE]
Summing the Lie series in (86) (see Lemma B5 of [16]) we get, also,
Lemma 4.3
Let and Assume that
[TABLE]
Then for every the time-1-flow of vector field define a good canonical transformation
[TABLE]
satisfying
[TABLE]
Moreover let and set
[TABLE]
Then for any
[TABLE]
*for every function with
In particular when *
[TABLE]
**Proof **We first note that by Lemma 4.1 (applied with , ) for every we have
[TABLE]
Then (89) holds.
For set for brevity
[TABLE]
We get
[TABLE]
and, iterating,
[TABLE]
by Stirling’s formula. Then
[TABLE]
Finally (92) and (93) follows by (91) and since by (87).
Given and a lattice , recall the definition of in (76) and define
[TABLE]
so that we have the decomposition (valid for any ):
[TABLE]
Lemma 4.4
Let and Consider a real–analytic Hamiltonian
[TABLE]
Suppose that is (,) non–resonant modulo for (with ). Assume that
[TABLE]
Then there exists a real–analytic symplectic change of coordinates
[TABLE]
generated by a function with
[TABLE]
satisfying
[TABLE]
such that
[TABLE]
with
[TABLE]
Notice that, by (94) and (100), one has
[TABLE]
Notice also that
[TABLE]
**Proof **Let us define
[TABLE]
and note that solves the homological equation
[TABLE]
Since is (,) non–resonant modulo the estimate (97) holds. We now use Lemma 4.3 with parameters With these choices it is , and, by (96) . Thus, (88) holds and Lemma 4.3 applies. (98) follows by (90). We have
[TABLE]
with
[TABLE]
Since
[TABLE]
we have
[TABLE]
Finally, applying again Lemma 4.3 with , by (92), we get , proving (100) and concluding the proof of Lemma 4.4.
As a preliminary step we apply Lemma 4.4 to the Hamiltonian in (72) with and By (58), (60), (94) and (73) hypothesis (96) holds, namely
[TABLE]
Then there exists a real–analytic symplectic change of coordinates
[TABLE]
satisfying
[TABLE]
such that
[TABLE]
with
[TABLE]
Recalling (94) and (106) we get
[TABLE]
[TABLE]
Then, setting
[TABLE]
we have
[TABLE]
Finally, since by (102) we get
[TABLE]
The idea is to construct by applying times Lemma 4.4.
Let
[TABLE]
Fix and make the following inductive assumptions:
Assume that there exist, for , real–analytic symplectic transformations
[TABLE]
generated by a function with
[TABLE]
satisfying
[TABLE]
such that
[TABLE]
satisfies, for , the estimates
[TABLE]
where
[TABLE]
Let us first show that the inductive hypothesis is true for (which implies ). Indeed by (110) we see that we can apply Lemma 4.4 with and . Thus, we obtain the existence of , generated by a function with
[TABLE]
satisfying (113) and262626Note also that (114), so that and, by (99) and (100),
[TABLE]
We have that Then272727Note that and
[TABLE]
Write
[TABLE]
By (92) (with ) we have
[TABLE]
by (110) and (111). By (91) with
[TABLE]
by (110), (118) and (87) (with ). Analaogously by (87) we get
[TABLE]
Summarizing:
[TABLE]
Then, by (119) and (120) we get
[TABLE]
checking (116) in the case
Now take and assume that the inductive hypothesis holds true for and let us prove that it holds also for . By (116) and (73) we can apply Lemma 4.4 with and . Thus, we obtain the existence of , generated by a function with
[TABLE]
so that and, by (99) and (100),
[TABLE]
since We have that Then282828Note that and
[TABLE]
Writing
[TABLE]
we have
[TABLE]
where . By (87) with , , , we get, by (113) and (117),
[TABLE]
By (91) with reasoning as above we get
[TABLE]
by (116) and (73). By (92) (with ) we have
[TABLE]
by (111), (116), (109) and (73). Analogously, for by (92) (now with )
[TABLE]
[TABLE]
Whence:
[TABLE]
Then by (123) we get
[TABLE]
By (124) we get (116) with This completes the proof of the induction.
Now, we can conclude the proof of Proposition 4.1. Set
[TABLE]
Notice that, by (4), and . By the induction, it is
[TABLE]
with (recall (76)). Note that by (116) and (110)
[TABLE]
Since by (105), (114) and triangular inequality we get
[TABLE]
then (75) follows (the estimate on the angle being analogous).
Since (for any , recall (94)) we have
[TABLE]
proving the second estimates in (77).
Finally, (using that and that )
[TABLE]
which proves also the first estimate in (77).
5 Geometry of resonances
We, first, discuss the Covering Lemma in frequency space in a ball and then we shall pull back through in the action domain.
We define a covering of
[TABLE]
as follows.
- :
The definition of the completely non–resonant zone is nearly tautological:
[TABLE]
- :
Recalling the definition of in (27) we set
[TABLE]
- :
The set is the union of neighbourhoods of exact double resonances292929Recall (22).
[TABLE]
namely:
[TABLE]
where
[TABLE]
Indeed, from these definitions, (129) follows immediately.
Next, let us point out the non–resonance properties satisfied by the frequencies in .
- (i)
If , then there exists and a such that and, therefore,
[TABLE]
- (ii)
Let with and let , . Then, there exist and such that . Hence,
[TABLE]
- (iii)
It remains to evaluate the measure of . To do this, we first prove the following
Lemma 5.1
If with , , then
[TABLE]
Moreover,
[TABLE]
**Proof **Let be the projection of onto , which is the plane generated by and (recall that, by hypothesis, and are not parallel). Then,
[TABLE]
and
[TABLE]
Set
[TABLE]
Then, decomposes in a unique way as
[TABLE]
for suitable . By (140),
[TABLE]
and
[TABLE]
since is a positive integer (recall, that and are integer vectors not parallel). Hence,
[TABLE]
and (137) follows since and .
To estimate the measure of we write as with in the orthogonal complement of the plane generated by and . Since and lies in a rectangle of sizes of length and (compare (142) and (143)) we find
[TABLE]
finishing the proof of Lemma 5.1.
From (133) and (144) it follows immediately (recall that ) that
[TABLE]
for a suitable constant depending only on .
Proof (of Proposition 2.1) Recalling (130), (131) and (133), for set
[TABLE]
Then (129) implies (23), while (135), (136) and (145) imply immediately (24), (25) and303030Recall the definition of in Assumption A, § 2. (26) respectively, proving Proposition 2.1.
6 Averaging Theory
Assumption B
Let and let satisfy Assumption A in § 2.
Let be a holomorphic function with
[TABLE]
and define
[TABLE]
Let , , and be such that
[TABLE]
For , define
[TABLE]
Putting together the Normal Form Lemma (Proposition 4.1) and the Covering Lemma (Proposition 2.1) there follows easily the following averaging theorem for non–resonant and simply resonant zones:
Theorem 6.1
*Let Assumption B hold and assume that satisfies (40) and *
[TABLE]
then the following holds.
(i)* There exists a symplectic change of variables*
[TABLE]
such that
[TABLE]
*where denotes the average with respect to the angles and *
[TABLE]
(ii)* and for any there exists a symplectic change of variables*
[TABLE]
such that
[TABLE]
where
[TABLE]
and
[TABLE]
Remark 6.1
(i) The functions and depend, effectively, only on one angle : more precisely, setting
[TABLE]
we have (recall (63))
[TABLE]
From (159) and (64) it follows
[TABLE]
The function will be called the effective potential since, disregarding the small remainder , it governs the Hamiltonian evolution at simple resonances.
(ii) We have assumed that (see (147)), since this is the natural assumption in term of genericity properties, however the Normal Form Lemma is formulated in term of the stronger norm . We need therefore to restrict slightly the angle–analyticity domain in order to pass to the norm . This can be done through (3.1), which yields (for or and or )
[TABLE]
(iii) The choice of in (149) is not restrictive (since it is done through the introduction of , a new parameter) and it has the effect of making disappear from the smallness conditions and from the definition of the smallness parameters and .
According to the choice of and one will get different kind of statements.
Remark 6.2
(i) Observe that so that assumption (40) ensures the necessary condition:
[TABLE]
(ii) The hypotheses of the Normal Form Lemma (Proposition 4.1) concern a complex domain , while the non–resonance properties of the Covering Lemma (Proposition 2.1) hold on real domains. The following simple observation allows to use directly the Covering Lemma:
If a set is non–resonant modulo for , then the complex domain is non–resonant modulo , provided , where is the Lipschitz constant of on the complex domain .
Indeed, if there exists such that and for all , . Thus, for such ’s, one has
[TABLE]
Proof (of Theorem 6.1) (i): By Remark 6.2–(ii), (24) and the choice of in (150), the domain is completely non–resonant (or non–resonant modulo the trivial lattice ) and, in view of (163) and (152), one can apply Proposition 4.1 to in (148) with313131Recall that the notation “” means “with replaced by ”.
[TABLE]
Thus, recalling (73), using (150) and that , one sees that
[TABLE]
showing that (73) holds and also that in (80). Then, by (79) and (163), one has:
[TABLE]
from which (155) follows.
(ii): By Remark 6.2–(ii), the definition of in (150) and (25), the domain is
[TABLE]
non–resonant modulo .
Using again (163), we can apply Proposition 4.1 with
[TABLE]
and (recall (73) and that )
[TABLE]
showing that (73) holds and also that in (80). From (79), (168) and (163) there follows (159); indeed
[TABLE]
7 Proofs of main results
Under the above standing hypotheses, apart from a finite number of simple resonances the effective potential at simple resonances is close to a (shifted) cosine:
Proposition 7.1
*Let the assumptions of Theorem 6.1 hold, let and let be as in (160), (158). Then, if *
[TABLE]
*one has that *
[TABLE]
**Proof **Observe that by definition of323232Recall (59). and ,
[TABLE]
Now, since ,
[TABLE]
so that
[TABLE]
Next, recalling (160), we have that
[TABLE]
Let us estimate the two sums separately. Since , so that and:
[TABLE]
where in the last inequality we used the assumption .
Then (again, because ), we see that
[TABLE]
Putting (174) and (175) together, by (171) and (169), (170) follows.
Proposition 7.2
*Let the assumptions of Theorem 6.1 hold; let and fix any . Assume (38) and (34). If satisfies *
[TABLE]
*(with defined in (37)) then, *
[TABLE]
*where *
[TABLE]
Remark 7.1
Conditions (38) and (176) are stronger than the ones on in (149), (152) and (169). In particular the assumptions of Proposition 7.1 hold.
Proof of Proposition 7.2 As mentioned in the above remark, Proposition 7.1 holds. Let us estimate the two terms in (170) separately. Recalling the definition of in (151) (and that ), we find:
[TABLE]
As for the second term in (170), we use the following calculus lemma, whose elementary check is left to the reader:
Lemma 7.1
If , and , then .
Indeed, by the lemma (with , and ) and in view of (34) and (176), one has
[TABLE]
The bounds (179) and (180) prove the claim.
The quantity defined in (178) is a “Fourier–measure” for the non–degeneracy of analytic potentials holomorphic on , since such potential will have, in general, Fourier coefficients .
7.1 Positional potentials: proof of Theorem 2.1
In order to conclude the proof of Theorem 2.1 we need the following
Lemma 7.2
*Let and fix any . Let the assumptions of Theorem 6.1 hold; assume (38), (34) and that the positional potential with the tail function defined in (176). Then, for , one has *
[TABLE]
and
[TABLE]
**Proof **Since Proposition 7.2 holds, by (177) and since we get (181). By (159) we get
[TABLE]
Then using that333333Using that for we have
[TABLE]
we prove (182).
Recalling the definition of given in (59), we have that
[TABLE]
for a suitable constant . Setting
[TABLE]
we get (42). Finally Theorem 2.1 follows from Lemma 7.2, in particular (43) and (44) follow from (181) and (182), respectively.
7.2 The general case (-dependent potentials)
For let such that
[TABLE]
For we set
[TABLE]
For definiteness we will fix
[TABLE]
but every other possible choice is fine.
Proposition 7.3
Let and For any let be holomorphic functions on the complex ball with
[TABLE]
Then, for every with , up, at most, to a set of measure343434As usual .
[TABLE]
we have
[TABLE]
The proof relies on the following classical result in function theory (see, e.g., [11]):
Lemma 7.3
(Cartan’s Estimate)* Assume that is holomorphic and bounded by on the complex ball . If then, for *
[TABLE]
for any up to a set of balls of radii satisfying
[TABLE]
Remark 7.2
Note that (184) holds in the complex ball up to a set of measure smaller than . Moreover it holds on the real interval up to a set of (real) measure
Proof of Proposition 7.3 Fix Fix with and We apply Cartan’s estimates simultaneously for every with
[TABLE]
By (184) estimate (183) for the fixed holds on the segment , up, at most, to a set of measure353535Recall Remark 7.2.
[TABLE]
Integrating on the half-sphere , we get that (183) for the fixed holds on the ball up, at most, to a set of measure
[TABLE]
Summing on all we get that (183) holds for all .
Fix Define the following tail function
[TABLE]
where
[TABLE]
Fix and assume that
[TABLE]
Set
[TABLE]
We have that
[TABLE]
Let . Then by Proposition363636With 7.3 there exists a set373737Both and are real sets.
[TABLE]
with
[TABLE]
such that
[TABLE]
Theorem 7.1
Let the assumption of Theorem 6.1 hold. Fix and . Assume that for some we have . Set
[TABLE]
Assume that
[TABLE]
If with then
[TABLE]
where was defined in (189) and383838Note that by (164)
[TABLE]
**Proof **First we note that by (7.2), (188), (195) and Cauchy estimates
[TABLE]
By (170), (187) and (196) we have that for every
[TABLE]
Then, in order to prove (193), it is enough to show that
[TABLE]
Let us consider the first inequality in (198). Since by (191) and recalling (151) we have then, recalling (188) and (7.2), for
[TABLE]
where
[TABLE]
Since
[TABLE]
by (192), we obtain that
[TABLE]
proving the first estimate in (198).
Regarding the second inequality in (198) we have
[TABLE]
with
[TABLE]
We note that
[TABLE]
by (185), indeed393939 Note that if and analogously if and
[TABLE]
Then
[TABLE]
This proves (198) and, therefore, completes the proof of (193).
Let us now show (194). By (159) and (196) we get
[TABLE]
where
[TABLE]
Then (194) follows if we prove that Recalling (149) we get404040Using that for
[TABLE]
by (192).
We rewrite Theorem 7.1 in the fashion of Theorem 2.1.
Theorem 7.2
Let , Fix and . Consider a Hamilonian as in (1) such that satisfies the non–degeneracy Assumption A (§ 2) and has norm one: Assume that for some we have , with defined in (185). Let with satisfying (192) and (152). Let as in (195) and as in (189) . Finally assume that satisfies (40).
Then, for any with , there exists a symplectic change of variables as in (41) such that the following holds.
For every there exist a phase and functions and satisfying
[TABLE]
*with *
[TABLE]
and
[TABLE]
**Proof **It directly follows from Theorem 7.1. We only note that is defined such that
[TABLE]
while
[TABLE]
Note that and are not analytic in (due to the presence of ), but, obviously, is real–analytic in and .
Acknowledgments We are grateful to V. Kaloshin and A. Sorrentino for useful discussions.
Appendix A An elementary result in linear algebra
Lemma A.1
*Given there exists a matrix with integer entries such that , , and . *
**Proof **The argument is by induction over . For the lemma is obviously true. For , it follows at once from414141The first statement in this formulation of Bezout’s Lemma is well known and it can be found in any textbook on elementary number theory; the estimates on and are easily deduced from the well known fact that given a solution and of the equation , all other solutions have the form and with and by choosing so as to minimize .
Bezout’s Lemma Given two integers and not both zero, there exist two integers and such that , and such that .
Indeed, if and are as in Bezout’s Lemma with and one can take . Now, assume, by induction for that the claim holds true for and let us prove it for . Let and and notice that . By the inductive assumption, there exists a matrix with , such that and . Now, let and be as in Bezout’s Lemma with , and . We claim that can be defined as follows:
[TABLE]
First, observe that since divides for , . Then, expanding the determinant of from last column, we get
[TABLE]
Finally, by Bezout’s Lemma, we have that , so that
[TABLE]
which, together with , shows that .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1]
- 2[2] V. I. Arnold. Instability of dynamical systems with several degrees of freedom. Dokl. Akad. Nauk SSSR 156, 9–12 (1964). Engl. transl.: Sov. Math., Dokl. 5, 581–585 (1964)
- 3[3] V. I. Arnold, V. V. Kozlov, and A. I. Neishtadt. Mathematical aspects of classical and celestial mechanics, volume 3 of Encyclopaedia of Mathematical Sciences . Springer-Verlag, Berlin, third edition, 2006. [Dynamical systems. III], Translated from the Russian original by E. Khukhro.
- 4[4] L. Biasco, and L. Chierchia. KAM Theory for secondary tori, ar Xiv:1702.06480 v 1 [math.DS]
- 5[5] L. Biasco, and L. Chierchia. On the measure of Lagrangian invariant tori in nearly–integrable mechanical systems. Rend. Lincei Mat. Appl. 26 (2015), 1–10
- 6[6] A. Delshams; P. Gutiérrez, Effective stability and KAM theory. J. Differential Equations 128 (1996), no. 2, 415–490
- 7[7] A. Giorgilli; L. Galgani, Rigorous estimates for the series expansions of Hamiltonian perturbation theory. Celestial Mech. 37 (1985), no. 2, 95–112
- 8[8] M. Guzzo, L. Chierchia and G. Benettin. The steep Nekhoroshev Theorem, Commun. Math. Phys. 342, 569-601 (2016)
