Krylov--Bogolyubov averaging
Wenwen Jian, Sergei Kuksin, Yuan Wu

TL;DR
This paper introduces a modified Krylov-Bogolyubov averaging method tailored for PDEs, enabling the analysis of Lipschitz perturbations of linear systems with imaginary spectra and extending to PDEs with small nonlinearities.
Contribution
It develops a new averaging approach specifically designed for PDEs, improving the analysis of systems with small nonlinearities and Lipschitz perturbations.
Findings
Effective treatment of Lipschitz perturbations in linear systems
Extension of averaging methods to PDEs with small nonlinearities
Potential for broader application in nonlinear PDE analysis
Abstract
We present the modified approach to the classical Bogolyubov-Krylov averaging, developed recently for the purpose of PDEs. It allows to treat Lipschitz perturbations of linear systems with pure imaginary spectrum and may be generalized to treat PDEs with small nonlinearities.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Krylov–Bogolyubov averaging
Wenwen Jian
School of Mathematical Sciences, Fudan University, Shanghai 200433, P. R. China
,
Sergei Kuksin
Université Paris-Diderot (Paris 7), UFR de Mathématiques - Batiment Sophie Germain, 5 rue Thomas Mann, 75205 Paris, France & School of Mathematics, Shandong University, Jinan, PRC & Saint Petersburg State University, Universitetskaya nab., St. Petersburg, Russia
and
Yuan Wu
School of Mathematical Sciences, Fudan University, Shanghai 200433, P. R. China
Abstract.
We present the modified approach to the classical Krylov-Bogolyubov averaging, developed recently for the purpose of PDEs. It allows to treat Lipschitz perturbations of linear systems with pure imaginary spectrum and may be generalized to treat PDEs with small nonlinearities.
1. Introduction
The classical Krylov-Bogolyubov averaging method is a method for approximated analysis of nonlinear oscillating process. Among a number of its equivalent or closely related formulations, we choose the following. In the space , let us consider the differential equation of the form
[TABLE]
where is a linear operator with pure imaginary eigenvalues without Jordan cells, and is a nonlinearity. The task is to study the behavior of solutions for (1.1) on time-interval of order . Let us firstly pass to the slow time and rewrite the equation as
[TABLE]
where now is a time-variable of order one. Secondly, in equation (1.2), let us pass to the interaction representation variables 111under this name the change of variable (1.3) is known in physics.
[TABLE]
and rewrite the equation as
[TABLE]
The Krylov-Bogolyubov averaging theorem is the following result:
Theorem 1.1**.**
Assume that the vector-field is locally Lipschitz continuous. Then
1) the limit
[TABLE]
*exists for all and also is Lipschitz continuous in . *
2) There exists , such that for , a solution of equation (1.4) is -close, as , to a solution of the equation
[TABLE]
We stress that the only restriction imposed on the spectrum of the operator is that it is pure imaginary. Theorem 1.1 and related results were proved by Krylov-Bogolyubov in a number of works in 1930’s. The research was summarized in the book [4], also see in [2]. In our work we present a proof of Theorem 1.1, based on a variation of the Krylov-Bogolyubov argument, developed recently for the purposes of partial differential equations in [7, 8]. It allows to prove the averaging theorem above under minimal restrictions on the smoothness of the nonlinearity – only its Lipschitz continuity is required – and it generalizes to a class of perturbative problems in PDEs. Theorem 1.1 is proved in Sections 3-4. In Section 5, we discuss its applications to the case when (1.2) is a Hamiltonian system. We remind that the Krylov-Bogolyubov averaging method was the first rigorously justified averaging theory. Before the work of Krylov-Bogolyubov the method of averaging existed as a heuristic theory, after that other rigorous averaging theories were created, see in [2]. In particular, now the method of averaging applies to equations with added stochasticity. The approach of our work, enriched with the ideas of the seminal work [9], suits well to the situation when the stochasticity is added to the problem in the form a stochastic force; both in the ODE and PDE settings. See the second half of the paper [8] and references in that work.
Our paper is based on the lecture notes for a course that SK was teaching at a CIMPA School on Dynamical Systems in Kathmandu (October 25 – November 5, 2018), which was attended by WJ and YW.
Notation. Abbreviation l.h.s. (r.h.s.) stands for “left hand side” (“right hand side”). By (by ) we denote the set of non-negative real numbers (non-negative integers), denote by the open ball , , and by – its closure.
Acknowledgements. SK was supported by the grant 18-11-00032 of Russian Science Foundation. WJ and YW were supported by the National Natural Science Foundation of China (No. 11790272 and No. 11421061).
2. preliminaries
Consider again equation (1.2), and assume that the linear operator has eigenvalues, counted with their geometrical multiplicities. Assume also that these eigenvalues are pure imaginary. Then they go in pairs , where (see [5]). So is an even number, . The imposed restrictions on are equivalent to the following conditions (see [5]): and in exists a basis such that in the corresponding coordinates , the matrix of the linear operator has the form
[TABLE]
Thus, the original unperturbed linear system (1.1) reads:
[TABLE]
Note that this linear system can be written in the Hamiltonian form
[TABLE]
where .
2.1. Complex structures in and real analysis in . Systems (2.2) and (1.2) in complex notation.
The systems (2.2) and (1.2) can be written more compactly if we introduce in the space a complex structure and write and the perturbation in its terms. Corresponding construction is performed in this section and is used below to prove Theorem 1.1; the complex language allows to shorten the proof significantly.
Vectors in the space are caracterised by the coordinates . Let us introduce in a complex structure by denoting
[TABLE]
Then the real space becomes a space of complex sequences with . That is, we have achieved that
[TABLE]
In the complex notation, the Euclidean scalar product in reads
[TABLE]
For the real numbers as in (2.2) let us consider the linear operator
[TABLE]
In the real coordinates it reads
[TABLE]
That is, in the complex coordinates the operator with the matrix (2.1) is the operator , so the system of linear equations (1.1) (2.2) reduces to the diagonal complex system
[TABLE]
In the complex notation the perturbed system (1.1) reads
[TABLE]
Below we assume that the vector-field is locally Lipschitz, i.e. its restrictions to bounded balls , are Lipschitz-continuous. The case of polynomial vector-field will be for us of special interest, and we start with its brief discussion.
Definition 2.1**.**
A complex function is a polynomial if it can be written as
[TABLE]
where are multi-indices with the norm , are some complex numbers, and
[TABLE]
Definition 2.2**.**
A vector-field is polynomial if every its component is a polynomial function.
We recall that for a function (real or complex) of a complex variable , the derivatives and are defined as and The lemma below follows by elementary calculation:
Lemma 2.3**.**
Let and consider a complex polynomial . Then and If is a real-valued -smooth function, then , for any .
Example 2.4**.**
For , consider the system of equations
[TABLE]
Now {\color[rgb]{1,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{1,0,0}\lambda}=\omega and in the complex coordinate the system reads
[TABLE]
Let us come back to the general case of locally Lipschitz vector-fields .
Definition 2.5**.**
Let be a non-decreasing continuous function and be a continuous vector-field. We say that , if for any , and .
Example 2.6**.**
Let be a -smooth vector-field. For we denote by the differential of at (this is a linear over real numbers map from to ). Denote . Then defines a continuous function of , and . Indeed, the continuity of is obvious, while the second property follows from the mean-value theorem which implies that
[TABLE]
Lemma 2.7**.**
Let and . Denote . Then a solution of (2.5) exists for and stays in the ball .
Proof.
Since is a locally Lipschitz vector-field, then a solution of equation (2.5) exists till the blow-up time. Taking the scalar product of equation (2.5) with (see (2.4)), we get
[TABLE]
Denote , where equals if the set under the inf-sign is empty. Then for we have
[TABLE]
Thus, and for all . So and the result follows. ∎
2.2. Slow time and interaction representation
Denote . Then , so equations (2.5) reduce to
[TABLE]
Let us substitute in (2.8)
[TABLE]
Then (2.8) becomes
[TABLE]
Denote . For a real vector let be the rotation operator
[TABLE]
It is easy to see that
[TABLE]
and that each is a unitary transformation.
Denote by the vector . Then (2.9) can be written as or Thus, the system (2.10) reads as
[TABLE]
with the initial condition
[TABLE]
Note that
[TABLE]
3. Averaging of vector-fields
We recall that a diffeomorphism transforms a vector-field on to the vector-field , , see in [1]. Accordingly, a linear isomorphism , transforms the vector-field to
[TABLE]
Our goal in this section is to study the averaging in of the vector-field above.
For a continuous vector-field on and a vector , we denote
[TABLE]
if the limit exists (for we understand as the integral ). Denote
[TABLE]
and for set
[TABLE]
Then
[TABLE]
The vector-field is called the averaging of a field in the direction of a vector , and is called the partial averaging. The latter always exists. Our goal in this section is to prove that the former also exists, if the mapping is locally Lipschitz. To indicate the dependence of the two introduced objects on , sometimes we will write them as and . We recall that being written in the special basis the matrix of operator takes the form (2.1), and that after introducing in the complex structure (2.3) the matrix becomes . Since , then the definition of agrees with that in (1.5).
Lemma 3.1**.**
If , then for any .
Proof.
If , then . Since , one obtains , for each , and then
[TABLE]
Similarly, for any , we have
[TABLE]
From (3.4) and (3), one obtains
[TABLE]
and
[TABLE]
Thus, for . ∎
Lemma 3.2** (The main lemma of averaging).**
For any and , the limit of (3.1) exists for any , and . If , then the rate of convergence in (3.1) depends only on and .
Before proving the general case of Lemma 3.2, we firstly consider the case when is a polynomial vector-field. Then,
[TABLE]
and one has
[TABLE]
It follows that
[TABLE]
Definition 3.3**.**
A pair is called -resonant if . The resonant part of the polynomial vector-field (3.6) is another polynomial vector-field such that
[TABLE]
Example 3.4**.**
For equation (2.7) in Example 2.4, we have The monomial is resonant since So now .
Note that
[TABLE]
So,
[TABLE]
Thus, we have
[TABLE]
Therefore, in the polynomial case the limit in (3.1) exists.
Lemma 3.5**.**
If is a polynomial vector-field of the form (3.6), then the limit in (3.1) exists for all , equals to the resonant part of and satisfies . Moreover, if , then the rate of convergence (3.1) depends only on and .
Proof.
The existence of the limit already is proved, and its Lischitz continuity easily follows Lemma 3.1. The second assertion holds since the rate of convergence in (3.7) and (3.8) depends only on the indicated quantities. We omit the details. ∎
Now we begin to prove Lemma 3.2.
Proof.
To show that the limit exists we have to verify that converges to a limit as . It suffices to show that for any , there exists , such that
[TABLE]
For any consider the restriction of to the closed ball . By the Stone-Weierstrass theorem, there exist and a polynomial of degree , depending only on and , such that
[TABLE]
We have got a polynomial vector field , for which the assertions of Lemma 3.2 already are proved.
Since for any , then
[TABLE]
So,
[TABLE]
By Lemma 3.5, there exists such that
[TABLE]
[TABLE]
The same is true for . Therefore (3.9) follows, and the convergence (3.1) is established. The inclusion is a consequence of Lemma 3.1, while the last assertion of the lemma directly follows from the proof. The lemma is proved. ∎
Example 3.6**.**
Let for all and be an anti-holomorphic polynomial . Then no pair is -resonant and .
3.1. Properties of the operator
Proposition 3.7**.**
Let and be locally Lipschitz vector-fields on . Then
(linearity): for any .
- 2)
If , , then .
- 3)
The mapping commutes with all operators .
- 4)
The mapping is measurable. So for every the averaging is a measurable function of .
Proof.
Properties 1) and 2) are obvious. Let us prove 3), assuming for definiteness that . We have:
[TABLE]
To prove 4), we note that for the mappings are continuous, so measurable. By Lemma 3.2, the mapping in question is a point-wise limit, as , of the measurable mappings above; so it also is measurable (see [10], Theorem 1.14). ∎
4. Averaging for solutions of equation (2.11)
In this section we get our main result, describing the behaviour, as , of solutions of equation (2.11) on time-intervals , where const does not depend on . In view of (2.11), this also describes the behaviour of the amplitudes of solutions (2.8), and accordingly, the behaviour of the amplitudes of solutions for (2.5) on long time-intervals .
Let in eq. (2.8) for some function as in Definition 2.5, and let be its solution such that . Denote . Then by Lemma 2.7, for . For the curve satisfies (2.11), (2.12) and for each . So for we have:
[TABLE]
Consider the collection of curves (the solutions of equation (2.11)),
[TABLE]
By (4.1) and the Arzela-Ascoli theorem, the family is precompact in . So there exists a sequence , such that
[TABLE]
for some curve . By (4.1),
[TABLE]
Passing in this relation to the limit as , we obtain
[TABLE]
Now we address the following problem: does the limit depend on ? If it does not, then how to describe it?
A solution of (2.11) satisfies the relation
[TABLE]
and the estimates (4.1). From Lemma 3.2,
[TABLE]
where does not depend on if .
Consider the following effective equation
[TABLE]
that is
[TABLE]
Since is locally Lipschitz, then a solution for (4.6) is unique and exists at least for small .
Lemma 4.1**.**
The curve is a solution of (4.6) for .
To prove the lemma we first perform some additional constructions.
Assume for definiteness that , i.e. , and consider an intermediate scale ; then . Denote . Let , and . Then , .
Let the curves , be defined as in (3.2) with .
Lemma 4.2**.**
For any , denote where . Then uniformly in we have , where as , does not depend on if .
Proof.
Denote
[TABLE]
Then . The term is trivially small. Now consider with . We have
[TABLE]
Consider the term . Since , then by (4.1), we have
[TABLE]
As for any both vector-fields and belong to , then
[TABLE]
Now consider the term . We have
[TABLE]
where . Making in the last inequality the substitution and noting that , we obtain:
[TABLE]
From (4.5), as . Since by item 3) of Proposition 3.7
[TABLE]
and as by Lemma 3.2 the above does not depend on , then
[TABLE]
So
[TABLE]
and therefore,
[TABLE]
The lemma is proved. ∎
Proof of Lemma 4.1.
Consider
[TABLE]
The term (4.7) as in view of (4.2), the term (4.8)=0 by (4.4), the term (4.9) by Lemma 4.2 and (4.10) as by (4.2). Passing to the limit as , we see that . Therefore, is a solution of (4.6) for 0\leq{\color[rgb]{1,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{1,0,0}|\tau|}\leq\theta. Lemma 4.1 is proved. ∎
Since a solution of eq. (4.4) is unique, then the convergence (4.2) holds as , not only as . Thus we have proved
Theorem 4.3**.**
Let , be a solution of (4.4). Then
[TABLE]
where is a solution of (4.6).
Theorem 4.3 proves the second assertion of Theorem 1.1, whose first assertion was already established in Lemma 3.2.
Since , then Theorem 4.3 implies:
Corollary 4.4**.**
The solution of (2.5) satisfies
[TABLE]
Example 4.5**.**
By Examples 2.4 and 3.4, the effective equation for the system (2.6), written in the complex variable and the slow time , is
[TABLE]
In the real variables and the fast time the equation reads
[TABLE]
For its solution is , . So for the solution of (2.6) with initial data satisfies
[TABLE]
5. Hamiltonian equations
Let us provide the space with the usual symplectic structure, given by the form . Then a real-valued Hamiltonian
[TABLE]
gives rise to the Hamiltonian system
[TABLE]
which we rewrite as
[TABLE]
Assume that is locally Lipschitz. It means that
[TABLE]
Then (5.1) is a special case of equation (2.11).
Lemma 5.1**.**
Let be defined as in (3.2). Then
[TABLE]
Proof.
Denote , i.e., . Then
[TABLE]
since . ∎
From Lemma 5.1,
[TABLE]
where we denoted
For the same reason as in Section 3 (see there Lemma 3.2), since is a locally Lipschitz function, then the limit
[TABLE]
exists and is locally Lipschitz (this limit is the averaging of the function in the direction ). Repeating the proof of Proposition 3.7.3) we get that is invariant with respect to the rotations :
[TABLE]
We claim that is a -function and
[TABLE]
for every . Indeed, by the second assertion of Lemma 2.3 and (5.2)
[TABLE]
From here and (5.2) we conclude that is a function, and for each the derivative is a complex linear combination of and . So by Lemma 3.1, for any bounded domain in functions are uniformly in bounded and Lipschitz on . Same is true for , for all . Now the uniform in convergence of to implies the uniform convergence of and to limits, for all . In view of the uniform in convergence of to we get that is -smooth and satisfies (5.5).
We have proved
Theorem 5.2**.**
If equation (2.5) has the hamiltonian form (5.1), where the real Hamiltonian belongs to the space , then the effective equation (4.6) also is hamiltonian: the vector field has the hamiltonian form (5.5), where the Hamiltonian is defined in (5.3).
Example 5.3**.**
If is a real polynomial,
[TABLE]
then
Let us take any and assume that the vector is non-resonant, that is for some implies that . Let us introduce in the action-angle coordinates with where for a vector with we have and . Then , and
[TABLE]
Since now the curve is dense in for each , then (5.4) implies that does not depend on the angles .222For example, if is the polynomial (5.6), then
That is, the Hamiltonian is integrable, and the effective equation reads
[TABLE]
So its solutions are such that const, and Theorem 4.3 implies
Corollary 5.4**.**
If the frequency vector is non-resonant and , then a solution of equation (5.1) satisfies
[TABLE]
for a suitable which depends on .
If , then for a function as in Definition 2.5. For we write
[TABLE]
if for all for a suitable function , independent from . For example, if is a polynomial (5.6) such that the coefficients are nonzero only for , then . Consider the equation
[TABLE]
where . To study its small solutions we substitute and get for equation (5.1) with . We see that if the frequency vector is non-resonant and is a solution of (5.7) with small initial data , then \big{|}|z_{j}(t)|^{2}-\epsilon^{2}|w_{0j}|^{2}\big{|}=o(\epsilon^{2}) for , where . In [6] a more delicate argument is used to show that if the frequency vector satisfies certain diophantine condition and the Hamiltonian is analytic, then the stability interval is much bigger – it is exponentially long in terms of for some positive . Our result is significantly weaker, but it only requires that the vector is non-resonant and the Hamiltonian vector-field is Lipschitz–continuous. We note that the result of [6] generalises to PDEs, e.g. see [3], as well as Theorem 1.1 (and Corollary 5.4), see [7, 8].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] V. I. Arnold. Geometrical Methods in the Theory of Ordinary Differential Equations . Springer-Verlag, New York, second edition, 1988.
- 2[2] V. I. Arnold, V. V. Kozlov, and A. I. Neishtadt. Mathematical Aspects of Classical and Celestial Mechanics . Springer-Verlag, New York, third edition, 2006.
- 3[3] D. Bambusi. An averaging theorem for quasilinear Hamiltonian PD Es. Ann. Henri Poincaré , 4(4):685–712, 2003.
- 4[4] N. N. Bogoliubov and Y. A. Mitropolsky. Asymptotic Methods in the Theory of Non-linear Oscillations . Gordon and Breach, New York, 1961.
- 5[5] I. M. Gelfand. Lectures on Linear Algebra . Dover Publications, Inc., New York, 1989.
- 6[6] A. Giorgilli, A. Delshams, E. Fontich, L. Galgani, and C. Simó. Effective stability for a Hamiltonian system near an elliptic equilibrium point, with an application to the restricted three-body problem. J. Differential Equations , 77:167–198, 1989.
- 7[7] G. Huang. An averaging theorem for nonlinear Schrödinger equations with small nonlinearities. Discrete Contin. Dyn. Syst. A , 34:3555–3574, 2014.
- 8[8] G. Huang, S. Kuksin, and A. Maiocchi. Time-averaging for weakly nonlinear CGL equations with arbitrary potentials. In Hamiltonian partial differential equations and applications , volume 75 of Fields Inst. Commun. , pages 323–349. Fields Inst. Res. Math. Sci., Toronto, ON, 2015.
