Three-Scale Singular Limits of Evolutionary PDEs
Bin Cheng, Qiangchang Ju, Steve Schochet

TL;DR
This paper investigates the behavior of solutions to certain evolutionary PDEs with two small parameters, establishing conditions for uniform boundedness and convergence to a limit system as parameters approach zero.
Contribution
It introduces a framework for analyzing three-scale singular limits in evolutionary PDEs, including conditions for solution boundedness and convergence.
Findings
Solutions remain uniformly bounded under specific conditions.
Solutions converge to a limit equation as parameters tend to zero.
A simple example illustrates the necessity of the conditions.
Abstract
Singular limits of a class of evolutionary systems of partial differential equations having two small parameters and hence three time scales are considered. Under appropriate conditions solutions are shown to exist and remain uniformly bounded for a fixed time as the two parameters tend to zero at different rates. A simple example shows the necessity of those conditions in order for uniform bounds to hold. Under further conditions the solutions of the original system tend to solutions of a limit equation as the parameters tend to zero.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Three-Scale Singular Limits of Evolutionary PDEs
Bin Cheng & Qiangchang Ju & Steve Schochet
(Date: 14-04-2019
B. Cheng
Department of Mathematics, University of Surrey, Guildford, GU2 7XH, United Kingdom
E-mail: [email protected]
Q. Ju
Institute of Applied Physics and Computational Mathematics, P.O. Box 8009, Beijing 100088, China
E-mail: [email protected]
S. Schochet
School of Mathematical Sciences, Tel-Aviv University, Tel Aviv 69978, Israel
E-mail: [email protected] )
Abstract.
Singular limits of a class of evolutionary systems of partial differential equations having two small parameters and hence three time scales are considered. Under appropriate conditions solutions are shown to exist and remain uniformly bounded for a fixed time as the two parameters tend to zero at different rates. A simple example shows the necessity of those conditions in order for uniform bounds to hold. Under further conditions the solutions of the original system tend to solutions of a limit equation as the parameters tend to zero.
Key words and phrases:
Keywords: singular limit multiple time scales. 35B25, 35L45.
1. Introduction
Many physical systems contain several small parameters, such as the Mach number, Alfvén number, Froude number, Rossby number, etc. When these parameters are considered to have fixed ratios to one another then the system has two time scales, one induced by the terms containing the small parameters and the other coming from the order-one terms in the equation. The classical theory of singular limits for evolutionary partial differential equations (PDEs) ([KM81, BK82, Maj84, Sch88, Sch94, Gre97, Gal98] and numerous papers on particular systems, e.g. [MS01]) was developed to treat this case. In order to determine the behavior of solutions when two physical parameters tend to zero in a different manner it is necessary to develop an analogous theory for systems with three time scales. The systems to be considered here have the form
[TABLE]
where and are small parameters. As in the theory of two-scale singular limits, the system without the large terms is assumed to be symmetric hyperbolic, and and are assumed to be antisymmetric constant-coefficient differential or pseudodifferential operators of order at most one. As for two-scale singular limits [KM81, Gal98], parabolic terms of size could be added to the right side of (1.1), although the complications such terms induce would be greater in the three-scale case.
The fundamental discovery of Klainerman and Majda [KM81, KM82] for two-scale singular limits was that the presence of the small parameter in the matrix , which occurs naturally in the normalized equations for low Mach number fluid flow, induces a delicate balance. As they showed, this ensures that solutions of (1.1) with , having fixed initial data belonging to a Sobolev space of sufficiently high index, exist for a time independent of the small parameter and satisfy bounds independent of that parameter, without the need for additional conditions on the large terms or the initial data, such as those assumed in [BK82, Sch88] to treat the case when depends on rather than . Whenever the small parameter in (1.1) is not asymptotically smaller than , i.e., when for some arbitrarily small positive constant , then the Klainerman-Majda balance is essentially preserved and their uniform existence result remains valid and requires only cosmetic changes to the proof. Similarly, when is a constant matrix, as in the rotating shallow water equations ([MK03, Equation (2.2)]) then the Klainerman-Majda uniform existence result remains valid for arbitrary and .
Hence we will be concerned here with the case when does depend nontrivially on , and
[TABLE]
Our first main result is a uniform existence theorem under two additional assumptions: The first condition is that
[TABLE]
for some positive constant , where
[TABLE]
is the Sobolev embedding exponent in dimension . The second condition is that the initial data are uniformly bounded in the Sobolev space and are “well-prepared” in the usual sense that the initial time derivative
[TABLE]
is uniformly bounded in , with the domain being either the whole space or the torus . Examples of initial data satisfying this condition are given in (3.10) below. For convenience, we shall henceforth omit the spatial domain in integrals and function spaces throughout the paper. Although (1.3) limits how small can be compared to , it is consistent with the scaling (1.2) that violates the Klainerman-Majda balance. Moreover, both conditions are necessary, at least for obtaining uniform bounds on solutions of general systems without Klainerman-Majda balance, as will be shown via a simple explicit example.
Our other main result is a convergence theorem showing, under the additional assumptions described below, that as and both tend to zero solutions of (1.1) whose initial data converge in tend to the solution of a certain limiting equation. The framework of the convergence theorem is the same as for two-scale singular limits: The bounds of the existence result yield compactness, which implies that every sequence of and tending to zero while obeying (1.3) has a subsequence for which the solution converges, and convergence without restricting to such subsequences is obtained by showing that the limit satisfies a limit equation for which solutions of initial-value problems are unique. However, both the form and the derivation of the limit equation are more complicated for three-scale singular limits. For the two-scale singular limit obtained when , the limit equation is obtained by decomposing (1.1) into the projections onto the null space of and onto its orthogonal complement, multiplying the latter by and taking the limits of the results. However, in order to obtain the limit equation for the three-scale singular limit in which (1.2)–(1.3) holds it is necessary to use perturbation theory to compute some number of terms of the power series in the small parameter for the eigenvalues and eigenspaces of in Fourier space. The number of terms required and the resulting limit equation depend on the relationship between and as they both tend to zero. In order to obtain convergence without restricting to subsequences it is necessary to restrict the relationship between and so as to obtain a specific limit equation. This requires the additional assumption that for some integer either
[TABLE]
or
[TABLE]
either of which implies (1.2)–(1.3) hold. Note that if for some that is not an integer then (1.7) holds with . The limit equation is different for different values of and even for different values of in (1.6), but is the same for all in . The reason why the limit equation depends on is that when (1.6) holds then the limit equation contains a term arising from the power series expansion in of . Moreover, although both and are both bounded operators from to it turns out that may not be, as will be explained in Definition 4.4 and Remark 4.5 below. Such terms do not occur in two-scale singular limits. As a result, the second time derivative of the limit solution may not belong to , although the limit process ensures that its first time derivative does belong to .
After presenting the example showing the necessity of our conditions for obtaining uniform bounds in §2, the uniform existence theorem will be formulated precisely and proven in §3, and the convergence theorem will be formulated precisely and proven in §4. Some simple examples of the perturbation procedure and the limit equations will also be presented in that section. In forthcoming work the results here will be applied to the problem that motivated this research, namely the simultaneous zero Alfvén number and zero Mach number limit of the scaled compressible inviscid MHD equations
[TABLE]
where the small parameters and are respectively the Mach number and Alfvén number, the fluid density is , its velocity is , the magnetic field is with being the unit vector in the -direction, and the coefficient functions and depend on the constitutive relation giving the fluid pressure as a function of its density.
2. Example
Consider the system
[TABLE]
which has the form (1.1), together with the initial data
[TABLE]
that satisfy the condition that the initial time derivative be uniformly bounded. The system could be turned into one in which the large terms involve derivatives with respect to an additional spatial variable by replacing the terms and by and , respectively, and changing to . A term containing could also be added.
It will be convenient to write the solution to (2.1)–(2.2) in terms of
[TABLE]
which satisfies
[TABLE]
since . The solution to (2.4) is
[TABLE]
Differentiating (2.5) or its derivatives with respect to produces a term containing a factor , while differentiating with respect to produces a term containing a factor since the -dependence in the exponent of (2.5) lies inside . Taking into account the factor of in (2.5) that comes from the initial condition, this shows that
[TABLE]
for some function that is not identically zero provided that both and genuinely depend on their argument.
The standard existence theory for symmetric hyperbolic systems in spatial dimension requires obtaining a bound on the norm of solutions. The system (2.1) can be considered to be a system in any dimension, and estimate (2.6) implies that the solution of (2.1)–(2.2) will be uniformly bounded in only when is bounded, which requires that (1.3) must hold.
Moreover, if the condition that the initial data must be well prepared is dropped then the initial value of in (2.2) can be rather than , which makes (2.6) more singular by one power of . The condition that the norm of the solution be uniformly bounded then requires that be bounded, i.e., that , so no general result beyond the Klainerman-Majda balance is then possible.
3. Uniform Existence Result
3.1. Scaling
Estimate (2.6) implies that the derivatives of solutions of (2.1) satisfy corresponding estimates, except that itself and its pure spatial derivatives are no smaller than because that is the size of the component . These estimates suggest that the appropriate norm of solutions of (1.1) to estimate would be
[TABLE]
where as usual denotes the spatial derivative of order . Although our method indeed allows us to estimate the weighted norm (3.1) of solutions, doing so requires keeping an exact count of the spatial derivatives appearing in instances of the Gagliardo-Nirenberg inequalities (3.16) below. In order to avoid the need to count spatial derivatives we will instead perform a simplified estimate by using
weights that depend only on the number of time derivatives, with the weight of the term and its spatial derivatives equal to . These weights equal their counterparts in (3.1) for the highest spatial derivative of , under the assumption that equality holds in (1.3). For lower-order spatial derivatives when that equality holds, or for all cases when strict inequality holds in (1.3), the weights we use are smaller than their counterparts in (3.1). Hence the simplified estimate will be somewhat weaker than the estimate that would be obtained using (3.1). With one exception this difference is of little importance, because estimates of norms of time derivatives of solutions weighted by small constants are simply a means of obtaining an unweighted estimate for the spatial norms of solutions. The one exception is that the norm of in (3.1) has weight one and so yields a uniform bound, while the norm of in the simplified scheme has weight and so does not yield a uniform bound. Obtaining a uniform bound for some norm of is important for the convergence theory, and it will turn out that the time evolution of the unweighted norm of can be estimated in terms of the norms appearing in the simplified estimates, so we will simply adjoin the unweighted norm of to the simplified scheme of estimates.
However, as is common in the theory of hyperbolic systems, we must modify the standard and norms to include the coefficient matrix of the time-derivative term in the PDE (1.1), with the argument of taken from some solution to (1.1). We therefore define
[TABLE]
The corresponding quantities with the subscript omitted will denote the standard inner product and norms in which is replaced by the identity matrix. Assumption 3.3 together with the estimates to be obtained will ensure that the two are equivalent for the time intervals considered here. The definitions in (3.2) are a slight abuse of notation, both since the argument of is usually not given explicitly but must be understood from the context, and because the value of and at a given time does not depend solely on the value of at that time on account of the inclusion of time derivatives.
Remark 3.1**.**
The standard existence theorem for symmetric hyperbolic systems ([Maj84, Ch. 2, Theorem 2.1]) shows that there exists a unique solution, for some time that may depend on and , to the initial-value problem consisting of (1.1) together with an initial condition . Moreover ([Maj84, Ch. 2, Theorem 2.2]), that solution continues to exist as long as its norm remains finite. Hence in order to prove that the time of existence can be taken to be independent of and it suffices to obtain a uniform bound on the norm of the solution. The proof of the existence theorem uses estimates in which the function appearing inside in the norm from (3.2) differs from the solution being estimated. However, since in this paper the solutions being estimated are already known to exist the function appearing inside in the norms (3.2) will simply be the solution that is being estimated.
Remark 3.2**.**
The standard energy estimates for both symmetric hyperbolic systems without large terms and for singular limits obeying Klainerman-Majda balance involve only spatial derivatives of the solution. The reasons why time derivatives are also needed here and why an unweighted estimate for the time derivative is only obtained in the norm will be explained after the proof of Theorem 3.6.
3.2. Assumptions and initial data
The following standard conditions on the terms appearing in system (1.1) will be assumed, where is defined in (1.4):
Assumption 3.3**.**
- (1)
The matrices and the are symmetric and are functions of their arguments. 2. (2)
The matrix is positive definite; more specifically there are positive constants and such that
[TABLE] 3. (3)
The function is bounded in uniformly in , and for the norm of is bounded uniformly in t. In addition, the function belongs to . 4. (4)
The operators and are anti-symmetric constant-coefficient differential or pseudodifferential operators of order at most one.
Remark 3.4**.**
The identity
[TABLE]
together with the definitions in Assumption 3.3 show that
[TABLE]
As noted in the introduction, the initial data will be required to be chosen so that from (1.5) is uniformly bounded in . From the PDE (1.1) we see that this well-preparedness condition is equivalent to the condition that
[TABLE]
Under the above conditions the following lemma shows that the norm of will be uniformly bounded at time zero. In the statements of both this result and the main theorem we will use the Sobolev embedding constant, i.e., the constant such that
[TABLE]
Lemma 3.5**.**
Assume that initial data satisfy
[TABLE]
for all
[TABLE]
that Assumption 3.3 holds, and that , where is defined in Assumption 3.3. Let be any function such that , equals the right side of (1.5) obtained by solving (1.1) for and setting equal to zero, and the higher time derivatives of at time zero through order are determined recursively in similar fashion by solving of the PDE (1.1) for , setting equal to zero, and substituting in the values of lower time derivatives of at time zero already so determined.
Then there is a constant depending only on the spatial dimension , the constants from (3.3), and from (3.7), and and from (3.8), the norms of and as operators from to , the norms of , , and over the domain , and the norm of , such that at time zero
[TABLE]
for all and satisfying the above conditions.
Proof.
Roughly speaking, the result of the lemma follows from the fact that when is then using the PDE (1.1) plus induction shows that at time zero is for , which is on account of the assumption that , and hence yields the uniform boundedness of the norm of at time zero.
More specifically, by repeated applications of the PDE (1.1) to express higher time derivatives in terms of , and their spatial derivatives, and applications of and to them, we obtain that, for , the leading-order term of is (\tfrac{1}{\delta}\mathcal{L}+\tfrac{1}{\varepsilon}\mathcal{M})^{k-1}\partial_{t}u{\big{|}_{t=0}}, which yields the estimate \varepsilon^{k}\|\partial_{t}^{k}u{\big{|}_{t=0}}\|_{{s_{0}+1-k}}\leq c\frac{\varepsilon^{k}}{\delta^{k-1}}. To see this note first that the assumptions on the initial data ensure that . Applying to (1.1), using the invertibility of to solve the result for , taking up to spatial derivatives of the result, and summing the norms of the results yields a formula for the norm of in terms of norms of products of spatial derivatives of lower-order time derivatives of . Note that coefficients such as can be estimated in the maximum norm in terms of and so may be pulled out of those norms.
For the case this yields an estimate of in terms of norms of products of the factors , , and and their spatial derivatives of order at most , with coefficients of size at most coming from the presence of in the time derivative of (1.1). Since all those factors are bounded in at time zero, and is an algebra, this yields the estimate \|u_{tt}{\big{|}_{t=0}}\|_{{s_{0}-1}}\leq\frac{c}{\delta}.
The analogous expressions for with include factors of and possibly higher time derivatives, plus their spatial derivatives. Although and higher-order time derivatives of do not belong to at time zero, the resulting expressions could be estimated by the method used in the proof of Theorem 3.6 to estimate similar expressions. However, it is simpler to use finite induction to express higher-order time derivatives in terms of and . Since the time derivative in (1.1) is expressed in terms of expressions involving at most one spatial derivative, this again yields an estimate in terms of norms of products of the factors , , and their spatial derivatives of order at most , plus time and spatial derivatives of of order at most , this time with coefficients of size at most since equation (1.1) is used at most times to express time derivatives in terms of spatial derivatives. Note that wherever and its spatial derivatives occur the time derivative is left unaltered rather than using (1.1) to express in terms of , because is at time zero but the individual terms on the right side of (1.1) may not be. This yields the estimates \|\partial_{t}^{k}u{\big{|}_{t=0}}\|_{{s_{0}+1-k}}\leq\frac{c}{\delta^{k-1}}. As indicated at the beginning of the proof, these estimates together with assumption (3.8) show that the norm of is uniformly bounded at time zero. ∎∎
The well-preparedness condition (3.5) can be achieved, for example, by using initial data of the form
[TABLE]
for some nonnegative integer , with the belonging to and bounded in that space uniformly in and . In fact, since
[TABLE]
in view of the scaling assumption (1.2), condition (3.5) will hold provided that
[TABLE]
For example, the well-preparedness condition holds when and , or when , equality holds in (1.3), , and
[TABLE]
When the ranges of and overlap, the condition (3.11) allows more general initial data than would be obtained by requiring that each side of those equations vanish separately.
3.3. Theorem and proof
Theorem 3.6**.**
Under the assumptions of Lemma 3.5, there exists a constant depending only on the quantities that in that lemma depends on, such that for all and satisfying (3.8) the solution of the initial-value problem (1.1), exists on and satisfies .
Proof.
The local existence and continuation theorems ([Maj84, Ch 2., Theorems 2.1–2.2]) mentioned in Remark 3.1 ensure that the solution of the initial-value problem exists on some time interval that might depend on and , and will continue to exist for a time independent of those small parameters provided that it satisfies an estimate independent of them. Hence it suffices to prove such an estimate. Moreover, although the norm used in the estimates below depends on the solution being estimated, condition (3.3) ensures that the resulting estimate will indeed be uniform.
The estimates that will be derived are similar to standard energy estimates for solutions of symmetric hyperbolic systems but require keeping track of the powers of and that appear in those estimates for the system (1.1).
Applying with and to (1.1), taking the inner product with , integrating over the spatial variables, integrating by parts in the terms that involve undifferentiated, noting that the terms involving or drop out on account of the anti-symmetry of those operators, summing over all satisfying the above-mentioned condition, and multiplying the result by the weight yields
[TABLE]
where the inequality is obtained by pulling out from the first integral in maximum norm, and breaking the second integral into several parts and using the Cauchy-Schwartz inequality in each of them.
Since will be differentiated at least once when it appears in any commutator term on the right side of the inequality in (3.12), which yields at least one power of , the power of in every term appearing on the right side of the inequality in (3.12) is at least as large as the total number of time derivatives in that term. By the definition of the norm plus the smoothness assumption on , this implies that in order to bound the right side of the inequality in (3.12) by a continuous function of it suffices to bound all the terms there by a continuous function of after replacing by and replacing by the identity matrix.
The condition on ensures that and are bounded by a constant times and , respectively, and those norms are each bounded by . By the smoothness of the , for some continuous function . This yields the desired estimate for the entire first term on the right side of the inequality in (3.12). The terms on the right side of the inequality in (3.12) in which and the norm of appear are also so bounded in view of the assumptions of those functions.
There remains to estimate only the terms on the right side of the inequality in (3.12) that involve commutators. Since the factor multiplying the norms of the commutators is one of the terms in , only the norms of the commutator terms themselves must be estimated. We can pull out in the norm any factor such as that depends only on , and without derivatives, and the assumptions on the various coefficients ensure that each factor so pulled out is bounded by a continuous function of and hence by a continuous function of . Since the presence of the commutator ensures that at least one derivative will be applied to the function appearing in the commutator, the terms arising from the commutators that remain inside the norms all take the form
[TABLE]
where , , and . If for some then only one derivative is applied to the other factor, so that factor can be pulled out in norm and estimated by or , both of which appear in . After pulling out that factor the integral becomes , which is bounded by , which also appears in . Otherwise for all , and by using the multiple-factor version of Hölder’s inequality we will bound the integral in (3.13) by
[TABLE]
where the exponents , which will be chosen later, must satisfy
[TABLE]
The integrals in (3.14) will then be bounded via the Gagliardo-Nirenberg inequality (e.g., [Fri76, p. 24])
[TABLE]
in which the parameters must satisfy , , and , where as usual is the spatial dimension. Although is actually allowed to equal the endpoint value for many values of the other parameters, we will avoid that value in order to obtain a unified proof. The inequality constraint on will hold provided that . In order to estimate the integrals in (3.14) we apply (3.16) with , so we will let , since that is the highest Sobolev norm of that is bounded by . Since we only use (3.14) when for all , the condition will indeed hold. Since the norm of appearing in (3.14) is the norm, in (3.16) equals . Substituting in these values and multiplying everywhere by two turns the inequality constraint on into the inequality constraint
[TABLE]
on . We now show that it is possible to choose the such that both (3.15) and (3.17) hold.
Since , the interval to which is restricted by (3.17) is nonempty. Since and , the lower limit in (3.17) is negative iff . The inequality in (3.15) is equivalent to
[TABLE]
Combining (3.18) with (3.17) yields
[TABLE]
where for simplicity we ignore the possibility , which will not be needed. Every value of satisfying (3.19) is allowed by both (3.17) and (3.18), so it suffices to show that we can choose values in the intervals in (3.19) that sum to one. That is possible iff the sum of the lower values there is less than one and the sum of the upper values is at least one. The latter condition holds trivially, and as noted above the second expression inside the in (3.19) is negative iff , so it suffices to show that
[TABLE]
Let denote the number of values of for which . If then the sum on the right side of (3.20) vanishes, so that condition indeed holds. When then (3.20) can be written more explicitly as
[TABLE]
Condition (3.21) can be rewritten as
[TABLE]
Since by assumption, , and , the left side of (3.22) is non-negative, and the right side there is non-positive. Moreover, since , if then exists an for which , and in that case the fact that implies that either either or . This shows that either the left side of (3.22) is strictly positive or the right side there is strictly negative, and hence that inequality indeed holds.
Summing over the estimates that we have obtained shows that
[TABLE]
for some continuous function .
Finally, differentiating (1.1) with respect to , taking the inner product of the result with , integrating over the spatial variables, integrating by parts in the terms that involve undifferentiated, and noting that the terms involving or drop out on account of the anti-symmetry of those operators yields
[TABLE]
where the first inequality follows in similar fashion to (3.12) and the second from Assumption 3.3 plus the definition of the norm. Adding (3.24) to (3.23) yields the uniform estimate
[TABLE]
for some continuous function . By Lemma 3.5, is bounded uniformly in and by at time zero, so the differential inequality (3.24) shows that there is a fixed positive constant such that . ∎∎
Remark 3.7**.**
- (1)
In the standard energy estimates for spatial derivatives of solutions of systems without large terms and of systems satisfying Klainerman-Majda balance, integrals of the form (3.14) not containing time derivatives are estimated using the Gagliardo-Nirenberg inequality
[TABLE]
with , , , and instead of (3.16). However, it is not possible to use (3.26) to estimate integrals involving second and higher time derivatives, because the boundedness of does not imply even an -dependent bound for when . 2. (2)
The special case of (3.14) and (3.16) in which , , , , , and is either two or three so that was used previously in **[Che14, §4.1 and Appendix]**. 3. (3)
The expression appears in the estimates for a purely spatial derivative of , arising from the commutator term . When the spatial derivative terms in the PDE are at most then substituting for from the PDE yields a spatial derivative term of order one. Making this substitution allows spatial derivatives to be estimated without requiring estimates of time derivatives, both for systems without large terms and in the Klainerman-Majda theory. However, for the PDE (1.1) with the scaling (1.2) this procedure cannot be used because it yields terms of order , which is large. It is therefore necessary to leave the term on the right side of the energy estimates for spatial derivatives of , and this necessitates estimating time derivatives as well. Similarly, two-scale systems for which depends on rather than also require estimates of time derivatives **[BK82, Sch88]**. 4. (4)
In similar fashion, a term containing appears in estimates for a spatial derivative of on account of the commutator term . Assuming that is bounded initially but is large at time zero, this prevents us from obtaining an unweighted estimate for spatial derivatives of . The reason we do obtain an unweighted estimate for itself is that the commutator term does not yield any second time derivative. 5. (5)
The bound (3.8) on how fast can tend to zero compared to is only needed to ensure that the norm of the solution is uniformly bounded at time zero. The proof of Theorem 3.6 therefore also yields uniform bounds for a uniform time in the case when the time derivatives of the solution through order are uniformly bounded at time zero, without the need for assumption (3.8) and without using weights of powers of in the norms. In particular, taking and letting yields a proof for arbitrary dimensions of the uniform existence theorem stated in **[BK82]** but only proven there in the case , for which no Gagliardo-Nirenberg estimates are needed.
4. Convergence
4.1. A finite-dimensional perturbation result
We begin with a result on perturbations of self-adjoint matrices , where is a small parameter, and is a positive integer. The result will be used in the proof of the convergence theorem in Subsection 4.2 , where and will stand for the Fourier symbols of operators and respectively. The result says that there is an orthogonal projection that commutes with , on whose range is bounded uniformly and has a limit as , and on whose null space is bounded from below by a constant times and has a finite expansion in inverse powers of .
Lemma 4.1**.**
Define , where and are operators on a finite dimensional inner-product space that are either both self-adjoint or both skew-adjoint, is a small parameter, and is a positive integer. Then
- (1)
There exists an orthogonal projection operator that commutes with for , is analytic in for real , and satisfies
[TABLE]
for , where is the norm on the space and and the are positive constants. 2. (2)
For there exist commuting orthogonal projection operators such that the ranges of the complementary projections are mutually orthogonal subspaces, and
[TABLE] 3. (3)
The are the orthogonal projection operators onto the null spaces of operators , which are are determined from via the reduction process of **[Kat82, §II.2.3]**: Specifically, after modifying the notation to facilitate repeated reductions, the are the first terms in the expansions
[TABLE]
Here for , is the orthogonal projection onto the direct sum of the eigenspaces of of all eigenvalues of order , and . The are all self-adjoint when and are self-adjoint, and are all skew-adjoint when and are skew-adjoint. 4. (4)
[TABLE] 5. (5)
The operator is given, and for ,
[TABLE]
where , and denotes the pseudo-inverse of the matrix , defined by .
Proof.
If both and are skew-adjoint then multiplying both of them by makes them self-adjoint without affecting the projections, so we may assume that they are self-adjoint. Moreover, as noted in [Kat82, §II.6.1], the reduction process preserves self-adjointness and so may be continued without limitation, since the nilpotent factors of the general case are absent. In particular, by [Kat82, Theorem 6.1 in §II.6.1] the eigenvalues of and the projection operators are all analytic for real . As in [Kat82, §II.1.3] let denote the resolvent wherever is not an eigenvalue of . In view of the analyticity of the eigenvalues, [Kat82, (1.16) in §II.1.4] shows that for sufficiently small the operator
[TABLE]
is the orthogonal projection onto the direct sum of the eigenspaces of the eigenvalues of of size at most , and is the orthogonal projection onto the direct sum of the eigenspaces of the eigenvalues of of size at least . These estimates show that (4.1)–(4.2) hold.
We carry out the reduction process of [Kat82, §II.2.3] while choosing the unperturbed eigenvalue zero at every stage. However, we do not want to include the range of when considering the zero eigenspace of since that subspace has already been accounted for at previous stages of the reduction process. For this reason we replace the factor , which would appear in (4.4) if the corresponding formula [Kat82, (2.37) in §II.2.3] were simply rewritten in our notation, with . This corresponds to the suggestion in [Kat82, §II.2.3] to add a constant multiple of to but without the need to modify that operator. This procedure yields (4.4).
Since , after multiplication by if necessary, is self-adjoint for all , formula (4.4) implies that all the are then also self-adjoint after that multiplication has been done if necessary. If at any stage of the reduction process the new unperturbed operator does not have any zero eigenspace except for the range of , then and hence also is identically zero, and if then is simply the zero operator for .
By the construction of the reduction process, is the orthogonal projection onto the direct sum of the eigenspaces of whose eigenvalues are of size . Since the eigenvalues for different values of are distinct for small enough , and is self-adjoint, the ranges of for different values of are orthogonal to each other. This implies that the for different commute with each other, and hence so do the . Since is the orthogonal projection onto the union over of the eigenspaces of whose eigenvalues are of size , and those eigenspaces are orthogonal for distinct ,
[TABLE]
In addition, since the project onto mutually orthogonal subspaces, for , which implies that
[TABLE]
Since the eigenvalues of size of are the perturbations of the nonzero eigenvalues of , the continuity of the projections shows that as tends to zero the orthogonal projection onto the direct sum of the eigenspaces of eigenvalues of that are tends to the orthogonal projection onto the direct sum of the eigenspaces of eigenvalues of that are nonzero. This shows that
[TABLE]
in the strong (finite-dimensional) operator topology, which is isometric to a suitably normed matrix space. Therefore, taking the limit of (4.8)–(4.9) and rearranging yields (4.3). Taking the limit of the identities yields , and the orthogonality of the ranges of imply the orthogonality of the ranges of . This also shows that is the orthogonal projection onto the null space of , where the are the first terms in the expansions (4.4).
Since is the orthogonal projection onto the direct sum of the eigenspaces of that are or , continuing the reduction process one more step yields (4.5).
The formulas for the are obtained by using recursively formula [Kat82, (2.18) in §II.2.2], which in our notation becomes, for the case here in which there are no nilpotents,
[TABLE]
In particular, for and only the term with is present in the outer sum in (4.11), and the inner sum then contains only the case where and . Using (4.12), this yields (4.6). An analogous but longer calculation yields (4.7). ∎∎
Remark 4.2**.**
- (1)
Although for , may be nonzero for arbitrarily large values of when . 2. (2)
Formula (4.11) shows that in order to calculate it is necessary to first calculate and , while in order to calculate it would be necessary to first calculate for and then for .
Example 4.3**.**
- (1)
In the application of Lemma 4.1 to the convergence theorem the operators and will be individual Fourier modes of the operators and from (1.1). For example, if and then and for some fixed values of and . When the projection onto the null space of is , and formulas (4.6) and (4.7) yield and since so . If then and each have one nonzero eigenvalue so the fact that the matrices are of size implies that for , while if then for . When but is nonzero then , , , , and for , while when both and vanish then, for all , and . 2. (2)
The operators and in (1.1) are allowed to have order zero, i.e., to be simply multiplication by fixed matrices, and then the operators in the lemma are simply the same operators. For example,
[TABLE]
For these operators,
[TABLE]
When then all eigenvalues of have been accounted for, so for . On the other hand, when then zero is an eigenvalue of with multiplicity two for all (with the eigenvectors being and when ), so for .
4.2. Theorem and proof
The following projections and operator will appear in the statement and proof of the convergence theorem. We assume that either (1.6) or (1.7) holds for some integer .
Let and be operators satisfying the conditions of Assumption 3.3. Let denote the Fourier transform of on or and let denote the corresponding inverse Fourier transform of . Since and are constant-coefficient operators there exist functions and such that and .
Definition 4.4**.**
For any , let and denote the projections and , respectively, from Lemma 4.1, where when (1.6) holds or when (1.7) holds, , and . Define the projection by , and the projection by . In addition, when (1.6) holds then let , where is the constant from (1.6) and is from Lemma 4.1 with , , and as mentioned above, and define the operator by . However, when (1.7) holds then define .
Remark 4.5**.**
Since is an orthogonal projection for each and hence bounded by one, is an orthogonal projection on and a bounded operator on for all . In contrast, although is a bounded operator for each the operator may be unbounded. By Lemma 4.1, is skew-adjoint for each so is anti-symmetric.
Theorem 4.6**.**
Assume that the conditions of Theorem 3.6 hold, that and tend to zero while obeying either (1.6) or (1.7) for some integer , and that converges in to in that limit.
Then the solution of the initial-value problem (1.1), converges to the unique solution belonging to of
[TABLE]
where is the orthogonal projection operator from Definition 4.4 and is the operator defined there.
Proof.
The uniform bound for the norm of the solution of (1.1), proven in Theorem 3.6 shows that , where and are as in Theorem 3.6. By Ascoli’s theorem plus the weak- compactness of , for every sequence of values of and tending to zero while satisfying (3.8) there is a subsequence converging weak- in and strongly in to a limit in . In particular, this convergence together with the assumption on the convergence of the initial data show that (4.16) holds.
By interpolation between Sobolev spaces, the convergence and bounds obtained so far imply that the subsequence also converges to in for any , and hence also in . This yields the convergence in at least of to .
Now apply the projection from Definition 4.4 with to the PDE (1.1), which yields
[TABLE]
As noted above, the expression in brackets on the right side of (4.17) converges in as and tends to zero in the manner stated in the theorem. Since tends to zero in that limit, the projection converges in the strong operator topology to in that limit since the Fourier transform of the former is uniformly bounded and converges pointwise to the Fourier transform of the latter, so for any , (or that expression with the integral replaced by a sum if the spatial domain is periodic) tends to zero by (4.10), the Dominated Convergence Theorem and the fact that orthogonal projection operators do not increase vector length. Hence the entire right side of (4.17) converges in the above limit to . This implies that the left side of (4.17) also converges.
When (1.6) holds then that relation plus the definition from Definition 4.4 imply that . Hence Lemma 4.1 shows that
[TABLE]
Although the Fourier transform of may be unbounded as a function of the Fourier transform variable, (4.18) together with the convergence of to shows that the Fourier transform of the left side of (4.17) converges pointwise to the Fourier transform of . The fact that that left side is known to converge in implies that its Fourier transform also converges in . Since the pointwise and limits of a sequence of functions must coincide when both exist, the Fourier transform of the left side of (4.17) tends in to the Fourier transform of , and hence that left side tends to . The reduction process also shows that the Fourier transform of is in the image of for each , so rearranging the limit of (4.17) yields (4.14). When (1.7) holds instead of (1.6) then that relation plus the definition from Definition 4.4 imply that , so (4.18) holds with replaced by zero, and again leads to (4.14) but with .
Now define . From the Fourier transform of (1.1), (1.6) or (1.7), and Lemma 4.1,
[TABLE]
where by (4.2) when (1.6) holds, and by the definition of plus (4.2) and the definition of in terms of when (1.7) holds. In the former case clearly tends to zero with , and in the latter case tends to zero by (1.7), i.e., in either case as and tend to zero. Since
[TABLE]
taking the limit of (4.19) as and tend to zero while satisfying (1.6) or (1.7) shows that , which implies (4.15).
To show that a solution of the given smoothness of (4.14)–(4.16) is unique, let be the projection onto the Fourier modes for which is bounded by . Since is finite for each , the limit as of is the identity operator. Since is a projection onto Fourier modes it commutes with , and . Hence, for any ,
[TABLE]
by (4.15) plus the antisymmetry of . Taking the difference of (4.14) for two solutions and , defining , taking the inner product of the result with and letting tend to infinity therefore yields
[TABLE]
where
[TABLE]
Since the final expression in (4.20) looks like the estimate for a symmetric hyperbolic system, we obtain
[TABLE]
for some and depending on the norms of and and the constant from (3.3). Estimate (4.21) plus the initial condition (4.16) imply that
[TABLE]
which implies that , i.e., , yielding uniqueness.
As usual, the uniqueness of the limit implies that convergence holds as and tend to zero while satisfying (1.6) or (1.7) without restricting to a subsequence. ∎∎
Example 4.7**.**
- (1)
Consider the PDE
[TABLE]
The relationship does not satisfy (1.3) in dimension two. Nevertheless, as noted in the introduction, the fact that the coefficient matrix of the time derivatives does not depend on or implies that solutions of (4.22) satisfy uniform bounds. Let be a function whose gradient belongs to , and take the initial data to be and . Then and , i.e., the initial time derivative is bounded. Since the PDE is linear with constant coefficients, it is convenient to express the limit equation in Fourier space. By part 1 of Example 4.3, when then the limit is , , while for but the limit is and for the limit is . The initial data for the limit are and . When and are both nonzero the solution of the limit equation is and
[TABLE]
while when then the limit is . When the spatial domain is then is unbounded but when the domain is then it is bounded since on the set where it is nonzero. Even when the spatial domain is , the fact that contains a factor of ensures that is bounded, but will be unbounded if . The limit solution (4.23), which implies the limit equation satisfied by , can be verified by solving the equation for and exactly for . This yields
[TABLE]
whose limit as indeed yields (4.23). 2. (2)
Adding the term to (4.22) changes the limit equation for to . If is irrational but well-approximated by rationals then the term may not be bounded by as that expression tends to infinity, even in the periodic case, so may not be a bounded operator from to . 3. (3)
Consider the PDE , where and and are the matrices discussed in Part 2 of Example 4.3. Since the choice of the relationship between and makes in (1.6) equal two and hence in Definition 4.4 equal three, the projection is orthogonal to the non-zero eigenspace of as well as those of and . The formula for in Part 2 of Example 4.3 therefore shows that when and then the limit equation is simply while when but then the limit equation is that the first three components of vanish and of its last two components equal zero. This shows that even the number of nonzero components of the limit cannot be determined simply by looking at the number of components that do not contain large terms nor even by first eliminating all components having terms of order and then eliminating those remaining components having terms of order not coming from components already eliminated, which works for the system (4.22).
5. Acknowledgements
Ju is supported by the NSFC (Grants No.11571046, 11471028, 11671225). Cheng and Ju are supported by the UK Royal Society “International Exchanges” scheme (Award No. IE150886). Ju and Schochet are supported by the ISF-NSFC joint research program (NSFC Grant No. 11761141008 and ISF Grant No. 2519/17). The authors thank an anonymous referee for comments that lead to improvements in the exposition of the paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[BK 82] G. Browning and H.-O. Kreiss. Problems with different time scales for nonlinear partial differential equations. SIAM J. Appl. Math. , 42(4):704–718, 1982.
- 2[Che 14] B. Cheng. Improved accuracy of incompressible approximation of compressible Euler equations. SIAM J. Math. Anal. , 46(6):3838–3864, 2014.
- 3[Fri 76] A. Friedman. Partial Differential Equations . Krieger, Huntington, NY, 1976.
- 4[Gal 98] I. Gallagher. Applications of Schochet’s methods to parabolic equations. J. Math. Pures Appl. (9) , 77(10):989–1054, 1998.
- 5[Gre 97] E. Grenier. Pseudo-differential energy estimates of singular perturbations. Comm. Pure Appl. Math. , 50(9):821–865, 1997.
- 6[Kat 82] T. Kato. A short introduction to perturbation theory for linear operators . Springer-Verlag, New York, 1982.
- 7[KM 81] S. Klainerman and A. Majda. Singular limits of quasilinear hyperbolic systems with large parameters and the incompressible limit of compressible fluids. Commun. Pure Appl. Math , 34:481–524, 1981.
- 8[KM 82] S. Klainerman and A. Majda. Compressible and incompressible fluids. Commun. Pure Appl. Math. , 35:629–653, 1982.
