Well-posedness and Critical Index Set of the Cauchy Problem for the Coupled KdV-KdV Systems on $\mathbb{T}$
Xin Yang, Bing-Yu Zhang

TL;DR
This paper investigates the well-posedness of coupled KdV-KdV systems on the torus, identifying critical index sets for different function spaces using number theory, revealing differences from the real line case.
Contribution
It determines the critical index sets for the well-posedness of coupled KdV systems on the torus, linking them to Diophantine approximation and contrasting with the real line case.
Findings
Critical index sets are explicitly characterized for various spaces.
Number theory techniques are used to identify these sets.
Differences from the real line case are highlighted.
Abstract
Studied in this paper is the well-posedness of the Cauchy problem for the coupled KdV-KdV systems \[ u_t+a_1u_{xxx} = c_{11}uu_x+c_{12}vv_x+d_{11}u_{x}v+d_{12}uv_{x}, \quad u(x,0)= u_0(x) \] \[ v_t+a_2v_{xxx}= c_{21}uu_x+c_{22}vv_x +d_{21}u_{x}v+d_{22}uv_{x}, \quad v(x,0)=v_0(x)\] posed on the torus in the spaces \[ {\cal H}^s_1:=H^s_0 (\mathbb{T})\times H^s_0 (\mathbb{T}), \quad {\cal H}^s_2:=H^s_0 (\mathbb{T})\times H^s(\mathbb{T}), \quad {\cal H}^s_3:=H^s (\mathbb{T})\times H^s_0 (\mathbb{T}), \quad {\cal H}^s_4:=H^s (\mathbb{T})\times H^s (\mathbb{T}).\] For , it is shown that for given , , and , there exists a unique , called the critical index, such that the system is analytically well-posed in for while the bilinear estimate, the key for the proof of the analytical…
| , | |||||
|---|---|---|---|---|---|
| (D1) | |||||
| (D2) | |||||
| (ND1) | |||||
| (ND2) |
| , | () | ||
|---|---|---|---|
| (D1) | |||
| (D2) with |
| , | |||
|---|---|---|---|
| (ND1) () | |||
| (ND2) | () |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Mathematical Physics Problems · advanced mathematical theories · Advanced Differential Equations and Dynamical Systems
\floatsetup
[table]capposition=bottom \newfloatcommandcapbtabboxtable[][\FBwidth]
Well-posedness and Critical Index Set of the Cauchy Problem for the Coupled KdV-KdV Systems on
Xin Yang and Bing-Yu Zhang
Abstract
Studied in this paper is the well-posedness of the Cauchy problem for the coupled KdV-KdV systems
[TABLE]
posed on the periodic domain in the following four spaces
[TABLE]
The coefficients are assumed to satisfy and .
Fix . Then for any coefficients , , and , it is shown that there exists a critical index such that the system (0.1) is analytically locally well-posed in if but weakly analytically ill-posed if . Viewing as a function of the coefficients, its range is defined to be the critical index set for the analytical well-posedness of (0.1) in .
By investigating some properties of the irrationality exponents of the real numbers and by establishing some sharp bilinear estimates in non-divergence form, we manage to identify {\cal C}_{1}=\left\{-\frac{1}{2},\infty\right\}\bigcup\big{[}\frac{1}{2},1\big{]} and {\cal C}_{q}=\left\{-\frac{1}{2},-\frac{1}{4},\infty\right\}\bigcup\big{[}\frac{1}{2},1\big{]} for . In particular, these sets contain an open interval . This is in sharp contrast to the case in which the critical index set for the analytical well-posedness of (0.1) in the space consists of exactly four numbers:
††footnotetext: 2010 Mathematics Subject Classification. 35Q53; 35G55; 35L56; 35D30; 11J72; 11J04.††footnotetext: Key words and phrases. Well-posedness, KdV-KdV systems, Fourier restriction spaces, Bilinear estimates, Diophantine approximation, Irrationality exponents.
1 Introduction
1.1 cKdV systems and well-posedness
This paper studies the Cauchy problem for the coupled KdV-KdV (cKdV) systems posed on the periodic domain in the following general form:
[TABLE]
where are real constant matrices, and are real-valued functions of the variables and . It is assumed that there exists an invertible real matrix such that with . By regarding as the new unknown functions (still denoted by and ), it reduces to
[TABLE]
or equivalently,
[TABLE]
which is called in divergence form if and , and in non-divergence form otherwise.
Listed below are a few specializations of (1.1) appeared in the literature.
- •
Majda-Biello system:
[TABLE]
where . This system was proposed by Majda and Biello in [36].
- •
Hirota-Satsuma system:
[TABLE]
where . This system is in non-divergence form and was proposed by Hirota-Satsuma in [18].
- •
Gear-Grimshaw system:
[TABLE]
where and . When , this system is a special case of (1.1) with
[TABLE]
Note that in (1.7) is diagonalizable over for any and . Moreover, the eigenvalues of are nonzero unless . So (1.6) can be reduced to the form (1.3) as long as . This system was derived by Gear-Grimshaw in [16] (also see [6] for the physical explanation).
We are mainly concerned with the well-posedness of the Cauchy problem for the system (1.3) in some scaled Banach spaces .
Definition 1.1**.**
For any integer , the Cauchy problem of the system (1.3) is said to be -locally well-posed (LWP) in the space if for any , there exists such that
- (a)
for any with , (1.3) admits a unique solution in the space satisfying the auxiliary condition , where is an auxiliary metric space;
- (b)
the corresponding solution map from the initial data to the solution : , is smooth from to .
Similarly, if the solution map is real analytic (or uniformly continuous), then the system (1.3) is said to be analytically (or uniformly continuously) LWP in .
In the literature, the well-posedness in Definition 1.1 is called conditional (cf. [26, 8] and the references therein). If the uniqueness holds in the space without the auxiliary space , then the well-posedness is said to be unconditional. On the other hand, the well-posedness in Definition 1.1 is called local, but if the lifespan can be specified independently of , then the system (1.3) is said to be globally well-posed (GWP). On the other hand, if the local well-posedness fails, then it is called ill-posed.
Definition 1.2**.**
For any integer , the Cauchy problem of the systems (1.3) is said to be ill-posed (IP) in the space if there exists such that for any , there does not exist a smooth solution map as defined in Definition 1.1. Similarly, if there does not exist a real analytic (or uniformly continuous) solution map, then (1.3) is said to be analytically (or uniformly continuously) IP in .
In the rest of the paper, for the convenience of notations, the analytically LWP, GWP and IP will be written as A-LWP, A-GWP and A-IP respectively. Similarly, the uniformly continuously LWP, GWP and IP will be written as U-LWP, U-GWP and U-IP respectively.
1.2 Literature review
For the single KdV equation
[TABLE]
posed on either or , the study of the its well-posedness began in the late 1960s with the work of Sjöberg [44, 45] and has culminated in the work of Killip and Visan[33]. Many significant breakthroughs were made in this study of more than fifty years (see e.g. Bona-Smith [7], Kato [22, 23, 24, 25], Constatin-Saut [15], Kenig-Ponce-Vega [27, 28, 29, 30, 31], Bourgain [9], Christ-Colliander-Tao [12], Colliander-Keel-Staffilani-Takaoka-Tao [13, 14], Kappeler -Topalov [21], Molinet [37, 38], Tao [46], Killip-Visan [33], to name a few).
Define and as the standard Sobolev spaces on and respectively. In addition, denote to be the collection of the functions in whose mean value is 0:
[TABLE]
The following theorem summarizes the known results on the well-posedness of (1.8).
Theorem 1.3** ([21, 33, 37, 38, 31, 17, 34, 13, 12, 9]).**
The Cauchy problem (1.8) is
- (1)
-GWP in both spaces and for any , and -IP for any ;
- (2)
A-GWP in the space for any , and U-IP for any ;
- (3)
A-GWP in the space for any , and U-IP for any ;
- (4)
U-IP in the space for any .
For the cKdV systems (1.3) posed on , it has also been very well studied (see e.g. [1, 6, 40, 49] and the references therein). Of particular interest is a special class of (1.3) in the following form
[TABLE]
It was shown in [49] that it is A-LWP in the space for any if . This is rather surprising since even the single KdV equation (1.8) is -IP in for any (see [37]). Similar phenomenon was also found in the coupled mKdV systems [11].
For the cKdV (1.3) posed on in divergence form, its A-LWP has been thoroughly investigated by Oh [40]. As a specific application, [40] obtained sharp A-LWP results for the Majda-Biello system (1.4) for any . The most interesting case is when . In this case, (1.4) is A-LWP in for any , where is some constant only depending on . On the other hand, (1.4) is -IP if . So the regularity threshold for the A-LWP of (1.4) in the space is .
By contrast, the study for the cKdV (1.3) posed on in non-divergence form is far from complete. Angulo [2, 3] studied the Hirota-Satsuma system (1.5) and obtained the following results. When , (1.5) is A-LWP in for any and when , (1.5) is A-LWP in . ***Actually, the mean-zero condition on is not needed, although it is needed on . In fact, the mean-zero condition on is not applicable since the Hirota-Satsuma system (1.5) does not preserve the mean value of . As a result, it makes more sense to adjust the spaces and in [2, 3] to be and respectively. Comparing to the known results on the cKdV systems in divergence form, there still exist a lot of room to explore and to optimize the regularity thresholds for the non-divergence cases.
1.3 Bilinear estimates
In Theorem 1.3, the optimal regularity index -1 for either the case or the case was obtained by the inverse scattering method [21, 33]. This method relies on the complete integrability of the single KdV equation and usually leads to well-posedness. Another method, called the bilinear estimate approach, does not require the complete integrability property and usually gives the sharp analytical well-posedness. For example, by using this method, [31] discovered the best regularity index for the A-LWP in the case (and the case resp.) to be (and resp.).
Since most cKdV systems do not possess the complete integrability property, the bilinear estimate approach seems to be the most powerful tool to use. This method is based on the Fourier restriction spaces which was first introduced by Bourgain in [9].
Definition 1.4** ([9, 31]).**
Let (or ) and denote (or ) to be the dual group of . For any with , the Fourier restriction space is defined to be the completion of the Schwartz space with respect to the norm
[TABLE]
where and refers to the space-time Fourier transform of .
Then the following is the so-called bilinear estimate for the KdV equation (1.8).
[TABLE]
where in the case, and are also required to have zero means, i.e. for any .
Bourgain [9] first proved (1.11) for and in both and cases, which lead to the A-LWP of (1.8) in or in for any . Later, Kenig, Ponce and Vega[31] optimized this estimate.
Lemma 1.5** ([31]).**
- •
In the case, (1.11) holds for any with some , but fails for any and .
- •
In the case, (1.11) holds for any with , but fails if or .
For the cKdV systems (1.3), the situations are more complicated since four types of bilinear estimates, (1.12)–(1.15), need to be studied.
Divergence forms:
[TABLE]
Non-divergence forms:
[TABLE]
where (or ) refers to or , and (or ) represents or in (1.3). ††† Strictly speaking, in the case, the above bilinear estimates need to be considered in more complicated forms and some zero-mean conditions should be imposed on and . But in order to illustrate the idea more clearly, we drop these technical details and just take the forms which are consistent with the case. For precise bilinear estimates in the case, please see Corollary 1.16, Theorem 1.18 and Section 5.
There are various ways to write out these estimates, what we adopted here is to fix the first term on the right hand side to be .
In the case, all the sharp regularity indices have been found for the bilinear estimates (1.12)–(1.15).
Lemma 1.6** ([1, 40, 49]).**
Let and denote . Then for any bilinear estimate among (1.12)–(1.15), there exists a critical index , as shown in Table 1, such that this bilinear estimate holds for any with some , but fails for any and .
From the above table, we can see there are four critical indices \big{\{}-\frac{13}{12},\,-\frac{3}{4},\,0,\,\frac{3}{4}\big{\}} for the bilinear estimates associated to the cKdV systems posed on .
In the case, Oh[40] studied the Majda-Biello system[36] whose associated bilinear estimates are (1.16) and (1.17).
[TABLE]
Note that the above two estimates are special (essentially equivalent) cases of (1.12) and (1.13). In fact,
- •
(1.16) corresponds to (1.12) by setting , and .
- •
(1.17) corresponds to (1.13) by setting , , , and .
In both cases, the ratio between and is . The most interesting case is when , i.e. when r\in\big{[}\frac{1}{4},\infty\big{)}\setminus\{1\}. In this case, the key technical issue is to know how close the rational numbers can approximate a given real number. Oh[40] adopted the so-called minimal type index for any real number (see Definition 1.14) to capture this approximation. Let be the roots of the quadratic equation and let be the roots of the quadratic equation . More specifically, let
[TABLE]
Denote
[TABLE]
Lemma 1.7** ([40]).**
Let . Define and as in (1.19). Then for the bilinear estimate (1.16) or (1.17), there exists a critical index , as shown in Table 2, such that this bilinear estimate holds for any with , but fails for any and .
Based on this discovery, Oh further concludes the sharp A-LWP for the Majda-Biello system (1.4).
Theorem 1.8** ([40]).**
Let . Define
[TABLE]
Then (1.4) is A-LWP for any , but -IP for any , in the space .
1.4 Weakly analytical ill-posedness
For the single KdV equation (1.8), it follows from Theorem 1.3 and Lemma 1.5 that the critical indices for the bilinear estimate (1.11) match those for the A-LWP of (1.8) in both and cases. For Majda-Biello system (1.4), we can also see from Lemma 1.7 and Theorem 1.8 that (1.4) is A-LWP if and only if both the corresponding bilinear estimates (1.16) and (1.17) hold.
Inspired by these observations, it is conjectured that for any cKdV system (1.3), its regularity threshold for the A-LWP is equivalent to that for the associating bilinear estimates. In fact, by the standard argument in [9, 31, 13], once the associated bilinear estimates are justified for some index , then the A-LWP can also be established for the same . In other words, if a cKdV system is A-IP, then at least one of the corresponding bilinear estimates must fail. But whether the failure of the bilinear estimates implies the A-IP is not known in general. This motivates the following definition.
Definition 1.9**.**
The Cauchy problem of the single KdV equation (1.8) is said to be weakly A-IP if the bilinear estimate (1.11) fails. Similarly, the cKdV systems (1.3) are said to be weakly A-IP if at least one of the corresponding bilinear estimates fails.
Based on the above definition, the aforementioned conjecture is translated to the equivalence between A-IP and weakly A-IP.
Conjecture 1.10**.**
For any cKdV system (1.3), it is A-IP if and only if it is weakly A-IP.
Since A-IP always implies weakly A-IP, the conjecture further reduces to “weakly A-IP impies A-IP”. For the single KdV and the Majda-Biello system, this conjecture has been confirmed. But for some special cKdV, say (1.9) posed on , although the critical index for the corresponding bilinear estimate has been found to be in Lemma 1.6, it is still unknown if it is A-IP in for any .
1.5 Critical index set
Consider the cKdV systems (1.3) posed on . Denote .
Definition 1.11**.**
For any cKdV system (1.3) posed on with and either or does not vanish.
- •
If there exists , depending on , , and , such that (1.3) is A-LWP in the space for any , but (weakly) A-IP for any in , then this is called the (weakly) analytically critical index. If there does not exist such a critical index, then we define .
- •
Regarding the above critical index as a map: , then the range of this map, denoted as , is called the (weakly) analytically critical index set of (1.3) in .
In the following, we will write ”analytically critical index“ to be ”A-critical index“. Based on Definition 1.11, it follows from Lemma 1.6 that the weakly A-critical index set of (1.3) in is
[TABLE]
Now we consider the following (1.20) which is a variant of the single KdV equation.
[TABLE]
Similar to Definition 1.11, we can define the A-critical index set of (1.20) posed on or . For any , we can also define the -critical index set analogously. By some simple scaling computation, one can see that the values of and in (1.20) do not affect its well-posedness. So it follows from Theorem 1.3 that
- •
The -critical index set of (1.20) in or is .
- •
The A-critical index set of (1.20) is \big{\{}-\frac{3}{4}\big{\}} in the space , \big{\{}-\frac{1}{2}\big{\}} in the space , and in the space .
Two observations can be drawn from the above results. Firstly, unlike (1.20), the coefficients of the cKdV systems (1.3) do have an effect on the A-critical index. Secondly, if the problem is posed on , then the critical index can be different if the underlying spaces are different (e.g. cf. ).
The main purpose of this paper is to investigate the weakly A-critical index set of the cKdV systems (1.3) posed on . We will consider the following four spaces:
[TABLE]
Definition 1.12**.**
Let . We define to be the weakly A-critical index set of the cKdV systems (1.3) in the space .
From the results in Oh[40] on the Majda-Biello system (1.4), the weakly A-critical index in the space is
[TABLE]
where is some constant only depending on . So the weakly A-critical index set for (1.4) in the space is
[TABLE]
But what is the precise range of over the region ? This question was not answered in [40], we will find an answer in this paper.
1.6 Irrationality exponent
In order to find out the critical index set for (1.3) posed on , as we mentioned before, it is crucial to estimate how well the real number can be approximated by rational functions. In the number theory, such estimate is called the Diophantine approximation. ‡‡‡The interested readers are referred to S. Lang [35] and Y. Bugeaud [10] for a nice introduction to this subject.
One standard characterization is via the irrationality exponent.
Definition 1.13** (see e.g. [5]).**
A real number is said to be approximable with power if the inequality
[TABLE]
holds for infinitely many , and
[TABLE]
is called the irrationality exponent of .
For any rational number , . For any irrational number , , but exact value of is difficult to find in general. The basic properties of are collected in Proposition 3.1. Recalling the minimal type index in Oh’s paper [40], it is closely related to but is defined slightly different.
Definition 1.14** (see e.g. [4, 40]).**
A real number is said to be of type if there exists a positive constant such that the inequality
[TABLE]
holds for any , and
[TABLE]
is called the minimal type index of , where the infimum is understood as if the set \{\nu\in\mathbb{R}:\text{\rho\nu}\} is empty.
If , then it is easy to see and . If , then it will be shown in Proposition 3.5 that .
1.7 Main results
Recall the irrationality exponent in Definition 1.13. For , define
[TABLE]
The basic properties of are collected in Proposition 3.6.
For the Majda-Biello system (1.4) with , Oh[40] showed the critical regularity indexes for the associated bilinear estimates to be \min\big{\{}1,\frac{1}{2}+\frac{1}{2}\nu_{c}\big{\}} and \min\big{\{}1,\frac{1}{2}+\frac{1}{2}\nu_{d}\big{\}}, see Table 2. The first main result of this paper is to demonstrate these two indexes are actually the same.
Theorem 1.15**.**
Let and define and as in (1.19). Then and
[TABLE]
where is defined as in (1.26).
The key observation in the proof of this theorem is the invariance of the irrationality exponent under the reciprocal operation, that is , see Proposition 3.4.
Combining Theorem 1.15 with Lemma 1.7 (also see Oh[40]), we are able to write out the sharp regularity indexes for the bilinear estimates (1.12) and (1.13) in a more unified way.
Corollary 1.16**.**
Let and denote . Define as in (1.26) for . Then for the bilinear estimate (1.12) or (1.13), there exists a critical index , as shown in Table 3, such that this bilinear estimate holds for any with , but fails for any and .
Let be the range of : . Then it will be shown in Proposition 3.6 that \mathcal{U}=\big{[}\frac{1}{2},1\big{]}. As a result, we obtain the following conclusion (Recall that in (1.21)).
Corollary 1.17**.**
For the Majda-Biello systems (1.4) posed on with , its weakly A-critical index set in the space is \big{\{}-\frac{1}{2},\infty\big{\}}\bigcup\big{[}\frac{1}{2},1\big{]}.
Next, we study the cKdV systems (1.3) in non-divergence form. The main result is the following sharp bilinear estimates.
Theorem 1.18**.**
Let and denote . Define as in (1.26) for . Then for the bilinear estimate (1.14) or (1.15), there exists a critical index , as shown in Table 4, such that this bilinear estimate holds for any with , but fails for any and .
As a corollary, consider the Hirota-Satusma systems (1.5).
Corollary 1.19**.**
For the Hirota-Satsuma systems (1.5) posed on with and , its weakly A-critical index set in the space is \big{\{}-\frac{1}{4},\infty\big{\}}\bigcup\big{[}\frac{1}{2},1\big{]}.
The more detailed well-posedness results on the Hirota-Satsuma system are shown in the appendix. Based on the results in Table 3 and 4, we can actually study the well-posedness of cKdV systems (1.3) in the space for any . The following is a summary of their weakly A-critical index sets.
Theorem 1.20**.**
The weakly A-critical index sets in Definition 1.12 are
[TABLE]
Fix any . The above result enables us to provide a complete classification of the cKdV systems (1.3) in .
Theorem 1.21** (Classification of the systems (1.3)).**
Assume and either or does not vanish. Fix and define as in (1.21). Then the systems (1.3) are completely classified into a family of classes, each of which corresponds to a unique index such that any system in this class is A-LWP in the space if while it is weakly A-IP if .
1.8 Organization of the paper
The organization of the rest of the paper is as follows. In Section 2, we will present the definitions of the Fourier restriction spaces and the resonance functions. Section 3 is devoted to explore properties of the irrationality exponents and prove Theorem 1.15. Some linear estimates will be introduced in Section 4. Theorem 1.18 will be broken into Lemma 5.1, and 5.2 and Proposition 5.3 and 5.4 in Section 5. We will justify Lemma 5.1 and 5.2 in Section 6 and prove Proposition 5.3 and 5.4 in Section 7. Finally, Appendix A includes the analytical well-posedness results about the Hirota-Satsuma systems (1.5) which can be obtained as a corollary of Theorem 1.18.
2 Fourier restriction spaces on
To study the LWP of (1.3), we adopt the similar treatment as in [13] to deal with more general periodic problem posed on for and thus consider the system
[TABLE]
We use , and to denote the spatial, temporal and space-time Fourier transform respectively. However, when there is no confusion, we simply use to denote any of these three types of Fourier transforms. On the other hand, , and represent the corresponding inverse Fourier transforms.
The temporal Fourier transform and its inverse are defined standardly. The definitions of the spatial Fourier transform and its inverse are more complicated. Denote the frequency space corresponding to by \mathbb{Z}_{\lambda}=\big{\{}k\,\big{|}\,k=n/\lambda\,\,\,\text{for some n\in\mathbb{Z}}\big{\}}. The normalized counting measure on is defined by
[TABLE]
For any function on ,
[TABLE]
On the other hand, for any functon on ,
[TABLE]
Consequently, the norm for the Sobolev space is defined as
[TABLE]
where . The homogeneous subspace is defined as . Finally, the space-time Fourier transform and its inverse are defined as and .
For any and , consider
[TABLE]
The solution to (2.6) is given explicitly by
[TABLE]
with
[TABLE]
The following is a generalized version of (1.4) for the definition of the Fourier restriction spaces on .
Definition 2.1** ([9, 31, 13]).**
For any with and , the Fourier restriction space is defined to be the completion of the Schwartz space with respect to the norm
[TABLE]
where refers to the space-time Fourier transform of .
It has been pointed out in [31] that one needs to take for the periodic case. However, this space barely fails to be in C\big{(}\mathbb{R}_{t};H^{s}_{x}\big{)}. To ensure the continuity of the time flow of the solution, a smaller space will be used via the norm
[TABLE]
Since the second term has already dominated the norm of , it follows that Y^{\alpha}_{s,\lambda}\subseteq C\big{(}\mathbb{R}_{t};H^{s}_{x}\big{)}. The companion spaces via the norm (2.11) is then introduced to control the norm of the integral term from the Duhamel principle (see Lemma 4.1).
[TABLE]
For convenience, we will drop when it equals . That is, , and . Throughout this paper,
[TABLE]
Definition 2.2** ([46]).**
Let be a triple in . Define the resonance function associated to this triple by
[TABLE]
where is as defined in (2.8). The resonance set of is defined to be the zero set of , that is
[TABLE]
3 Proof of Theorem 1.15
We first collect some classical results about the irrationality exponent .
Proposition 3.1**.**
- (a)
If , then ; 2. (b)
If , then ; 3. (c)
If is an irrational algebraic number, then ; 4. (d)
For almost every , ; 5. (e)
The function maps onto .
Proof.
Part (a) and (b) are standard. Part (c) is the famous Thue-Siegel-Roth theorem [48, 43, 42]. Part (d) is the Khintchine theorem [32]. Part (e) was proved by Jarnik[19, 20] using the theory of continued fractions. ∎
Next, we present two technical lemmas concerning the irrationality exponent function .
Lemma 3.2**.**
If a real number is approximable with power , then there exist with the following two properties.
- (1)
* is an increasing positive sequence and ;* 2. (2)
For each , satisfies
[TABLE]
Proof.
The conclusion follows from Definition 1.13 and the observation that for any fixed , there are at most finitely many such that
[TABLE]
∎
Lemma 3.3**.**
Let with . Then for any , there exists a constant such that the inequality
[TABLE]
holds for any , where .
Proof.
Since , it follows from Definition 1.13 that there exist at most finitely many such that
[TABLE]
In addition, since is an irrational number, \big{|}\rho-\frac{m}{n}\big{|} is never zero. Therefore, by choosing a sufficiently small constant , (3.1) holds for any . ∎
Now some invariant properties of the irrationality exponent will be justified.
Proposition 3.4**.**
- (a)
For any and , . 2. (b)
For any and , . 3. (c)
For any , \mu\big{(}\frac{1}{\rho}\big{)}=\mu(\rho).
Proof.
As part (a) and (b) are obvious, we will only prove part (c). First, by taking advantage of (a), we may assume . Then due to symmetry, it reduces to prove
[TABLE]
If , then it is trivial. So we further assume is an irrational number, which implies \mu\big{(}\frac{1}{\rho}\big{)}\geq 2. Then it suffices to show that if is approximable with some power , then is approximable with the power for any .
Let be approximable with some power and fix any . It follows from Lemma 3.2 that there exists a sequence such that , and
[TABLE]
Since and , it immediately yields that . In addition, for sufficiently large , we have . Hence, for any such , noticing
[TABLE]
we obtain
[TABLE]
Finally, since , when is large enough, . Therefore,
[TABLE]
Since the above inequality is valid for any large , is approximable with the power . ∎
Finally, we discuss the relation between the irrationality exponent in Definition 1.13 and the minimal type index in Definition 1.14.
Proposition 3.5**.**
- (a)
If , then and ; 2. (b)
If , then .
Proof.
Part (a) is obvious, so we will only prove part (b). Let . We will first show . If is approximable with power , then it follows from Lemma 3.2 that there exists a sequence such that , and
[TABLE]
As a result, for any , there does not exist such that
[TABLE]
holds for all . In other words, is not of type according to Definition 1.14. Therefore, . Sending and then taking supremum with respect to leads to .
Now if , then it follows from that . So in the following, we just assume and intend to show . According to Lemma 3.3, for any , there exists a constant such that
[TABLE]
holds for any . Hence, is of type , which implies . Sending yields . ∎
For , denote and as in (1.26). That is,
[TABLE]
The properties of can be derived from Proposition 3.1.
Proposition 3.6**.**
Let be given. Then
- (a)
* if ;* 2. (b)
* for any ;* 3. (c)
* if is an algebraic number and .* 4. (d)
* for almost every ;* 5. (e)
The range of over is .
Proof.
- (a)
When , it follows from Proposition 3.1(a) that . Therefore, . 2. (b)
This part is obviously due to the definition (1.26) for . 3. (c)
When is an algebraic number, is also an algebraic number. Now if , then it follows from Proposition 3.1(c) that . Hence, . 4. (d)
By Proposition 3.1(d), for almost every . Thus, for almost every . 5. (e)
Combining Proposition 3.1(e) with Proposition 3.4(a), we conclude the range of is . As a result, the range of is .
∎
Now we are ready to prove Theorem 1.15.
Proof of Theorem 1.15.
Let . Recall , where
[TABLE]
and recall , where
[TABLE]
Denote and . Then and it follows from (3.2) that . In addition, based on Proposition 3.1,
[TABLE]
Similarly, . On the other hand, noticing that and , so it follows from Proposition 3.4 that and . Hence,
[TABLE]
Then according to Proposition 3.5, . In particular, .
Next, we will justify (1.27). Denote
[TABLE]
Then it suffices to prove .
- •
Case 1: . In this case, , so it follows from Proposition 3.5 that . Therefore, . On the other hand, by Proposition 3.6(a), we also have .
- •
Case 2: . In this case, , so it follows from Proposition 3.5 that , which implies . Putting this into (3.4) yields
[TABLE]
Since , then Proposition 3.1(b) implies . Thus, we conclude from (3.2) that
[TABLE]
∎
4 Linear estimates
Let be a bump function supported on and on . We first present two linear estimates, one is for the solution to the homogeneous linear KdV equation (2.6), and another one is for the solution to the forced linear KdV equation (2.6) with the right hand side being instead of 0. Recall the notation as defined in (2.7).
Lemma 4.1**.**
There exists a constant which only depends on the bump function such that for any with and ,
[TABLE]
and
[TABLE]
The proof of the above lemma is almost the same as those for Lemma 7.1 and Lemma 7.2 in [13], so we omit it. Next, we provide two well-known embedding results.
Lemma 4.2**.**
There exists a universal constant such that for any , and for any function on ,
[TABLE]
The proof of this estimate can be found in ([39], Lemma 2.3.2).
Lemma 4.3**.**
Let . Then there exists a constant such that for any and for any function on ,
[TABLE]
When and , Lemma 4.3 was first proved by Bourgain in [9] for a version when the left hand side of (4.4) is localized in time. Then Tao removed such a restriction in ([46], Proposition 6.4). Later, a more elementary proof was provided by Oh in his online note [41]. Actually, similar method had been applied earlier to the Schrödinger equation (see e.g. [47], Proposition 2.13).
5 Bilinear estimates in non-divergence form
This section will present the rigorous version of the bilinear estimates (1.14) and (1.15) in non-divergence form. Meanwhile, Theorem 1.18 will be broken into Lemma 5.1, 5.2 and Proposition 5.3, 5.4 in more general settings. We denote as in (1.26) for any .
Lemma 5.1**.**
Let and with . Assume one of the conditions below is satisfied.
- (a)
, and ; 2. (b)
, and ; 3. (c)
, and ; 4. (d)
* with , and .*
Then there exist constants and such that
[TABLE]
for any and with .
Lemma 5.2**.**
Let and with . Assume one of the conditions below is satisfied.
- (a)
* and and ;* 2. (b)
, , and ; 3. (c)
, and ; 4. (d)
* with , and .*
Then there exist constants and such that
[TABLE]
for any and .
Next, we will address the sharpness of Lemma 5.1 and Lemma 5.2. Without loss of generality, we take .
Proposition 5.3**.**
The bilinear estimate
[TABLE]
fails for any (and hence (5.1) fails by taking ) under any of the following conditions.
- (a)
* and ;*
- (b)
* and ;*
- (c)
* and ;*
- (d)
, , but without the restriction in (5.3).
Proposition 5.4**.**
The bilinear estimate
[TABLE]
fails for any (and hence (5.2) fails by taking ) under any of the following conditions.
- (a)
* and ;*
- (b)
, and ;
- (c)
, , but without the restriction ;
- (d)
* and .*
6 Proofs of the bilinear estimates
The goal of this section is to prove Lemma 5.1 and Lemma 5.2. Since their proofs are similar, we will only justify Lemma 5.2.
6.1 Idea of the proof
Without loss of generality, we consider the following simpler version of (5.2) with ,
[TABLE]
By duality and Plancherel identity, in order to verify (6.1), it suffices to prove (see [46] or Lemma 6.1)
[TABLE]
where , A=\Big{\{}(k_{1},k_{2},k_{3},\tau_{1},\tau_{2},\tau_{3})\in\mathbb{Z}^{3}\times\mathbb{R}^{3}:\sum\limits_{i=1}^{3}k_{i}=\sum\limits_{i=1}^{3}\tau_{i}=0\Big{\}} and
[TABLE]
where is as defined in (2.8).
In (6.2), the loss of the spatial derivative in the bilinear estimate (6.1) is reflected via the term and the gain of the time derivative is reflected via the term . How to compensate the loss of the spatial derivative from the gain of the time derivative is the key point. Denote
[TABLE]
Then we need to control by . Since , then . As a result, is decreasing in , which means the larger is, the more likely the bilinear estimate will hold. So the interest lies in the search for the smallest such that the bilinear estimate holds. Noticing that contains the time variable , a single alone can barely have any contributions to control . On the other hand, as ,
[TABLE]
is a function of only. Because of this, we define the resonance function as in Definition 2.2:
[TABLE]
Since , then it is easier to control by in the region where is large. The situation becomes more complicated near the region where vanishes. The zero set of is called the resonance set as in Definition 2.2.
By writing in (6.3) and simplifying,
[TABLE]
If , then . If , then can be rewritten as
[TABLE]
where
[TABLE]
The following is the classification of the roots of depending on the values of .
- (1)
: does not have real roots; 2. (2)
: has a unique root ; 3. (3)
but : has two roots, neither of which equals -1 or 0; 4. (4)
: has two roots -1 and 0.
Among the above four situations, Case (3) is the most interesting one, so we will first focus on this case. Assume r\in\big{(}\frac{1}{4},\infty\big{)}\setminus\{1\} and denote the two roots of as and , i.e.
[TABLE]
Define and as in (1.26). Then it follows from Proposition 3.4 that
[TABLE]
In addition, the resonance function can be written as
[TABLE]
As a result, the resonance set consists of three lines: , and . The most difficult estimate is near the line or . Without loss of generality, let us consider the region near the line . In this situation, \big{|}\frac{k_{1}}{k_{2}}-x_{2r}\big{|}\approx|x_{1r}-x_{2r}| which is a positive constant. Hence,
[TABLE]
In addition, when is very close to , we have . Consequently,
[TABLE]
On the other hand, noticing that both and are integers, the estimate of \big{|}x_{1r}-\frac{k_{1}}{k_{2}}\big{|} reduces to the problem of the Diophantine approximation of .
- •
If or if its irrationality exponent , then and there exist infinitely many such that
[TABLE]
So due to (6.9). Then in order to have , it follows from (6.10) that .
- •
If , then and it follows from Lemma 3.3 that for any and for any integers and ,
[TABLE]
So it follows from (6.9) that . Then in order to have , we need , that is
[TABLE]
The above argument explains why the critical index is when . Next, we will briefly discuss the rest cases (1), (2) and (4).
- •
When , the argument is similar to the above. The two roots of are the same: . So . Meanwhile, H_{2}(k_{1},k_{2},k_{3})=-3\alpha_{1}k_{2}^{3}\big{(}\frac{k_{1}}{k_{2}}+\frac{1}{2}\big{)}^{2}. So there exist infinitely many integer pairs such that , which implies . In order to have , it follows from (6.10) that .
- •
When , , so it follows from (6.5) that
[TABLE]
For , the worst situation is when and . In this situation, and . So in order to ensure , needs to be at least .
- •
When , . The worst situation occurs when and , which implies and . So in order to have , should be at least .
6.2 Auxiliary results
For any vector , we denote it as . For any ,
[TABLE]
and for any given ,
[TABLE]
Lemma 6.1**.**
Let , and for . The bilinear estimate
[TABLE]
holds if and only if the following two estimates hold,
[TABLE]
and
[TABLE]
where .
This lemma is essentially proved in [46, 13] by using duality, Plancherel identity and definition (2.11), so we omit the details here.
Lemma 6.2**.**
Let , and for . Then there exists a constant such that for any functions on ,
[TABLE]
where and M:=\max\big{\{}\langle L_{1}\rangle,\langle L_{2}\rangle,\langle L_{3}\rangle\big{\}}. In particular, if denotes the resonance function as defined in Definition 2.2, then
[TABLE]
This lemma is also standard (e.g. see [9, 13, 40]), but the statement is a little bit different from that in the literature, so we will briefly include a proof here for the convenience of the readers.
Proof of Lemma 6.2.
Since , it is obvious that , so it suffices to prove (6.17).
- •
Let’s first focus on the region where . In this region, . Define g_{1}=\mathcal{F}^{-1}\big{(}f_{1}(k_{1},\tau_{1})\big{)} and
[TABLE]
Then
[TABLE]
As a result, it follows from Lemma 4.3 and that
[TABLE]
- •
In the regions where or , the argument is similar, so (6.17) is justified.
∎
6.3 Proof of Lemma 5.2
As we have seen in Section 6.1, the main ideas in the proofs for part (a)–part (d) in Lemma 5.2 are analogous. So we will only carry out the detailed proof for part (d) which is the most technical case. The framework of the following proof is similar to that in [40].
Proof of Part (d).
First, we recall from Proposition 3.6(b) that . In addition, it follows from the assumption and Proposition 3.6(a) that . The case of is analogous to the case of , so we will just assume in the rest of the proof.
Fix and , according to Lemma 6.1, it remains to prove
[TABLE]
and
[TABLE]
where
[TABLE]
Since , it suffices to consider the case when is sufficiently close to . So we can just assume
[TABLE]
Next, we will prove (6.18) first and then briefly mention the proof for (6.19).
Proof of (6.18). First, in the region where , the integrand on the LHS of (6.18) vanishes, so we only need to focus on the region where . Recall the resonance function in (6.5):
[TABLE]
where is defined in (6.6). When , it follows from (6.7) that the two roots and of satisfy . Hence, there exists a constant d_{r}\in\big{(}0,\frac{1}{8}\big{)}, which only depends on , such that (see Figure 1)
[TABLE]
On the other hand, there exists a constant , which also only depends on , such that
[TABLE]
In the following, we denote
[TABLE]
Since |H_{2}(k_{1},k_{2},k_{3})|=\big{|}\sum\limits_{i=1}^{3}L_{i}\big{|}, then .
Case 1. \Big{|}\dfrac{k_{1}}{k_{2}}-x_{1r}\Big{|}\geq d_{r} and \Big{|}\dfrac{k_{1}}{k_{2}}-x_{2r}\Big{|}\geq d_{r}. In this case, it follows from (6.21) and (6.23) that
[TABLE]
So . In addition, implies that . Hence, . Recalling , so
[TABLE]
where the last inequality is due to Lemma 6.2. This verifies (6.18) since .
Case 2. \Big{|}\dfrac{k_{1}}{k_{2}}-x_{1r}\Big{|}<d_{r} or \Big{|}\dfrac{k_{1}}{k_{2}}-x_{2r}\Big{|}<d_{r}. Without loss of generality, it is assumed that \Big{|}\dfrac{k_{1}}{k_{2}}-x_{1r}\Big{|}<d_{r}. Then it can be seen from (6.22) or Figure 1 that and \Big{|}\dfrac{k_{1}}{k_{2}}-x_{2r}\Big{|}>3d_{r}. Meanwhile, if , then (6.18) is obviously true. So we will just assume . Thus, (6.18) reduces to
[TABLE]
Denote
[TABLE]
then . Next, we will split the range of \big{|}\frac{k_{1}}{k_{2}}-x_{1r}\big{|} further.
Case 2.1. \frac{d_{r}}{\lambda|k_{2}|}\leq\big{|}\frac{k_{1}}{k_{2}}-x_{1r}\big{|}<d_{r}. In this case,
[TABLE]
Then it follows from and that . Hence,
[TABLE]
Case 2.2. \big{|}\frac{k_{1}}{k_{2}}-x_{1r}\big{|}<\frac{d_{r}}{\lambda|k_{2}|}.
Since , it follows from (1.26) that . In addition, by taking advantage of (6.8) and Lemma 3.3, there exists a constant such that
[TABLE]
Combining and (6.25) yields . Therefore,
[TABLE]
which implies
[TABLE]
Then we obtain
[TABLE]
So (6.24) reduces to
[TABLE]
- •
Case 2.2.1. . Then (6.28) reduces to
[TABLE]
For fixed , it follows from \big{|}\frac{k_{1}}{k_{2}}-x_{1r}\big{|}<\frac{d_{r}}{\lambda|k_{2}|} that which is defined as follows.
[TABLE]
It is easily seen that the size is at most . Define
[TABLE]
Since , the LHS of (6.29) is bounded above by
[TABLE]
By the change of variable and the definition (2.2) for the measure , the above quantity is seen to be dominated by
[TABLE]
Since the size of the set is bounded by the constant , then by applying the Plancherel identity, we obtain
[TABLE]
Noticing , then it follows from the definition of and Holder’s inequality that
[TABLE]
In addition, it is not difficult to show that for any fixed ,
[TABLE]
Then plugging (6.34) and (6.35) into (6.33) yields
[TABLE]
Finally, by Holder’s inequality and Lemma 4.2,
[TABLE]
where the last inequality is due to (6.31).
- •
Case 2.2.2. or . The argument is similar to that in Case 2.2.1, therefore omitted.
Proof of (6.19). We basically reduce the proof of (6.19) to the following two cases. In different regions, we have the flexibility to choose which case to estimate.
- •
First, by Cauchy-Schwartz inequality and duality, the proof of (6.19) reduces to establish
[TABLE]
- •
Secondly, for fixed , if is restricted to some set , then it follows from Holder’s inequality in that
[TABLE]
So if we can prove
[TABLE]
then (6.19) will follow from (6.37), duality and (6.18).
If or or with , then we can apply the similar argument as that in the proof of (6.18) to justify (6.36).
If with , then we will justify (6.37). Similar to the proof of (6.18), the most delicate situation is when is very near the root or . Without loss of generality, we will only investigate the region where \big{|}\frac{k_{1}}{k_{2}}-x_{1r}\big{|}<\frac{d_{r}}{\lambda|k_{2}|} in the rest argument (see the above Case 2.2).
Firstly, it follows from \big{|}\frac{k_{1}}{k_{2}}-x_{1r}\big{|}<\frac{d_{r}}{\lambda|k_{2}|} that
[TABLE]
So when is fixed, the choice of is at most . Secondly,
[TABLE]
As a result, and . So there exists a small constant \delta\in\big{(}0,\frac{1}{10}\big{)} such that . Fix this . For any , define
[TABLE]
Then . For any , define . For any , the choices of , as mentioned above, is at most . On the other hand, it follows from
[TABLE]
that . So \big{\langle}\eta+H_{2}(k_{1},k_{2},k_{3})\big{\rangle}\leq\delta M^{\frac{1}{100}}. As a result, \big{|}\Omega_{M}^{\delta}(k_{3})\big{|}\lesssim M^{\frac{1}{100}}. By the change of variable ,
[TABLE]
which verifies (6.37). ∎
7 Sharpness of the bilinear estimates
In this section we will prove Propositions 5.3 and 5.4 which justify the sharpness of the bilinear estimates in Lemmas 5.1 and 5.2. Since their proofs are similar, we will only prove Proposition 5.4. Throughout this section,
[TABLE]
The following lemma will be frequently used in this section.
Lemma 7.1**.**
Let be bounded regions such that , i.e.,
[TABLE]
Then
[TABLE]
Proof.
Rewriting the left hand side as
[TABLE]
then the conclusion follows from (7.1). ∎
7.1 Proof of Proposition 5.4
Similar to Lemma 6.1, (5.4) is equivalent to
[TABLE]
where
[TABLE]
The corresponding resonance function is given by (6.4):
[TABLE]
where . If , then can be rewritten as in (6.5):
[TABLE]
where .
Proof of Part (a).
Suppose there exist , and such that (7.2) holds.
- •
For large , define for , where
[TABLE]
Then . In addition, for any , , , and it follows from (7.4) that . Therefore, and we conclude from (7.2) that
[TABLE]
Noticing , applying Lemma 7.1 leads to . In other words,
[TABLE]
- •
Similarly, define for , where
[TABLE]
Then and by similar argument, we conclude
[TABLE]
(7.5) and (7.6) together yields , which contradicts to the assumption . In addition, when , has to be exactly . ∎
Proof of Part (b).
Under the additional assumption , (5.4) is equivalent to (7.2) with the additional restriction . When , writing , then . Define for , where
[TABLE]
Then . In addition, for any , , , and , which implies . Then it follows from (7.2) that
[TABLE]
Noticing , applying Lemma 7.1 leads to . ∎
Proof of Part (c).
Analogous to part (b) but without the restriction , so is not required to be zero anymore. So it is valid to define , where
[TABLE]
Then . In addition, for any , , , and , which implies . Then it follows from (7.2) that
[TABLE]
which implies due to Lemma 7.1. But this is impossible since can be arbitrarily large. ∎
Proof of Part (d).
The following proof is in the similar spirit as that in ([40], Proposition 3.9).
The proofs for the case and the case are analogous, so we will just assume . Suppose there exist and such that (5.4) holds. Then (7.2) holds for some constant . Since , the function has two roots
[TABLE]
In addition, and . When , the resonance function can be written as
[TABLE]
In the following, we will first show that and then derive a contradiction to .
For large positive , define for , where
[TABLE]
Then . In addition, for any , , , and
[TABLE]
which implies . Then it follows from (7.2) that
[TABLE]
Noticing , applying Lemma 7.1 leads to . On the other hand, define for , where
[TABLE]
Then and by applying similar arguments, we conclude that . Thus, .
Next, we discuss two situations depending on whether is a rational number or not.
- •
Case 1. .
In this case, , and . In addition, since , we can assume for some with , and . For any large positive , define for , where
[TABLE]
Then . In addition, for any , , , and . As a result,
[TABLE]
which implies . Then it follows from (7.2) that
[TABLE]
Noticing , applying Lemma 7.1 leads to , which contradicts to .
- •
Case 2. .
Since by the assumption, there exists such that . Recalling , so is approximable with power . Hence, it follows from Lemma 3.2 that there exists a sequence such that , and
[TABLE]
When is large enough, will be very close to . Since , it implies , and . In addition, which is a positive constant only depending on . As a result,
[TABLE]
Denote
[TABLE]
Then no matter or , it always holds that
[TABLE]
- –
For any large as in the above discussion, define for , where
[TABLE]
Then . In addition, for any , , , and
[TABLE]
which implies . Then it follows from (7.2) and that
[TABLE]
Noticing , applying Lemma 7.1 leads to
[TABLE]
- –
On the other hand, define for , where
[TABLE]
Then . In addition, by applying similar arguments, we conclude that
[TABLE]
Adding (7.9) and (7.10) together yields . Recalling , so
[TABLE]
If , then and , which contradicts to (7.11). If , then and , which also contradicts to (7.11).
∎
Appendix A Well-posedness for the Hirota-Satsuma systems (1.5)
In this appendix, we summarize the analytical well-posedness results on the Hirota-Satsuma systems (1.5).
Theorem A.1**.**
The Hirota-Satsuma system (1.5) is A-LWP in if one of the following conditions is satisfied.
- (1)
* and ;* 2. (2)
, and ; 3. (3)
, and or (equivalently ).
Due to the following conserved energies for (1.5), the A-GWP follows directly from Theorem A.1.
[TABLE]
Theorem A.2**.**
Let . Then the Hirota-Satsuma system (1.5) is A-GWP in if one of the following conditions is satisfied.
- (1)
* and ;* 2. (2)
* and .*
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] B. Alvarez and X. Carvajal. On the local well-posedness for some systems of coupled Kd V equations. Nonlinear Anal. , 69(2):692–715, 2008.
- 2[2] J. Angulo. Stability of cnoidal waves to Hirota-Satsuma systems. Mat. Contemp. , 27:189–223, 2004.
- 3[3] J. Angulo. Stability of dnoidal waves to Hirota-Satsuma system. Differential Integral Equations , 18(6):611–645, 2005.
- 4[4] V. I. Arnold. Geometrical Methods in the Theory of Ordinary Differential Equations . Fundamental Principles of Mathematical Sciences, vol.250. Springer-Verlag, New York, second edition, 1988.
- 5[5] V. Becher, Y. Bugeaud, and T. A. Slaman. The irrationality exponents of computable numbers. Proc. Amer. Math. Soc. , 144(4):1509–1521, 2016.
- 6[6] J. L. Bona, G. Ponce, J.-C. Saut, and M. M. Tom. A model system for strong interaction between internal solitary waves. Comm. Math. Phys. , 143(2):287–313, 1992.
- 7[7] J. L. Bona and R. Smith. The initial-value problem for the Korteweg-de Vries equation. Philos. Trans. Roy. Soc. London Ser. A , 278(1287):555–601, 1975.
- 8[8] J. L. Bona, S.-M. Sun, and B.-Y. Zhang. Conditional and unconditional well-posedness for nonlinear evolution equations. Adv. Differential Equations , 9(3-4):241–265, 2004.
