Temporal oscillations in Becker-Doering equations with atomization
Robert L. Pego, Juan J. L. Vel\'azquez

TL;DR
This paper demonstrates that a modified Becker-Doering coagulation-fragmentation system with atomization exhibits time-periodic solutions arising from Hopf bifurcation, modeling oscillatory behavior in physical chemistry gas evolution processes.
Contribution
It introduces a novel variant of the Becker-Doering equations incorporating atomization and proves the existence of oscillatory solutions via bifurcation analysis.
Findings
Time-periodic solutions are proven to exist.
Oscillations arise through Hopf bifurcation.
Model captures physical chemistry gas evolution oscillations.
Abstract
We prove that time-periodic solutions arise via Hopf bifurcation in a finite closed system of coagulation-fragmentation equations. The system we treat is a variant of the Becker-Doering equations, in which clusters grow or shrink by addition or deletion of monomers. To this is added a linear atomization reaction for clusters of maximum size. The structure of the system is motivated by models of gas evolution oscillators in physical chemistry, which exhibit temporal oscillations under certain input/output conditions.
| 0.39349 | 3.9349 | 0.021740 | |
| 0.075016 | 2.3722 | 3.6176e-4 | |
| 0.020376 | 2.0376 | 9.3596e-6 | |
| 0.0061392 | 1.9414 | 2.7777e-7 | |
| 0.0019118 | 1.9118 | 8.6091e-9 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\usdate
Temporal oscillations in Becker-Döring equations
with atomization
Robert L. Pego*(1), Juan J. L. Velázquez(2)*
Abstract
We prove that time-periodic solutions arise via Hopf bifurcation in a finite closed system of coagulation-fragmentation equations. The system we treat is a variant of the Becker-Döring equations, in which clusters grow or shrink by addition or deletion of monomers. To this is added a linear atomization reaction for clusters of maximum size. The structure of the system is motivated by models of gas evolution oscillators in physical chemistry, which exhibit temporal oscillations under certain input/output conditions.
1-Department of Mathematics and Center for Nonlinear Analysis
Carnegie Mellon University, Pittsburgh, Pennsylvania, PA 12513, USA
email: [email protected]
2- Institut für Angewandte Mathematik
Universität Bonn
Endenicher Allee 60
53115 Bonn, Germany
email: [email protected]
*Keywords. *bubbling oscillator, shattering, bubbelator, time periodic solution
*Mathematics Subject Classification. * 34C23, 34A34, 82C99
1 Introduction
Coagulation-fragmentation equations are commonly used to model particle size distributions in a wide range of scientific and technological applications. These equations model binary reactions of clusters of size with clusters of size as indicated schematically by
[TABLE]
With rate coefficients for aggregation and for breakup, the net rate of this binary reaction is modeled by the law of mass action to be
[TABLE]
The coagulation-fragmentation equations accounting for the gain and loss rates for the number density of groups of size then take the form
[TABLE]
To date, mathematical investigations of the dynamic behavior of solutions have largely focused on questions of convergence to equilibrium and the phenomenon of gelation, in which mass conservation fails (either in finite or infinite time) due to a flux to infinite size. We refer to classic work of Aizenman and Bak [1] who established an -theorem for perhaps the simplest coagulation-fragmentation model with constant rate coefficients, and Ball, Carr and Penrose [2] for the first analysis of (infinite-time) gelation in the Becker-Döring equations. If fragmentation is weak, finite-time gelation can occur [12, 13, 19, 31] as it does for the case of pure coagulation about which there is now an extensive literature.
Regarding convergence to equilibrium, entropy methods have been effectively used to study general classes of coagulation-fragmentation equations that admit equilibria in detailed balance, meaning that for each individual reaction in the system, so the forward and backward reaction rates match. See work of Laurençot and Mischler [21] for the continuous-size case and Cañizo [7] for the discrete-size case. More recent studies of equilibration have examined rates of convergence and their relation to entropy-dissipation relations [18, 6, 27, 28, 5].
In the absence of detailed balance, however, one does not expect that an -theorem always holds, and it is not clear whether the structure of coagulation-fragmentation reaction networks means that solutions necessarily always converge to some equilibrium. Sometimes it is indeed the case, as in cases when coagulation is weak [14] or for special systems that can be studied globally using transform methods, as in [9]. In [22], Laurençot and van Roessel analyzed a model with a critical balance of coagulation and fragmentation rates, and used transform methods to show that infinite-time gelation emerges through self-similar growth.
On the other hand, in studies of pure coagulation without fragmentation, the usual expectation of self-similar growth has sometimes been shown not to occur. For special rate kernels, solutions with fat tails are known to be capable of periodic and even chaotic behavior after rescaling [26]. Temporal oscillations can persist after rescaling without fat tails for Smoluchowski equations with diagonal rate kernel [20].
Our goal in the present work is to demonstrate that persistent oscillations in time are possible in a simple discrete-size coagulation-fragmentation model, by proving that Hopf bifurcations occur.
The particular system that we study is a modified system of Becker-Döring equations. (For a nice historical review of mathematical developments concerning the Becker-Döring equations, see [17].) As usual for Becker-Döring equations, we suppose that the coagulation of clusters of size with monomers proceeds at the rate , and clusters of size lose monomers at the rate . We take these rates to apply only for a finite range of sizes , however, and consider only the simplest case, always taking . Thus the net flux of clusters from size to is , as given by
[TABLE]
We suppose further that is the size of the largest clusters in the system, and these are also subject to a linear atomization reaction that converts an -cluster into monomers and proceeds at rate . Thus the governing equations take the following form:
[TABLE]
All solutions of the system (2)–(4) conserve mass, since
[TABLE]
A formal continuum analog of this system will be studied for illustrative purposes in Section 4.
2 Background and motivation
Model with nonlinear atomization. In the physical literature, recent work of Matveev et al. [25] and Brilliantov et al. [4] has identified a coagulation-fragmentation model with a different, nonlinear atomization mechanism that exhibits persistent temporal oscillations in numerical simulations. In this model, aggregation of clusters of size and proceeds at rate where
[TABLE]
and pairs of such clusters atomize upon collision into monomers at rate . In total, the rate equations in [25] take the form
[TABLE]
This system has the feature that interactions between large clusters of similar size appear to be dominated by interactions between large clusters and small ones (for which either or is large). Oscillations are found for and small . Though the numerics is convincing, to our knowledge there is no proof yet that temporal oscillations persist in this system.
Bubbling oscillators. Our motivation for studying the system (2)–(4) comes from literature in physical chemistry concerning bubbling oscillators (often called ‘gas evolution oscillators’ in much of the literature). In these systems, dissolved gas (such as CO or CO2) is added slowly to a liquid solution, producing a super-saturated mixture. At some time, nucleation of gas bubbles occurs spontaneously and the bubbles grow rapidly and carry most of the dissolved gas out of the system. The first system of this kind was reported by J. S. Morgan in 1916, who found that a small concentration of formic acid mixed in sulphuric acid produced periodic bursts of carbon monoxide. Such systems were the subject of part of an extensive series of quantitative studies by R. M. Noyes and collaborators concerning chemical oscillators, including some of the original studies of chemical oscillators such as the BZ reaction and the Oregonator. Regarding gas evolution oscillators, we especially refer to [30, 32, 3]. The phenomenon of sudden outgassing of CO2 after slow buildup of supersaturation was responsible for the 1986 Lake Nyos disaster in Cameroon, which killed more than 1700 people.
In the work of Yuan, Ruoff and Noyes [32], this process was simulated numerically by grouping bubble sizes into a finite set corresponding to exponentially spaced radii , and writing rate equations to model the number density of bubbles of size . A key equation when greater than a critical value is
[TABLE]
where the coefficients are proportional to bubble growth rate and the are rate constants for escape. This resembles a linearized Becker-Döring equation or a discretized advection equation, and models the process of free bubble growth and escape. With size classes, numerical simulations in [32] exhibit temporal oscillations for a range of parameters designed to model experimental conditions.
Bar-Eli and Noyes [3] later devised a simplified, qualitative model for bubbling oscillators that involves a nonlinear differential-delay equation for the concentration of dissolved gas. When linearized about a constant steady-state, one obtains a constant-coefficient linear DDE of the form
[TABLE]
where the parameters , and the delay time are positive constants. Whenever , one finds there is an oscillatory transition from stability to instability as increases.
We sketch loosely how one can see this mathematically. (A detailed analysis of (8) can be found in work of Hadeler and Tomiuk [16].) Equation (8) has solution provided
[TABLE]
For , naturally , and moreover is never possible for any . But for and sufficiently large, there are solutions with . To show this is so, one can consider the winding number around 0 of a curve , where is a concatenation of a path for and a path in the right half plane along the semicircle where . Along the semicircle, can never cross the negative real axis . Along the imaginary axis, however,
[TABLE]
and this does cross for between [math] and , if is large enough. Moreover, can only ever cross going from the second quadrant to the third, since whenever ,
[TABLE]
and this has negative imaginary part. Consequently, the winding number of around 0 is positive if is large enough, and this implies (9) has a root inside .
Becker-Döring with linear atomization (our model). Now, the rough idea behind our model (2)–(4) is that the Becker-Döring equations involve a well-known advection mechanism that transports mass from small cluster sizes to large ones when the monomer concentration is supercritical. The atomization reaction added in (3) couples the advected wave back to the monomer concentration after a time delay that depends on the size of the system. Luckily enough, we find that for large there indeed is an oscillatory transition to instability as the parameter varies, in a certain parameter range where is small but remains large. See Figure 1, where we plot the monomer concentration vs. time for a numerically computed solution of (2)–(4) with parameters and initial values given by
[TABLE]
Other models with linear atomization. Finally, we mention two other kinds of merging-splitting models involving a linear atomization reaction that have appeared in the literature. Alongside discussion of Niwa’s model [29] for animal group size, Ma et al. [24] described a “preferential attachment” model, which takes the form
[TABLE]
This model admits a simple logarithmic distribution in equilibrium, of the form
[TABLE]
(This is roughly similar to the distribution Niwa found to be a good description of empirical data on school size for pelagic fish.) A model of herd behavior by networks of colluding agents in financial markets introduced by Eguiluz & Zimmermann [11] takes the form
[TABLE]
D’Hulst and Rodgers [10] found a formula for equilibrium solutions of this model by use of generating functions. But as far as we are aware, no analysis of dynamics has been carried out for either of these models.
3 Equilibria, linearization, and main result
In this section, we find the general equilibrium solutions of the model (2)–(4), describe the special family of constant equilibria, and state our main rigorous result on the existence of Hopf bifurcations from this family, occurring at particular values of , for large enough .
3.1 General equilibria
We find the general equilibria as follows. In equilibrium, due to (2) the fluxes are all equal to the same value for , so the equilibrium number densities satisfy the difference equation , when we require . For , the solution takes the form
[TABLE]
To obtain an equilibrium it remains to require that (3) hold, i.e.,
[TABLE]
Then it follows (recall )
[TABLE]
and
[TABLE]
Note that (4) then holds also. For every , and , such an equilibrium exists and is positive. In case , one finds directly that
[TABLE]
The total mass as a function of and is now
[TABLE]
where
[TABLE]
3.2 Linearization at constant equilibria
Particularly convenient for our analysis is the special family of equilibria that have constant densities, corresponding to . By (14) these take the form
[TABLE]
Corresponding to we require . We will study the linearization of the system (2)–(4) about this equilibrium. We write:
[TABLE]
The linearized fluxes take the form
[TABLE]
and the linearized evolution equations are written as follows:
[TABLE]
Equivalently, after some computations, the system takes the more explicit form
[TABLE]
Equation (21) yields a combination of diffusion and transport. It is not able to yield oscillatory behavior by itself, but this will be generated through the ‘boundary conditions,’ or more precisely the equations with and
Looking for solutions of this system with the form
[TABLE]
leads to the eigenvalue problem (recalling and )
[TABLE]
This system takes the form of an eigenvalue problem for a vector , with matrix having the structure
[TABLE]
Our goal is to understand the spectrum of in some detail, and eventually show that in a certain parameter range, some pair of complex eigenvalues of crosses the imaginary axis, and this forces the system (2)–(4) to undergo a Hopf bifurcation.
To begin, one can check that for any eigenvector corresponding to a nonzero eigenvalue , the mass conservation condition holds:
[TABLE]
This is due to the fact that is a left null vector of . Thus is an eigenvalue of . If we represent the equations of equilibrium from (2)–(4) in vector form as , then the matrix evaluated at with constant components . Thus by differentiation it naturally follows from (15) that a right null vector satisfying is given by
[TABLE]
In fact, we have the following.
Lemma 3.1**.**
For all and , is a simple eigenvalue of .
Proof.
First, we show the null space of is one-dimensional. Whenever , the fluxes defined by
[TABLE]
must all take the same value due to (18), and for this value is due to (19). If , then we can replace by a linear combination with to make all fluxes . But by (30) it follows and, by induction, with for . Since , the only vector making all the fluxes vanish is . It follows that spans the null space of .
Next, we claim there is no generalized eigenvector satisfying . The reason is that, because for all and is a left null vector of , we would obtain a contradiction, via
[TABLE]
Thus the eigenvalue has algebraic multiplicity one, so it is simple. ∎
3.3 Main results
If is an eigenvalue of , we say is unstable if . We find that we can show the matrix has unstable eigenvalues when is sufficiently large and is small but not too small, in a range proportional to . These eigenvalues are characterized as follows. It is convenient to state our results in terms of the parameter
[TABLE]
in place of .
Theorem 3.2**.**
For each and , there exists , and positive constants , , such that for each , the following hold:
If , then any unstable eigenvalue of is non-real and simple, and satisfies
[TABLE] 2. 2.
There are numbers for , satisfying
[TABLE]
such that:
- (a)
If (), then has exactly complex-conjugate pairs of unstable eigenvalues. 2. (b)
There are analytic curves , , such that is an eigenvalue of that satisfies
[TABLE]
along with
[TABLE]
By the properties stated in part 2 of this theorem, the matrix has a unique pair of nonzero, purely imaginary eigenvalues when is large and , and these cross transversely into the right half plane as increases.
The simple eigenvalue at zero, described in Lemma 3.1, is nominally an obstruction to applying the standard Hopf bifurcation theorem at this point. This eigenvalue is easily removed, however, by considering the dynamics of the nonlinear system (2)–(4) restricted to the invariant affine hyperplane determined by conservation of mass, i.e., the hyperplane where
[TABLE]
with being the constant equilibrium state from (17). Within this hyperplane, the linearization of the system (2)–(4) is restricted to orthogonal complement of the left null vector of . In this subspace, the zero eigenvalue is removed, and the standard Hopf bifurcation theorem can be applied to yield the following result. (For a proof of the Hopf bifurcation theorem see [8, pp. 98–99]. For further discussion also see [15, pp. 150–152].)
Theorem 3.3**.**
Let and suppose as given by Theorem 3.2. Then for each , the system (2)–(4) admits a Hopf bifurcation as the bifurcation parameter passes through . Thus a time-periodic solution exists for some value of with small.
We have not managed to determine analytically whether the bifurcating solutions are stable (the supercritical case) or not. Many of our numerical computations, as in Fig. 1, are consistent with the presence of stable periodic solutions, however.
In Figure 2 we illustrate the location of the complex eigenvalues of computed numerically for the parameter values and . The unstable eigenvalues shown correspond to values
[TABLE]
Besides the real eigenvalue this matrix also has a large negative eigenvalue . There are 49 complex-conjugate pairs of eigenvalues that lie close to an ellipse that we will describe formally in Remark 5.1 and Section 6. The presence of eigenvalues spaced closely along a smooth curve and a large isolated eigenvalue is reminiscent of the structure of the spectrum of differential-delay systems with large delay, see [23].
The real parts of eigenvectors for the first 3 complex eigenvalues closest to are plotted in Figure 3. They appear to have a “smooth” structure except in a boundary layer near .
In Table 1 we tabulate for various values of numerically computed critical values of that correspond to , the value at which the first pair of complex-conjugate eigenvalues crosses the imaginary axis. Eigenvalues were obtained by solving the equation (77) in Proposition 5.3 below using an iteration method. The first two rows were computed also by finding all eigenvalues of using the julia function eigen. The values of in the third column can be compared to the value described below in (92). This value is proved in Section 10 to be the limiting value of as , see (152).
4 A formal continuum approximation
Here we describe and heuristically analyze a formal continuum approximation of the system (2)–(4) that we are studying. This will serve to preview how the proof of Theorem 3.2 will proceed, and also provides an approximate understanding of the origin of the oscillatory instability in the system and the smooth structure of the eigenvectors as shown in Figure 3.
We introduce a scaled atomization rate and scaled space and time variables via
[TABLE]
With these relations, we write a continuum approximation of (2)–(4) in terms of variables
[TABLE]
as follows: The evolution equations for , in (2)–(3) are approximated by the PDE
[TABLE]
where the number flux relation from (1) is approximated by
[TABLE]
Taking (2) to hold also for with , the following boundary condition replaces (3):
[TABLE]
We replace the evolution equation for by an equation equivalent to mass conservation, which says
[TABLE]
After integrating by parts, we require
[TABLE]
The system of equations (36)–(39) has the constant equilibrium just like the discrete system. Using a superposed dot to denote differentiation with respect to a variational parameter leads to the linearized system
[TABLE]
Then looking for solutions of the form leads to the eigenvalue problem
[TABLE]
The steps that we now take to analyze this continuum eigenvalue problem parallel the steps that we will take to analyze the discrete eigenvalue problem (24)–(26). First, we describe the solutions of (44) as linear combinations of solutions of the form where
[TABLE]
It is convenient to write a general solution of (44) in terms of the two solutions of (47) as
[TABLE]
Then a nonzero solution can satisfy the boundary conditions (45) and (46) if and only if
[TABLE]
[TABLE]
So far this is an exact treatment of the eigenvalue problem (44)–(46). But now we approximate, noting that the two solutions of (47) satisfy
[TABLE]
for small . For and complex both of order , we neglect the exponentially small term and keep only the leading order terms in the other entries of the matrix in (48), expressing in terms of using the relations in (50). Thus, writing we make the approximations
[TABLE]
Multiplying the second row of (48) by and dividing the second column by , we find that the equation determining eigenvalues approximately takes the form , where
[TABLE]
It is exactly this function that we will find responsible for the appearance of unstable eigenvalues for the discrete problem (24)–(26) in the limit of large . We analyze the complex roots of in depth in Section 7. It turns out that a complex-conjugate pair of roots emerges into the right half plane as increases past each value of an infinite sequence. The values will be seen to be the limiting values of as appear in the statement of Theorem 3.2, in the limit .
In the remainder of this paper, we carry out the proof of Theorem 3.2 by performing an analogous analysis for the discrete eigenvalue problem, including rigorous estimates for all the error terms. For brevity’s sake, we forgo the formulation and rigorous demonstration of results analogous to Theorems 3.2 and 3.3 for the (parabolic) continuum model (36)–(39) described in this section. It should be evident from our analysis, though, that Hopf bifurcation occurs for this model in a similar way.
5 Reformulation of the eigenvalue equation
5.1 The difference equation
The eigenvalue equations (25) for comprise a family of second order difference equations. These difference equations have solutions of the form
[TABLE]
whenever
[TABLE]
which we can rewrite using as
[TABLE]
or as
[TABLE]
We take decreasing powers in (54) for reasons of scaling explained below.
We can then “connect” the values of and by means of a transition matrix depending on two constants (for each value of ). More precisely, any solution of (25) takes the form
[TABLE]
whenever and are distinct roots of (57). Evidently the two roots are always related by , and for the roots are and .
The roots are distinct except when , which corresponds to
[TABLE]
For small , we note that this becomes
[TABLE]
The roots , are naturally functions of . However, it will be more convenient to recast the eigenvalue equations in terms of the variable and regard as a function of , given by the following equation equivalent to (56):
[TABLE]
Except when (which will generate spurious roots below), corresponding to eigenvalues there should exist roots of the relevant equations, which occur in pairs , that produce the same .
Remark 5.1*.*
We note that by (56), values of on the unit circle, with for real, produce values of on an ellipse with
[TABLE]
This ellipse lies in the left half plane and passes through . In numerical computations such as those reported in Fig. 2, almost all the eigenvalues lie near this ellipse. By consequence we will expect to find most roots satisfying and , with extremely small. (This is the basic reason for the form we took in (54).) The possibility of transition to instability will depend upon the deviation of roots from this ellipse in the vicinity where .
5.2 Reduction to a determinant
We now use the expression (58) to write the “boundary conditions” for , that correspond to the equations for and in (26) and (24) respectively. Using the fact that (56) holds for both and , after some computation we find that these equations take the following form:
[TABLE]
Except in the degenerate cases when and (59) holds, the eigenvalue problem in (24)–(26) is therefore equivalent to the vanishing of a determinant:
[TABLE]
where and , and the functions , are given by
[TABLE]
The function depends on and , but this dependence will not be written explicitly for simplicity. We note the general root-exchange symmetry
[TABLE]
Because is an eigenvalue we also know that has roots at and . Note that due to dependence of the columns, but these roots are spurious, unless double, as we now discuss.
The degenerate case. In the cases of (59) when the two roots of (57) coincide at , one checks that the difference equation (25) has the general solution
[TABLE]
by the expedient of replacing , in (58) with
[TABLE]
and taking . Doing the same with (63)–(64), we see that the eigenvalue condition (65) is replaced by the condition
[TABLE]
This is equivalent to the condition because one finds at these points.
Remark 5.2*.*
In order to characterize Hopf bifurcation, we will use the fact that when , 1 or , is a simple root of if and only if is a simple eigenvalue of . See Lemma 5.4 and its proof in Section 10.
5.3 Sorting terms and removing singularities
For convenience in analysis, we sort the terms in (65) according to th powers of and . Note that
[TABLE]
In order to remove singularities, we multiply (65) by . Define
[TABLE]
where
[TABLE]
with the definitions
[TABLE]
By consequence we have the sorted representation
[TABLE]
where
[TABLE]
Observe that has a pole at of order , with , because at the origin. And for we find that
[TABLE]
Consequently must have exactly zeros, counting multiplicities.
We may summarize the situation as follows.
Proposition 5.3**.**
A complex number is an eigenvalue of if and only if (61) holds for some pair , satisfying
[TABLE]
except in the two cases . In these cases, is an eigenvalue if and only if
[TABLE]
Of the roots of , four are spurious, counting , and once each, coming from the dependence of the columns in (65) and the factors used to remove singularities from .
The polynomial of degree is divisible by the factor
[TABLE]
and the remaining roots of correspond in pairs , to the eigenvalues of . The values and , are (at least) double roots of because they were already roots of , and correspond to the simple eigenvalue . Concerning other roots of , we have the following result whose proof we defer to Section 10.
Lemma 5.4**.**
Suppose and . Then is a simple root of if and only if is a simple eigenvalue of .
6 Formal approximation
Before we begin a rigorous analysis of the zeros of , we treat the problem approximately in the limit of large to gain insight. Numerical experimentation suggests that we can expect to find most solutions of (65) to satisfy , and , with extremely small.
Thus we neglect the terms containing in (76) and study the zeros of
[TABLE]
For any such zero, evidently
[TABLE]
unless both numerator and denominator vanish. The right-hand side is a ratio of polynomials of low degree, while for large , the function expands a small region about any th root of unity to cover a large part of the complex plane. Roughly, then, we can expect (81) to have a solution near each th root of unity. These should then provide eigenvalues spread out around the ellipse in (62).
We focus next on looking for imaginary roots . We change variables from to , noting that
[TABLE]
With these relations we have
[TABLE]
and we find from (72)–(74) the exact expressions
[TABLE]
It turns out to be appropriate to require is small while is large. Somewhat more precisely, we ask that
[TABLE]
Then we get the approximate relations
[TABLE]
By consequence, we find that
[TABLE]
If we suppose as , then
[TABLE]
where
[TABLE]
The complex roots of provide an approximation for roots of when is large. These approximate eigenvalues of (24)–(26) through (55), which may be written directly in terms of as
[TABLE]
Thus purely imaginary roots of approximate eigenvalues near the imaginary axis, and roots of in the right half plane should approximate eigenvalues satisfying .
7 Analysis of roots of
In this section we establish basic properties of the roots of as defined in (87). This will serve as the foundation to analyze the roots of and ultimately those of , in subsequent sections.
Purely imaginary roots of occur whenever
[TABLE]
Matching real parts demands that if then hence . Matching also the ratio of imaginary to real parts, one finds that (89) holds if and only if
[TABLE]
together with
[TABLE]
Each positive root of (90) provides a complex conjugate pair of imaginary roots of . Let denote the increasing sequence of all these positive roots of (90). The smallest occurs for (less than ). This corresponds to a critical value of given by
[TABLE]
The roots approach from below as . As increases, they correspond to larger values of , hence larger values of for a fixed .
In the rest of this section, we shall prove that non-real roots of are always simple, and purely imaginary roots must move into the right half plane as increases, where they must remain in a bounded region. By this result and (88), when we can expect that for large enough with , there will be some eigenvalue of (24)–(26) in the right half plane, and when we can expect there will not.
Lemma 7.1**.**
For any , has a double root . All other complex roots are non-real and simple.
Proof.
Clearly , and for real we have by the convexity of . In general we compute
[TABLE]
The root is double because at 0. At a complex double root, on the other hand, necessarily . This implies where and is an odd integer. Then, however, it follows
[TABLE]
so and we infer
[TABLE]
a contradiction. Hence the nonzero roots of are all non-real and simple. ∎
For the next result, let and recall that denotes the sequence of positive roots of (90).
Lemma 7.2**.**
The function has exactly complex-conjugate pairs of roots in the right half plane if with .
Proof.
First, we claim that the imaginary roots of always cross into the right half plane as increases. To see this, regard as a complex variable and note that if and only if
[TABLE]
Because , we compute
[TABLE]
by using the identity to eliminate . Multiplying by , we find
[TABLE]
For in the first quadrant, the imaginary part of this expression is negative, which implies
[TABLE]
Furthermore, provided (which must be the case if by (91)), the real part of (95) is larger than , hence
[TABLE]
It follows from these computations that the roots of on the imaginary axis always pass into the right half plane as increases, with derivative remaining in the first quadrant. They can never escape to infinity, because any roots of in the right half plane must lie in the bounded region where
[TABLE]
To finish the proof, we show that if is small enough, then has no roots with . If , any such root must satisfy
[TABLE]
and this implies . Now it follows
[TABLE]
Therefore, for small enough , does not vanish when . ∎
Labeling the roots. Due to the results of the previous two lemmas, we may label all the non-real roots of that cross the imaginary axis and lie in the upper half plane by analytic functions , , defined for all according to the property that
[TABLE]
Thus we can summarize as follows.
Lemma 7.3**.**
There are analytic curves , , satisfying (98) and for all , such that when , the numbers comprise all the roots of in the first quadrant. Moreover, for all , and
[TABLE]
Proof.
To show the curves are well defined and nondegenerate for all , we note that according to standard continuation theory for the ODE (94), a solution exists for real in a maximal interval for which remains bounded. It is not possible that , however, because the right-hand side of (94) cannot approach zero at the same time as (93) holds with , for the following reason: If (94) vanishes, then , hence with . But then (93) implies
[TABLE]
This implies , so necessarily but also , and this is impossible. ∎
8 Analysis of roots of
In this section we locate all the roots of the polynomial in (80) of degree , to a rough approximation, provide bounds on roots that may correspond to unstable eigenvalues, and establish the convergence in (86) in a precise sense. Let denote the closed disk with center and radius . We fix a constant . (Actually, suffices.) Depending on some large (to be chosen in the proof of Theorem 3.2), we presume throughout that
[TABLE]
8.1 Rough locations of all roots
Locations of the roots of will be identified as follows. We recall that the four values , , , which comprise the roots of the polynomial
[TABLE]
from (79), are already known to be roots of the function that approximates. Note that the three roots of with satisfy
[TABLE]
Proposition 8.1**.**
Fix . Then for any there exists and such that whenever and (100) holds, the polynomial has exactly:
- (i)
one double root at .
- (ii)
one simple root in each of the following disks of radius :
[TABLE]
- (iii)
one simple root in .
- (iv)
* roots in the punctured annulus*
[TABLE]
Proof.
Recall . where we can write
[TABLE]
with as in (79), and
[TABLE]
Step 1. First we establish (i). Note that , since
[TABLE]
Furthermore, since
[TABLE]
Hence is at least a double root. But one also checks
[TABLE]
(e.g., by computer algebra) so when .
Step 2. Next we claim that the only roots of in the disk are three as described in (ii). We can write
[TABLE]
where
[TABLE]
It suffices to show that for all in outside the balls listed in (ii),
[TABLE]
for large enough. For then our claim follows from Rouché’s theorem, since each of the balls in (ii) contains one simple root of .
Observe that for , therefore
[TABLE]
(Here and below denotes a generic constant which may depend on and but is independent of and , whose value may change from instance to instance.) To complete the proof of (107), we consider three sub-cases:
[TABLE]
In case (a), for each (i.e., for each positive root of ), we have , therefore
[TABLE]
Because and it follows that for , with large enough we have
[TABLE]
(We could replace by say here, but we have no need.)
In case (b), each positive root of satisfies , hence
[TABLE]
Consequently
[TABLE]
and (for )
[TABLE]
Therefore as in (110) we get
[TABLE]
for large enough depending on and .
In case (c), we have
[TABLE]
with min and max taken over positive roots of . Therefore for large, when (chosen to separate the roots) we find
[TABLE]
for some depending on .
This finishes the proof of (107). The conclusion in (ii) now follows, and also the fact that has no other roots in .
Step 3. Next we show that has no roots satisfying
[TABLE]
for large enough depending on , and deduce (iii) and (iv). The estimates in (113) imply
[TABLE]
Observe
[TABLE]
where
[TABLE]
(To get this last, expand and cancel a term.)
We now show the ratios and are uniformly small for satisfying (113), by estimating six terms as follows:
(a) The first term of the ratio is bounded using (114) as follows:
[TABLE]
(b) To bound the next term in , observe
[TABLE]
For , since , for we have
[TABLE]
while for we have and infer from (114) that
[TABLE]
(c) The last term in the ratio satisfies the bound
[TABLE]
(d) The terms in are estimated as follows. By (114),
[TABLE]
Further, and
[TABLE]
Therefore, since and recalling ,
[TABLE]
Hence, since , the last term in is bounded by
[TABLE]
(e) For the next term in , we have the bound
[TABLE]
(f) Lastly we have the bound
[TABLE]
Assembling the estimates in (a)-(f), we conclude that if and , then
[TABLE]
for all satisfying (113). Part (iii) now follows by Rouché’s theorem since has only one simple zero at inside . Part (iv) follows since we have shown that has exactly 6 roots (counting multiplicity) in the complement of the punctured annulus . ∎
We record here several estimates that follow from the proof above.
Corollary 8.2**.**
Under the conditions of Proposition 8.1, we have the following estimates, for some depending on :
- (i)
* if .*
- (ii)
* if or .*
- (iii)
* if .*
- (iv)
* if .*
- (v)
* if .*
Proof.
Part (i) follows from (110) in case (a) of Step 2, because . Similarly, part (ii) follows from (112) in case (c) of Step 2, and part (iii) follows from all cases of Step 2. To infer part (iv), note that (127) of Step 3 implies that for we have
[TABLE]
because . Part (v) follows similarly, since and therefore . ∎
8.2 Bounds for roots relevant to instability
Next we focus on roots of that may be related to eigenvalues of the matrix having non-negative real part. It turns out these are roots in the punctured annulus of Proposition 8.1 that are near . Recall the relation (61) between eigenvalues of the matrix and roots of , namely
[TABLE]
Lemma 8.3**.**
Under the conditions of Proposition 8.1, if is large enough, then whenever (61) holds with , then implies
[TABLE]
Proof.
By (61), , hence if then . Writing
[TABLE]
we then have and
[TABLE]
For and large, we infer , then
[TABLE]
Now because , we deduce from (130) that
[TABLE]
This entails , due to (131). Since implies , we have established the desired bounds on .
Now since and , we deduce from (130) that
[TABLE]
Since we may presume , therefore as claimed. ∎
Any roots of in the region where (129) holds actually satisfy a tighter bound, namely , as we now show.
Proposition 8.4**.**
Under the conditions of Proposition 8.1, there exist positive constants and such that whenever , any zeros of that satisfy the bounds in (129) must satisfy . Moreover,
[TABLE]
for all that satisfy
[TABLE]
Proof.
In the expression we seek to show that the first term dominates, provided (133) holds for some . Writing for convenience, we have , so by (8.1),
[TABLE]
By (133) we have and , so
[TABLE]
Because
[TABLE]
for and large enough we infer that
[TABLE]
On the other hand, due to (122) we have
[TABLE]
therefore from (8.1) we obtain the upper bound
[TABLE]
if , say, and is large enough. Since if (133) holds, the result follows. ∎
8.3 Convergence of
After the results of the previous subsection, to study unstable eigenvalues of we are motivated to make the change of variables
[TABLE]
as in Section 6. According to Proposition 8.4, for any zeros of that correspond to , the quantity must satisfy
[TABLE]
As in (31), let us now define and . Then the formal approximations in Section 6 are rigorous, with errors that are uniform over the values of such that
[TABLE]
where is an arbitrary constant (to be chosen later), and
[TABLE]
for some small . (We allow to be complex with small argument here, to simplify derivative estimates later.) By consequence, the convergence in (86) holds, in the following sense.
Proposition 8.5**.**
Uniformly for satisfying (137)–(138), with and we have that
[TABLE]
9 Analysis of roots of
Recall from (76) we have
[TABLE]
where , are low-degree polynomials that may be written in the form
[TABLE]
For large , is exponentially small, with the bound
[TABLE]
We now roughly characterize the location of the roots of .
Proposition 9.1**.**
Under the conditions of Proposition 8.1, there exists such that whenever , has (counting multiplicities):
- (i)
one double root at , and one double root at .
- (ii)
* roots in the punctured annulus , and roots with which satisfy .*
- (iii)
one simple real root in , and one with .
- (iv)
one simple real root at and one at .
Proof.
We note that due the root symmetry (66), the multiplicity of each root of is the same as the multiplicity of , unless . Also, all non-real roots of come in complex-conjugate pairs when is real.
For we then have and it follows is exponentially small.
Combining the lower bounds in parts (iii)–(v) of Corollary 8.2 with the count of roots of in parts (i), (iii) and (iv) of Proposition 8.1, we conclude from Rouché’s theorem that has a simple root inside the ball , and roots inside the closed annulus , the same as .
By examining (140)–(141), we find has at least a double root at , due to the fact that the expression
[TABLE]
has a double root at . Then, because is exponentially small, it follows from (104) that . This proves (i).
Now (iii) follows and also (ii), due to the fact that for and large,
[TABLE]
To infer (iv) we can simply recall that we know due to the root symmetry relation (66). These roots must be simple, since we have accounted for all roots of . ∎
Next, we can characterize zeros of that may correspond to unstable eigenvalues of as follows.
Proposition 9.2**.**
Under the conditions of Propositions 9.1 and 8.4, there exists such that whenever and is an eigenvalue of with , then for some root of that satisfies
[TABLE]
Proof.
Under the correspondence between and in (61), the zeros of described in parts (iii) and (iv) of Proposition 9.1 correspond to negative real values of , and the roots in part (i) correspond to . So, given is large enough, for any nonzero eigenvalue satisfying , necessarily (61) holds for some . This must satisfy the bounds in (129), due to Lemma 8.3. For these values of , we have , so is exponentially small. Then we can conclude from Proposition 8.4 that
[TABLE]
for all that satisfy (133). The conclusion follows. ∎
Further, the convergence in Proposition 8.5 holds with in place of :
Proposition 9.3**.**
Let , and let be small. Uniformly for satisfying (137)–(138), with and we have that
[TABLE]
Furthermore, for each pair of integers , the derivatives
[TABLE]
uniformly for all and satisfying
[TABLE]
Proof.
For and , the factor is bounded by . Hence again is exponentially small, and the convergence of follows from Proposition 8.5.
The convergence of derivatives follows from the Cauchy integral formula representation for such derivatives, since is analytic for satisfying (137) and satisfying (138). ∎
Curves of roots. Recall that the non-real roots of are simple and those that may satisfy lay on the curves described by Lemma 7.3. Moreover, due to (98) and (99), only a finite number of these curves provide values that can satisfy (136), corresponding to values of that satisfy (142). In particular, we note the following.
Corollary 9.4**.**
For , if and is given by Proposition 8.4, then
[TABLE]
Proof.
Suppose for some . Recall satisfies , . Then for large enough, satisfies (133), and
[TABLE]
due to (143). But this contradicts the convergence result in Proposition 9.3. ∎
Any finite number of the curves of simple zeros of perturb to curves of simple zeros of as a consequence of the implicit function theorem, as follows.
Proposition 9.5**.**
For , suppose . Let be given by Proposition 8.4, and suppose
[TABLE]
Then for sufficiently small , there is a curve that is real analytic, with the following properties:
- (i)
For each , is a simple root of .
- (ii)
* as , uniformly for , together with any finite number of derivatives in .*
- (iii)
There exists satisfying as , such that if and only if , and
[TABLE]
Proof.
The existence of the curve, its analyticity in , and properties (i), (ii) and (iii), follow from standard implicit function theorem arguments using the simplicity of the roots of , the convergence in Proposition 9.3, and Lemma 7.3. ∎
10 Analysis of eigenvalues of
The eigenvalues of are generated via the relation (61) by: one of the roots of at , the one near , and the roots in . The roots , one root at , and one root at are spurious, as discussed earlier. We have not characterized the multiplicity of all the eigenvalues or all the roots, but each eigenvalue must correspond to some root of , and vice versa.
10.1 Curves of unstable eigenvalues
Recall that zeros of correspond to eigenvalues of the matrix via the relation (88). We rescale this relation by defining
[TABLE]
Clearly as , together with derivatives, uniformly for , satisfying (146).
When , of course we have if and only if , for any . By stardard implicit function theorem arguments, for small enough there is a real analytic function such that for and ,
[TABLE]
Let denote the surface on which this holds, i.e., where . When , the imaginary axis meets each curve transversely due to the computation in (97). Therefore, for sufficiently small , the surface meets each curve provided by Proposition 9.5 transversely. By consequence, each curve given by
[TABLE]
provides a curve of eigenvalues of that must cross the imaginary axis transversely as increases, exactly once for .
10.2 Proof of Theorem 3.2
Let and . Recalling that the curves and numbers were defined in (98), we fix , and note
[TABLE]
Next, choose , let be determined by Proposition 8.4, and choose such that
[TABLE]
If is sufficiently large (i.e., for some depending on ) then analytic curves are defined by Proposition 9.5 and by (150). Let
[TABLE]
Due to Propositions 9.2 and 9.5 and the discussion above, each curve crosses the imaginary axis transversely at some point that satisfies
[TABLE]
By consequence, for small enough we have for , where we set . Also we have the monotonicity relations in (33).
Since , the eigenvalues of given by , satisfy the bound
[TABLE]
for large. Furthermore, due to Lemma 5.4 (proved below), every such eigenvalue is a simple eigenvalue of , since the roots of are simple.
It remains to prove that for , if is an eigenvalue of with , and , then necessarily for some with . According to Proposition 9.2, necessarily such an eigenvalue must satisfy
[TABLE]
where , and .
Now, for any sufficiently small, note that the balls do not overlap or contain [math] for any , and each must contain a simple root of . Fix some such , and let be the set of such that
[TABLE]
and . Because , for sufficiently small we have
[TABLE]
From the convergence in Proposition 9.3 it follows
[TABLE]
if is sufficiently small. Then it follows that for some , whence necessarily . And since .
This completes the proof of Theorem 3.2.
10.3 Simplicity of eigenvalues
It remains to prove Lemma 5.4, which shows in particular that simple roots of provide simple eigenvalues of .
Proof of Lemma 5.4.
First, we show that the kernel of is one-dimensional. Recall from Section 5 that whenever , then the components have the form (58) for some constants , . More generally, if has the form (58) with , , and if , then equations (63)–(64) are equivalent to the equation
[TABLE]
where denotes the th standard basis vector, and
[TABLE]
The value is an eigenvalue if and only if is singular. The matrix does not vanish in this case, however, for the following reason. Since and , necessarily and are distinct and have the same sign. But the function is convex and cannot have two distinct roots with the same sign. Hence it is not possible that for both and .
It follows that the kernel of is one dimensional, and the eigenspace is spanned by , taking
[TABLE]
Next, we determine when is simple, i.e., when it has algebraic multiplicity one. Since , this is the case if and only if the equation
[TABLE]
has no solution. Letting ′ denote differentiation with respect to , it follows by differentiating (154) (while keeping , fixed), that
[TABLE]
Now, whenever , so it follows that a solution to (156) exists if and only if where is a solution to
[TABLE]
As in Section 5, necessarily for some constants , that satisfy
[TABLE]
Writing , and similarly for , , the fact that is singular means
[TABLE]
and a left null vector is given by or (since ). Supposing , applying the left null vector to (157) we find that a solution of (157) exists if and only if
[TABLE]
where we used (158) to replace by . If similarly the criterion is . Thus an eigenvalue is simple if and only if , and this is equivalent to . ∎
Acknowledgements
The authors acknowledge support from the Hausdorff Center for Mathematics and the CRC 1060 on Mathematics of emergent effects, Universität Bonn. This material is based upon work supported by the National Science Foundation under grants DMS 1515400 and 1812609, partially supported by the Simons Foundation under grant 395796, and by the NSF Research Network Grant no. RNMS11-07444 (KI-Net).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Aizenman and T. A. Bak , Convergence to equilibrium in a system of reacting polymers , Comm. Math. Phys., 65 (1979), pp. 203–230.
- 2[2] J. M. Ball, J. Carr, and O. Penrose , The Becker-Döring cluster equations: basic properties and asymptotic behaviour of solutions , Comm. Math. Phys., 104 (1986), pp. 657–692.
- 3[3] K. Bar-Eli and R. M. Noyes , Gas-evolution oscillators. 10. a model based on a delay equation , J. Phys. Chem., 96 (1992), pp. 7664–7670.
- 4[4] N. Brilliantov, W. Otieno, S. Matveev, A. Smirnov, E. Tyrtyshnikov, and P. Krapivsky , Steady oscillations in aggregation-fragmentation processes , Physical Review E, 98 (2018), p. 012109.
- 5[5] J. A. Cañizo, A. Einav, and B. Lods , Trend to equilibrium for the Becker-Döring equations: an analogue of Cercignani’s conjecture , Anal. PDE, 10 (2017), pp. 1663–1708.
- 6[6] J. A. Cañizo and B. Lods , Exponential convergence to equilibrium for subcritical solutions of the Becker-Döring equations , J. Differential Equations, 255 (2013), pp. 905–950.
- 7[7] J. A. Cañizo , Convergence to equilibrium for the discrete coagulation-fragmentation equations with detailed balance , J. Stat. Phys., 129 (2007), pp. 1–26.
- 8[8] S. N. Chow and J. K. Hale , Methods of bifurcation theory , vol. 251 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Science], Springer-Verlag, New York-Berlin, 1982.
