Approximations of Cumulants of the Stochastic Power Law Logistic Model
Ingemar N\r{a}sell

TL;DR
This paper derives explicit asymptotic approximations for the first three cumulants of the quasi-stationary distribution in a stochastic power law logistic model, improving upon previous methods by avoiding system closure.
Contribution
It introduces a novel approach that approximates cumulants without closing the system of equations, providing explicit formulas and error bounds.
Findings
Explicit asymptotic formulas for cumulants derived
Conditions for approximation validity established
Errors and spurious solutions identified and addressed
Abstract
Asymptotic approximations of the first three cumulants of the quasi-stationary distribution of the stochastic power law logistic model are derived. The results are based on a system of ODEs for the first three cumulants. We deviate from the classical moment closure approach by determining approximations without closing the system of equations. The approximations are explicit in the model's parameters, conditions for validity of the approximations are given, magnitudes of approximation errors are known, and spurious solutions are easily detected and eliminated. In these ways, we provide improvements on previous results for this model.
| A | A | A | |
|---|---|---|---|
| 1 | |||
| 2 | |||
| 3 | |||
| 4 |
| Cumulant | N=100 | N=200 | N=400 | |
|---|---|---|---|---|
| 1 | 81.6 | 163 | 327 | |
| 1 | 16.7 | 33.3 | 66.3 | |
| 1 | -13.8 | -27.3 | -54.3 | |
| 1 | 8.61 | 16.9 | 33.6 | |
| 1 | -0.532 | -0.620 | -0.833 | |
| 1 | -10.0 | -21.0 | -42.8 | |
| 1 | 17.8 | 38.3 | 79.0 | |
| 4 | 95.0 | 190 | 380 | |
| 4 | 4.93 | 9.74 | 19.3 | |
| 4 | -4.80 | -9.45 | -18.8 | |
| 4 | 4.63 | 9.09 | 18.0 | |
| 4 | -4.61 | -8.98 | -17.7 | |
| 4 | 5.52 | 10.5 | 20.5 | |
| 4 | -10.2 | -18.9 | -36.6 |
| 1 | 1 | 1 | 1 | 1 | 1 |
|---|---|---|---|---|---|
| 2 | 1/2 | 3/4 | 7/8 | 13/32 | 1/4 |
| 3 | 1/3 | 2/3 | 8/9 | 7/27 | 1/9 |
| 4 | 1/4 | 5/8 | 15/16 | 25/128 | 1/16 |
| 5 | 1/5 | 3/5 | 1 | 4/25 | 1/25 |
| 6 | 1/6 | 7/12 | 77/72 | 119/864 | 1/36 |
| 7 | 1/7 | 4/7 | 8/7 | 6/49 | 1/49 |
| 8 | 1/8 | 9/16 | 39/32 | 57/512 | 1/64 |
| 9 | 1/9 | 5/9 | 35/27 | 25/243 | 1/81 |
| 10 | 1/10 | 11/20 | 11/8 | 77/800 | 1/100 |
| Cumulant | N=100 ab | N=200 ab | N=400 ab | |
|---|---|---|---|---|
| 0.5 | ||||
| 0.5 | ||||
| 0.5 | ||||
| 3.5 | ||||
| 3.5 | ||||
| 3.5 |
| Approx | Cumulant | N=100 | N=200 | N=400 |
|---|---|---|---|---|
| N | ||||
| N | ||||
| N | ||||
| BR1 | ||||
| BR1 | ||||
| BR1 | -21 | -41 | -82 | |
| BB | ||||
| BB | ||||
| BB |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Approximations of Cumulants of the Stochastic Power Law Logistic Model
Ingemar Nåsell
Department of Mathematics
The Royal Institute of Technology
S-100 44 Stockholm, Sweden
Abstract.
Asymptotic approximations of the first three cumulants of the quasi-stationary distribution of the stochastic power law logistic model are derived. The results are based on a system of ODEs for the first three cumulants. We deviate from the classical moment closure approach by determining approximations without closing the system of equations. The approximations are explicit in the model’s parameters, conditions for validity of the approximations are given, magnitudes of approximation errors are given, and spurious solutions are easily detected and eliminated. In these ways, we provide improvements on previous results for this model.
1. Introduction
We study a stochastic version of the power law logistic model. Logistic models are used as models for the size of a population with density dependent growth. This means that the net growth rate per individual is a decreasing function of the population size. The simplest decreasing function is the linear one. It leads to the classical logistic model, whose deterministic version was studied by Verhulst already in 1838. The stochastic version of this model has been studied extensively, as shown by Nåsell (2017). More general deterministic power law logistic models have been discussed by Banks (1994) and by Tsoularis and Wallace (2002), while stochastic versions of such models are treated by Matis, Kiffe, and Parthasarathy (1998), by Renshaw (2011), and by Bhowmick, Bandyopadhyay, Rana, and Bhattacharya (2016).
The model that we deal with is a Markov Chain with finite state space and continuous time, and with an absorbing state at the origin. It can also be described as a finite-state birth-death process. Two model formulations are given in Section 2. They are mathematically equivalent, but with different parameters. The maximum population size is an important parameter in the second one of the two formulations. This formulation is basic for our work, since serves the role of being the large parameter for which asymptotic approximations of various quantities can be derived.
The state variable of either of the two formulations is interpreted as the number of individuals in a population. Ultimate extinction of this variable is a fact for this model. This means in particular that the stationary distribution of is degenerate with probability one at the origin. A related and important random variable is defined by conditoning on non-extinction. The stationary distribution of this conditioned random variable is the so-called quasi-stationary distribution (QSD). The QSD is important for the information it gives about the long-term behavior of a surviving population. Our main goal is to derive approximations of the first three cumulants of the QSD.
Our approach toward this goal resembles the well-known moment closure method in the sense that both are based on a system of ODEs for the first few (three) cumulants. This system of ODEs is not closed: The number of unknown functions (cumulants) is larger than the number of equations. This fact has caused all practitioners of moment closure to introduce an approximating assumption that leads to closure of the system of ODEs. The method is used widely, as shown by the recent review paper by Kuehn (2016). However, it is also known to possess several weaknesses. The first of these weaknesses is that no condition for validity of the resulting approximation is available, a second one is that the magnitude of the approximation error can not be evaluated, a third one is that it often leads to spurious solutions that require large efforts to eliminate, and a fourth one is that the dependence of the approximating expressions on the model’s parameters is not known. An early basic paper in the area was written by Whittle (1957). The desirability of closing the system of ODEs for the cumulants cannot be denied. It is indeed a necessity if the goal is to derive exact expressions for the cumulants. However, it is important to realize that closure is not necessary if one is satisfied with approximate results, as we are in this paper.
We do not use moment closure methods at all. Instead, we apply the alternative mathematical method introduced in Nåsell (2017). Thus, we base our work on a system of ODEs for cumulants, just as is done by practitioners of moment closure. However, we avoid the step that leads to closure of the system of ODEs, since the ad-hoc nature of this step is the cause of several weaknesses of the moment closure method. Our alternative is to search for asymptotic approximations. This requires the reparametrization that accompanies the second one of the two model formulations of Section 2. To proceed with our approach, we also need information about the orders of magnitude (in terms of ) of the cumulants that appear in the system of ODEs. We motivate our assumptions in this regard with numerical evaluations. The method that we use avoids the weaknesses associated with moment closure methods.
Methods for numerical evaluations of the QSD and of the associated cumulants are discussed in Section 3. Our approach is different from what has been used by previous workers on this model. Results of numerical evaluations are used to motivate forms of basic assumptions that are made for determining asymptotic approximations of the first few cumulants of the QSD, and also for checking the orders of magnitude of the errors of the approximations that we derive.
Sections 4 and 5 are used to study the two random variables and , respectively. We give a system of first-order ODEs for the first 3 cumulants of the unconditioned random variable in Section 4. The same system of equations is shown in Section 5 to provide an approximation for the system of ODEs of the first 3 cumulants of the conditioned random variable in a particular parameter region. This means that approximations of the cumulants of the QSD in this region are found as critical points of the system of ODEs. Asymptotic approximations of the coordinates of the critical points of this system of ODEs are derived in Section 6. They serve to give asymptotic approximations of the first 3 cumulants of the QSD. These results hold for small positive integer values of the parameter that describes the power of the population size that gives the decreasing function of the population size that reflects the density dependence of the net birth rate of the model. An extension of these results to arbitrary positive values of , both integer and non-integer, is proposed in Section 7. Comparisons are made in Section 8 with published results that also deal with the cumulants of the QSD of the same model. The paper ends with some concluding comments in Section 9.
2. Model Formulation
The stochastic power law logistic model is a model for the size of a population with density-dependent growth. It is formulated as a birth-death process . The hypotheses of the model are summarized in the descriptions of the population birth-rate and the population death-rate as functions of the state of the process. The formulation given by Matis, Kiffe, and Parthasarathy (1998) takes the following form: The population birth-rate equals
[TABLE]
and the population death-rate is
[TABLE]
The state space appears to be unbounded and equal to . The parameters are all assumed to be positive. This model formulation takes its form from the similar model introduced by Bartlett (1957) for the special case of the Verhulst logistic model where .
We shall use a different formulation, where
[TABLE]
The state space of the process in this formulation is finite and equal to . The parameter space contains the five parameters , and . Among these, is a large positive integer that represents the maximum population size, is a positive dimensionless threshold parameter, is a nonnegative dimensionless parameter, is a positive death rate with dimension inverse time, and is a positive dimensionless number. This model formulation is an extension from the formulation of the logistic Verhulst model, corresponding to , given by Nåsell (2017). Under the assumption that the initial distribution is supported on the state space, the process remains there for all time, since and . The origin is seen to be an absorbing state, since both and are equal to zero. Absorption at the origin corresponds to extinction of the population. To study a surviving population, we introduce the conditioned random variable by conditioning on the event . Thus,
[TABLE]
The state space of the conditioned random variable differs from that of in one respect: The origin can be reached by , but not by the conditioned random variable . Thus, the state space of the latter one of these random variables is equal to . Stationary distributions of the two random variables are vastly different. The stationary distribution of is degenerate with probability one at the origin, while the stationary distribution of the conditioned random variable is the important quasi-stationary distribution (QSD), supported on the state space .
We shall use the second of the two model formulations in this paper. An important reason for this is that the second formulation contains a parameter that can take large values. The presence of such a parameter is essential for the formulation of asymptotic approximations. The second formulation also gives knowledge about the parameter dimensions. Dimensions of parameters and state variables need to be known when different terms are compared. It is obvious that only terms of the same dimension are comparable.
The two formulations of the stochastic power law logistic model are essentially equivalent if we require in the first formulation that is an integer, that the state space is finite and equal to , and that . We assume in what follows in this paper that these three requirements are met. The second model formulation can then be seen as a reparametrization of the first one. We find then that the 4 parameters of the first formulation can be expressed in terms of the 5 parameters of the second formulation as follows:
[TABLE]
Obviously, takes the same value in both formulations. We note also that some results that we shall refer to below use the following notations for the sums and differences of the parameters and :
[TABLE]
As already mentioned, the logistic Verhulst model can be seen as a special case of the power law logistic model, corresponding to . Asymptotic approximations of the first 3 cumulants of the QSD for the Verhulst model are derived in Nåsell (2017). The goal of the present paper is to derive corresponding approximations of the first 3 cumulants of the QSD of the power law logistic model for positive values of . We proceed by first deriving asymptotic approximations of the first 3 cumulants of the QSD for the -values 2, 3, and 4. These results are then extended in Section 7 to both integer and non-integer positive values of .
3. Numerical Evaluations
Numerical evaluations have been used in earlier work on this model, and we shall also use them here. However, there are differences between our approach and those of earlier workers both in the methods used for deriving numerical results, and in the type of evaluations that are carried out.
Bartlett et al. (1960) argue incorrectly that a stationary distribution cannot exist for the model that they are concerned with, since , while we claim that the process in this case has a degenerate stationary distribution with probability one at the origin. Bartlett et al. claim furthermore that ”the probability distribution for the stationary (or quasi-stationary) distribution must satisfy the recurrence relation”
[TABLE]
We find it disturbing that they refer to both stationary and quasi-stationary distributions here, without clarifying which of these two distributions that they mean. The statement can certainly not hold for both of them. The fact is that it is incorrect for both of them. Instead, the relation (3.1) holds for the stationary distribution of a related auxiliary process that is useful for studying the quasi-stationary distribution. The transition rates of this auxiliary process are equal to those of the process , with the one exception that the rate for transition from the state 1 to the absorbing state 0 is replaced by zero. The stationary distribution is non-degenerate and can be evaluated explicitly. It has been used to study the QSD for a SIS model, which is the special case of the model that we are concerned with here that corresponds to the parameter values and , in Nåsell (2011). It follows from this study that is a good approximation of the QSD in its body in the parameter region where . But it follows also that is not an acceptable approximation of the QSD if , nor in the parameter region near . Similar relations between and the QSD are expected to hold for and for integer values of larger than 1. Both Bartlett et al. (1960) and Matis et al. (1998) use the recurrence relation (3.1) as a basis for numerical evaluations of the QSD. Our approach is different, as shown below. However, we expect the cumulants of the stationary distribution to be close to the cumulants of the QSD in the parameter region where , since the tail probablities have a minor influence on the cumulant values. We note that Renshaw (2011) uses as a ”working definition” of quasi-stationarity. We do not follow this approach. Our results require a sharp line to be drawn between exact and approximate relations.
To describe our method for numerical evaluations, we need a second auxiliary process . The birth rates of this process are equal to those of the original process , while the death rates are slightly smaller than those of , and equal to . This means that the second auxiliary process can be interpreted as the original process with one surviving and immortal individual. Like the first of the two auxiliary processes, the second one lacks absorbing state, and its stationary distribution is non-degenerate and can be evaluated explicitly.
Our numerical evaluations of the QSD make use of the restart map analyzed by Ferrari, Kesten, Martinez, and Picco (1995), and also discussed by Nåsell (2011). This restart map is defined as follows: Let a probability vector be given. Define a process related to the process that we are studying by the requirement that whenever the original process reaches the state zero, it is immediately restarted at some state with probability . The restarted process has the state space , and a unique stationary distribution . The map is then defined by . The quasi-stationary distribution is a fixed point of the mapping , since . Furthermore, Ferrari et al. (1995) show that an iteration scheme can be established by repeated applications of the map to an arbitrary initial probability vector, and that it converges to the quasi-stationary distribution . Suitable initial probability vectors are given by the stationary distributions and of the two auxiliary processes and . It turns out that that the map can be described explicitly with the aid of these two stationary distributions. A derivation is given by Nåsell (2011). By denoting the components of the vector by , we find that
[TABLE]
where
[TABLE]
and
[TABLE]
with
[TABLE]
and
[TABLE]
The stationary distributions and of the two auxiliary processes are determined from tte sequencies and as follows:
[TABLE]
and
[TABLE]
Our numerical method for determining the quasi-stationary distribution consists in applying the restart map to a suitable initial distribution and continue iterations until successive iterates are sufficiently close. In case , which is the parameter region of main interest for the study of cumulants in this paper, the stationary distribution is recommended over the distribution as initial distribution.
Numerical evaluations are used by Matis et al. (1998) for showing that the errors that their approximations lead to are small. With our different parametrization we have access to a parameter that can take large values. We can then use numerical evaluations to extract additional information both about the model and about the approximations that we derive. One goal is to derive information about the orders of magnitude (in terms of ) of various cumulants, and another one concerns the probability of taking the value 1 in the quasi-stationary distribution. Information of these two types is important for the formulation of hypotheses as a basis for finding asymptotic approximations of the first few cumulants for large values of , as will be shown below. Another use of numerical evaluations is to derive information about the orders of magnitude (in terms of ) of the errors that are committed in using our approximations. Such results will be used below to support the forms of asymptotic approximations that we derive.
4. ODEs for Cumulants of the Unconditioned Random Variable
The starting point for our derivations of asymptotic approximations of the first 3 cumulants of the QSD are ODEs for the first 3 cumulants of the conditioned random variable . It turns out that these ODEs are closely related to the ODEs for the corresponding cumulants of the unconditioned random variable in one important region of parameter space, namely where . We start therefore by giving these latter ODEs.
The derivation of these ODEs makes use of the moment generating function and the cumulant generating function of the random variable . They are defined by
[TABLE]
and
[TABLE]
where the state probabilities satisfy the Kolmogorov forward equations
[TABLE]
(Here, we agree to put so that (4.3) makes sense formally for all n-values indicated.)
The definitions of the cumulant generating function and of the transition rates and can be used to derive a partial differential equation (PDE) for . It can be written as follows:
[TABLE]
The same result follows by applying the reparametrization in (2.6)–(2.9) to the expression for the PDE given by Renshaw (2011), and numbered (3.4.11).
The cumulants are obtained from a power series expansion of the cumulant generating function as follows:
[TABLE]
We use this result for determining the ODEs of the first few cumulants of the random variable . They are found by determining the first few terms of the expansion of the PDE of the cumulant generating function in terms of , and then identifying terms with equal powers of .
To express our results, we use the capital letters , , and to denote the time derivatives of the first three cumulants of the random variable , with superscripts indicating the -values, as follows:
[TABLE]
Expressions for the functions for the -values 2, 3 and 4 can then be written as follows:
[TABLE]
Derivations of these results using Maple are given in Nåsell (2018).
We note that the 3 derivatives in (4.9), (4.12), (4.15) of the first cumulant for the 3 -values 2, 3, and 4 are found from the expressions (27), (28), (29) in Matis, Kiffe, Parthasarathy (1998) by reparametrization using (2.10)–(2.13) after introducing the correction that the first term in the right-hand side of (29) is written . Similarly, the 2 derivatives and in (4.10) and (4.13) of the second cumulant for the 2 -values 2 and 3 are found from the expressions (33) and (34) in Matis et al. by the same reparametrization after noting that the minus sign of the term in the right-hand side of their formula (34) is incorrect and should be changed to a plus sign.
5. ODEs for Cumulants of the Conditioned Random Variable
The goal of our study is to determine approximations of the first three cumulants of the QSD for the stochastic variable . This leads us to a study of the stationary values of the system of ODEs for the first three cumulants of the conditioned random variable . These ODEs turn out to be closely related to the ODEs for the cumulants of the unconditioned random variable in one important parameter region, namely where . Identificaton of this parameter region is of high importance, since it gives the validity condition for the approximations that we derive.
In similarity to the case in the previous section we use the cumulant generating function of the random variable that is of concern here to derive the ODEs of the first few cumulants of this random variable. To derive the PDE of this cumulant generating function we proceed as in the previous section. The main difference from the case in the previous section is that the expression for the time derivative of the state probability is different from the counterpart of (4.3). By using the relation in (2.5) we find that
[TABLE]
Differentiation and use of the Kolmogorov forward equations in (4.3) gives
[TABLE]
The PDE for the cumulant generating function of the conditioned random variable turns out to be quite similar to the PDE for . It can be written as follows:
[TABLE]
As in the previous section we determine the ODEs of the first few cumulants by expanding this PDE in terms of and identifying terms of equal powers of . We find then that the last term of the right-hand side of every such ODE contains as a factor. The stationary values of the probabilities are equal to the quasi-stationary probabilities . We claim that the probability is exponentially small in for any and any positive integer value of when . Arguments that support this are given below. The last term can therefore be ignored when we are searching for asymptotic approximations of the critical points. An important consequence of this is that asymptotic approximations of the first few cumulants of the QSD for are found as stationary solutions of the ODEs in Section 4 for the corresponding cumulants of the unconditioned random variable , with obvious rules for exclusion of spurious solutions. Derivations of these results are given below in Section 6.
To show that is exponentially small in when for any and any positive integer we note first from Nåsell (2011) that this result is true for the SIS model, where and . Next we consider the Verhulst model, with and . We then make use of the result that , where is the stationary distribution of the auxiliary process , and where it is shown by Nåsell (2001b) that
[TABLE]
where
[TABLE]
and
[TABLE]
Since the normal density function satisfies , we conclude that , and therefore also , are exponentially small in under the condition , and .
Finally, we consider the cases where is an arbitrary positive integer, , and . Here, we have not found an analytical proof that is exponentially small. Instead, we use numerical evaluations to support our claim. The results in case , , the four -values 1,2,3,4, and the three -values 100, 200, and 400, are given in Table 1. They show that decreases strongly with for the indicated values of , , and . Since is exponentially small in for , we conclude that is exponentially small in for the indicated parameter values. The results in Table 1 also support our claim in the sense that a doubling of the -value for fixed leads to an approximate squaring of the value of . This is what is expected when is exponentially small in .
The condition that we have found under which is exponentially small in , namely , turns out to be the condition for the validity of the approximations of the first three cumulants of the QSD that we derive later. We note that a corresponding condition of validity has been missing from earlier results for the same model, based on cumulant closure.
6. Derivations of Asymptotic Approximations of Cumulants of the QSD
This section is used to derive asymptotic approximations of the first 3 cumulants of the quasi-stationary distribution of the stochastic power law logistic model for the -values 2, 3, and 4 and large -values in the parameter region where . The method that we use is similar to the one introduced in the study of the cumulants of the QSD of the Verhulst logistic model in Nåsell (2017). This latter study actually represents the special case with of the model that we study here. Our results are based on assumptions about the forms of asymptotic approximations of the first few cumulants for large values of . For the Verhulst model with and we give arguments in Nåsell (2017) for assuming that the first 4 cumulants of the QSD are O(N). The results make it easy to conjecture that all finite order cumulants of the QSD for are in the parameter region . It is tempting to guess that this holds also for integer values of larger than 1.
In what follows in this section we shall assume that the first cumulants of the QSD are O(N) when , , and . As a basis for this we refer to the numerical results in Table 2. It shows numerically determined values of the first 7 cumulants of the QSD for the parameter values and , with the 2 -values 1 and 4, and with the three -values 100, 200, and 400. It is easy to verify from the table that all cumulants experience an approximate doubling both when is doubled from 100 to 200 and also when is doubled from 200 to 400. We take this as a strong indication that all of the cumulants considered are . The results in Table 2 are derived using Maple in Nåsell (2018), where also similar results are given for the -values 2 and 3.
We study first the case . The assumptions that the first 5 cumulants are in the parameter region for are basic for our further assumptions that the first 5 cumulants have the following asymptotic behaviors for and :
[TABLE]
The reason for including different numbers of terms in these asymptotic approximations will become apparent shortly. Thus, we have introduced 8 unknowns, namely . We proceed to determine the first 6 of them. By inserting the above asymptotic approximations of the first 5 cumulants into the expressions (4.4)–(4.6) for the functions , we find that asymptotic approximations for them can be written as follows:
[TABLE]
where
[TABLE]
The reason for including different numbers of terms in the assumed asymptotic approximations of the cumulants can be understood with reference to the resulting asymptotic approximations of the quantities , and . We note for example from the expression (4.4) for that an additional term in the assumed asymptotic approximation of would contribute a term of the order of to the asymptotic approximation of . Inclusion of such an additional term would therefore be absorbed in the error term for the asymptotic approximation of . We note from the expressions in (6.9)–(6.14) that only the first 6 of the 8 coefficients introduced above appear in these expressions.
Our assumptions for that the first 5 cumulants have the asymptotic approximations given in (6.1)–(6.5) lead to the asymptotic approximations of , , in (6.6)–(6.8). These expressions are equal to the derivatives with respect to time of the first 3 cumulants. Setting them equal to zero gives conditions for the stationary points of the corresponding system of ODEs. Since we are working with approximations instead of exact results, we transform these conditions into the 3 requirements that , , . These requirements are satisfied by the basic conditions that the 6 expressions in (6.9)–(6.14) are equal to zero. We note that each of these expressions is a polynomial of degree 3 in the 6 unknown coefficients . Our basic problem is to solve the 6 equations that are formed by setting the corresponding expressions equal to zero for the 6 unknown coefficients. Each equation is a polynomial of degree on the 6 unknown coefficients. Solution appears possible, since the number of equations equals the number of unknowns. Further inspection of the 6 equations reveals that considerable simplification can be achieved in the solution by determining the 6 unknowns in a definite order, as described below. The first advantage is that each of the equations to be solved contains only one unknown coefficient, and the second one is that all equations except the first one are linear in the coefficient to be solved for. The very first equation is of degree in the case we are considering here, with . All of the roots of the first equation except one are spurious solutions that turn out to be easy to identify and to delete. Whenever a solution is found for one of the coefficients, its value is immediately inserted into the remaining unsolved equations. The order of solution that leads to these pleasant results is as follows: First, the equation is solved for . This equation has in this case, with , solutions. Among them, we exclude and as the only spurious solutions that appear in this method. The solution found for is then inserted into the remaining 5 unsolved equations. After finding , we proceed to solve the equation for , the equation for , the equation for , the equation for , and the equation for . It is elementary to use these rules to solve for the 6 unknown coefficients. Alternative derivations using Maple are given in Nåsell (2018). The solutions are as follows:
[TABLE]
The method used here for derivation of asymptotic approximations of the first 3 cumulants of the QSD for can in principle be followed for higher integer values of . We proceed to consider the case when . We begin by asssuming that the first 6 cumulants have the following asymptotic behaviors for and :
[TABLE]
Insertions of these asymptotic approximations of the first 6 cumulants into the expressions (4.7)-(4.9) for the functions lead to the following asymptotic approximations for them:
[TABLE]
where
[TABLE]
These 6 expressions are all polynomials of degree 4 in the 6 unknowns , , , , , . We solve the 6 equations formed by putting each of these expressions equal to zero for the 6 unknowns . First, the equation is solved for . Among the 4 roots we exclude the one that is equal to zero, and the two that are complex conjugate as the only spurious solutions that appear in this method. The remaining 5 equations are then solved for the remaining 5 unknowns, using an order of solution similar to what is described for the case above. Derivations using Maple are given in Nåsell (2018). The results are as follows:
[TABLE]
The last problem addressed in this section is the derivation of asymptotic approximations of the first 3 cumulants of the QSD for . We begin by assuming that the first 7 cumulants have the following asymptotic behaviors for and :
[TABLE]
Insertions of these asymptotic approximations of the first 7 cumulants into the expressions (4.10)–(4.12) for the functions lead to the following asymptotic approximations for these 3 functions:
[TABLE]
where
[TABLE]
These 6 expressions are all polynomials of degree 5 in the 6 unknown quantities . As above, we solve the 6 equations formed by putting each of these expressions equal to zero for the 6 unknowns . First, the equation is solved for . Among the 5 roots we exclude 4 of them as spurious solutions, namely one that is equal to zero, one that is negative, and 2 that are imaginary. The remaining 5 equations are then solved for the remaining 5 unknowns, using the same order as above for the cases and . Derivations using Maple are given by Nåsell (2018). The results are as follows:
[TABLE]
7. Extensions to Positive Values of
The results derived in the previous section give approximations of the first 3 cumulants of the QSD for the integer -values 2, 3, and 4. The present section is used to extend these approximations to positive values of . The extension is made in four steps. In the very first step we combine the results of the previous section with the results in Nåsell (2017), which are valid for , to give results that hold for the -values 1, 2, 3, and 4. In the second step we extend these results to the integer -values from 1 to 10. The third step extends these results to all positive integer values of . The final step makes an extension from this to all positive values of .
We emphasize that the method that we use for deriving approximations of the first few cumulants for integer values of cannot be used for deriving results for non-integer values of . The reason is that the derivations are based on ODEs for the first few cumulants of integer order. Our last step of extension does not require us to consider ODEs for cumulants of non-integer order. Instead we use a continuity argument for extending the derived approximations of the first few cumulants to non-integer positive values of .
By using obvious definitions of the 6 coefficients , we find from Nåsell (2017) for the case that they depend on the model parameters and as follows:
[TABLE]
The dependence on for all positive values of is claimed to be as follows for the 6 coefficients :
[TABLE]
where the functions are defined as follows:
[TABLE]
To show that the functions are determined by these expressions for , we determine first the values that they take for the integer -values 1 - 10, and given in Table 3. The values taken by these 5 functions for the integer -values 1-4 are readily found from the expressions found for the 6 coefficients given in the previous and present sections. The further results for the -values from 5 to 10 are based on asymptotic approximations of the first 3 cumulants of the QSD of the power law logistic model that have been derived using Maple, and are reported in Nåsell (2018).
It is straightforward to use the entries in Table 3 for the functions , , and to confirm that the expressions (7.13), (7.14), (7.17) for these functions are valid for all integer values of from 1 to 10.
We turn now to deal with the functions and . To find expressions for them, we assume that each of them can be written as a quotient of 2 polynomials in of degree 2. To be specific, we assume that
[TABLE]
To determine the values of the 6 coefficients , we establish 6 equations that are found by equating the values of for the integer -values from 1 to 6 according to (7.18) to the corresponding values in Table 3. By solving these equations using Maple, as seen in Nåsell (2018), we find that equals the expression in (7.15). An entirely similar treatment of the function leads to the expression (7.16) for . So far, we can conclude that the expressions (7.15) and (7.16) for and are valid for the integer -values in the interval from 1 to 6. Extensions of the validities of the 2 expressions for and to all integer values of from 1 to 10 are easily seen to hold by the simple expedient of verifying equalities between the values of the expressions in (7.15) and (7.16) and the corresponding values in Table 3. We take this as a strong indication that the domain of validity of the expressions (7.15) and (7.16) can be further extended to all positive integer values of .
As a last step of extension, we conjecture that the domain of validity for the 5 functions can be extended from all positive integers to all positive values of . We illustrate the result for non-integer -values by giving the numerical values of the error terms for the 3 cumulants in case and , for the 2 -values 0.5 and 3.5, and the 3 -values 100, 200, and 400 in Table 4. The table indicates that the error term for is divided by approximately 4 for each doubling of , the error term for is divided by approximately 2 for each doubling of , and the error term for is approximately constant when is doubled. This is consistent with the claims that the error terms of are for the -values 1, 2, and 3. Evaluations of the error terms in Table 4 are derived using Maple in Nåsell (2018).
The results in (7.7)–(7.12) show the way in which our asymptotic approximations of the first 3 cumulants of the QSD of the stochastic power law logistic model depend on the 4 parameters , , , and . The one-term asymptotic approximations of the first 3 cumulants , , and are useful for the information that they give about the behaviors of these cumulants. They can be written as follows:
[TABLE]
where expressions for the 3 coefficients are given in (7.7), (7.10), and (7.12), respectively. We use these expressions to derive some properties of the first 3 cumulants, valid for sufficiently large values of .
We note first that the expectation is an increasing function of the power . This follows from the following expression for the derivative of with respect to :
[TABLE]
Our second observation concerns the variance . For sufficiently large values of , we find that it has a maximum as a function of at , where
[TABLE]
This follows from the following expression for the derivative of with respect to :
[TABLE]
We conclude in particular that is increasing in for and decreasing in for , provided is large enugh.
Our third observation concerns the third cumulant . It is useful for determining the skewness of the QSD. Actually, the QSD has negative skewness if is negative, and positive skewness if is positive. It follows from the expression (7.12) that is positive if and negative if , where
[TABLE]
It is easy too see that the same intervals of lead to positive respectively negative skewness of the QSD if is sufficiently large.
Our brief study of the -dependence of the first 3 cumulants shows that one can identify 2 cases with rather different behaviors. The first case occurs when , and the second one when . In the first case we conclude that both and are increasing functions of , and that the QSD has positive skewness. In the second case we conclude as in the first case that is increasing in , while now is a decreasing function of and the skewness of the QSD has changed from positive to negative.
We illustrate the -dependence of the first three cumulants by plotting the QSDs for 5 different -values with the constant parameter values , , and . In these cases we find and . Numerically determined QSDs are shown in Figure 1 for the 5 -values 0.2, 0.5, 1, 3, and 10. The figure shows that the first cumulant (the expectation) increases as a function of , that the second cumulant (the variance) decreases both when approaches small positive values, and when grows towards large values, and finally that the third cumulant (a measure of skewness) is positive for small -values, and negative for large s-values. (For the interpretations of the plots in Figure 1, it is useful to note that the probabilities are positive for all in the state space . Recall that the QSDs are discrete, and that the individual probabilities are determined from the plotted curves by reading off the values at each abscissa .)
The asymptotic approximations that we have given of the first three cumulants for large values of are valid for sufficiently large -values. It is seen from the plot in Figure 1 that the requirement that is exponentially small is not satisfied for when , and . We leave it as an open problem to determine how large must be as a function of , and to assure that is exponentially small in .
8. Discussion of Other Approaches
In this section we discuss five published results that all deal with the problem in this paper, namely to determine approximations of the first three cumulants of the QSD of the stochastic power law logistic model. All these results are formally different from each other, and also different from the results that we have derived and presented in Section 7 of this paper. We use the powerful method of asymptotic approximations to analyze these results. In this discussion we are mainly limited to the case .
The first published results are those due to Bartlett (1957) and Bartlett, Gower, ansd Leslie (1960). The latter results are referred to as the BGL-approximations. They give approximations of the first three cumulants of the QSD of the stochastic logistic model with . The model they analyze is the first one formulated in Section 2. Their derivation is based on a study of the change of the stochastic variable , and of its seond and third powers, during an infinitesimal time interval. The resulting approximations of the first three cumulants of the QSD, which are equal to the mean , the variance , and the third central moment , are given as follows in Bartlett et al (1960):
[TABLE]
In the quoted paper, these three approximations are denoted by the sign instead of . This would indicate that the approximations have the desirable property of being asymptotic under the condition that some identified variable becomes large. However, no such variable appears in the derivations. Because of this, the derived approximations can not be claimed to be asymptotic. It is, however, easy to show that the BGL-approximations are closely related to asymptotic approximations of the first three cumulants. All one has to do is to use the reparametrization in (2.6)–(2.9). This leads to the following results:
[TABLE]
where , , , are given in (7.1), (7.2), (7.4), (7.6). This shows that the approximation (8.1) of the first cumulant of the QSD is equal to the sum of the first two terms of its asymptotic approximation for large -values, while the approximations (8.2) and (8.3) of the second and third cumulants of he QSD are equal to the first terms of the corresponding asymptotic approximations. All three approximations are therefore asymptotic for large . We note, however, that the BGL-method does not by itself allow this important conclusion. Additional information about properties of the method will be uncovered in our study below of published results number three and four, referred to as BR1 and BR2.
The second published result that we comment on was presented by Matis and Kiffe (1996) in case , and by Matis, Kiffe, and Parthasarathy (1998) when is a positive integer. A valuable contribution of these papers is that they show that a system of ordinary differential equations (ODEs) for the first few cumulants of the unconditioned random variable can be derived from the PDE for the cumulant generating function of . As already mentioned, we use a slight variation of this approach for deriving ODEs of the first few cumulants of the conditioned random variable . It turns out that the first system of ODEs (for the cumulants of ) serves as an approximation of the second system of ODEs (for the cumulants of ) under the condition that . The critical points of the two systems of ODEs correspond to stationary distributions of the two random variables and , respectively. The two systems of ODEs for the cumulants are not closed, in the sense that the number of cumulants is larger than the number of equations. This may appear as an undesirable property of the problem. Closure is clearly necessary if one wants or needs exact solutions. However, for our purposes it is important to realize that closure is not needed to find approximate solutions. To achieve closure, Matis et al. (1996), (1998) assume that all cumulants of sufficiently large order are equal to zero. This assumption is clearly at odds with our finding that all cumulants of the QSD are of order . It turns out, however, that it leads to good numerical approximations of the cumulants. This behavior can be understood from our results. We note for example from equations (4.15)-(4.17) that the right-hand sides of the ODEs for the first three cumulants when depend upon all seven cumulants . But we note also in this case from (6.52)-(6.57) that our assumption that the cumulants are all leads us to conclude that they have no influence on the first three terms of the asymptotic appoximation of , nor on the first two terms of the asymptotic approximation of , or on the first term of the asymptotic approximation of . The assumption that made by Matis and Kiffe is clearly fulfilled if . This argument shows that cumulant approximations based on cumulant closure can lead to asymptotic approximations if they are acceptable at all, as shown in several cases by Matis et al. (1998). But the assumption that some cumulant equals zero is incorrrect and can lead to errors if one studies e.g. the QSD instead of its cumulants.
The remaining three published results that we discuss here are all based on the BGL-method. Renshaw (2011) has used the BGL-method in an effort to derive additional terms in the approximations of the first three cumulants of the QSD. He proceeds to formulate two approaches that we refer to as BR1 and BR2, respectively, where the letters B and R are used to refer to Bartlett and Renshaw. In each of these two approaches he proceeds to derive three relations between cumulants that turn out to be similar to relations between critical points of the system of ODEs for the first three cumulants. To describe his findings, we quote first from Renshaw (2011) the ODEs for the first three cumulants in the case . They are derived from the PDE for the cumulant generating function, and do not involve any approximations. For convenience in notation we denote the time derivatives of the first three cumulants by , , and , respectively. We get
[TABLE]
The same equations are also found in Matis and Kiffe (1996), and, after using the reparametrization in (2.10)-(2.13), in Nåsell (2017).
The two approaches disussed by Renshaw (2011) are the third and fourth of the published results that we discuss. It turns out that the first two of the cumulant relations derived by Renshaw are in each of his two approaches equal to the expressions found by setting the cumulant derivatives and quoted above equal to zero. However, the third cumulant relation that he derives is different from what is found by setting equal to zero in each of the 2 approaches. In the approach BR1 the third cumulant relation is given by relation (3.5.21) in Renshaw’s book, while it is given by the expression following (3.5.38) in the same book in the approach BR2. We use the notations and to refer to these relations. For the first one of these we find that is written as follows after using the reparametrization in (2.10)-(2.13):
[TABLE]
We proceed to derive asymptotic approximations of the first three cumulants of the QSD for this case. As in Section 6, our basic assumption is that the first four cumulants have the following asymptotic behaviors:
[TABLE]
By inserting these asymptotic approximations of the first four cumulants into the expressions for in (8.7), (8.8), (8.10) we find that asymptotic approximations for them can be written as follows:
[TABLE]
where
[TABLE]
The basic mathematical problem at this point is to determine the six coefficients so that the following three conditions are satisfied:
[TABLE]
These conditions are satisfied by setting the eight expressions (8.18) - (8.25) equal to zero. We proceed to solve the resulting eight equations for the six unknown coefficients , , , , , . At this point it appears that there are more equations than unknowns, but the apparent problem that this causes will readily be solved. The equation is first solved for , and the spurious solution is excluded. As above, as soon as the solution is found for any of the coefficients , , , , , , it is inserted into the remaining unsolved equations. The equation is then solved for , and the equation is solved for . After this we find that the two expressions and can both be determined from these values, and that they both are equal to zero. Thus, the number of equations is reduced to be equal to the number of unknown coefficients. In the last steps we use the equation to express as a function of , the equation to solve for as a function of , and finally the equation to solve for . The result of these evaluations is that the six coefficients , , , , , can be expressed as follows as functions of the model parameters and :
[TABLE]
Comparisons with (7.1)-(7.6) show that the expressions for the 3 coefficients , , and are equal to the corresponding expressions derived by Nåsell (2017) using our preferred method where asymptotic approximations of the cumulants are derived from the ODEs for the cumulants without attempting to close the equations. The comparisons also show that the expressions for the remaining three coefficients , , and disagree between our approach and BR1. This means that the two approaches lead to different approximations of all three cumulants. Since both of them cannot be right, and since we claim that our derivations are correct, we conclude that the BR1 results are incorrect. The reason for the discrepancy is that one or more of the various approximation steps that are taken in using the BGL-method brings in errors. Renshaw (2011) appears to share this view in his brief discussion of this issue after his formula (3.5.21). We conclude that the BGL-method brings in errors of unknown magnitude, and therefore is unsuitable for deriving approximations of the cumulants of the QSD beyond those derived by Bartlett et al. (1960). Support for this claim is given by the numerical evaluations reported in Table 5. They indicate that the error terms of the approximations of the cumulants , derived in Nåsell (2017), are of the orders O(1/), O(1/), O(1), respectively, just as expected. But we find also that the error terms of the approximations derived via the BR1 approach are larger than this by an order of magnitude. The BR1 approximation of in this case is useless, since the error terms actually grow with .
We turn now to a consideration of the second version BR2 of the Bartlett-Renshaw approach. Renshaw (2011) uses it to derive a new expression for the third cumulant relation. It is given after relation (3.5.38) in his book. After writing and using the reparametrization (2.10)-(2.13) it can be written , where is as follows:
[TABLE]
We proceed to derive asymptotic approximations of the first three cumulants of the QSD for this case, using the three cumulant relations , , and . As above, we assume that the first four cumulants have the following asymptotic behaviors:
[TABLE]
By inserting these asymptotic approximations of the first four cumlants into the expressions for , , and , we find that asymptotic approximations for them can be written as follows:
[TABLE]
Expressions for the five quantities are given in (8.18)-(8.22), while and are equal to
[TABLE]
We set these 7 expressions equal to zero, and solve the resulting seven equations for the six unknown coefficients . The value of is found by solving the equation and excluding the spurious solution . The values of and are then found by first solving the equation for , and then solving the equation for . The values of , , and are the same as the ones found in (8.29), (8.30), (8.32). Furthermore, we insert these values of into (8.43) and (8.44). This shows that and that therefore the number of equations available for solving for the six coefficients is reduced from seven to six. We also get
[TABLE]
To determine the three coefficients , , we solve the equation for , the equation for , and the equation for . The rather surprising result is that the values of the six coefficients , , , , , are found to be the same as in (7.1)-(7.6). This means that even though the explicit results from the BR2 method are different from the results from our preferred method, the asymptotic approximations agree.
In his development of BR2, Renshaw (2011) uses the equation to derive the relation
[TABLE]
By using the reparametrization in (2.11) and (2.13) we find that this relation was shown to hold already by Bartlett et al. (1960). It is seen from (7.4) and (7.6) that this relation is asymptotic for large . However, the arguments used by Renshaw in his derivation can be criticized. As also pointed out by Bhowmick et al. (2016), it does not make sense to assume that is large in comparison with , since these four parameters are rates whose values depend on the unit of time, which is arbitrary, while is independent of the time unit. It is also unrealistic to assume that is small compared with , since it contradicts our finding that all cumulants are .
Renshaw’s two efforts to extend the BGL-approximation beyond what was derived by Bartlett et al. (1960) leads to two different conclusions. In one case (BR2) the results are correct asymptotically, while they are not in the other case (BR1). This is enough to cnclude that the BGL-method cannot be trusted to give correct results wthout further investigations.
We turn now to consider the results reported by Bhowmick et al. (2016). They work with the parameter space that is associated with the second of the model formulations in Section 2. Actually, they study a more general model than the one that we are concerned with here. The population birth rate in their model is given by
[TABLE]
while their population death rate equals
[TABLE]
They do not require to be an integer. We study their model in the special case where . To agree with our model formulation we furthermore put and .
By using results derived by Bhowmick et al. (2016) and reported in Section 4 of their paper, we find that the mean equals
[TABLE]
where
[TABLE]
By including 3 terms in the asymptotic approximation of in (8.49), we find that
[TABLE]
From relation (25) in Bhowmick et al. we find that the variance equals
[TABLE]
By including two terms in the asymptotic approximation of , we find that
[TABLE]
Bhowmick et al. use four quantities , , , and in a formula that determines the third cumulant as follows:
[TABLE]
These four quantities are defined in terms of the parameters that are used to describe the first of the two model formulations in Section 2. After reparametrization they can be expressed as follows:
[TABLE]
Thus, all four of these quantities are determined as functions of . We use the asymptotic approximation of in (8.51) to determine one-term asymptotic approximations of each of . Two terms of the asymptotic approximation of is needed for , while one term suffices for the remaining 3 quantities. The results are
[TABLE]
It follows from this that the one-term asymptotic approximation of equals
[TABLE]
By inserting this result and the one-term asymptotic approximation of from (8.53) into (8.54), we get
[TABLE]
The resulting asymptotic approximations of the first 3 cumulants are found in (8.51), (8.53), and (8.64). Comparisons with the results derived by using our preferred method and found from (7.1)-(7.6) show that the two approaches lead to different results. We claim that our results are correct, and that the results given by Bhowmick et al. are incorrect. An independent suppport for this conclusion is found from the numerical evaluations of the error terms of the results presented by Bhowmick et al. and given in Table 5. We see here that the error term for the BB approximation of is reduced by the factor 2 for each doubling of . This is typical of an error that is of magnitude of . In addition, the error term for the BB approximation of is found to be approximately independent of , which indicates that it is of the magnitude of . The error term for the BB approximation of is, finally, found to be approximately proportional to , which makes the corresponding approximation useless. The reason for the incorrect results arrived at by Bhowmick et al. is that one or more of the approximation steps taken in the application of BGL method brings in errors. This is similar to the reason for incorrectness of the BR1 result, commented on above.
9. Concluding Comments
The method that we have used here to derive asymptotic approximations of the first few cumulants of the QSD of the stochastic power law logistic model emphasizes the importance of the second model formulation in Section 2. It gives access to two parameters that are basic for our study, namely the maximum population size and the threshold parameter . We have shown that the condition for large is required for the results that we have presented here, while both the moment closure method and the 3 approaches BR1, BR2, and BB based on the BGL method have the serious weaknesses that they can not produce conditions for validity of the approximations that are derived. In addition we note that magnitudes of approximation errors are easy to establish in our method, as they are for any asymptotic approximation, while they can not be produced in the moment closure method, nor in approaches based on the BGL method. We note furthermore that spurious solutions that could require large efforts to eliminate appeared in early studies based on moment closure, while the spurious solutions that appear in the method that we use here are easy to identify and eliminate. Our results show that the number of spurious solutions is equal to the parameter whenever is a positive integer. As a further comment we note that the dependence of the new approximations on the model parameters is explicit in our approach, while they are unknown in results based on moment closure and in BR1 and BR2.
We have found that our method based on the second model formulation of Section 2, and followed by a search for asymptotic approximations, provides a powerful approach for determination of approximations of the first few cumulants of the QSD for the power law logistic model. We have also shown that the method of determining asymptotic approximations can be used to study other approaches to the same problem.
We conclude that the method of determining ODEs for the first few cumulants of the QSD introduced by Matis and Kiffe (1996) is preferred over the BGL-method for deriving relations between the cumulants. We emphasize the obvious fact that whenever exact solutions of a mathematical problem are difficult to establish, then one should search for approximations. Furthermore, approximation methods are then definitely preferred that give both condiitions for validity of the approximations and magnitudes of approximation errors. Because of this, we conclude that our method for determining asymptotic approximations of the first few cumulants is preferred over any method that is based on moment closure.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R.B. Banks, Growth and Diffusion Phenomena, Springer, Berlin, Heidelberg (1994).
- 2[2] M.S. Bartlett, On theoretical models for competitive and predatory biological systems, Bimetrika, 44 , 27–42, 1957.
- 3[3] M.S. Bartlett, J.C. Gower, and P.H. Leslie, A comparison of theoretical and empirical results for some stochastic population models, Biometrika, 47 , 1–11 (1960).
- 4[4] A. R. Bhowmick, S. Bandyopadhyay, S. Rana, and S. Bhattacharya, A simple approximation of moments of the quasi-equilibrium distribution of an extended theta-logistic model with non-integer powers, Math. Biosci., 271 , 96–112 (2016).
- 5[5] P. Ferrari, H. Kesten, S. Martínez, and P. Pico, Existence of quasi-stationary distributions. A renewal dynamic approach, Ann. Probab. 23 , 501–521 (1995).
- 6[6] C. Kuehn, Moment closure - A brief review, In Control of Self-Organizing Nonlinear Systems, Springer, 2016, ar Xiv:1505.02190
- 7[7] J.H. Matis and T.R. Kiffe, On approximating the moments of the equilibrium distribution of a stochastic logistic model, Biometrics, 52 , 980–991 (1996).
- 8[8] J.H. Matis, T.R. Kiffe, and P.R. Parthasarathy, On the cumulants of population size for the stochastic power law logistic model, Theor. Pop. Biol., 53 , 16–29 (1998).
