Phenomenology of stochastic exponential growth
Dan Pirjol, Farshid Jafarpour, Srividya Iyer-Biswas

TL;DR
This paper explores a broader class of models for stochastic exponential growth beyond the traditional Geometric Brownian Motion, emphasizing power law noise to better match experimental observations of biological and physical systems.
Contribution
It introduces a generalized phenomenological model with power law multiplicative noise, providing analytical solutions and a method to infer parameters from experimental data.
Findings
Mean-rescaled distributions are approximately stationary at long times.
Power law noise models better fit experimental data than GBM.
Analytical solutions and parameter inference methods are developed.
Abstract
Stochastic exponential growth is observed in a variety of contexts, including molecular autocatalysis, nuclear fission, population growth, inflation of the universe, viral social media posts, and financial markets. Yet literature on modeling the phenomenology of these stochastic dynamics has predominantly focused on one model, Geometric Brownian Motion (GBM), which can be described as the solution of a Langevin equation with linear drift and linear multiplicative noise. Using recent experimental results on stochastic exponential growth of individual bacterial cell sizes, we motivate the need for a more general class of phenomenological models of stochastic exponential growth, which are consistent with the observation that the mean-rescaled distributions are approximately stationary at long times. We show that this behavior is not consistent with GBM, instead it is consistent with power…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Phenomenology of stochastic exponential growth
Dan Pirjol
National Institute of Physics and Nuclear Engineering, Bucharest, Romania
Farshid Jafarpour
Department of Physics and Astronomy, Purdue University, West Lafayette, IN 47907
Srividya Iyer-Biswas
Department of Physics and Astronomy, Purdue University, West Lafayette, IN 47907
Abstract
Stochastic exponential growth is observed in a variety of contexts, including molecular autocatalysis, nuclear fission, population growth, inflation of the universe, viral social media posts, and financial markets. Yet literature on modeling the phenomenology of these stochastic dynamics has predominantly focused on one model, Geometric Brownian Motion (GBM), which can be described as the solution of a Langevin equation with linear drift and linear multiplicative noise. Using recent experimental results on stochastic exponential growth of individual bacterial cell sizes, we motivate the need for a more general class of phenomenological models of stochastic exponential growth, which are consistent with the observation that the mean-rescaled distributions are approximately stationary at long times. We show that this behavior is not consistent with GBM, instead it is consistent with power law multiplicative noise with positive fractional powers. Therefore, we consider this general class of phenomenological models for stochastic exponential growth, provide analytical solutions, and identify the important dimensionless combination of model parameters which determines the shape of the mean-rescaled distribution. We also provide a prescription for robustly inferring model parameters from experimentally observed stochastic growth trajectories.
pacs:
???,???,…
I Introduction
Stochastic exponential growth occurs when a quantity of interest multiplies geometrically, with a random multiplier and/or with stochastic waiting times between growth spurts. Such processes have been observed in various contexts, across a range of length and timescales. Familiar examples include microbial population growth Ginzburg et al. (1982); Keiding (1975); Lewontin and Cohen (1969); Iyer-Biswas et al. (2016), nuclear fission Everett and Ulam (1948); Birkhoff and Varga (1958), copy number dynamics of key biochemicals Iyer-Biswas et al. (2014a); Tsuru et al. (2009); Friedman et al. (2006); Cai et al. (2006), increase of individual cell sizes during interdivision periods Iyer-Biswas et al. (2014b, a); Pfeuty and Kaneko (2007); Iyer-Biswas and Zilman (2016), tissue growth in developmental and cancer biology Eden (1961); Furusawa and Kaneko (2000); Kohandel et al. (2007); Halpin-Healy and Zhang (1995), spreading of epidemics Girvan et al. (2002), growth of internet and propagation of viral social media posts Huberman and Adamic (1999); Kumar et al. (2000), financial markets Samuelson (1965); Black and Scholes (1973), stochastic divergence of neighboring phase space trajectories Stoddard and Ford (1973), and cosmological expansion of the universe Diamandis et al. (1999).
Despite the variety of contexts in which the phenomenon occurs, literature on modeling stochastic exponential growth has predominantly focused on one model, Geometric Brownian Motion (GBM). Such processes appear naturally in the context of proportionate growth processes; for a general introduction to these processes and their applications see Mitzenmacher (2004). A stochastic process, , is said to undergo GBM if its time evolution obeys a Langevin equation with linear drift and linear multiplicative noise van Kampen (2007):
[TABLE]
in which is Gaussian white noise, traditionally interpreted in the Itô sense van Kampen (2007), with
[TABLE]
It is well-known that this process yields a log-normal distribution for , and the ratio of its standard deviation to mean diverges with time van Kampen (2007).
Recent studies have revealed a scaling law governing fluctuations during stochastic exponential growth of individual microbial cell sizes in between divisions: the mean-rescaled distribution is stationary and has finite variance at long times Iyer-Biswas et al. (2014a, b). This observation is inconsistent with the behavior predicted from GBM, in which the coefficient of variation (ratio of standard deviation to mean) diverges with time, and so the mean-rescaled distribution is not stationary van Kampen (2007).
Motivated by these observations, we seek a general phenomenological description of stochastic exponential growth, consistent with the time independence of the mean-rescaled distribution at long times. To this end we consider a stochastic Markovian growth process which is a generalization of GBM, in which the multiplicative noise term is given by a power-law with exponent :
[TABLE]
This model is well-studied in finance literature in the context of applications in option pricing Cox (1996); Lo et al. (2000). We find constraints on for which these dynamics are consistent with stationarity of the asymptotic mean-rescaled distribution. We show that the probability density of the mean-rescaled variable, , asymptotically approaches a time-independent distribution with finite variance provided . Thus we focus on the phenomenology of stochastic exponential growth for the class of models given by Eq. (3), with . For this class of models we provide analytical solutions for the time dependent dynamics, and use them to motivate a robust prescription for inferring model parameters (, , and ) from observed stochastic growth trajectories.
In Section II we comment on generic properties of a more general class of models than Eq. (3), namely, ones with a general form of the multiplicative noise term. Remarkably, a stable distribution for the mean-rescaled variable, , is found to exist in the long time limit for this general class of models. We derive a sufficient condition for finiteness of the variance of this asymptotic distribution, and use this constraint to motivate why we focus on the phenomenology of the class of models in Eq. (3). In Section III, we discuss analytical solutions and general properties of Eq. (3). We also examine in detail the limiting cases corresponding to and , and a special case, , which arises naturally from a microscopic model for stochastic exponential growth Iyer-Biswas et al. (2014a). Using the analytical solutions for Eq. (3), we find that a specific combination of the second and third central moments of the distributions is particularly sensitive to the exponent , and not to other model parameters. In Section IV we utilize this idea to provide a robust prescription for inferring the exponent from time series data, and test the prescription using synthetic data from numerical simulations of Eq. (3). Once has been determined, estimation of the parameters and is a straightforward statistical inference problem. Here we use the Maximization of Likelihood Estimation (MLE) approach of parametric statistical regression, and present its application to the models considered.
II Condition for Finite Variance of the Mean-Rescaled Distribution
To begin with, we consider properties of the mean-rescaled distribution for a general class of stochastic exponential growth models with multiplicative noise:
[TABLE]
We will assume the initial condition, , and an absorbing boundary at the origin, , to ensure that remains non-negative. The phenomenological model introduced in Eq. (3) corresponds to the subset of models with multiplicative noise, , of the form .
In the absence of noise , grows exponentially with time: . When fluctuations are present, the ensemble averaged value undergoes exponential growth with growth rate :
[TABLE]
This follows directly from Eq. (4) upon taking the expectation value and integrating over time. We remind the reader that Eq. (4) is to be interpreted in the Itô sense.
What can be said about the spread of stochastic trajectories, , around the expectation value given in Eq. (5), in the presence of the noise term in Eq. (4)? We are especially interested in the mean-rescaled variable , and its behavior in the long time limit, . Under general conditions111Assuming that the noise term satisfies appropriate regularity conditions which are required for the existence and uniqueness of the solution of the stochastic differential equation Eq. (4), it can be shown that is a continuous time, non-negative martingale. It is known that any non-negative supermartingale (martingale) is integrable. By the Doob’s martingale convergence theorem [37], it follows that the limit exists, in an almost sure sense. Furthermore, is also integrable. This implies that converges to a well-defined random variable in long time limit., approaches a limiting distribution in the long time limit. However, this result does not place constraints on the finiteness of the variance of the asymptotic distribution of . Motivated by experimental observations in Iyer-Biswas et al. (2014b, a), we are interested in possible functional forms of the noise term , for which the distribution of has a finite variance as .
To this end, we first find an expression for the variance of . Changing variables in Eq. (4),
[TABLE]
Using Itô’s lemma McKean (1969),
[TABLE]
The variance, , becomes equal to , since is a mean-rescaled variable and its average is . Thus, using Eq. (7), we have
[TABLE]
which implies
[TABLE]
Therefore, the mean rescaled distribution has a finite variance only if
[TABLE]
What are the possible functional forms of which satisfy the condition Eq. (10)? Evidently, any bounded function will satisfy this condition. Thus, any model of multiplicative noise for which the amplitude of the noise saturates for large will produce a finite variance distribution for in the large time limit.
The small-noise limit. In this limit, , we can approximate the expectation with the deterministic value,
[TABLE]
Therefore the condition on , for finiteness of variance of the asymptotic mean-rescaled distribution, can be simply stated as must fall off faster than in the limit , such that the integral in Eq. (10) converges as . This analysis suggests a more explicit condition on :
[TABLE]
Thus if the leading behavior of in the large limit is a power law, i.e., , then the mean-rescaled distribution can become stationary only for . Hence our focus on the phenomenological models of stochastic exponential growth given by Eq. (3).
In the following Section, before proceeding to give a general solution for models of the form in Eq. (3), we will contextualize the results by carefully considering the limiting cases, (constant additive noise) and (linear multiplicative noise, corresponding to GBM). In addition, we will also consider the (square-root multiplicative noise) case in detail, since it arises naturally in discrete stochastic models (for e.g., the stochastic Hinshelwood cycle Iyer-Biswas et al. (2014a), birth-death and Galton-Watson branching processes Ethier and Kurtz (1986)).
In relation to Eq. (12), the expected behavior of the asymptotic mean-rescaled distributions for these cases are summarized below.
- •
Constant additive noise, : since the noise has bounded intensity for this case, the asymptotic distribution of has finite variance. This distribution can be found exactly and is Gaussian (see Section III).
- •
Linear multiplicative noise, : this corresponds to geometric Brownian motion (GBM). Here the condition Eq. (12) is not satisfied, and thus, as explicitly derived in the following Section, the variance of increases without limit.
- •
Power-law multiplicative noise, , with : since the condition in Eq. (12) is satisfied for this choice of , the variance of the asymptotic mean-rescaled distribution is finite. We will show this explicitly in Section III.
III Power-law multiplicative noise: solutions and properties
As motivated in the preceding section, we now focus on the phenomenology and analytical solutions of stochastic exponential growth process with power-law multiplicative noise:
[TABLE]
To contextualize the general results, we first review the properties of the familiar special cases of this equation, i.e., in Section IIIA — C, before proceeding to the general solution in Section IIID. The constant additive noise case, , corresponds to a simple stochastic differential equation which is known to have a Gaussian solution Uhlenbeck and Ornstein (1930); van Kampen (2007). Linear multiplicative noise, , corresponds to geometric Brownian motion (GBM). This process was proposed as a model for risky asset prices by Samuelson Samuelson (1965), and was used by Black and Scholes in their pricing theory for equity derivatives Black and Scholes (1973). Square-root multiplicative noise, , corresponds to the square-root process Feller (1951); this is obtained as the continuous limit of discrete stochastic models such as the the stochastic Hinshelwood cycle Iyer-Biswas et al. (2014a). More generally, square-root processes arise in the context of birth-death and Galton-Watson branching processes Ethier and Kurtz (1986); Iyer-Biswas et al. (2009); Iyer-Biswas and Jayaprakash (2014); Iyer-Biswas (2009).
III.1 Constant additive noise,
The simplest model of stochastic exponential growth has constant amplitude noise, , and the corresponding stochastic differential equation has a simple Gaussian noise term:
[TABLE]
This immediately suggests that the dimensionless ratio,
[TABLE]
which can be identified as the variance of the mean-rescaled distribution, should be a key determinant of its shape (the subscript [math] for indicates that this corresponds to ). This dimensionless variable is the inverse Péclet number of the process, and as we will see, we will need to generalize the Péclet number (or its inverse) for different values of .
In the absence of a boundary condition at the origin, this process is a mean-repelling Ornstein-Uhlenbeck process Uhlenbeck and Ornstein (1930). We remind the reader that the Ornstein-Uhlenbeck process has the form , where corresponds to the mean-reverting case, and is the mean-repelling case. For both cases this is known to yield a Gaussian distribution. Therefore, the mean-rescaled distribution, , is also Gaussian, with variance given by
[TABLE]
with
[TABLE]
identifies the characteristic timescale of approaching stationary-state. Thus, the mean-rescaled distribution, , asymptotically approaches a time-independent Gaussian distribution with variance , for . By definition, the mean of the distribution is unity at all times.
In proximity of the origin this stochastic process can become negative. However, in real-world applications of the stochastic exponential growth process (e.g., the growth dynamics of bacterial cell sizes), the dynamical variable (e.g., the cell’s size) may be constrained to always remain positive. This can be addressed in the model by imposing an absorbing boundary condition at the origin, , to avoid negative values for the dynamical variable. We can compute the solution for , the probability density of at time given the initial condition at time , by using the method of images (see Appendix A for details of derivation). We obtain
[TABLE]
where
[TABLE]
Since the modified time, , approaches a finite limit () in the long time limit, it follows that approaches a stationary distribution asymptotically; this distribution is given by using in Eq. (18).
The probability of absorption at origin is
[TABLE]
The mean of is equal to , as expected, and the second moment is given by
[TABLE]
where . For , one has . This variance approaches a limiting value in the infinite time limit: .
III.2 Linear multiplicative noise (geometric Brownian Motion),
The case of corresponds to another simple model. Here the noise amplitude is proportional to the magnitude of the process, and so
[TABLE]
This process has been well studied and is known as geometric Brownian motion (GBM) van Kampen (2007). The solution for is
[TABLE]
where denotes standard Brownian motion.
Using Itô’s lemma, we find that the mean-rescaled process also follows geometric Brownian motion:
[TABLE]
The probability density of the mean-rescaled variable, , is given by
[TABLE]
The variance of is given by
[TABLE]
while the mean is always unity.
In the small time limit, , the variance increases linearly as , while in long time limit, , it increases exponentially as . Thus, for , the variance of does not approach a finite asymptotic value in the infinite time limit. The distribution of becomes increasingly concentrated near zero, as can be seen by computing the probability that is larger than an arbitrary value, .
[TABLE]
Here is the normal cumulative distribution function. This probability goes to zero as , for any . At the same time, the variance of increases without limit. The seemingly paradoxical conclusion that the distribution of becomes more and more concentrated near the origin while its variance grows without limit can be understood using the argument provided by Lewontin and Cohen Lewontin and Cohen (1969), who formulated it in the general context of the proportionate growth models Mitzenmacher (2004).
III.3 Square-root multiplicative noise,
Next we consider the special case, , which arises naturally in the context of phenomenological models corresponding to discrete stochastic exponential growth process Feller (2008). Rewriting Eq. (3) with a square-root multiplicative noise term, one has
[TABLE]
Processes with multiplicative square-root noise often appear in models of population genetics and in the Langevin description of birth-death processes Feller (2008). The properties of the stochastic differential equations with multiplicative square-root noise have been studied in detail by Feller in Ref. Feller (1951).
In the absence of the noise term this process describes exponential growth with a constant growth rate, . The noise term introduces fluctuations which have variance proportional to . The transition density can be read off from Ref. Feller (1951) (also see Equation (1.6) in Ref. Mazzon (2012)), and is given by
[TABLE]
where is the modified Bessel function of the first kind.
One can check by direct calculation that the transition density satisfies the following relations.
[TABLE]
The relation Eq. (29) encodes normalization of the probability while accounting for the non-zero probability of absorption at the boundary. As expected, in Eq. (30) we find that the mean grows exponentially with time, with growth rate, .
Using Itô’s lemma, we find that the second moment, , satisfies the following time evolution equation:
[TABLE]
Solving it with initial condition, , one has
[TABLE]
Thus the variance of is
[TABLE]
Once again, we can identify a dimensionless variance, a combination of parameters which determines the shape of the mean rescaled distribution, from the preceding equation:
[TABLE]
We can find the density of the mean rescaled variable directly from the distribution of given in Eq. (28). We present next an alternative approach, based on the method of the time change. This has the advantage that it is applicable also to the case of general exponent , which will be discussed next.
An application of Itô’s lemma yields the following stochastic differential equation for :
[TABLE]
with the initial condition . The time-dependent factor can be absorbed into a deterministic rescaling of time. To this end, we define the modified dimensionless time, , by
[TABLE]
which implies . In terms of , Eq. (35) can be written as
[TABLE]
The probability density of can be read off from Eq. (28):
[TABLE]
where
[TABLE]
The variance of is given by . As limit, approaches , thus the mean-rescaled distribution tends to a stationary distribution with variance equal to .
In the small noise limit, i.e., , the asymptotic probability density, , is sharply peaked around , and is well approximated as
[TABLE]
This relation follows from the general result, Eq. (38), upon using the asymptotic expansion of the Bessel function, : in the limit , it approaches .
III.4 General exponent
We have seen that for and , there are characteristic timescales equal to and , respectively, after which the mean-rescaled distributions become stationary. After this time, the shape of the distributions only depends on the dimensionless variables and . In this section, we first find the corresponding timescale for a general . Then, we find a dimensionless variable, , that determines the shape of the mean rescaled distribution at steady state. This dimensionless variable is a generalization of the inverse Péclet number, which is often used in transport literature Patankar (1980).
Let us start with the case of a power-law multiplicative noise of the form , with a general exponent, .
[TABLE]
This process appears in financial literature in the context of models known as Constant Elasticity of Variance (CEV) Cox (1996). We will show that its properties can be understood by relating it to a generalization of the square-root process () that we considered in the preceding section. The small-noise approximation result, Eq. (12), suggests that must be restricted to the range , in order to have a finite variance for in the large time limit. We will prove this constraint rigorously.
We would like to derive the probability density, , for the process defined by Eq. (3). It will prove more convenient to first derive the probability density of the mean-rescaled variable, , and subsequently obtain using
[TABLE]
Note that the properties of the mean rescaled distribution should depend on the two dimensionless parameters and . It would however simplify the algebra if we define the dimensionless variance as the following combination of these parameters:
[TABLE]
To obtain the probability density, , of the mean-rescaled variable, , we start by finding an SDE for the variable using Itô’s lemma:
[TABLE]
with the initial condition . Following the same approach as before, we absorb the time dependent factor into a deterministic time change. We define the modified dimensionless time, , such that . Thus,
[TABLE]
approaches as , provided that . The time scale for the mean rescaled distribution to approach its stationary solution is set by the exponent . This timescale itself approaches infinity as , hence a stationary mean-rescaled distribution is never attained in the GBM case.
With this change of time variable Eq. (43) reads:
[TABLE]
Equation (45) can be reduced to a square root process with constant drift by the following change of variable:
[TABLE]
Using Itô’s lemma, one has the process that follows:
[TABLE]
with the initial condition . Stochastic differential equations with square-root multiplicative noise have been studied in detail by Feller in Feller (1951); this paper considered the general case of SDEs of the form
[TABLE]
This is a generalization of the square-root process with an additional constant drift term. The process in Eq. (47) is obtained by the following substitutions:
[TABLE]
Feller Feller (1951) showed that the stochastic differential equation Eq. (48) has different qualitative behaviors depending on the relative strength and sign of the coefficients. The following three cases are important to distinguish between: (i) , (ii) , and (iii) .
For the problem we are interested in, only two of the cases in the Feller classification of solutions are relevant:
(i) : this corresponds to . The fundamental solution of the forward Kolmogorov equation corresponding to Eq. (47) is not unique. There are two independent fundamental solutions, and the problem is well-posed only if we add an additional boundary condition at , for example absorbing or reflecting boundary. With an absorbing boundary condition, the transition density is given by Eq. (50) below.
(ii) : this corresponds to . There is only one fundamental solution, given by
[TABLE]
This can be extracted from equation (1.6) of Mazzon (2012) by taking the limit. Substituting for and from Eq. (49), we obtain
[TABLE]
The probability of absorption at origin is found to be
[TABLE]
where is the upper incomplete Gamma function, defined as . The moments of the can be computed in closed form using the probability distribution function given in Eq. (51). As expected, the mean, , evaluates to unity. The next two moments are
[TABLE]
Here is the confluent hypergeometric function, is the function given in Eq. (44) and . It is straightforward to show that these equations reduce to the results derived in the previous section for the case. The moment relations Eq. (III.4) and Eq. (III.4) also hold for , provided that an absorbing boundary condition is imposed at .
Figure 1 compares the qualitative behavior (on a log-linear scale) of trajectories of the stochastic exponential growth processes described by Eq. (3), for different values of the two dimensionless variables which determine the dynamics, namely, and . The variance of the process is set by , while the exponent determines the relative stochasticity at short and long times. Small values correspond to more stochastic/deterministic behavior at short/long times, while for values around , the log-scale trajectories stay stochastic over a longer time.
The limit of small noise
In the large Péclet number limit, or small noise regime, i.e., when , the distribution of is sharply peaked around 1. The probability density, , given by Eq. (51), can be approximated using the approximation of the Bessel function for large argument:
[TABLE]
in which
[TABLE]
The exact result for the density Eq. (51) depends on the model parameters through the same combination which approximate the variance of the distribution:
[TABLE]
This is obtained from Eq. (III.4) using the asymptotic expansion of the confluent hypergeometric function of large positive argument
[TABLE]
For the small noise regime, the limit of the variance is simply our dimensionless variance, :
[TABLE]
which is finite for .
Itô vs. Stratonovich interpretations
For many practical applications, the asymptotic variance of the mean-rescaled distribution, which can also be interpreted as the square of coefficient of variation of the original variable, is expected to be small Iyer-Biswas et al. (2014b). Here, we show that in this region, for , there is no difference between the behavior of Eq. (3), and a similar equation where the noise is interpreted in the Stratonovich sense, i.e. ,
[TABLE]
We start by writing the Itô equivalent of Eq. (59):
[TABLE]
Let us first examine the limiting cases: For , the Itô and Stratonovich equations are exactly the same. For , the Stratonovich equation is the same as the Itô equation with . For any in between, the extra term added to the deterministic part of the equation decays as grows, and therefore, does not affect the dynamics of at long times. However, the short time dynamics can affect the long time distribution. This is where the small noise limit plays a role.
From the behavior of the two limiting cases, we know that the extra term is more problematic as we get closer to . But for , the additional term starts with a value strictly smaller than , and its expected value decays exponentially over time. Therefore, for , the dynamics of Eq. (59) and Eq. (3) are the same. In summary, one does not need to worry about the choice of Itô vs. Stratonovich schemes when phenomenologically modeling a stochastic exponential growth process.
Let us further quantify this statement by defining to be the relative error of choosing the incorrect prescription. Then we have
[TABLE]
In particular, the relative error is a bounded function of time and is proportional to .
When the system is not in the small noise limit, Itô and Stratonovich schemes are no longer equivalent. In this regime, we can still solve the Stratonovich problem by mapping it to an Itô problem. The general model in Stratonovich interpretation can be reduced by a change of variable to the case in Itô formalism, discussed in Section III.A. If we change of variables to , then we have
[TABLE]
Now the noise is additive and there is no distinction between Itô and Stratonovich interpretations, and so we can use the previously found solution to this model.
Possible generalizations of the models
We comment briefly on the assumptions of our model, and possible generalizations. The assumption of Markovian dynamics is a standard assumption in models of stochastic kinetics van Kampen (2007), and simplifies the analytical treatment. We also assume that the noise is modulated by the amplitude of the process being modeled . A more sophisticated treatment could take into account the stochasticity of the growth rate, , which could represent a stochastic environmental variable. A similar approach is taken in population dynamics modeling, where the analog of is an environmental variable, which might represent factors such as food supply or temperature Cohen (1976); Tuljapurkar and Orzack (1980); Cohen (2014). One model of this type in Ref. Pirjol (2015) considered a proportionate growth process where the growth multipliers are modulated by GBM. Interestingly, in this model the growth rate of the mean behaves discontinuously as model parameters cross a critical curve, similar to what is observed in a phase transition.
IV Statistical Inference of the Model Parameters
Our goal in this section is to use the analytical results provided in Section III to find a robust prescription for the determining the exponent and the other model parameters, in our case and from experimentally observed stochastic growth trajectories. We first, in Section IV.1, show that there is a ratio of the third moment and the second moment of the mean rescaled distribution that is only sensitive to the exponent and can be used to determine the . Then in Section IV.2, we discuss the use of maximum likelihood estimation to find the other parameters of the stochastic dynamics.
IV.1 Determination of
The results for the second and third moments of in Eq. (III.4) and Eq. (III.4) depend only on the two combinations of variables
[TABLE]
Experimental measurements of the two moments and the central third moment can be thus translated into a constraint for the two parameters in Eq. (63). This offers a possibility of determining the noise exponent parameter from the shape of the distribution.
As shown in Eq. (56), in the small-noise limit the variance of this distribution gives the parameter
[TABLE]
If the time (much larger than the characteristic growth time), then the variance determines .
Using Eq. (51) we computed the shape of the distribution for several values of , for several values of the variance . The results are shown in Fig. 2, where for each case we plot for (black), 0.75 (blue) and 0.9 (red). These plots show that the shape of the distribution is sensitive to .
In order to determine we propose the measurement of the ratio
[TABLE]
This is defined such that it is equal to 1 for , and its deviation from 1 is a measure of the difference of from .
In Figure 3 we show the theoretical prediction for for three values of the variance . This ratio can be computed explicitly as function of and as
[TABLE]
where the second and third moments are given explicitly in Eq. (III.4) and Eq. (III.4). The results are shown in Figure 3, which shows good discriminating value in , and little sensitivity to the variance of . Although the theoretical results in Eq. (III.4) and Eq. (III.4) are derived only for , we used them also for , in order to show that the ratio varies monotonically with through . As mentioned above, the theoretical results of Eq. (III.4) and Eq. (III.4) hold for all , provided that for , an absorbing boundary condition is imposed at
The insensitivity of the ratio to at low noise limit can be understood by expanding Eq. (III.4) and Eq. (III.4) using Eq. (57) in and calculating :
[TABLE]
showing why a linear relationship is observed in the small limit. Note that the exact results from Eq. (66) should be used to infer from distributions with non-zero variance, instead of the approximate linear relationship.
IV.2 Maximum-likelihood estimation
The method for determining described in the previous section uses only the distributional properties of the variable at a fixed time . It is also possible to constrain and determine the parameters of the stochastic process, namely, and , by using time-series data for the process.
The simplest estimation method for the parameters of a stochastic diffusion is the Maximum-Likelihood Estimation (MLE) method Bartlett (1955); Billingsley (1961). This has been also applied to the estimation of the parameters for a stochastic kinetic model Erdi and Lente (2014).
Here we provide a brief summary of the method, applied to the model at hand. Assume that we observe the stochastic process on a sequence of times . We would like to use the time series of observations to determine or constrain the parameters of the continuous-time diffusion which is assumed to have generated the observed data
[TABLE]
The coefficients of the process depend on certain parameters .
Define the log-likelihood function
[TABLE]
Here is the conditional density of given , obtained from the diffusion Eq. (68). The log-likelihood function depends on the parameters .
The MLE method determines the parameters by minimizing the log-likelihood function . This method is easy to use in cases where the conditional density appearing in Eq. (69) is known in closed form. This is the case for many popular one-dimensional diffusions, like geometric Brownian motion, the square root model Feller (1951), and the Ornstein-Uhlenbeck process Uhlenbeck and Ornstein (1930).
Explicit results in Eq. (28) and Eq. (41) for the transition density in the processes Eq. (27) and Eq. (3), respectively, given in the previous section, can be used to determine the parameters of these processes from time series of observations of the stochastic variable . We propose combining the MLE method with the approach based on a measurement of , defined in Eq. (65). Once has been determined according to the prescription in Sec. IV.1, the MLE will be used to infer the remaining parameters, . As an illustrative example, Appendix B gives a detailed application of this method to the estimation of the parameters of the exponential growth process with square-root noise.
V Summary
To characterize the phenomenology of stochastic exponential growth, we have considered a general class of Markovian stochastic models with power-law multiplicative noise described by Eq. (3). In this model trajectories start at at time and undergo exponential growth with rate constant subject to power-law multiplicative noise with strength and exponent . Since the expected value of this process is a simple exponential growth, , to understand the dynamics of the distribution, we only need to understand the dynamics of the distribution of the mean rescale variable . This paper provides an exhaustive study of the dynamics of this mean-rescaled distribution.
Starting with Eq. (3), we have shown that the dynamics of the mean-rescaled distributions depend only on the (dimensionless) exponent , and a time-dependent dimensionless factor (see Eq. (39)):
[TABLE]
The latter combines the effect of all the other variables, namely , , , and, . In the small noise regime, is approximately the time-dependent variance of the distribution. The timescale given by sets the time for the distribution to approach steady state. This timescale approaches infinity as , thus stationarity of the mean-rescaled distribution is never attained in the geometric Brownian motion (GBM) case (the case of ); this is a unique case for which the mean-rescaled distribution becomes independent of the initial condition . On the other hand, for , the mean-rescaled distributions approach stationary distributions with a finite variances at long times.
For , both the deterministic and the stochastic terms in Eq. (3) go to zero as which seems to suggest the presence of an absorbing boundary at . However, due to singularity of the multiplicative noise at the origin, for , the trajectories can leak through the origin, and therefore, one needs to artificially impose an additional boundary condition at the origin. The choice of boundary condition (absorbing or reflecting) affects the dynamics of the distribution in region. This is not the case for , for which is a natural absorbing boundary.
For , the dimensionless combination of parameters, , defined by , sets the asymptotic variance of the mean rescaled distribution and is the analogue of the inverse Péclet number. When , the dynamics are almost deterministic, and the distributions for all ’s are well-approximated with a narrow Gaussian around the mean. However, when or larger, the exponent determines the skewness of the distribution. We have identified a particular metric of skewness, namely, the ratio of the third central moment to the square of variance, which is sensitive to and very insensitive to . Thus, this measure can be used to infer the value of from experimentally measured distributions. Additionally, we have provided analytical expressions for transition probabilities (the probability density of at time , given ), and a prescription that incorporates these analytical expressions to infer the remaining model parameters from observed growth trajectories by using MLE methods.
VI Acknowledgments
We thank Rudro Biswas and Sasha Grosberg for insightful discussions. We acknowledge financial support from Purdue University Startup Funds and Purdue Research Foundation.
VII Author Contributions
SI-B conceived of and designed research; DP, FJ and SI-B performed calculations and simulations, and wrote the paper.
Appendix A Constant Noise with Absorbing Boundary: Method of Images
We start with Eq. (13). Recalling that , the application of Itô’s lemma gives that follows the diffusion:
[TABLE]
The time dependent factor can be absorbed into the time change
[TABLE]
simplifying Eq. (70) to
[TABLE]
In the absence of the boundary condition at , the solution of the SDE in Eq. (72) is a time-changed Brownian motion, shifted by a constant amount. Its distribution is a Gaussian which is centered at and has variance
[TABLE]
The absorbing condition requires that the probability density of at , for all . The solution of the Eq. (70) can be found by the method of images and can be constructed by superposition of two Gaussian distributions, centered around and respectively:
[TABLE]
where
[TABLE]
It is straightforward to show that the solution, Eq. (74), satisfies the absorbing boundary condition at for all . Figure 4 shows a plot of the density , and of its construction by the method of images as the difference of two Gaussian distributions. For an alternative derivation of this result, see Feller (1951).
Appendix B Details of application of MLE
We illustrate here an example of the application of the MLE method for determining the parameters and of the process
[TABLE]
from a time series of observations of the process on the sequence of times .
The log-likelihood function is defined in Eq. (69) and requires the conditional transition density function . For the square root noise process Eq. (76) this is given in Eq. (28), and for the general model it is given in (41).
The MLE method determines the model parameters by minimization of the log-likelihood function, , over the values of the model parameters . A judicious choice of the initial starting point for these parameters can speed up the minimization.
We start by considering the simpler case of the square root process . For this case we propose to choose the initial values such that the first two conditional moments are matched exactly. They are given by Eq. (30) and Eq. (32)
[TABLE]
This gives
[TABLE]
For general , the MLE method can be reduced to the estimation of the parameters of a square-root model by working with the time series for . We assume that has been determined from a measurement of the ratio , as explained in Sec. IV.1. The MLE method will be applied to determine the growth rate and noise parameter , for known .
For this case it is convenient to write the log-likelihood function in terms of the variables, for which the transition density is known from Eq. (50) and has a simpler form.
A first estimate for can be obtained for general again from matching the first conditional moment (77), which gives the estimator in (79). The analog of the relation (78) is given by
[TABLE]
with given by (III.4) with the replacement . This gives a complicated non-linear equation for , which has to be solved to obtain .
A simpler approach starts with the relation
[TABLE]
with . This follows from the diffusion (47) for . From this equation we obtain the conditional moment
[TABLE]
If , we have to a good approximation . We obtain the following simple estimator for
[TABLE]
This estimator is expected to be useful when is not too close to . For this case the numerator and denominator are small, and the estimator may be unstable numerically.
In order to illustrate the application of this method, we generated sample data by simulating the SDE of the square-root process using an Euler discretization in time. For this simulation we generated values on a time grid with time step . The model parameters assumed are and starting point . Figure 5 shows the sample path for this data.
For the example data set considered here the initial estimators Eq. (79), Eq. (80) are and .
Minimization of the log-likelihood function over and with the starting point gives and . These values are close to the actual parameter values and . The error of the determination decreases with , the number of the data points. See Bartlett (1955); Billingsley (1961) for error estimates, and the asymptotics of the MLE errors.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Ginzburg et al. (1982) L. R. Ginzburg, L. B. Slobodkin, K. Johnson, and A. G. Bindman, Risk Anal. 2 , 171 (1982) . · doi ↗
- 2Keiding (1975) N. Keiding, Theor. Popul. Biol. 8 , 49 (1975) . · doi ↗
- 3Lewontin and Cohen (1969) R. Lewontin and D. Cohen, Proc. Nat. Acad. Sci. 62 , 1056 (1969) .
- 4Iyer-Biswas et al. (2016) S. Iyer-Biswas, H. Gudjonson, C. S. Wright, J. Riebling, E. Dawson, K. Lo, A. Fiebig, S. Crosson, and A. R. Dinner, “Bridging the time scales of single-cell and population dynamics,” (2016), ar Xiv:1611.05149 .
- 5Everett and Ulam (1948) C. Everett and S. Ulam, Proc. Nat. Acad. Sci. 34 , 403 (1948) .
- 6Birkhoff and Varga (1958) G. Birkhoff and R. Varga, J. Soc. Ind. Appl. Math. 6 , 354 (1958) . · doi ↗
- 7Iyer-Biswas et al. (2014 a) S. Iyer-Biswas, G. E. Crooks, N. F. Scherer, and A. R. Dinner, Phys. Rev. Lett. 113 , 028101 (2014 a) . · doi ↗
- 8Tsuru et al. (2009) S. Tsuru, J. Ichinose, A. Kashiwagi, B.-W. Ying, K. Kaneko, and T. Yomo, Phys. Biol. 6 , 036015 (2009) . · doi ↗
