Timing and Shape of Stochastic Autocatalytic Burst Formation
Alastair Jamieson-Lane, Eric N. Cytrynbaum

TL;DR
This paper investigates how stochastic noise influences the timing and shape of burst formation in autocatalytic systems near a saddle-node bifurcation, revealing pattern nucleation driven by microscopic fluctuations.
Contribution
It applies Large Deviation Theory to characterize burst shape and timing in spatially extended stochastic autocatalytic systems near bifurcation points.
Findings
Burst shape and timing scale with noise amplitude
Spatial nucleation of peaks occurs randomly across the domain
Microscopic noise induces macroscopic pattern formation
Abstract
Chemical, physical and ecological systems passing through a saddle-node bifurcation will, momentarily, find themselves balanced at a semi-stable steady state. If perturbed by noise, such systems will escape from the zero-steady state, with escape time sensitive to noise. When the model is extended to include space, this leads to different points in space "escaping from zero" at different times, and uniform initial conditions nucleate into sharp peaks spread randomly across a nearly uniform background, a phenomena closely resembling nucleation during phase transition. We use Large Deviation Theory to determine burst shape and temporal scaling with respect to noise amplitude. These results give a prototype for a particular form of patternless symmetry breaking in the vicinity of a stability boundary, and demonstrates how microscopic noise can lead to macroscopic effects in such a region.
| Equation | ||
|---|---|---|
| 0 | ||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Timing and Shape of Stochastic Autocatalytic Burst Formation
Alastair Jamieson-Lane
Department of Mathematics,
University of British Columbia
&Eric N. Cytrynbaum
Department of Mathematics,
University of British Columbia.
Abstract
Chemical, physical and ecological systems passing through a saddle-node bifurcation will, momentarily, find themselves balanced at a semi-stable steady state. If perturbed by noise, such systems will escape from the zero-steady state, with escape time sensitive to noise. When the model is extended to include space, this leads to different points in space “escaping from zero” at different times, and uniform initial conditions nucleate into sharp peaks spread randomly across a nearly uniform background, a phenomena closely resembling nucleation during phase transition. We use Large Deviation Theory to determine burst shape and temporal scaling with respect to noise amplitude. These results give a prototype for a particular form of patternless symmetry breaking in the vicinity of a stability boundary, and demonstrates how microscopic noise can lead to macroscopic effects in such a region.
1 Introduction
The universe we live in does not consist of a perfectly uniform cloud of hydrogen gas, nor is the earth a perfect sphere. Clouds condense into droplets of rain [McGraw and Liu(2003)], a single zygote divides and differentiates into a multitude of different cell types, a flipped coin lands either heads, or tails. All around us, symmetry is broken every day.
Depending on the system of interest, symmetry can be broken in either a patterned or unpatterned manner. Pattern formation is ubiquitous in nature, from a Zebra’s strips [Turing(1952)] to Tiger bush [Lefever and Lejeune(1997)]. Patterned symmetry breaking in reaction diffusion systems is well understood, and was first studied by Alan Turing in the 1950s [Turing(1952)].
In nature we also observe patternless symmetry breaking, in for example the nucleation of crystals[Erdemir et al.(2009)Erdemir, Lee, and Myerson] or rain drops[McGraw and Liu(2003)]. This patternless symmetry breaking is less studied in a mathematical context, and is the focus of this article.
Our present work was motivated by intriguing experimental results in a recent study of the Min cell division system in E.coli [Vecchiarelli et al.(2016)Vecchiarelli, Li, Mizuuchi, Hwang, Seol, Neuman, and Mizuuchi]. In their study, Vecchiarelli et al. observed that Min proteins demonstrate either patterned or unpatterned symmetry breaking, depending on the concentration conditions. For high concentrations of the relevant Min proteins, spirals and linear waves are observed, with each of these patterns possessing their own well studied mathematical frameworks [Murray(2008)]. For lower concentrations, Vecchiarelli et al’s experiment also demonstrates “burst” behavior, whereby the local concentration of Min proteins increases for spatially localized patches of membrane, and then reduces again shortly thereafter (see figure 1). These bursts appear to arise from a relatively homogeneous background, and in contrast to the ordered high concentration patterns described above, they are positioned randomly across the membrane, and appear governed by low level particle scale noise.
In this paper, we demonstrate how dynamical systems in the vicinity of a semi-stable steady state are sensitive to low level white noise, giving rise to similar burst formation behavior. We consider a “canonical” stochastic PDE model of such a system
[TABLE]
and use Large Deviation Theory to predict both the time of burst formation, and the shape of the burst formed, in the limit of low amplitude noise.
We consider a particular reaction-diffusion system from the literature in Sect. 2, and use simulations to demonstrate the occurrence of burst formation in appropriate parameter regimes. In Sec. 3, we use Large Deviation Theory to predict the time taken for the stochastic ODE to evolve from to some fixed value , and determine the most probable path for this occurrence. Sect. 4 extends this work to systems with one spatial dimension via the addition of a diffusion term (as is 1). We use a mixture of numeric and analytic techniques to determine both the shape and time taken for bursts to form. In Sec. 5, we compare our analytic results both to one another, and to the corresponding simulations, and in Sec. 6 we discuss the conceptual difficulties preventing our analysis from being extended to higher dimensions. Finally, in Sec. 7 we summarize the results, and discuss limitations, physical relevance, and future directions for research. The Appendices contain both an introduction to Large Deviation Theory for those unfamiliar with the topic, along with more detailed calculations excluded from the main discussion.
2 Computational Exploration
Before delving into analytical results, we take a brief computational detour to demonstrate the general behaviour of the class of neutrally stable systems that form the focus of this paper.
Consider the following equation due to Mori et al. [Mori et al.(2008)Mori, Jilkine, and Edelstein-Keshet]
[TABLE]
over the unit square in with periodic boundary conditions. For the sake of these simulations, we use , and diffusion coefficient .
Equation (2) was designed to model a system of cell signaling proteins called Rho GTPases. The model was originally used to demonstrate wave pinning, a behavior reminiscent of the “cell polarization” behavior associated with Rho proteins.
In this equation , , , and are reaction and diffusion parameters, is the concentration of membrane bound Rho protein and is the concentration of Rho diffusing quickly through the bulk of the cell. As is varied, the above system gains and loses steady states via a pair of saddle-node bifurcations (see figure 3). Let and denote the values of and at the second of these saddle-node bifurcations, in which the lower stable steady state is annihilated. In this region .
Simulating with fixed , and additive noise of the form , leads to a long nearly homogeneous period where for all , followed by the abrupt formation of several “spikes”, which then saturate and spread across the domain. (See figure 3 A)
By contrast, when we select some fixed greater than the system remains approximately homogeneous as it tend towards this -saturated state. For less than , the system instead approaches some homogeneous low state. In either of these latter two cases, no significant patterning or heterogeneity is observed. (See figure 3 B & C)
Similar results may be observed for a variety of spatially distributed systems in the vicinity of saddle-node bifurcations. In order to consider this behavior in more generality, we consider a variety of “cannonical” equations, each demonstrating different aspects of the behavior of interest in the vicinity of a generic saddle-node.
3 Escape time from steady states; spaceless case
Let us begin our discussion by considering first the spaceless stochastic differential equation
[TABLE]
Here is our stochastic process of interest, is some small noise amplitude parameter, and is assumed to be Gaussian white noise. We assume noise as defined by Walsh [Walsh(1986)], such that the integral over any time interval is given by , and integrals over non-intersecting time intervals are independent. The extension to spatio-temporal white noise (also discussed by Walsh) makes use of higher dimensional integration. Equation 4 provides a canonical example of a spaceless noisy system balanced at a saddle node, and provides a toy model on which to build our understanding before moving to the spatial case.
Deterministically, trajectories starting at remain stationary, and never reach positive . However, any noise added to the system is enough to disturb this equilibrium and allow to escape. We would like to determine the amount of time it takes our stochastic process to travel from the semi-stable steady state to some positive constant . The appropriate tool for this task is Large Deviation Theory.
Large Deviation Theory (LDT) is a theory used to study unlikely noise driven events in stochastic systems (often, but not always SDEs). For the interested reader, we give a brief introduction to LDT in Appendix A. For those who wish for more detail, we recommend Freidln and Wentzell’s “Random Perturbations of Dynamical Systems” [Freidlin and Wentzell(2012)], and Rassoul-agha and Seppalainen’s “A Course on Large Deviations With an Introduction to Gibbs Measures”[Rassoul-agha and Seppalainen(2015)]. Here we present only the key results of LDT - those concepts and theorems critical to our present interest.
The central premise of LDT is that when unlikely events occur, they are overwhelmingly likely to occur via a path “close to” the most probable path. The most probable path, , is found by minimizing the “normalized action functional”
[TABLE]
For any given SPDE, can be re-written as a function of . For example, when studying eq. 4 we can rearrange to isolate and find
[TABLE]
Minimizing allows us to determine the most probably path for a given event, and having identified this part we are able to approximate the probability of an event occurring by a given time using either
[TABLE]
to estimate the expected time or
[TABLE]
to estiamte the probability distribution.
These formula are given as theorem 4.1 and theorem 1.2 (respectively) in chapter 4 of Freidln and Wentzell[Freidlin and Wentzell(2012)].
We will make use of both of these theorems throughout this paper, however, because for many of the examples discussed, eq. 8 will prove to be the more helpful of the two. This is because we are in the slightly unusual position of studying escape from an energy plateau, as opposed to an energy well, as might be more usually studied.
Suppose we wish to study the probability of passing from to by time , that is to say the “escape from zero” problem for equation 4. In order to use either of the above formula, we must find the minimum of . By the calculus of variations, any minimizer of must satisfy
[TABLE]
and hence
[TABLE]
For we recover the deterministic solution . This solution indicates infinite travel time to in the case where (our case of interest). For , the integral can be solved using wolfram|alpha [Wolfram|Alpha(2018a)], determining in terms of . Given that the function itself provides limited illumination, and is defined in terms of the ellipticF function (itself defined in terms of an integral), we will refer to solutions of 10 simple as , the inverse of .
In order to match our assumed boundary conditions, we must pick such that . This is not generically an easy problem to solve exactly, but for large it can be well approximated. In order to determine , in this limit, we first note (as given by wolfram|alpha [Wolfram|Alpha(2018b)]). Here is the Gamma function [Artin(2015)], and takes the value . The deterministic time taken to get from to is , thus . Rearranging gives
[TABLE]
Numerical experimentation indicates that, for this choice of , whenever , indicating that our approximation of is very good.
Having determined , we are able to plot for arbitrary and (see figure 4, Left).
In order to invoke either eq. 7 or eq. 8, we must must determine along this action minimizing path. Starting from the definition of (equation 6), it is possible to show that in our case
[TABLE]
Details are given in appendix B. Remembering from equation (11) that we find .
This analytic result agrees with numeric results found by approximating directly for a variety of different values.
Because as , we see that eq. 7 gives no information. This is a reflection of the fact that need not overcome any energy barrier in its path from [math] to , but merely an energy plateau.
We can however apply equation 8 to find
[TABLE]
for small .
[TABLE]
Hence we find that time taken for a solution to to escape from zero and approach infinity is predicted to scale like . The majority of this time is spent close to , and the escape time behavior of the system is dominated by the behavior of the system in this region; the exact value of has negligible impact, so long as is selected such that . Simulation of equation 4 for a variety of agrees with these asymptotic results (see figure 4, Right).
3.1 Systems near a saddle-node bifurcation
The above work describes the behaviour of the system precisely balanced at a saddle-node bifurcation. For the sake of completeness, we might also consider systems in the vicinity of such a bifurcation.
The system
[TABLE]
is a canonical example of a system which, for small , has just lost its steady states. In the deterministic limit , this system admits solutions of the form , which gives travel times of order .
The system
[TABLE]
is a canonical example of a system with a stable/unstable pair of steady states. Assuming in this case that (the stable steady state), and , it is possible to show that the action minimizing path obeys . In the limit of large , we can select , and find .
By eq. 7 we thus have .
In addition to the above static results, the behavior of systems passing through saddle node bifurcations (either with or without noise), has also been studied. A review of this “delayed bifurcation theory” is given by Christian Kuehn [Kuehn(2011)]. For a more detailed introduction, see Berglund and Gentz [Berglund and Gentz(2006)]. We now move on to consider the previously unexplored spatially distributed problem.
4 The Spatially Distributed Problem
Now that we have built up our understanding using the spaceless model, let us turn our attention to the spatially distributed model. Our primary interest in the spatially distributed case is considering the behavior of systems at a saddle-node bifurcation. Spatial systems of this form can be represented via the canonical equation
[TABLE]
Here, both and are functions of and . We assume that starts at the steady state; . Here represents our white noise term, although it may be thought of as a forcing function, with an associated energy .
We wish to describe the behavior of the system as it ‘escapes’ from the steady state at zero. We assume (without loss of generality) that takes its maximum at , and ask “for a given time , what is the most probable path such that , where for all ?”
For the sake of comparison, we will also consider escape from a linear unstable stead state,
[TABLE]
While less mathematically interesting, this linear escape problem gives us an analytically tractable case to explore, along with something to compare our non-linear results to. We begin with this simpler case.
4.1 Linearly Unstable Case
In what follows we present only the most important results, and the conceptually important steps leading to these results. Details can be found in Appendix C. In order to understand spike formation in equation (21), we must minimize the associated “action functional”:
[TABLE]
subject to the conditions and . As per standard functional analysis techniques, this minima can be found when the functional derivative . This condition can be written as a single differential equation with double the number of derivatives with respect to each variable, but is more conveniently written as a coupled pair of PDEs:
[TABLE]
This system admits the explicit solution:
[TABLE]
where and .
The associated action functional can be shown to be .
Further analysis and discussion of these results we postpone until section 5.1.
4.2 Semi-stable Case
We now consider the non-linear case, (equation 20). As previously, in order to determine the dynamics of spike formation, we concern ourselves primarily with determining the most probable path to spike formation. We present here only the conceptually important milestones in our calculations. Details are similar to those given in Appendix C.
[TABLE]
subject to the constraints and , we find that the minimizer satisfies
[TABLE]
Unlike the linear case, where could be solved independently of , and then used to determine explicitly, here no such analytic solution is available. and are inextricably coupled. Solving 27 numerically in either direction in time requires that we solve the ill posed backwards heat equation, either for or , depending on which direction we solve in. Creating a full spatial-temporal mesh and solving for both and simultaneously is possible, but quickly becomes computationally expensive for finer meshes. We can, however, solve iteratively. This is done by initially assuming , and solving
[TABLE]
with the boundary condition , to determine . We use as a forcing term to find , and to determine and so on, until . (See figure 5). Using this iteration scheme, we are able to determine and for the action minimizing profile. The value of in each iteration step is found using a bisection method in which is reduced if and increased if .
5 Comparisons of results
5.1 Comparison Between Results for Linear and Non-Linear Case
Now that we have determined the action minimizing profile in both the linear and non-linear case (equations (21) and (20), respectively), let us now compare these two models.
To begin, we would like to determine the time till burst formation in each model. For the sake of notation, let us define . While under normal circumstances we would find the expected escape time by invoking equation 7, in all cases studied here, as . Hence equation 7 provides us with no information, and we instead rely on equation 8.
Regardless of the particular system under study, when equation 8 implies and hence . This in turn implies that the probability density of burst formation for such values of is low. Similarly, when we have and hence , and once again the probability of escape is negligible, as escape has almost certainly already occurred. Escape can only have non-negligible probability density when is neither too big nor too small, namely when and are similar orders of magnitude.
This implies that for the linear model escape is predicted when , or equivalently when . The time of escape changes little, even as is varied over several orders of magnitude. In the nonlinear case we observe that as (see figure 6). By the above arguments, we predict ; the time till burst formation is sensitive to the amplitude of the noise driving the system. Table 1 summarises these timing results, and compares to the previously discussed spaceless models.
Far more interesting than the difference in escape times for the linear and quadratic case is the substantial difference in the profile of our action-minimizing . In the linear case, was found explicitly (equation 25). In this case is well approximated by a corresponding normal distribution , and becomes wider over time (see figure 7, upper). In contrast, in the quadratic case, the final width of increases at most very slowly, and appears to approach a limiting distribution as (see figure 7, lower).
As a final point of comparison, we consider the reaction of the two systems to changes in . As might be expected, the shape of the action-minimising profile is unaffected by changes in for the linear system. Doubling for a given doubles both and , but does not change the shape of either. In contrast, in the non-linear case, the profile of is sensitive to changes in , with larger leading to a sharper “spike” solution, and smaller leading to soft “bumps” (see figure 8).
5.2 Comparison of Analytic Results to Direct Simulations
As a final check on the above analytic results, we compare to simulations of the corresponding systems.
In the linear case (equation 21), we find that the time until scales like , as predicted by theory (see figure 9, left). In the non-linear case (equation 20), burst time is reasonably approximated by as predicted by theory, but the agreement is weaker than might be hoped (see figure 9, right). It is unclear if this disagreement stems from the limitations of our simulations, or the numeric-analytic arguments proposed earlier.
Comparison of shape in the linear case finds reasonable agreement in the vicinity of the primary burst itself, however because a large number of bursts form simultaneously, the final profile of ends up being an overlapping combination of many different such bursts (see figure 10,left).
In contrast, comparison between the predicted and observed shape of bursts shows strong agreement in the non-linear case, even for relatively large values of (see figure 10,right). Bursts spend a greater portion of their development time close to zero in the non-linear case, and hence, although many bursts may begin growing at roughly the same time, the second and third place bursts will have only a minor effect on the overall profile of .
The code for all simulations can be found in the supplementary materials.
6 Difficulties in 2D, and higher dimensions
Given that the universe we occupy is not one dimensional, it would be beneficial to extend the above results to higher dimensions, preferably three dimensions, although for certain contexts, such as the binding of proteins to a cell membrane[Vecchiarelli et al.(2016)Vecchiarelli, Li, Mizuuchi, Hwang, Seol, Neuman, and Mizuuchi], two dimensions would suffice.
Unfortunately, the properties of white noise preclude this avenue of investigation; as noted by Ryser et al [Ryser et al.(2012)Ryser, Nigam, and Tupper], equations of the form do not remain well posed in dimension two or higher. The smoothing effects of diffusion are insufficient to restrain the irregularity produced by space-time white noise.
This result can be seen through a variety of lenses, depending on ones outlook. Ryser et al, describe in terms of a sum of Fourier modes, and show
[TABLE]
In one dimension, when this sum converges, however for two or more dimensions, and the sum is unbounded for all . This indicates that is not well defined. Simulations in 2d, such as those presented in section 5.1 function for fixed , but invariably break down as .
In terms of the methodology used in this paper, the degeneracy of higher dimensions manifests as a break down of various integrals. Issues arise regardless of what equation is used, but the problem is most easily illustrated by considering the linear case. Consider the solution of eq. 21 given in eq. 25. The parameter of eq. 25 is determined by considering the boundary condition ,
[TABLE]
where .
In 1D , and hence . Because the integral converges to a finite number, we can rearrange to determine and go on to find explicitly for any .
By contrast, in 2D we have , and hence, . The integral is unbounded, and thus we are able to select arbitrarily small. This in turn allows both and to be selected arbitrarily close to zero, regardless of . Physically, we are picking to be a spike that is sufficiently narrow so as to ensure arbitrarily small, yet also sufficiently high so as to force from zero to in time . This sleight of hand is impossible in one dimensional systems, but for all higher dimensions it is the “action minimizing” strategy.
While here we have described the difficulties of integration in the linear case, the problems described extends to all equations that include noise terms in 2D, and become progressively worse for higher dimensions. They are a symptom, rather than the root cause of the problems of noise in higher dimensions.
7 Conclusions, Discussion and Future work
In this article we have presented a number of prototypical models exploring the behavior of systems starting at unstable and semi-stable steady states.
We have demonstrated both analytically and through simulations that under the influence of spatio-temporal white noise, systems of the form develop well defined “spikes” in finite time. This phenomena can be seen as delayed bifurcation [Kuehn(2011)] in a spatially distributed system, and also as a compliment to non-linear blow up phenomena, in which we consider instead the system’s “escape from zero”, as opposed to the more typically studied “approach to infinity” [Vázquez and Galaktionov(2002)].
The symmetry breaking observed here does not rely on multiple chemical species, nor on contrasts in diffusion rate, as might be expected for the more classical Turing type pattern formation.
We were able to determine both the “energy cost” of spike formation, along with the shape of the most probable spike, and showed that in the limit of large time, spike formation requires infinitesimally small energy. The sensitivity of the system in the vicinity of its semi-stable steady state allows noise of amplitude to have macroscopic effects in O() time. We also explored similar results both in the spaceless case, and in the case where our state is linearly unstable as opposed to semi-stable.
In terms of physical relevance, the systems studied can be best thought of as a prototype for chemical reaction-diffusion systems in the vicinity of a saddle-node bifurcation. Our study demonstrates the breakdown of a homogeneous state, and can be seen as a first step towards understanding systems such as the Min system studied by Vecchiarelli et al [Vecchiarelli et al.(2016)Vecchiarelli, Li, Mizuuchi, Hwang, Seol, Neuman, and Mizuuchi], in which changes in the bulk concentration of a protein pushes the system through a bifurcation boundary and leads to the formation of membrane bound “bursts”.
There are a number of further questions which must be answered before the work here can be applied to any experimental context. First and foremost, the work here considers a system perfectly balanced at the saddle-node; . No physical system however is ever so perfectly balanced, and so determining the robustness of these results for a system that is passing through such a saddle node bifurcation is critical to our understanding of the relevance of these results. Further in order for the results here to prove useful for experimental scientists, time must be invested in investigating the mapping between experimental observations, and the associated reaction and diffusion parameters implied; for example, does the relationship between spike width and height give a reliable signature that can be used to determine system parameters?
Finally, our work here alludes (in passing) to potentially deeper questions in chemistry; namely the difficulties in standard representations of noise when in higher dimensions. Such questions must be answered, or at the very least sidestepped, before the phenomena observed here can be sensibly applied to the multi dimensional systems ubiquitous in the real world.
8 Code
The Code used in this project is available on github at alastair-JL/StochasticBurst.
9 Acknowledgements
We wish to acknowledge Anthony Vecchiarelli, for stimulating discussion on the topic of Min proteins, and sharing his data. This research was funded by the four year fellowship from the university of British Columbia, and NSERC.
Appendix A Introduction to Large Deviation Theory
Here we give a very brief introduction to the principle ideas and techniques used in Large Deviation Theory (LDT), and used throughout this paper. In this appendix we present LDT as it applies to discrete time Stochastic processes. The continuous time case is conceptually similarly. Our goal here is to build intuition rather than mathematical rigor.
As a concrete example with which to frame our discussion, consider the Discrete Orenstien-Ulembeck process[Larralde(2004)]
[TABLE]
Here is assumed to be a normal random variable with mean zero and variance one, such that each is independent. We assume .
Suppose we wish to know the first time that . At first glance, given noise, and the decay term of our OU process, seems unlikely to occur. That said, as we take , unlikely, even exceedingly unlikely events should occur eventually. LDT concerns itself with such questions as “how long will it take for X to occur?” and “given that X occurs, what path is the system most likely to take in order to get there?”. In our particular case X is “”.
The total probability of our event is equal to the integral over the probability density of all paths leading to that event. This integral can be formulated either in terms of the paths of the stochastic process , or in terms of the underlying noise .
[TABLE]
The probability density function for a noise vector of length is
[TABLE]
Unfortunately, even with this well defined probability density, the boundary of the integral is generically complicated enough so as to prevent us from evaluating directly (the OU process being a notable exception).
In order to avoid this complex integration boundary, it is useful to transform our integral back into a form dependent on . To achieve this we rearrange our recurrence relation (equation 29), and find
[TABLE]
Using this one-to-one correspondence between and along with the change of random variables formula, we find
[TABLE]
Here is said to be the “normalized action functional” of our problem. In the particular case discussed here . In general depends both on the equation governing a stochastic process, and the particular form of the noise generating it. can be thought of as a measure of the “total improbability” associated with a given path, and is associated with the amount of “energy” that noise must pour into the system in order to cause a particular path to occur. When dealing with continuous systems, is defined as an integral over the square of noise, rather than a sum.
At this stage, in order to determine the probability of our event, we need to evaluate . Typically, this integral can not be evaluated exactly, but it can be well approximated via Laplace’s Principle [Laplace(1986), Olivieri and Vares(2005)].
Laplace’s Principle states that:
[TABLE]
where here we minimize over all . Laplace’s principle is based on the idea that for integrals of the from , the overwhelming majority of all probability mass is concentrated in the vicinity of whenever , (see figure 11). From the point of view of stochastic processes, what Laplace’s Principle is effectively stating is that if an improbable event does occur, the observed path is overwhelmingly likely to be ‘close’ to the most probable path.
Let us return to the original question proposed at the start of this section: how long does it take before the Discrete Orenstien-Ulembeck process defined in equation 29 exceeds one for the first time?
Suppose we wish to determine the probability that , for some particular , assumed to be large. By Laplace’s Principle, we must thus minimize . Because any will only increase we can assume . In order to minimize, we require that and for all . As , solutions to the above can be well approximated by . Substituting back into the definition gives , and hence . If this is the probability of success for any particular large , then we infer that the expected time until will scale such that .
Freidln and Wentzell[Freidlin and Wentzell(2012)] provide the general formulation for the above two results. They state (chapter 4, theorem 1.2) that for small epsilon the probability of a particular rare event occurring by a given time is governed by:
[TABLE]
Here, as is customary, denotes a particular trajectory of and indicates that we are minimising over all trajectories such that our rare event occurs by time . This theorem is referred to as equation (8) in the main text.
Appendix B Derivation of analytic expression for in the spaceless case.
Here we present the derivation of equations 13. We begin by taking the definition of and expanding. From eq. 9 we can show and hence:
[TABLE]
The problematic term here is the first integral. Applying integration by parts gives
[TABLE]
By eq. 10 this last term is . Substituting this result back in gives.
[TABLE]
hence recovering equation 13.
At no point in this derivation have we used the approximation of given in eq 11, hence this is an analytic result.
Appendix C The linearly unstable case; full calculation
Here we provide the detailed algebra suppressed in section 4.1. Consider the minimization problem:
[TABLE]
This is equivelent to equation (26) of the main text.
If we consider as a form of forcing function, then the above asks for the lowest energy forcing required to lift to height . Thinking of as a form of noise we are asking for the most probable noise. For the rest of this appendix we will think of in terms of energy input, although we will keep the white noise formulation in mind. Please note that while both and are stochastic processes, in a slight abuse of notation we also use the symbols to indicate the optimal path subject to the above constraints.
In order to find our minima, we first re-write in terms of , and then take the functional derivative:
[TABLE]
[TABLE]
Now using integration by parts on each term, as appropriate, we find
[TABLE]
Because we seek to minimise , we can safely demand that as and thus . We are left with the potentially problematic term, however, since is fixed at zero for and at for we know (our perturbation to ) is zero in these location. Further, we can assume that for , as any non-zero forcing at these locations would increase , but have no effect on . Hence , and so
[TABLE]
The above can be stated as a fourth order differential equation purely in terms of , but this provides little illumination. Instead we leave it as a backwards heat equation, ready to be coupled to C.
As argued previously, for our optimal solution whenever , as non-zero values of in these locations increase but have no effect on . In order for to have any effect on it must have non-zero total mass, and since we know that it must have non-zero total mass at time . Hence for some .
Combining the above, the minimal noise carrying from [math] to must solve:
[TABLE]
This recovers equation (27) of the main text.
Solving directly gives
[TABLE]
Where
[TABLE]
The noise scaling can be found using our boundary condition:
[TABLE]
In order to find the normalized action functional we find:
[TABLE]
Escape is predicted to occur when . For all but the smallest values, this can be considered to take place in time.
Conceptually similar calculations can be used to get from equation (26) to (27) in the non-linear case.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Artin(2015)] E. Artin. The Gamma Function . Courier Dover Publications, Jan. 2015. ISBN 978-0-486-80300-5. Google-Books-ID: c 3R 2Bg AAQBAJ.
- 2[Berglund and Gentz(2006)] N. Berglund and B. Gentz. Noise-Induced Phenomena in Slow-Fast Dynamical Systems: A Sample-Paths Approach . Probability and Its Applications. Springer-Verlag, London, 2006. ISBN 978-1-84628-038-2. URL https://www.springer.com/gp/book/9781846280382 .
- 3[Erdemir et al.(2009)Erdemir, Lee, and Myerson] D. Erdemir, A. Y. Lee, and A. S. Myerson. Nucleation of Crystals from Solution: Classical and Two-Step Models. Accounts of Chemical Research , 42(5):621–629, May 2009. ISSN 0001-4842. doi: 10.1021/ar 800217 x . URL https://doi.org/10.1021/ar 800217 x . · doi ↗
- 4[Freidlin and Wentzell(2012)] M. I. Freidlin and A. D. Wentzell. Random Perturbations of Dynamical Systems . Grundlehren der mathematischen Wissenschaften. Springer-Verlag, Berlin Heidelberg, 3 edition, 2012. ISBN 978-3-642-25846-6. URL //www.springer.com/gp/book/9783642258466 .
- 5[Kuehn(2011)] C. Kuehn. A mathematical framework for critical transitions: Bifurcations, fast–slow systems and stochastic dynamics. Physica D: Nonlinear Phenomena , 240(12):1020–1035, June 2011. ISSN 0167-2789. doi: 10.1016/j.physd.2011.02.012 . URL http://www.sciencedirect.com/science/article/pii/S 0167278911000443 .
- 6[Laplace(1986)] P. S. Laplace. Memoir on the Probability of the Causes of Events. Statistical Science , 1(3):364–378, 1986. ISSN 0883-4237. URL http://www.jstor.org/stable/2245476 .
- 7[Larralde(2004)] H. Larralde. A first passage time distribution for a discrete version of the Ornstein–Uhlenbeck process. Journal of Physics A: Mathematical and General , 37(12):3759, 2004. ISSN 0305-4470. doi: 10.1088/0305-4470/37/12/003 . URL http://stacks.iop.org/0305-4470/37/i=12/a=003 .
- 8[Lefever and Lejeune(1997)] R. Lefever and O. Lejeune. On the Origin of Tiger Bush. Bulletin of Mathematical Biology , 59:263–294, Mar. 1997. doi: 10.1007/BF 02462004 .
