Dynamic disorder in simple enzymatic reactions induces stochastic amplification of substrate
Ankit Gupta, Andreas Milias-Argeitis, Mustafa Khammash

TL;DR
This paper investigates how fluctuations in enzyme activity, known as dynamic disorder, can cause stochastic amplification of substrate levels in simple enzymatic reactions, with implications for understanding cellular processes.
Contribution
The study derives an explicit formula linking enzymatic fluctuation speed to steady-state substrate levels, revealing how dynamic disorder amplifies substrate concentration.
Findings
Fluctuation speed significantly affects mean substrate levels.
Large deviations from deterministic predictions occur due to enzyme activity fluctuations.
The connection between fluctuation speed and Markov process mixing properties is established.
Abstract
A growing amount of evidence points to the fact that many enzymes exhibit fluctuations in their catalytic activity, which are associated with conformational changes on a broad range of timescales. The experimental study of this phenomenon, termed dynamic disorder, has become possible due to advances in single-molecule enzymology measurement techniques, through which the catalytic activity of individual enzyme molecules can be tracked in time. The biological role and importance of these fluctuations in a system with a small number of enzymes such as a living cell have only recently started being explored. In this work, we examine a simple stochastic reaction system consisting of an inflowing substrate and an enzyme with a randomly fluctuating catalytic reaction rate that converts the substrate into an outflowing product. To describe analytically the effect of rate fluctuations on the…
| 1 | 0.1703 | 2.1988 | 7.2903 |
|---|---|---|---|
| 5 | 0.1543 | 1.6907 | 8.5349 |
| 10 | 0.0930 | 0.6782 | 12.2383 |
| 20 | 0.0494 | 0.2550 | 15.4510 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Dynamic disorder in simple enzymatic reactions induces stochastic amplification of substrate
Ankit Gupta
Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058 Basel, Switzerland.
Andreas Milias-Argeitis
Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058 Basel, Switzerland.
Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Nijenborgh 4, 9747 AG Groningen, the Netherlands.
Mustafa Khammash [email protected] Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058 Basel, Switzerland.
Abstract
A growing amount of evidence over the last two decades points to the fact that many enzymes exhibit fluctuations in their catalytic activity, which are associated with conformational changes on a broad range of timescales. The experimental study of this phenomenon, termed dynamic disorder, has become possible thanks to advances in single-molecule enzymology measurement techniques, through which the catalytic activity of individual enzyme molecules can be tracked in time. The biological role and importance of these fluctuations in a system with a small number of enzymes such as a living cell, have only recently started being explored.
In this work, we examine a simple stochastic reaction system consisting of an inflowing substrate and an enzyme with a randomly fluctuating catalytic reaction rate that converts the substrate into an outflowing product. To describe analytically the effect of rate fluctuations on the average substrate abundance at steady-state, we derive an explicit formula that connects the relative speed of enzymatic fluctuations with the mean substrate level. Under fairly general modeling assumptions, we demonstrate that the relative speed of rate fluctuations can have a dramatic effect on the mean substrate, and lead to large positive deviations from predictions based on the assumption of deterministic enzyme activity. Our results also establish an interesting connection between the amplification effect and the mixing properties of the Markov process describing the enzymatic activity fluctuations, which can be used to easily predict the fluctuation speed above which such deviations become negligible. As the techniques of single-molecule enzymology continuously evolve, it may soon be possible to study the stochastic phenomena due to enzymatic activity fluctuations within living cells. Our work can be used to formulate experimentally testable hypotheses regarding the nature and magnitude of these fluctuations, as well as their phenotypic consequences.
Keywords: stochastic amplification; enzymatic fluctuations; dynamic disorder; Markov models
Mathematical Subject Classification (2010): 92C42; 92C45; 60J22; 60J28; 65C40.
1 Introduction
First made almost two decades ago, observations of enzymatic turnovers for single enzyme molecules have allowed scientists to probe enzyme behavior beyond the regime of high-copy numbers and ensemble averages [27]. Thanks to advances brought by experimental techniques such as single-molecule fluorescence spectroscopy [36, 26], the field of single-molecule enzymology developed rapidly in the subsequent years. The key observation made possible by single-molecule assays is that the catalytic rates of single enzyme molecules often display very large dynamic fluctuations over timescales much longer than the typical reaction cycle times, most likely driven by slow (spontaneous or induced) transitions in conformation [27, 25, 7, 42].
Around the time when the first single enzyme molecules were observed in action, the mathematical theory of dynamic disorder was introduced by Zwanzig [43], motivated by several observations of different physico-chemical processes with a seemingly common underlying cause that boiled down to random fluctuations of key process properties. The phenomenon of dynamic disorder refers to fluctuations in enzymatic reaction rates that occur at a timescale that is either slower or comparable to the reaction timescale [4]. These fluctuations are often caused by slow transitions in the conformational state of enzymes. The most simple example of dynamic disorder first considered by Zwanzig [43] involved a so-called “rate process controlled by passage through a fluctuating bottleneck” [44]. In the language of chemical kinetics, it describes the removal of a substrate, , from a system at rate , where the time-varying rate is a (typically Markovian) stochastic process (see Figure 1). As the speed of fluctuations tends to zero, the reaction rate becomes a random variable which does not change with time, and we transition to the regime of static disorder [43]. On the other hand, as the speed of fluctuations tends to infinity, the dynamic disorder vanishes on the timescale of substrate kinetics, and we recover the classical case where the reaction rate becomes a deterministic constant. Our goal in this paper is to investigate the effects of fluctuations between these two extremities, where most realistic systems are likely to lie.
Thanks to the mathematical theory of dynamic disorder, stochastically fluctuating enzyme activities can be understood and studied within a consistent mathematical framework that can also generate testable experimental predictions [37, 20, 34, 4, 7, 27]. Already in [43] it was observed that dynamically disordered systems can give rise to macroscopic observations that differ from those expected in the absence of disorder. In subsequent years, a large body of theoretical and computational work has examined various alternative enzymatic reaction schemes, mostly focusing on the enzyme dynamics itself, e.g. on the autocorrelation of fluctuations and the distribution of waiting times between turnover events [29]. On the other hand, dynamic disorder has been observed for several biologically relevant enzymes [41], suggesting that it is ubiquitous in the cellular context. Early work [7] had already noted that enzymatic fluctuations could play an important biological role in a system containing only a small number of enzyme molecules, as often happens within a living cell, and the recent in vivo observation of fluctuating enzymatic activity confirms this claim [17].
Besides studying the intrinsic mathematical properties of dynamically disordered enzymes, it would be also highly instructive and relevant for biology to examine the consequences of dynamic disorder on substrate statistics (a first example of such a study is given in [17]). Experimental work in this area is still done in vitro using constant and large substrate pools. Here, on the contrary, we provide a mathematical treatment of how dynamic disorder alters the substrate mean abundance in the presence of substrate inflow, a condition closer to biological reality that has also been considered in [13, 40]. To this end, we analytically examine a highly simplified stochastic system with a randomly fluctuating catalytic reaction rate and describe the effect of rate fluctuations on the average substrate abundance. Under fairly general conditions, we demonstrate that the relative speed of rate fluctuations can have a dramatic effect on the mean substrate, and lead to large positive deviations from predictions based on the assumption of deterministic enzyme activity. Using a Markovian model for enzyme kinetics, we mathematically characterize this effect by deriving an explicit formula for the steady-state substrate-mean as a function of the relative speed of enzymatic fluctuations. From this formula we show that for any finite speed-value, the steady-state substrate-mean is sandwiched between the two values obtained in the static and the deterministic regimes. Furthermore, we demonstrate that the mapping between the relative speed of enzyme kinetics and the substrate-mean at steady-state can be well-approximated by a convex, monotonically decreasing function whose key shape parameter depends on the “mixing strength” of the Markov process describing enzyme kinetics. This mixing strength can be measured by computing an appropriate Dirichlet form [35] of the Markov process. Even though we consider a highly simplified situation, our analysis can serve as a guide in the case of more realistic, but analytically intractable enzymatic reaction schemes. Our results only depend on the enzymatic fluctuations, but they do not depend on the fluctuations caused by the low abundance of substate molecules (see [6]), although we account for these fluctuations by modelling the substrate kinetics as a jump Markov chain. Indeed the results we present remain unchanged even if we discard these fluctuations and describe the substrate kinetics as an ordinary differential equation (ODE) with a fluctuating rate constant .
Enzymatic fluctuations can also arise from sources other than dynamic disorder. The abundance or the availability of enzyme molecules may also fluctuate due to gene expression noise [6, 31], and their chances of finding substrate molecules can be diffusion-limited [39]. In this work, we do not distinguish between the various sources of fluctuations and model the aggregate enzymatic activity by a Markovian stochastic process.
The biological significance of our findings is manifold since enzymatic interactions are ubiquitous is cell biology and the effects of enzymatic noise in metabolic networks have only recently started to be explored [31]. Using the relative speed of enzymatic fluctuations as parameter, our results provide a clear way to determine if the deterministic approximation is a faithful representation of reality. Our results can shed light on the timescale disparities that exist between enzyme and substrate kinetics. In particular, we see that enzyme kinetics needs to be “fast” in order to avoid any undesirable amplification of the mean substrate abundance due to inevitable variations in the enzymatic states. On the other hand, one can envisage situations where it would be beneficial for enzymes to be “slow” so that their fluctuations amplify a weak signal and enable its detection by the intracellular machinery (see Section 3.1). Such a signal-detection mechanism was the main motivation behind stochastic focusing, a sensitivity amplification phenomenon introduced in [32]. We illustrate our results on the reaction scheme of [32] in Section 3.2, where we characterize how the substrate-mean changes with the speed of the enzyme abundance dynamics. Note that in situations where enzyme kinetics is “slow”, undesirable amplification effects can be eliminated by feedback mechanisms [28]. In this context, our results can help in postulating the presence of feedback loops using experimental data. We discuss the biological importance of our results in greater detail in Section 4.
It is interesting to note that some of the expressions we derive are related to those obtained in the analysis of various physico-chemical quantum dynamical systems coupled to a randomly fluctuating environment. This theory dates back to the original work of Kubo and Anderson [1, 21] and, more recently, has been generalized to arbitrary quantum systems described by the Liouvile-von Neumann equation with Markovian and non-Markovian parametric noise [10, 12, 11]. The example system of Ref. [11] can be interpreted as a substrate decaying with a stochastically fluctuating rate. While similar to it, the system we consider here includes the inflow of substrate, which results in a non-zero steady-state and requires a different mathematical treatment.
2 Results
2.1 The model
We consider a system into which a substrate, , enters at a constant rate and is degraded or (equivalently) converted into a product that in turn leaves the system. The rate of substrate outflow depends on the activity state or abundance level of an enzyme . In turn, the catalytic activity of , denoted by , is assumed to fluctuate in time according to a continuous-time Markov chain (CTMC) over a finite state-space . Here each is a positive constant denoting the degradation rate constant at the -th enzymatic state or abundance level. Due to fluctuations in the catalytic activity of , the degradation rate of substrate will also fluctuate in time according to a stochastic process whose value at time is given by , where is the molecular count or concentration of the substrate. This model is summarised in Figure 1.
As mentioned before, the degradation reaction can also be viewed as a conversion reaction which is catalyzed by the enzyme. Generally this catalytic step proceeds through the reversible formation of an intermediate complex which is formed when an enzyme molecule binds to a substrate molecule. In other words, the single reaction is an abstraction for the following three reactions:
[TABLE]
If the binding/unbinding rates of and molecules is much higher than the rate of the conversion reaction, then we can apply the quasi-stationary assumption to conclude that the model in Figure 1 is a good approximation to the catalytic conversion dynamics (for more details see the Supplementary Material in [28]).
To describe the CTMC , we need to specify its transition rate matrix (see [30]). For any distinct , denotes the rate at which the process leaves state and enters state . The diagonal entries of are given by . From now on we assume that the rate matrix is irreducible111A matrix is called irreducible if there does not exist a permutation matrix such that the matrix is block upper-triangular. which implies that there exists a unique stationary distribution satisfying
[TABLE]
where and denote the vectors of all zeroes and ones respectively. Since the state-space is finite and the transition rate matrix is irreducible, the CTMC is ergodic which means that the probability distribution of converges to the stationary distribution as . As we are interested in the steady-state limit, without loss of generality we can assume that the initial state is distributed according to , i.e. for each . This ensures that the process is a stationary stochastic process whose various statistical properties do not depend on time222For a more rigorous definition of a stationary stochastic process see the Supplementary Material.. In particular its mean is equal to
[TABLE]
where is a -valued random variable with probability distribution and denotes the expectation w.r.t. this distribution.
From now on, we regard as the baseline process which corresponds to enzymatic dynamics at the natural timescale. In order to study the substrate behavior, we need to view enzymatic dynamics at the timescale of substrate kinetics. For this we define a family of processes parameterised by the “relative speed” parameter as follows:
[TABLE]
Note that one time-unit of process corresponds to time-units of process . In this sense, the parameter sets the speed of the fluctuation dynamics for the enzyme relative to the speed of the substrate kinetics. Like , the process is also a CTMC over state-space with transition rate matrix and initial distribution . Since is stationary, this process is also stationary with the same mean given by for all times . Replacing by in the model depicted in Figure 1, we will study how the steady-state mean of substrate abundance depends on the fluctuation speed .
Given a sample path of the enzyme dynamics with relative speed , we regard the dynamics of substrate molecular counts as a jump Markov chain over the set of nonnegative integers . This Markov chain can be written in the random time change representation [8] as
[TABLE]
where and are independent, unit rate Poisson processes. From this representation it is immediate that the substrate-production rate is constant () in time, but the substrate-degradation rate is time-varying and it is equal to at time . Here the Poisson processes and capture the intermittency in the firing of production and degradation reactions. This intermittency becomes unimportant if the substrate is present in high copy-numbers [22] and in this case one can regard as the dynamics of subtrate concentration333The concentration of any species is its copy-number divided by the system volume., specified by the following ODE
[TABLE]
Note that even if the intermittency in production/degradation reactions is ignored and is described by the ODE (2.4), the process is still stochastic because it is driven by the stochastic process that represents enzymatic fluctuations.
Let for each . We shall soon see that does not depend on whether we use representation (2.3) or (2.4) for the substrate dynamics . Our goal in this paper is to understand the role of fluctuations in the catalytic activity of enzyme in determining the steady-state value of the mean
[TABLE]
In particular, we study how this steady-state mean depends on the relative fluctuation speed and the variability in degradation rates at various enzymatic activity levels.
2.2 Expressions for : The general case
We can approximately find by estimating for a very large , using simulations of the whole system. However this naive approach is highly unsatisfactory because these simulations can be computationally expensive and the approximation error incurred by replacing the steady-state mean by a finite-time mean is generally difficult to quantify. Moreover this approach does not provide us with an explicit formula for that can enable us to study its dependence on the relative speed parameter . In light of these difficulties, we look for alternative ways to compute . In this section we assume that enzymatic kinetics is given by a general stationary stochastic process with an arbitrary state-space , and so we do not rely on the CTMC structure mentioned in Section 2.1. We specialise the results of this section to the CTMC case in Section 2.3.
Using representation (2.3) or (2.4) we can show that is given by the following formula
[TABLE]
From the stationarity of the process we can conclude that
[TABLE]
Substituting this in (2.6) and letting , we obtain our first formula for , which is,
[TABLE]
From (2.2) we obtain
[TABLE]
Since the process is stationary, from Theorem 10.6 in [18] we know that as , the quantity (2.8) converges a.s. to (recall (2.1)). As a consequence and hence we get
[TABLE]
This shows that as the relative speed of enzymatic fluctuations approaches , these fluctuations become equilibrated at the timescale of substrate kinetics, and so they do not affect the mean substrate level. In other words, from the point of view of the substrate, the enzyme kinetics is so fast that it is as if the enzyme state is constant at the equilibrium level . This corresponds to the classical case where there is no dynamic disorder in the enzyme activity and so this activity is well-approximated by a deterministic rate constant for the substrate degradation reaction. As the mapping is convex, Jensen’s inequality tells us that
[TABLE]
where the last relation follows from the fact that is a stationary process with mean for all times . Substituting this in (2.7) we see that for any
[TABLE]
Therefore for a finite relative speed , enzymatic fluctuations always amplify the mean substrate abundance, in comparison to the classical deterministic case. The natural question that now arises is - how large should speed be in order for the deterministic approximation to be acceptable within a certain tolerance level ? We address this question in Section 2.4.
Let us now consider the situation where the relative speed parameter and so at the timescale of substrate kinetics, the enzyme dynamics approaches a static process, i.e. for all . This case corresponds to the situation where the enzyme kinetics is very slow in comparison to the substrate kinetics. Hence from the point of view of the substrate, the kinetics of the enzyme is almost fixed. In this regime, we can replace by in (2.7) to obtain
[TABLE]
where we have used the fact that has probability distribution to write as . Observe that , which can be readily seen by letting in (2.10) or by directly using Jensen’s inequality on the convex map (see Figure 2). The two extremal cases and serve as a guide to the behavior of realistic systems with an intermediate value of . In particular we can expect that for such intermediate -values, the steady-state substrate mean will lie somewhere between and . This is precisely what happens as we shall soon see. We will also discuss how the precise value of can be computed or estimated from any Markovian model of enzymatic fluctuations.
Until now, the conclusions we have drawn regarding rely on the formula (2.7) that holds for any real-valued stationary stochastic process as long as its states are positive and bounded away from [math], i.e. the state-space satisfies
[TABLE]
for some . This makes this formula very general but it is difficult to work with, because it involves an indefinite integral which is generally analytically intractable as the mapping does not have an explicit form. We remedy this problem in the next section by specialising this formula to the case where is a finite state-space CTMC as described in Section 2.1. Before we come to that, we provide a numerical recipe for statistically estimating without the need for evaluating the indefinite integral. This scheme is based on the assumption that we can efficiently generate sample-paths of the stationary process (see [9, 2]).
Define a random variable by
[TABLE]
where is an independent random variable with the uniform distribution on . To sample we can adopt the following strategy. We first sample from the uniform distribution on , draw an initial condition from , and then simulate the sample path , keeping track of the integral . We take to be first time when this integral hits the value . From the samples of the random variable , we can estimate its expectation which gives us an estimate for because it can be shown that
[TABLE]
Note that the estimator for based on formula (2.14) will be unbiased but it will suffer from statistical error due to a finite sample size. However this error can be estimated and managed far more easily than the error one would incur by approximating the steady-state mean by a finite-time mean (recall (2.5)). This makes this formula (2.14) useful in practice (see Example 3.2).
The results from this section are collected in our next proposition which is proved in the Supplementary Material.
Proposition 2.1
Suppose is a real-valued stationary stochastic process with stationary distribution and state-space satisfying (2.12). Let be the speed version of this process given by (2.2) and define the substrate dynamics either by (2.3) or by (2.4). Let and let the steady-state limit be given by (2.5). Then we have the following:
- (A)
The value is well-defined (i.e. the limit in (2.5) exists) and it is given by (2.7).
- (B)
If is the random variable defined by (2.13) then (2.14) holds.
- (C)
The limits (2.9) and (2.11) are satisfied as and respectively.
2.3 Expressions for : The finite CTMC case
In this section we specialize expression (2.7) to the case where is a stationary CTMC with a finite state-space as described in Section 2.1. Define the CTMC by (2.2) and recall that its transition-rate matrix is given by . Let be the diagonal matrix
[TABLE]
whose entries are the degradation rates at different enzymatic states or abundance levels. One of the main results in our paper is to show that can be expressed as
[TABLE]
Two alternative proofs of this result are given in the Supplementary Material. The first proof exploits some ideas from the theory of occupation measures for Markov chains [14] while the second proof is based on the Methods of Conditional Moments (MCM) approach recently developed by Hasenauer et al. [15]. Note that this formula assumes matrix is invertible for any but this can be easily verified from the properties of matrix . Using formula (2.16) we can prove that for any
[TABLE]
Therefore for any relative speed of enzymatic fluctuations, the steady-state mean of the substrate is always sandwiched between the values obtained for the deterministic and the static cases. Moreover since depends continuously on , the limits (2.11) and (2.9) imply that for any value in the open interval there exists a relative speed value such that . Hence the positive deviations caused by enzymatic fluctuations (in the mean substrate abundance) range from [math] to exactly .
To detemine the map we need to evaluate at several values of . This can be difficult with formula (2.16) because each evaluation requires inversion of a potentially large matrix. Fortunately we can resolve this issue using simple ideas from the theory of resolvents for linear operators [19], as we now describe. Let be the transition rate matrix of another CTMC over state-space . The difference between this new CTMC and the original CTMC is that the rates of outflow from state to each state (for ) are divided by the state value . Let denote the field of complex numbers. The resolvent for the Markov semigroup corresponding to this CTMC is the matrix-valued function over defined by
[TABLE]
where is the Identity matrix. This function is well-defined for any which is not an eigenvalue of matrix . Let be the eigenvalues of matrix , repeated according to their algebraic multiplicity. Since is the transition rate matrix of a CTMC, it has a simple444An eigenvalue is said to be simple if its algebraic multiplicity is . eigenvalue (say ) equal to [math], while its other eigenvalues have negative real parts. This implies that the resolvent function is well-defined on the positive real line .
From now on we assume that matrix is diagonalizable555A square matrix is diagonalizable if it can be written as for some diagonal matrix and some invertible matrix . The diagonal entries of are the eigenvalues of matrix . over the field of complex numbers. This assumption is not very restrictive because almost every matrix is diagonalisable (see [16]) and so if is not diagonalisable, we can perturb matrix slightly to make diagonalisable and not affect the enzyme dynamics significantly. The diagonalizability of allows us to write matrix as , where and is an invertible matrix whose columns contain the right eigenvectors for matrix corresponding to the eigenvalues . Similarly the rows of contain the left eigenvectors for matrix corresponding to the eigenvalues . Let and be vectors denoting the -th column and -th row of matrices and respectively. Therefore
[TABLE]
and we can express the resolvent function (see Chapter 5 in [19]) as
[TABLE]
Let denote the standard inner product on . Note that formula (2.16) can be expressed as
[TABLE]
for any . Plugging from (2.20) and defining
[TABLE]
we obtain the following formula for :
[TABLE]
Observe that since -s and -s are independent of , they only need to be computed once to construct this expression and then we can easily compute for several values of without the need of evaluating the matrix inverses in (2.16). Moreover if is large, then using the values of and as a guide, one can derive suitable approximations of the formula (2.22) for . We derive one such approximation in the next section and use it as a tool to further understand the phenomenon of stochastic amplification induced by dynamic disorder in enzymatic activity.
The results from this section are collected in our next theorem which is proved in the Supplementary Material.
Theorem 2.2
Suppose is a stationary CTMC with transition rate matrix , stationary distribution and state-space (see Section 2.1). Let be the speed version of this process given by (2.2) and define the substrate dynamics either by (2.3) or by (2.4). Let the steady-state substrate mean be given by (2.5) and the diagonal matrix be defined by (2.15). Then we have the following:
- (A)
The matrix is invertible and can be expressed as (2.16).
- (B)
Suppose the matrix is diagonalizable and let be its eigenvalues. For each define by (2.21). Then can be expressed as (2.22).
- (C)
The relation (2.17) is satisfied for any .
2.4 Approximate formula for
The goal of this section is to derive an approximate formula for using (2.22) and then use it to obtain some interesting insights. Recall from the previous section that are the eigenvalues of matrix . Among these while the eigenvalues have negative real parts. Define a positive constant by
[TABLE]
where denotes the real part of a complex number . Setting in (2.22) we obtain
[TABLE]
This formula is valid for any in the interval and its form shows that the function is real-analytic666A function is called real analytic at a point if it is infinitely differentiable at that point and it agrees with its Taylor series expansion around that point. at . Therefore all the information about function is contained in the value of this function and its derivatives at .
Using limits (2.9) and (2.11) we can conclude that
[TABLE]
Let denote the following weighted combination of eigenvalues
[TABLE]
We now propose an approximate formula for
[TABLE]
Note this formula is much easier to use than (2.23) because it contains only one rational term. From (2.24) it is immediate that also obeys the limits (2.9) and (2.11). Moreover it is straightforward to check that the first derivatives of and match at . Hence the approximation error is given by a difference of second-order derivatives and we explain in Supplementary Material why this error is likely to be small. We can also view the approximation of as replacing a weighted arithmetic mean of several quantities with the corresponding harmonic mean. To see this note that from (2.23) and (2.24) we can express as
[TABLE]
where
[TABLE]
is the weighted arithmetic mean of quantities with weights 777These weights may not be positive real numbers, as is customary in the definition of arithmetic means. However in our examples we generally find that the most significant weights indeed have a positive real part and a negligible imaginary part.. The corresponding weighted harmonic mean of these quantities is given by
[TABLE]
and observe that can be expressed as the r.h.s. of (2.27) with arithmetic mean replaced by the harmonic mean .
We now illustrate the accuracy of this approximation using a couple of randomly generated , transition rate matrices with and respectively. In both cases we choose the input rate to be and the enzymatic state-values to be for . The exact function along with its approximation are plotted in Figure 3. The accuracy of this approximation can be easily seen. Notice that the exact function is slightly above its approximation. Assuming the significant weights (-s) are positive reals, this can be explained by the fact that arithmetic mean is always higher than the corresponding harmonic mean.
From (2.26) it is immediate that the shape of the function depends crucially on the parameter computed according to (2.25). We now examine more closely and see how it is connected to an existing notion from the theory of Markov processes. Let us denote the numerator of (2.25) by
[TABLE]
Since -s are given by (2.21), using (2.19), and we can express as
[TABLE]
This relation shows that (and hence ) is always real-valued even though some -s or -s may have imaginary parts. Moreover to compute we do not need to compute the eigenvalues of a potentially large matrix . Instead we only need to evaluate the expression which is computationally much easier. Interestingly the definition of coincides with the well-known notion of Dirichlet forms, that is extensively used in the study of mixing properties of Markov processes [35, 23]. We now discuss this connection in more detail.
Consider the CTMC with state-space and transition rate matrix . The generator 888The generator of a Markov process is an operator which specifies the rate of change of the distribution of the process. For more details see Chapter 4 in [8]. of this CTMC maps any real-valued function on to another such real-valued function given by
[TABLE]
Define a function by . Then one can see that (2.36) can be expressed as
[TABLE]
In other words, if a -valued random variable has distribution , then is the expectation of the random variable . Relation (2.38) shows that is a Dirichlet form associated with the Markovian semigroup generated by (see [35]). An important consequence of this connection is that is always positive (see Lemma 2.1.2 in [35]) irrespective of the entries of the rate matrix or the state values . The positivity of implies that is also positive and hence the mapping is convex and monotonically decreasing from at to as . Intuitively the magnitude of Dirichlet form corresponds to the mixing strength of the underlying Markov process. Therefore as increases, also increases and the mapping has a sharper “drop” to the deterministic value . Our next goal is to make this mathematically precise and quantitatively estimate the relative speed-values beyond which the deterministic assumption is acceptable.
In the rest of this section, our object of interest will be the relative stochastic amplification factor defined by
[TABLE]
which measures the difference of steady-state substrate means in the presence and absence of enzymatic fluctuations, normalized by the the steady-state substrate mean in the deterministic case. Note that does not depend on the input rate and using (2.17) we can see that satisfies
[TABLE]
In order to study the dependence of on , we now look at its approximation which is defined analogously to (2.39), with replaced by . Using (2.25), (2.36) and (2.24) we see that is the same as the normalized Dirichlet form defined by
[TABLE]
Substituting by in (2.26) and dividing by , we obtain the following formula after some simple algebraic manipulations
[TABLE]
This formula clearly indicates that as gets larger, the amplification factor decreases more sharply to with the relative speed parameter . One can regard as the “relative error” between the actual substrate mean and the mean computed with deterministic assumption on the enzymatic kinetics. From relation (2.42) it is immediate that in order to test if this error will exceed some tolerance level we just need to check if the relative enzyme speed is smaller than the threshold defined by
[TABLE]
We can expect this test to be rather conservative because as we have argued before, the exact values will usually lie above their approximation .
Note that is inversely proportional to but directly proportional to . The first parameter is the normalized Dirichlet form and it captures the “mixing strength” of the underlying enzymatic dynamics (see Example 3.1), while the second parameter can be viewed as a proxy for the variance of the stationary-distribution 999To see this note that defined by (2.40) represents the “error” in Jensen’s inequality for the convex map . It can be easily shown that this error is proportional to the variance of the distribution (see [3] for instance).(see Example 3.1). Generally both these parameters will increase with higher levels noise in the enzymatic dynamics. However since they affect in opposing ways, it is difficult to ascertain the overall effect of dynamical noise in setting the threshold value . We explore this issue in greater detail in Section 3.2 and numerically show that increasing levels of dynamical noise in the enzymatic kinetics of that reaction network gives rise to decreasing values of . This is surprising and counterintuitive because it suggests that this dynamical noise is actually beneficial in improving the accuracy of the deterministic assumption for the enzyme activity.
Finally we remark that even though most of the analysis in this paper assumes that enzymatic kinetics is described by a finite Markov chain, the formulas we derive can provide insights for a more general class of stationary stochastic processes. This is because finite Markov chains can serve as good approximations of such processes [33]. Moreover if the process is Markov, even with an arbitrary state-space, we can compute expression (2.42) for by sampling its stationary distribution and using this sample to estimate and the normalized Dirichlet form . We illustrate this for the example network in Section 3.2 where the enzyme dynamics follows a Markov process over a countable state-space.
From (2.26) it is immediate that the shape of the function depends crucially on the parameter computed according to (2.25). We now examine more closely and see how it is connected to an existing notion from the theory of Markov processes. Let us denote the numerator of (2.25) by
[TABLE]
Since -s are given by (2.21), using (2.19), and we can express as
[TABLE]
This relation shows that (and hence ) is always real-valued even though some -s or -s may have imaginary parts. Moreover to compute we do not need to compute the eigenvalues of a potentially large matrix . Instead we only need to evaluate the expression which is computationally much easier. Interestingly the definition of coincides with the well-known notion of Dirichlet forms, that is extensively used in the study of mixing properties of Markov processes [35, 23]. We now discuss this connection in more detail.
Consider the CTMC with state-space and transition rate matrix . The generator 101010The generator of a Markov process is an operator which specifies the rate of change of the distribution of the process. For more details see Chapter 4 in [8]. of this CTMC maps any real-valued function on to another such real-valued function given by
[TABLE]
Define a function by . Then one can see that (2.36) can be expressed as
[TABLE]
In other words, if a -valued random variable has distribution , then is the expectation of the random variable . Relation (2.38) shows that is a Dirichlet form associated with the Markovian semigroup generated by (see [35]). An important consequence of this connection is that is always positive (see Lemma 2.1.2 in [35]) irrespective of the entries of the rate matrix or the state values . The positivity of implies that is also positive and hence the mapping is convex and monotonically decreasing from at to as . Intuitively the magnitude of Dirichlet form corresponds to the mixing strength of the underlying Markov process. Therefore as increases, also increases and the mapping has a sharper “drop” to the deterministic value . Our next goal is to make this mathematically precise and quantitatively estimate the relative speed-values beyond which the deterministic assumption is acceptable.
In the rest of this section, our object of interest will be the relative stochastic amplification factor defined by
[TABLE]
which measures the difference of steady-state substrate means in the presence and absence of enzymatic fluctuations, normalized by the the steady-state substrate mean in the deterministic case. Note that does not depend on the input rate and using (2.17) we can see that satisfies
[TABLE]
In order to study the dependence of on , we now look at its approximation which is defined analogously to (2.39), with replaced by . Using (2.25), (2.36) and (2.24) we see that is the same as the normalized Dirichlet form defined by
[TABLE]
Dividing (2.26) by , we obtain the following formula after some simple algebraic manipulations
[TABLE]
This formula clearly indicates that as gets larger, the amplification factor decreases more sharply to with the relative speed parameter . One can regard as the “relative error” between the actual substrate mean and the mean computed with deterministic assumption on the enzymatic kinetics. From relation (2.42) it is immediate that in order to test if this error will exceed some tolerance level we just need to check if the relative enzyme speed is smaller than the threshold defined by
[TABLE]
We can expect this test to be rather conservative because as we have argued before, the exact values will usually lie above their approximation .
Note that is inversely proportional to but directly proportional to . The first parameter is the normalized Dirichlet form and it captures the “mixing strength” of the underlying enzymatic dynamics (see Example 3.1), while the second parameter can be viewed as a proxy for the variance of the stationary-distribution 111111To see this note that defined by (2.40) represents the “error” in Jensen’s inequality for the convex map . It can be easily shown that this error is proportional to the variance of the distribution (see [3] for instance).(see Example 3.1). Generally both these parameters will increase with higher levels noise in the enzymatic dynamics. However since they affect in opposing ways, it is difficult to ascertain the overall effect of dynamical noise in setting the threshold value . We explore this issue in greater detail in Section 3.2 and numerically show that increasing levels of dynamical noise in the enzymatic kinetics of that reaction network gives rise to decreasing values of . This is surprising and counterintuitive because it suggests that this dynamical noise is actually beneficial in improving the accuracy of the deterministic assumption for the enzyme activity.
Finally we remark that even though most of the analysis in this paper assumes that enzymatic kinetics is described by a finite Markov chain, the formulas we derive can provide insights for a more general class of stationary stochastic processes. This is because finite Markov chains can serve as good approximations of such processes [33]. Moreover if the process is Markov, even with an arbitrary state-space, we can compute expression (2.42) for by sampling its stationary distribution and using this sample to estimate and the normalized Dirichlet form . We illustrate this for the example network in Section 3.2 where the enzyme dynamics follows a Markov process over a countable state-space.
3 Examples
In this section we present a couple of examples to illustrate our results. Our first example of a two-state switching enzyme is such that all calculations can be easily done analytically allowing us to clearly understand the stochastic amplification effect. We also see how enzymes can utilize their fluctuations to serve as high-gain amplifiers. Our second example is the reaction network of Paulsson et al. [32] which displays stochastic focusing. We apply our results to this network and demonstrate that in some cases dynamical fluctuations can actually be beneficial in reducing the unwanted stochastic amplification effects.
3.1 Stochastic amplification induced by a two-state switching enzyme
Consider a simple instance of the system in Figure 1, in which a single enzyme molecule is present and can fluctuate between two states of activity: with enzyme in the low-activity (“0”) state, the degradation rate of substrate is assumed to be , while it is equal to when is highly active (“1”). The 0-to-1 and 1-to-0 rates are given by and respectively. A schematic representation of this model is presented in Figure 4.
The time-varying degradation rate induced by this fluctuating enzyme is a CTMC with state-space and transition-rate matrix
[TABLE]
One can check that the unique stationary distribution for this CTMC is simply given by
[TABLE]
For each , we can regard as the steady-state probability of the enzyme being in state . Due to the Ergodic Theorem (see Theorem 10.6 in [18]) we can also view as the proportion of time that the enzyme spends in state in the long-run. Let be a -valued random variable with probability distribution . Then its mean and variance can be computed as
[TABLE]
Suppose that the speed of enzymatic kinetics relative to the substrate is and so the degradation rate is given by the process defined by (2.2). Let (2.5) be the steady-state substrate mean in this case. Using part (A) of Theorem 2.2 we obtain
[TABLE]
This formula involves the inverse of a matrix, which can be easily computed explicitly. Substituting this inverse along with the expressions for and (see (3.44)) we get
[TABLE]
It is interesting to point our the formal similarity of (3.51) with the formula for the mean transfer time of a relaxation process whose rate is modeled by a two-state CTMC [11]. From (3.51) it can be readily seen that the steady-state substrate means in the static and deterministic cases are given by
[TABLE]
Hence we can compute the maximum relative amplification factor (2.40) as
[TABLE]
Recall the formula for from (3.45) and oberve that can be expressed as
[TABLE]
which reinforces the point we made in Section 2.4 that serves as a proxy for the variance of the stationary distribution.
The Dirichlet form (2.38) for this CTMC is given by
[TABLE]
This yields the following formula for the normalized Dirichlet form (2.41)
[TABLE]
which determines the shape of our approximate formula (2.26) for the steady-state substrate mean. It is straightforward to check that for this example, this approximate formula is exact because the expression (3.51) for can be written as
[TABLE]
Note that is a measure of the mixing strength of the enzymatic kinetics and so it is not surprising that it increases linearly with the transition rates and .
Define the relative amplification factor by (2.39). It can be exactly expressed as
[TABLE]
We now consider the situation when the degradation rate induced by the enzyme E in the low-activity state (“0”) is negligible. In this case and simplifies to
[TABLE]
which shows that the relative amplification factor is proportional to and the proportionality constant is simply the product of the proportion of time () the enzyme spends in the low-activity state, the degradation rate () at the high-activity state and the reciprocal of the sum of transition rates and . In particular as approaches [math], the relative amplification factor can be enormous, thereby indicating that such a switching enzyme can exploit its fluctuations to function as a biological amplifier with a very high gain.
3.2 Stochastic focusing Network
In this section we apply our results to the famous stochastic focusing network given in [32]. This network involves three species: substrate S, product P and enzyme E121212In [32], S was called I and E was called S. We have changed the notation to ensure consistency with the notation in this paper.. The molecules of substrate S are produced constitutively at rate and converted into product P through a first-order reaction with rate constant . Both subtrate and product molecules degrade spontaneously at rates and respectively, where denotes the current state or abundance level of enzyme E. The schematic representation of these reactions is as follows
[TABLE]
The enzymatic dynamics in this example is given by the Markovian birth-death process with birth-rate and death-rate :
[TABLE]
This process evolves on state-space , which is the set of all nonnegative integers and its unique stationary distribution is Poisson with mean . We assume that the initial enzymatic state is a random variable with this stationary distribution.
Multiplying the rate constants and by , we obtain enzymatic kinetics whose speed relative to the substrate is . Let and denote the steady-state means of substrate and product respectively when the relative enzyme speed is . From the first-order moment equations for the network (3.58) one can easily show (see Supplementary Material) that for any
[TABLE]
To study the amplification of steady-state means due to enzymatic fluctuations we use the relative stochastic amplification factor defined by (2.39). Due to the linear relationship (3.60) between and , these amplification factors are same for both product and subtrate. Therefore we can understand the amplification phenomenon by replacing network (3.58) with our simplified scheme (see Figure 1), where the degradation rate at time is given by
[TABLE]
and denotes the state at time of enzymatic kinetics with relative speed . Let for each . Note that is a CTMC with state-space
[TABLE]
and stationary distribution131313This stationary distribution is obtained by applying the linear change of variables on the Poisson distribution with mean .
[TABLE]
This CTMC transitions from state to state at rate and from state to state at rate . In other words, the generator for this CTMC is given by
[TABLE]
for any bounded function .
In the rest of this section, we denote the steady-state substrate mean by instead of . Since the state-space is not finite, we cannot use the results from Section 2.3 to compute . However we can easily simulate the paths of process with Gillespie’s Algorithm [9], and obtain samples of the random variable defined by (2.13). The corresponding sample mean then serves as an estimator for (see part (C) of Proposition 2.1). Note that the steady-state substrate mean in the absence of enzymatic fluctations is simply given by
[TABLE]
Dividing by and subtracting , we obtain an estimate for the relative stochastic amplification factor (see (2.39)). This factor only depends on four rate constants and which we now set as
[TABLE]
We estimate for several values of in the interval and plot these estimates in Figure 5. For each value of , was estimated using samples of and the resulting standard error141414The standard error is simply the standard deviation of the distribution of the sample mean. is also displayed in Figure 5. In Section 2.4 we develop an approximate expression (2.42) for the relative amplification factor which is likely to hold even though the state-space is not finite. Using the stationary distribution (3.61) and the generator (with ) we estimate the maximum amplification factor (2.40) and the normalized Dirichlet form (2.41) as and respectively. With these values we evaluate the map and plot it in the interval in Figure 5. The close agreement between the estimated and the approximate values of the relative amplification factor can be easily seen. In Figure 5 we also indicate the threshold speed (2.43) for the threshold level (i.e. ). This threshold speed is which indicates that if then enzymatic fluctuations will amplify the steady-state substrate mean by more than in comparison to the deterministic case. In other words, the relative error in assuming that the enzyme activity is deterministic exceeds if .
We now explore the effects of changing the levels of noise in the enzymatic activity, on the relative amplification factor for the steady-state substrate mean. This noise can be measured using the coefficient of variation (CV)151515The coefficient of variation of a probability distribution is its standard deviation divided by its mean. It measures the dispersion of a distribution relative to its mean. of the stationary distribution for the enzyme abundance. Since this distribution is Poisson with mean , the CV is which shows that for a fixed , we can decrease the relative noise level by simply increasing . With this in mind we repeat the above computations (see Figure 5) for three additional values of : 5, 10 and 20, and the results are provided in Figure 6. For each value of , the corresponding estimates for the maximum relative amplification factor , the normalized Dirichlet form and the threshold speed (for ) are given in Table 1.
Recall the discussion at the end of Section 2.4 on the effects of noise in the enzymatic dynamics. From Table 1 it is clear that as expected, decreasing noise (or increasing ) results in the decline of both and . These parameters influence the threshold speed (see (2.43)) in opposite ways, but their overall effect is to increase , indicating that as the noise levels go down, the relative enzyme speed needs to be higher and higher for the assumption of deterministic enzymatic activity to be acceptable. In other words, even though noise in enzyme activity causes the stochastic amplification effect it also helps in eliminating it.
4 Discussion
We examined the mathematical properties of a system consisting of a substrate that is degraded through an enzyme with stochastically fluctuating activity levels. Our analysis focused on the effect of enzymatic fluctuations on the mean substrate abundance and its deviations from the deterministic model predictions. It should be pointed out that even if the substrate inflow rate is assumed to be an independent stationary stochastic process with mean , our results will not be affected.
Whereas a stochastically varying production rate would leave the mean substrate level unaffected and equal to that of the deterministic model, fluctuations in the removal rate of the substrate result in a system that behaves very differently in the stochastic and deterministic regimes due to the product term in the degradation rate of . Our formulas help quantify this discrepancy and study its behavior as the speed of enzymatic fluctuations varies from zero to infinity. They also provide an interesting connection between the amplification effect and the mixing properties of the Markov process describing the enzymatic activity fluctuations, which allow us to determine the speed above which this amplification becomes negligible for a given system parametrization. Note that the study of such systems through the use of approximate stochastic models such as the Linear Noise Approximation [5] is particularly challenging, since these methods typically fail to capture the very strong negative correlations between enzyme activity and substrate that can arise at slow enzyme fluctuations (see also discussion in [28]). On the contrary, the results presented here are valid under much milder simplifying assumptions, and can thus accurately reveal the magnitude of the discrepancy between stochastic and deterministic descriptions of the system.
Given the prevalence of enzymatic interactions in cell biology, stochastic fluctuations in enzyme activity and/or abundance are expected to play a large role in shaping the mean intracellular abundances of substrates [17], which could potentially also deviate significantly from the deterministically predicted amounts. Since many enzymes are allosterically regulated [24] by their products, substrates or other small signaling molecules, it would be very interesting to also study the effects of this regulation on the statistics of substrates and products, and examine potential noise reduction [28] or signal amplification strategies. As the sensitivity of single-molecule enzymology experimental techniques increases, it may soon be possible to study the phenomena described theoretically in this work within living cells.
Supplementary Material
Appendix S1 The model
In this paper we consider a system where the substrate enters at a constant rate and is degraded at a rate that depends on the activity state or abundance level of an enzyme . This activity state is assumed to fluctuate in time according to a continuous-time Markov chain (CTMC) over a finite state-space . The system can be written as
[TABLE]
The CTMC is described by its transition rate matrix (see [30]). For any distinct , denotes the rate at which the process leaves state and enters state . The diagonal entries of are given by . From now on we assume that the rate matrix is irreducible161616A matrix is called irreducible if there does not exist a permutation matrix such that the matrix is block upper-triangular. which implies that there exists a unique stationary distribution satisfying
[TABLE]
where and denote the vectors of all zeroes and ones respectively. Since the state-space is finite and the transition rate matrix is irreducible, the CTMC is ergodic which means that the probability distribution of converges to the stationary distribution as . As we are interested in the steady-state limit, without loss of generality we can assume that the initial state is distributed according to , i.e. for each . This ensures that the process is a stationary stochastic process whose finite-dimensional distributions are invariant under time-shifts. This means that for any finite collection of time-points the joint distribution of the random vector remains the same for all . This also implies that various statistical properties of this process do not depend on time. In particular its mean is equal to
[TABLE]
where is a -valued random variable with probability distribution and denotes the expectation w.r.t. this distribution.
In what follows, we need to view enzymatic dynamics at the timescale of substrate kinetics. For this we define a family of processes parameterised by the “relative speed” parameter as
[TABLE]
Like , the process is also a CTMC over state-space with transition rate matrix and initial distribution . Since is stationary, this process is also stationary with the same mean given by for all times . Replacing by in (S1), we will study how the steady-state mean of substrate abundance depends on the fluctuation speed .
Given a sample path of the enzyme dynamics with relative speed , we regard the dynamics of substrate molecular counts as a jump Markov chain over the set of nonnegative integers . This Markov chain can be written in the random time change representation [8] as
[TABLE]
where and are independent, unit rate Poisson processes. Here the Poisson processes and capture the intermittency in the firing of production and degradation reactions. This intermittency becomes unimportant if the substrate is present in high copy-numbers [22] and in this case one can regard as the dynamics of substrate concentration171717The concentration of any species is its copy-number divided by the system volume., specified by the following ODE
[TABLE]
Let for each , where evolves according to either (S4) or (S5). Our goal in this paper is to understand the role of fluctuations in the catalytic activity of enzyme in determining the steady-state value of the mean
[TABLE]
Appendix S2 Expressions for : The general case
In this section we prove Proposition 2.1 in the main paper. For convenience, we restate this proposition below.
Proposition S2.1
Suppose is a real-valued stationary stochastic process with stationary distribution and state-space satisfying
[TABLE]
for some . Let be the speed version of this process given by (S3) and define the substrate dynamics either by (S4) or by (S5). Let and let the steady-state limit be given by (S6). Then we have the following:
- (A)
The value is well-defined (i.e. the limit in (S6) exists) and it is given by
[TABLE]
- (B)
Let is the random variable defined by
[TABLE]
where is an independent random variable with the uniform distribution on . Then we have
[TABLE]
- (C)
The limits below are satisfied as and respectively:
[TABLE]
Proof. Let be the filtration generated by the process and let be its limiting value. Given the information , the random path of the process is completely known. Hence we can formulate the ODE for the conditional first-moment as follows
[TABLE]
This equation remains unchanged whether we use representation (S4) or (S5) for the substrate dynamics . Using as the integrating factor we can write (S13) as
[TABLE]
Finally, integrating both sides w.r.t. time we obtain
[TABLE]
Taking expectations we get
[TABLE]
where the last equation follows from the Fubini’s theorem. Since the process is stationary, the distribution of the random variable is same as the distribution of . Hence using a simple change of variables we can write
[TABLE]
which gives us the following formula for
[TABLE]
Using (S7) we see that
[TABLE]
and using the dominated convergence theorem we can conclude that
[TABLE]
Taking the limit in (S15) we obtain the formula (S8) for . This finishes the proof of part (A) of the proposition.
Let be the random variable defined by (S9). Since it is a continuous random variable with range we can write
[TABLE]
Let be the filtration as defined before. As in (S9) is a uniform random variable independent of the process we have
[TABLE]
Taking expectations both sides we get
[TABLE]
Substituting this in (S16) we obtain
[TABLE]
Replacing this integral in (S8) by proves formula (S10). This finishes the proof of part (B) of the proposition. The proof of part (C) is already outlined in the main paper.
Appendix S3 Expressions for : The finite CTMC case
In this section we prove Theorem 2.2 in the main paper. For convenience, we restate this theorem below.
Theorem S3.1
Suppose is a stationary CTMC with transition rate matrix , stationary distribution and state-space (see Section S1). Let be the speed version of this process given by (S3) and define the substrate dynamics either by (S4) or by (S5). Let the steady-state substrate mean be given by (S6) and the diagonal matrix be defined by
[TABLE]
Then we have the following:
- (A)
The matrix is invertible and can be expressed as
[TABLE]
- (B)
Suppose the matrix is diagonalizable and let be its eigenvalues. For each define by
[TABLE]
Then can be expressed as
[TABLE]
- (C)
The following relation is satisfied for any
[TABLE]
Proof. For each and define
[TABLE]
Let denote the vector . Note that
[TABLE]
Note that because for each we have . From Proposition 4.1 in [14] we see that satisfies the following ODE:
[TABLE]
Since , the solution to this ODE is
[TABLE]
Therefore
[TABLE]
On integrating from to we get
[TABLE]
This relation along with (S8) proves (S18). Since the matrix is irreducible and Metzler with non-positive eigenvalues, the matrix is invertible and its inverse is a matrix with only nonnegative real entries (see Theorem 2.6 in [38]).
We now provide another proof of (S18) using the Methods of Conditional Moments (MCM) approach of Hasenauer et al. [15]. We thus define and consider the algebraic equations that describe the system of conditional moments for :
[TABLE]
If we define and consider the limit as in (S22), we get the following system of linear equations:
[TABLE]
Letting , we get
[TABLE]
Therefore,
[TABLE]
The proof of part (B) is outlined in the main paper. For part (C) note that one of the inequalities was already shown in the main paper. Hence it suffices to prove that which is equivalent to
[TABLE]
Let be the diagonal matrix with entries . Define another matrix by
[TABLE]
Since is a positive diagonal matrix and is a componentwise nonnegative matrix, we can conclude that is also a Metzler matrix. In order to prove (S24) we just need to show that
[TABLE]
which is equivalent to proving that
[TABLE]
Note that is a symmetric Metzler matrix. Relation (S26) will hold if this matrix is nonpositive definite, which is same as saying that all its eigenvalues have nonpositive real parts. Theorem 2.6 in [38] shows that this matrix is nonpositive definite if we can find a componentwise positive vector such that
[TABLE]
Let . Note that since we have and so . Therefore
[TABLE]
Similarly since we have and so . Hence
[TABLE]
Combining (S28) and (S29) proves (S27) and shows that is a symmetric nonpositive definite matrix. Therefore (S26) holds and this finishes the proof of part (C) of this theorem.
Appendix S4 Approximate formula for
In this section we examine the approximate formula for and discuss why the approximation error is likely to be small. Recall from Theorem S3.1 that are the eigenvalues of matrix . Among these while the eigenvalues have negative real parts. Using part (B) of Theorem S3.1 gives us this exact formula for :
[TABLE]
From limits (S11) and (S12) we can conclude that
[TABLE]
Let denote the following weighted combination of eigenvalues
[TABLE]
Recall from the main paper that is the normalized Dirichlet form which is always positive. We proposed the following approximate formula for
[TABLE]
We define the relative error between the exact value and its approximation by
[TABLE]
Due to limits (S11) and (S12), we know that this error function is contained between [math] and . Our goal is to argue that this relative error function has a relatively small magnitude. For this purpose we define a change of variables as
[TABLE]
Note that as goes from [math] to , goes from [math] to . Define a function by
[TABLE]
where
[TABLE]
for each . Replacing by in (S30) and using (S31) we get
[TABLE]
Similarly can be written as
[TABLE]
which allows us to express the relative error as
[TABLE]
Note that
[TABLE]
We can compute the first and second order derivatives of function as
[TABLE]
Therefore from (S35) we obtain
[TABLE]
We can see that measures the weighted “spread” of the eigenvalues around , where the weights are given by . Numerical experiments indicate that generally only a few of these weights are significant while the others are negligibly small. This is precisely the situation where this spread is small since . Hence we can safely assume that is small. The function is real-analytic at and its Taylor series expansion is given by
[TABLE]
Using (S34) we can conclude that the relative error behaves like which is small since is small.
Appendix S5 Stochastic Focusing example
Consider the stochastic focusing network given in [32]. It involves three species: substrate S, product P and enzyme E. The molecules of substrate S are produced constitutively at rate and converted into product P with rate constant . Both subtrate and product molecules degrade spontaneously at rates and respectively, where denotes the current state or abundance level of enzyme E. These reactions can be expressed as
[TABLE]
Let and denote the expected abundance level at time of substrate and product molecules respectively. Furthermore assume that the limit
[TABLE]
exists. From the reaction network (S36) it is immediate that
[TABLE]
Solving this ODE we get
[TABLE]
Therefore using limit (S37) we can conclude that
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Anderson, P. W. A Mathematical Model for the Narrowing of Spectral Lines by Exchange or Motion. Journal of the Physical Society of Japan 9 , 3 (1954), 316–339.
- 2[2] Asmussen, S., and Glynn, P. W. Stochastic simulation: algorithms and analysis , vol. 57 of Stochastic Modelling and Applied Probability . Springer, New York, 2007.
- 3[3] Costarelli, D., and Spigler, R. How sharp is the jensen inequality? Journal of Inequalities and Applications , 1 (2015), 1–10.
- 4[4] Dan, N. Understanding dynamic disorder fluctuations in single-molecule enzymatic reactions. Current Opinion in Colloid & Interface Science 12 , 6 (2007), 314–321.
- 5[5] Elf, J., and Ehrenberg, M. Fast evaluation of fluctuations in biochemical networks with the linear noise approximation. Genome research 13 , 11 (2003), 2475–2484.
- 6[6] Elowitz, M. B., Levine, A. J., Siggia, E. D., and Swain, P. S. Stochastic gene expression in a single cell. Science 297 , 5584 (2002), 1183–1186.
- 7[7] English, B. P., Min, W., Van Oijen, A. M., Lee, K. T., Luo, G., Sun, H., Cherayil, B. J., Kou, S., and Xie, X. S. Ever-fluctuating single enzyme molecules: Michaelis-Menten equation revisited. Nature Chemical Biology 2 , 2 (2006), 87–94.
- 8[8] Ethier, S. N., and Kurtz, T. G. Markov processes . Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics. John Wiley & Sons Inc., New York, 1986. Characterization and convergence.
