Unifying Thermodynamic Uncertainty Relations
Gianmaria Falasco, Massimiliano Esposito, Jean-Charles Delvenne

TL;DR
This paper presents a new geometric approach to derive and strengthen thermodynamic uncertainty relations, broadening their applicability and achieving optimal bounds without complex probabilistic techniques.
Contribution
It introduces a unifying geometric method to generalize and improve TURs, including a new optimal bound based on entropy production and a novel bound for stationary Markov processes.
Findings
Derived a generalized TUR using Euclidean geometry
Established a new optimal TUR based on entropy production
Proved bounds for stationary Markov processes surpassing previous results
Abstract
We introduce a new technique to bound the fluctuations exhibited by a physical system, based on the Euclidean geometry of the space of observables. Through a simple unifying argument, we derive a sweeping generalization of so-called Thermodynamic Uncertainty Relations (TURs). We not only strengthen the bounds but extend their realm of applicability and in many cases prove their optimality, without resorting to large deviation theory or information-theoretic techniques. In particular, we find the best TUR based on entropy production alone and also derive a novel bound for stationary Markov processes, which surpasses previous known bounds. Our results derive from the non-invariance of the system under a symmetry which can be other than time reversal and thus open a wide new spectrum of applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Unifying Thermodynamic Uncertainty Relations
Gianmaria Falasco
Complex Systems and Statistical Mechanics, Physics and Materials Science Research Unit, University of Luxembourg, L-1511 Luxembourg
Massimiliano Esposito
Complex Systems and Statistical Mechanics, Physics and Materials Science Research Unit, University of Luxembourg, L-1511 Luxembourg
Jean-Charles Delvenne
Institute of Information and Communication Technologies, Electronics and Applied Mathematics Université catholique de Louvain, Louvain-La-Neuve, Belgium
Abstract
We introduce a new technique to bound the fluctuations exhibited by a physical system, based on the Euclidean geometry of the space of observables. Through a simple unifying argument, we derive a sweeping generalization of so-called Thermodynamic Uncertainty Relations (TURs). We not only strengthen the bounds but extend their realm of applicability and in many cases prove their optimality, without resorting to Large Deviation theory or information-theoretic techniques. In particular, we find the best TUR based on entropy production alone and also derive a novel bound for stationary Markov processes, which surpasses previous known bounds. Our results derive from the non-invariance of the system under a symmetry which can be other than time reversal and thus open a wide new spectrum of applications.
pacs:
05.70.Ln, 87.16.Yc
At several levels of complexity, random processes are successfully employed to model natural phenomena, such as open quantum system bre02 , soft and active matter fre05 , biochemical reactions bre14 , and population ecology ova09 , just to name a few. In recent years, the understanding of their dynamical fluctuations has greatly advanced thanks to exact results of nonequilibrium physics. Most importantly, fluctuation theorems esp09 ; rao18 and response relations bai13 have been derived that, respectively, constrain the distribution of currents and relate the system’s perturbation to its dissipation and dynamical activity. Moreover, stochastic thermodynamics has emerged as a comprehensive framework to rigorously study the energetics and thermodynamics of stochastic processes sei12 ; rao18b .
Recently, uncertainty relations appeared as a new powerful tool to investigate dynamical fluctuations. They denote a set of inequalities in which the square-mean-to-variance ratio, or precision , of a generic observable integrated over a time interval is bounded by an -independent functional :
[TABLE]
It was first conjectured in bar15 that for a time-integrated current-like (i.e. odd under time reversal) observable is bounded by half the expected entropy produced over the interval , i.e. . This so-called thermodynamic uncertainty relation, originally proved in the linear response regime and under stationary conditions, triggered an intense activity seeking generalizations or improvements for the largest possible class of out-of-equilibrium conditions. Apart from its conceptual importance, i.e. the existence of an universal upper bound set by dissipation on the precision of any current, (1) has major practical consequences. Indeed, (1) allows one to bound functions of the system’s dissipation which are not directly measurable, e.g. the thermodynamic efficiency of molecular motors pie16 , or to reveal the existence of hidden nonequilibrium states how19 . A first proof valid beyond the linear regime but restricted to large time intervals gin16 was soon extended to arbitrary hor17 . These, and related early results pie16a ; pol16 ; gin17 ; nar17 ; mae17 were obtained within large deviation theory, by progressively refining the bound on the rate function for empirical currents of jump and diffusion processes. Simultaneously, the same formalism was employed to extend (1) to counting observables of jump processes gar17 . In this context it was found that is bounded by the mean of the total number of jumps, or activity, occurring in the time span .
A different method to tackle the problem, based on perturbing the generating function of an arbitrary observable , was designed in DS18 . It yields an upper bound for the response of , which reduces to (1) when the chosen perturbation results in a time rescaling of the dynamics. The entropic dec18 as well as the activity bound dit18 have thus been extended to both current-like and counting observables. This approach, which makes contact with inequalities originally derived by Kullback kul , has sparked much interest in the application of information theoretic results and concepts.
More recently has19 , the exponential bound has been derived for Langevin dynamics with feedback, under the condition of validity of the detailed (joint) fluctuation theorem for and . The same bound had already been derived in pro17 for periodically driven Markovian systems with a time-symmetric protocol, where now is computed over one period and bounds the precision divided by the (asymptotically large) number of periods.
Here, we provide an overarching method, based on elementary observations on the Hilbert space structure of observables, to recover and generalize the various bounds obtained so far in the literature. First, we provide an exact expression for in the case of arbitrary stochastic processes, possibly non-Markovian, time-varying or non-stationary, and show that the bound can be improved by a factor 2, and no more. In the case of periodic Markovian processes, we show that the precision over a period bounds the precision per period over arbitrary time intervals, which trivializes all the asymptotic bounds obtained so far in the periodic Markovian case. In the case of stationary time-invariant Markov processes, it also allows to replace them with simple and tighter bounds, valid over all time intervals.
The Hilbert Uncertainty Relation— We first state the most abstract version of our result. We consider a general real or complex Hilbert space with some scalar product . To every is associated the so-called mean value of , a scalar quantity that is linear and continuous in , i.e. a one-form in the dual of . By virtue of the Riesz representation theorem, one can find a special element in , so that the mean is expressed as . We call an averaging observable for . We now consider the following ratio, that we called normalized precision for reasons that will appear clearly below,
[TABLE]
Through Cauchy-Schwarz inequality we get , with equality when is aligned with . Thus
[TABLE]
This constitutes the key observation of this article which we call the Hilbert Uncertainty Relation.
To be concrete, we focus on classical physical systems described by a configuration space whose elements are, for example, trajectories of a random dynamical system. The configuration space is endowed with a probability measure . An obvious Hilbert space of interest is the space of square-summable observables, i.e. functions such that the mean and the mean square are well-defined and finite (even though our considerations also apply to continuous cases, we adopt the discrete summation notations). The normalized precision now ranges between zero and one, and is equivalent to precision via the relation . In this situation, the averaging observable is simply the constant observable 1, so that , corresponding to zero variance and infinite precision.
However in many situations we are interested in a (closed) linear subspace of those observables, sharing some properties of interest, which we call for the sake of convenience the ‘legitimate observables’. If this subspace, itself a Hilbert space for the same scalar product, does not contain the constant observables, then there is a non-trivial legitimate averaging observable , for which now caps the normalized precision of all legitimate observables. It is also the orthogonal projection of the constant observable 1 onto the space of legitimate observables, as implies that is orthogonal to all legitimate observables. The corresponding over is .
Interestingly this quantity has a geometric interpretation. Assume that we find a zero-mean square-summable observable —possibly illegitimate, i.e. outside of —that is still an averaging observable, i.e. verifying for all legitimate observables . Then is also the covariance of with , since , and is also the variance of . Therefore, Cauchy-Schwarz inequality applied to the covariance, , yields that is an upper bound on the maximum precision of legitimate observables. In fact, if is aligned with 1 and while being orthogonal to 1, namely,
[TABLE]
we find that is exactly the maximum precision reachable over (see Fig. (1) for a geometric representation).
Time anti-symmetric observables— The TURs are obtained by considering as the set of all possible paths of a random process, endowed with an involution symmetry (i.e., a transformation whose square is the identity) called time-reversal, which maps any path in to its time-reversed path . We consider legitimate the observables that are time-antisymmetric, i.e. satisfying . The time-reversal induces another probability measure , attributing to an event the -probability of the time-reversed event. Then the scalar product of two time-antisymmetric observables and can be written as while the mean of is written as . From this we deduce that the mean observable satisfying is the time-antisymmetric observable:
[TABLE]
The maximum normalised precision (3) over all time-antisymmetric observables is therefore
[TABLE]
This exact bound can be written in terms of as
[TABLE]
Equation (6) clearly cancels when the probability measure is time-symmetric, , and can be loosened in terms of two different quantities that capture the gap separating from . First, the total variation distance, ranging between zero and one, . Second, the Kullback-Leibler (KL) divergence . Rewriting (6) as , a convex combination of positive values of the concave function , we obtain the relaxed inequality
[TABLE]
the main result of this section. As this expression is increasing in , one can use the coarse bound to obtain , which in term of square-mean-to-variance ratio reads
[TABLE]
a bound recently proposed under the name of General TUR has19 ; pot19 . We underline that this result is valid for arbitrary dynamics, such as non-Markovian and non-autonomous, for all time intervals , and all possible time-antisymmetric observables—not necessarily time-integrated ones. It is even valid for set of paths of variable length, e.g., defined by a random stopping time. It is also valid for any notion of ‘time-reversal’ that is an involution of . For example, if a path is defined as a discrete or continuous list of ‘states’, then the time-reversed path may be defined as the time-reversed list of the same states, or the time-reversed list of conjugated states. Typically, in a model of an underdamped system we want to include the speed or momentum as part of the state, and flip it as well as reversing the order of states when applying time-reversal. All these choices for the time-reversal involution will yield mathematically valid inequalities, but not all will carry the same physical meaning. For instance, it is only in the circumstances where the fluctuation relation holds rao18 that is the physical entropy production associated with the process (as requested in has19 ). One such circumstance is when the system is driven by a time-symmetric protocol, and respects local detailed balance at all times. Outside these examples, is to be regarded as an observable of interest, accessible in principle to the measurement, bearing no direct connection to thermodynamics, yet useful as a bound on the fluctations of time-antisymmetric observables such as total displacement, etc. Note that some observables of practical interest, such as work or heat, are dependent on the parameters of the protocol and therefore are time-antisymmetric if the time-varying protocol is itself time-symmetric. Time-symmetric protocol is an assumption requested by pro17 ; pot19 . In the case of arbitrary time-varying protocols, another notion of time-reversal is needed, which also reverses the protocol, in order to include those observables of interest in the space of legitimate observables. This was first investigated in pro19 with a tailored large deviation argument (see SI for a formal statement and a proof as a direct corollary of (9)).
A slightly tighter bound than (9) (whose implicit expression appears in tim19 ) follows from replacing in (8) by an upper bound given in terms of (see SI), leading to the novel asymptotic expression
[TABLE]
Remarkably, this is the tightest bound obtainable from the sole knowledge of . This is proved in SI by finding a specific system and a specific observable on that meets the bound, for every given value of .
The Periodic Uncertainty Relation— In many cases it is relevant to decompose a path becomes the concatenation of paths taking place on time intervals of duration . In this way the space of paths factors as a Cartesian product . Here we study the most common case of interest where every path is a (discrete or continuous) sequence of states and transitions in a Markov process, and where the the sequence is stationary (periodicity assumption). This is typically sufficient to model overdamped Markov processes. We also consider the legitimate observables on as those observables that decompose as a sum , where each is time-antisymmetric: . In this case we find that the precision available over any number of periods is bounded above by the precision available over a single period, namely,
[TABLE]
a theorem (proved in SI) that we call the Periodic Uncertainty Relation for time-antisymmetric observables on overdamped Markov processes. In particular, applying (9) to a single period, we find that the precision of periods is bounded by , where is now the Kullback-Leibler divergence over a single interval. This is valid for arbitrary protocols (being understood that is not necessarily the entropy production). This includes in particular the result in pro17 , which was proved originally by large deviation techniques in the limit , for overdamped systems under time-symmetric protocols. Our result is a special case of a more general Periodic Uncertainty Relation, stated and proved in the SI, which holds for more general families of legitimate observables.
In the case of stationary (jump or diffusive) processes over a total time interval , the period is infinitesimal, , and so is . Then (9) combined with (11) reduces to
[TABLE]
so that we recover the entropy bound for arbitrary time intervals, previously proved with information-theoretic means dec18 . Beyond recovering these results with a unified method, we can derive far sharper bounds. In particular, for a stationary continuous-time Markov process, precision and normalized precision over an infinitesimal time interval coincide. So, in view of (11), the precision over a time interval is bounded by
[TABLE]
where is the probability of a transition along a path relating a source state to a target state over an infinitesimal time interval . The current is defined as . In a finite state jump process, is a transition between two different states, and factors as for stationary state probability and jumping rate . We know that (13) can be relaxed to with . We obtain in particular the simple and novel bound,
[TABLE]
which we call the absolute current bound, valid for all stationary Markov processes.
In the case of finite state jump processes, it is evidently tighter than the activity bound,
[TABLE]
This last bound applies to all time-summed observables taking non-zero values only on the transitions (thus zero values on the constant paths), without any request of time-antisymmetry dit18 . The activity bound turns out to be another avatar of the Periodic Uncertainty Relation, where the bound can be derived on an infinitesimal interval and then extended to arbitrary times (see SI).
*Example—*We illustrate the different bounds for stationary Markovian dynamics on a benchmark example dit18 which provides a minimal model for the molecular motor kinesin moving under load along a microtubule. Kinesin is either in a low energy state (1) with both heads on the microtubule or in a high energy state (2) with only one head attached. Transitions from state 1 to state 2 happen with or without ATP consumption, and cause both forward and backward motion along the microtubule (with half step size ). Each of these four transitions out of each state (making eight possible transitions) has an associated rate , function of the ATP concentration, [ATP], and of the external loading force (see SI). In Fig. (2) we plot for the displacement, and the various bounds. We see that our simpler novel bound (14) outperforms the activity bound and entropy production bounds.
Discussion—The general approach introduced in this Letter solely exploits the properties of the Hilbert space of observables and the presence of a (broken) involutive symmetry. Therefore, it is not restricted to trajectories of random systems endowed with some notion of time-reversal symmetry. Rather, can be, e.g., the configuration space of a classical or quantum system and the involution may be parity, charge conjugation, spin reversal, etc. (see SI for an example of an Ising system). We leave for the future the application to quantum systems and spontaneously broken symmetries.
Acknowledgments— M. E. thanks the European Research Council (project NanoThermo ERC-2015-COG agreement no. 681456). M. E. and J-C. D. thank the FNR INTER mobility program.
I Supplemental informations
I.1 Best bound based on only
A slightly tighter bound than (9) follows by bounding in terms of . Indeed, for the legitimate observable , the normalized precision (2) is and (8) becomes
[TABLE]
or
[TABLE]
which allows to find the bound where the r.h.s. is defined by
[TABLE]
Injecting this bound on into (8), we obtain
[TABLE]
a tighter bound than (9), which was obtained from the trivial bound — see figure 3 for a comparison of the two bounds.
Remarkably, this is the tightest bound obtainable from the sole knowledge of as the following argument proves. We split into and . On both parts we take a uniform probability distribution, so that the total probability of is , chosen to satisfy . One sees that the total variation distance is precisely , and the bound is matched with equality.
The asymptotic expression (10) is obtained for , or equivalently . We expand . The latter step stems from but requires some care in the error analysis, in particular it requires to show that .
Plugging into (19) and using the relation , we obtain (10), confirming the numerical observation in figure 3.
I.2 Bound for arbitrary time-varying protocols
Here, we tackle the case a random system subject to arbitrary time-varying protocols. In this case some meaningful observables, such as work and heat, are not time-antisymmetric in the naive sense of time-reversal as reading the list of states in reverse order, because work and heat depend on the parameters of the time-varying protocol.
For this reason, we consider the auxiliary configuration space , which is the space of all pairs of paths , endowed with the direct product measure . Here evaluates the probability of in the forward protocol, and is the probability computed in the time-reversed protocol.
On we consider the involution . In other words the involution reverses and swaps the paths. We consider the legitimate observables on as those that take the form and are antisymmetric for the involution, which is equivalent to the identity . One checks that protocol-dependent thermodynamic variables, such as heat are indeed anti-symmetric for this involution, where in this case (resp., ) denotes the heat exchanged along the path as computed from the forward (resp., backward) protocol.
We can now apply the bound (9), only with the linear form occurring in (1) now being the sum of means according to the forward and backward protocol (similarly for the variance). Moreover, turns out to be
[TABLE]
In this way we retrieve the recent result of pro19 , which is there derived with a large deviation argument. We refer to that paper for a discussion on the meaning and importance of this bound.
I.3 Periodic Markovian processes
We now formulate generalities on periodic Markovian processes, introducing progressively the assumptions of Markovianity on the path level, then periodicity, and finally the construction of a state space. This will be useful to state and prove the Periodic Uncertainty Relation in the next section.
We decompose the configuration space as a Cartesian product , so that a global configuration is seen as the concatenation of local configurations . Although the formalism applies in principle to any sort of configurations (for instance spin configurations), for consistency with the main text and application to the Thermodynamic Uncertainty Relations, from now on we refer to and and as global and local ‘paths’.
The global probability measure on naturally projects into marginal probability measures on each , and into marginal pairwise probability measures on each pair . The so-called time-summed observables on are those of the form , where is an observable on . We take the legitimate observables on as those time-summed observables such that each belongs to the space of legitimate observables on . The mean of a time-summed observable is the sum of local means . The mean product of two such observables and can be written in terms of scalar products on each ,
[TABLE]
Here, we decomposed as , with the conditional probability measure on given , and wrote to emphasize that it maps an observable on to an observable on through a linear conditional mean operator . Note that even if is legitimate, i.e. belongs to , the conditional mean observable may be an arbitrary square-integrable observable on , not necessarily legitimate. Observe that , where ∗ denotes the adjunction of linear operators between the Hilbert spaces and equipped with their respective scalar products.
We now introduce the assumption that the sequence is a Markov chain. This implies that for any , and also for any , which is known as Chapman-Kolmogorov’s equation. Moreover, assume that the dynamics is periodic, which means that all can be taken identical with identical marginals , the joint measures only depend on the difference , and the spaces of legitimate observables are identical as well, . Then, it is enough to consider , from which we compute any as if , and if , where is the adjoint of for the scalar product over . Note that is but the usual transition matrix appearing in the master equation associated to the discrete-step Markov chain , as we may write the propagation of transient probability measures as
[TABLE]
Nevertheless, as we assume periodicity, i.e. stationarity of this Markov chain, the master equation is of little use here, except to notice that must be the dominant left-eigenvector of , of eigenvalue 1. The viewpoint explicited above, and used in (20), sees as describing the propagation of the conditional mean of an observable instead of the transient probability measures: this is the ‘Heisenberg viewpoint’ dual to the master equation.
From the knowledge of we can compute the mean product of any two time-summed observables and over an arbitrary number of intervals, as given by (20), which now becomes
[TABLE]
or, equivalently:
[TABLE]
To proceed we exploit Markovianity and periodicity further, as they imply the possibility to define a concept of ‘state space’. This state space is such that to a path we can associate a source state and a target state , with the properties that and that , are independent given the state . With the knowledge of the probability measure on the paths, one can always build in principle (albeit in a non-unique way) a notion of state complying with these properties, as being a sufficient statistics of the past for the future and conversely cru89 ; sha01 . In most applications, the reverse situation occurs, where a natural notion of state is given, from which a notion of path is built as a (discrete or continuous) list of successive states. Once a state space is fixed, together with the source map and target map , one may endow a probability measure on as . From this we define a Hilbert space of square-summable real observables on the state space .
We now have natural linear mappings between the path observables and the state observables. In particular given an observable , we denote the mean of knowing the source state. In other words,
[TABLE]
In other terms, we can write the operator as a matrix whose entry is if and [math] otherwise. The adjoint operator is simply the lifting of a state observable to a path observable: if is a state observable, then . If we think of as a matrix, its entry is if and 0 otherwise. This is observed by writing down the identity defining , namely for all state observables and all path observables .
Similar considerations apply for , the target conditional mean operator. A trivial observation is that : the constant unit path-observable is mapped to the constant unit state-observable. Another observation is that , the identity on . Moreover, we have and .
With these tools at hand, we can state and prove the Periodic Uncertainty Relation.
I.4 The Periodic Uncertainty Relation
We state and prove an abstract version of the Periodic Uncertainty Relation, more general than both (11) and the activity bound. Roughly speaking, it states that oftentimes the precision reachable over periods (for any ) is less than times the precision reachable over a single period.
We work under the same assumptions as in the previous section. Namely, a global path is a list of local paths , for which we assume a Markovian and periodic dynamics, and we assume we have chosen a state space . Now suppose that for a certain space of legitimate observables on , we find a zero-mean averaging observable , i.e. checking for every legitimate observable .
We now introduce the crucial assumption that is such that . Then it is easily checked with the identities derived at the end of the previous section that and .
Let us get back to the computation of , for and . Recall from (22) that the total contribution of in this sum is
[TABLE]
Assume that we take and . Then the contribution of to reduces to (using among others the fact that ). Thus is a zero-mean averaging observable over , as for every legitimate . Therefore the precision over the periods is bounded by , which develops as
[TABLE]
using . Therefore, is not only a bound on the precision over one period but also a bound over the precision over periods, if scaled with a factor . In particular, if the legitimate averaging observable satisfies
[TABLE]
then the observable given by (4) also satisfies . Moreover .
We now recapitulate our assumptions and formulate the main result. Assume that over a single period of a periodic Markovian system the legitimate averaging observable satisfies (26). Then the precision that can be achieved over periods, divided by , is less than the precision that can be achieved in a single period:
[TABLE]
This is our most general statement of the Periodic Uncertainty Relation.
I.5 The Periodic Uncertainty Relation for time-antisymmetric observables on overdamped Markov processes
In this section we consider again a periodic Markovian system, and assume that the legitimate observables over one period are the time-antisymmetric observables, for some definition time-reversal, i.e. any involutive symmetry of . We also assume that the path on a period (which up to now has been defined as an element of some arbitrary abstract space , from which we can derive a source state and a target state) is a discrete or continuous sequence of states of a Markov process, and the time-reversal simply consists in taking this sequence in reverse order. This is the case when the Markov process models an overdamped system.
To prove (26) for the legitimate observable defined by (5), it is enough to show that , since . But , thus evaluated at state reads
[TABLE]
As this expression is symmetric for time-reversal, this is also evaluated at state . Therefore we can apply the Periodic Uncertainty Relation.
I.6 The activity bound as a Periodic Uncertainty Relation
We now prove the activity bound dit18 : the precision of an observable that is the weighted sum of the transitions undergone by a finite-state continuous time Markov chain during an arbitrary time interval is bounded by the mean number of transitions.
In the first step, we identify the correct for a short time interval , and notice that is the expected number of transitions within time . In order to prove this, let us first consider a general setting (absolutely no assumption on ). When the legitimate observables are defined as those that take zero value on a given subset , we find that is the function that takes zero value on and unit value on . Therefore the maximum normalised precision is and the maximum precision is . The zero-mean observable in the span of and is here taking value on and 1 on , for which we can check indeed .
Coming back to the case of stationary finite state Markov chains over a short time interval, we take as the constant paths, i.e. those where the walker waits without jumping to another state. We neglect the possibility of multiple transitions in such a short time, therefore , equal to to first order, is the probability of a proper transition, which is also the mean number of proper transitions, and is proportional to . Thus, the optimal assigns a unit weight to all transitions.
In a second step, we verify that as requested by (26). Indeed evaluated at state is simply the probability to leave the state in the infinitesimal interval, and evaluated at state is the probability of arrival to , which is the same from stationarity. From there the Periodic Uncertainty Relation applies, and the precision available over any time interval is no larger than the expected number of transitions: this is the activity bound.
I.7 Computing the variance of a time-summed observable in a stationary finite-state continuous-time Markov chain
We indicate here how to evaluate numerically the variance and covariances of observables for a stationary ergodic finite-state Markov chain over asymptotically large time intervals. This is useful to evaluate the variance of the displacement observable in the kinesin model, as we show in the next section. A continuous-time Markov chain is often represented by a master equation, or Kolmogorov forward equation, computing the evolution of a transient probability towards stationarity:
[TABLE]
in matrix notation, where is a row vector and is the Laplacian matrix encoding the rates: , the rate at which the Markov chain, in state , transitions to another state . The diagonal entry is picked so that every row sums to zero: , to ensure preservation of total probability. To make an explicit link with the periodic case exposed above, we take as the space of paths of some arbitrary duration . It is then useful to write the discrete-time master equation which propagates the state over an interval :
[TABLE]
As we already observed in last section, the master equation is of little use for a stationary Markov chain, and we prefer the observable viewpoint. Given a state observable , assigning value on each state , then is a state observable assigning to each the mean value of at time knowing that the state at time is . The mean value at time given the state at time is encoded in the state observable . Here denotes the adjoint of under the natural scalar product on states, which is defined elementwise with . In the same way that the path-to-path operator factorizes as , the state-to-state operator factorizes as . Therefore for , we can write .
We now consider in the case where all are zero-mean and identical to one another, and all are zero-mean and identical to one another. In this case, (I.4) provides the scaling of the covariance with :
[TABLE]
We can rewrite which in the limit of short time intervals gives . Note that the inversion of the non-invertible matrix is not problematic for an ergodic Markov chain, because is then invertible on the subspace of zero-mean state observables, such as . In the limit of short times, it is convenient to consider a time horizon , with , and then take , as we are interested in asymptotically large times.
[TABLE]
This expression can be processed further by expressing as the state-space scalar product , and similarly for . Overall, the covariance for asymptotically large times is
[TABLE]
which is the main result of this section.
If we want to evaluate the covariance of non-zero-mean observables, then we may first center the observable by removing the mean.
I.8 The kinesin model
The kinesin model of the main text consists of 2 states connected by 4 reversible transitions. Each of these four transitions out of state has an associated rate given by lau07 :
[TABLE]
The control parameters are the ATP concentration, [ATP], and the external loading force , with being its associated dimensionless work along the length . Typical ranges for in vitro experiments are and . Other parameters are chosen as in lau07 ; dit18 , i.e. extracted from fits of experimental velocity data.
We are interested in computing the variance of the displacement for arbitrarily large times. For that purpose we use (29) for encoding the centered displacement on each path over an infinitesimal time . The displacement is on each proper transition, and zero on each constant path ( is short enough to ignore multiple transitions). The mean displacement is proportional to , therefore the centered displacement on each transition can still be taken to up to a negligible transition, and is set to on both constant paths. The mean displacement over conditioned on the initial state is
[TABLE]
where , . The stationary probabilities read and . The mean displacement over conditioned on the final state is
[TABLE]
while the Laplacian is
[TABLE]
where . Therefore the two state-space products are
[TABLE]
while the path-space scalar product is
[TABLE]
Summing together (33) and (34) we obtain the variance of the displacement.
We checked these results by relying on the large deviation approach gar09 . The scaled cumulant generating function of any observable is found by ‘tilting’ the stochastic matrix by into
[TABLE]
and looking for its leading eigenvalue , i.e. the one satisfying . In (37), defines the observable . For example, the dynamical activity is obtained for , while the motor displacement for , and with odd (even). The (scaled) mean and the variance of are calculated as and , respectively.
I.9 An example of involution: spin reversal in an Ising system
Imagine a classical Ising system of spins , in an external magnetic field , equilibrated at inverse temperature . The Gibbs probability measure is thus
[TABLE]
where is the interaction Hamiltonian and is the system magnetization. Considering spin reversal , entailing the legitimate observables , (9) is an upper bound on the precision of, e.g., the magnetization, taking the form (cf. with gui16 )
[TABLE]
Beyond the classical case, it is also evident that the same Hilbert uncertainty principle apply to the quantum case. The spin system is then characterized by a density matrix rather than a probability measure, and the mean and second moment of an observable (now a Hermitian matrix) are computed as the trace of and respectively. Now for any subspace of legitimate observables, one may again find an appropriate maximum precision.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) H. P. Breuer, and F. Petruccione The theory of open quantum systems Oxford University Press (2002).
- 2(2) E. Frey, and K. Kroy Ann. Phys. , 14(1-3) , 20-50 (2005).
- 3(3) P. C. Bressloff Stochastic processes in cell biology Springer, New York (2014).
- 4(4) O. Ovaskainen, and B. Meerson Trends in ecology and evolution , 25(11) , 643-652 (2010).
- 5(5) Esposito M., Harbola U. and Mukamel S. Rev. Mod. Phys. 81( 4 ), 1665 (2009).
- 6(6) R. Rao and M. Esposito Entropy , 20(9) , 635 (2018).
- 7(7) Baiesi M. and Maes C. New J. Phys. , 15(1) , 013004 (2013).
- 8(8) Seifert U. Rep. Progr. Phys. 75( 12 ), 126001 (2012)
