Free energy for non-equilibrium quasi-stationary states
A.E. Allahverdyan, N.H. Martirosyan

TL;DR
This paper explores the concept of free energy in non-equilibrium quasi-stationary states of Markov systems interacting with two thermal baths, identifying conditions for its existence and properties.
Contribution
It introduces conditions under which a free energy can be defined in non-equilibrium systems, extending thermodynamic concepts beyond equilibrium.
Findings
Free energy exists under specific conditions like time-scale separation and partial controllability.
Non-equilibrium free energy can increase with temperature, unlike in equilibrium.
External fields can more efficiently reduce entropy at higher temperatures.
Abstract
We study a class of non-equilibrium quasi-stationary states for a Markov system interacting with two different thermal baths. We show that the work done under a slow, external change of parameters admits a potential, i.e., the free energy. Three conditions are needed for the existence of free energy in this non-equilibrium system: time-scale separation between variables of the system, partial controllability (external fields couple only with the slow variable), and an effective detailed balance. These conditions are facilitated in the continuous limit for the slow variable. In contrast to its equilibrium counterpart, the non-equilibrium free energy can increase with temperature. One example of this is that entropy reduction by means of external fields (cooling) can be easier (in the sense of the work cost) if it starts from a higher temperature.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
11institutetext: Yerevan Physics Institute, Alikhanian Brothers Street 2, Yerevan 375036, Armenia
Free energy for non-equilibrium quasi-stationary states
A.E. Allahverdyan and N.H. Martirosyan
Abstract
We study a class of non-equilibrium quasi-stationary states for a Markov system interacting with two different thermal baths. We show that the work done under a slow, external change of parameters admits a potential, i.e., the free energy. Three conditions are needed for the existence of free energy in this non-equilibrium system: time-scale separation between variables of the system, partial controllability (external fields couple only with the slow variable), and an effective detailed balance. These conditions are facilitated in the continuous limit for the slow variable. In contrast to its equilibrium counterpart, the non-equilibrium free energy can increase with temperature. One example of this is that entropy reduction by means of external fields (cooling) can be easier (in the sense of the work cost) if it starts from a higher temperature.
pacs:
05.20.-y
pacs:
05.10.Gg
pacs:
05.70.Ln
statistical mechanics stochastic models in statistical mechanics and non-linear dynamics irreversible thermodynamics
Introduction. One reason for the effectiveness of thermodynamics is that its concepts and ideas apply beyond the domain of equilibrium states, e.g., in real life thermodynamics is applied to systems having different temperatures, even though such states do not belong to equilibrium. There are specific mechanisms for this applicability, e.g., thermodynamics applies to systems that are perturbatively close to equilibrium [1, 2, 3, 4]. Another general mechanism is the time-scale separation. In fact, this mechanism is inherent in the structure of thermodynamics, which is built up via the notion of a quasi-static process [3]. It is also important in statistical physics of open systems, where even low-dimensional systems can play the role of a thermal bath, provided that they are fast [5, 6]. A related point is seen in stochastic thermodynamics, where the notion of white (i.e., fast) noise is relevant [7].
There is already a body of work concerning thermodynamic aspects of systems with time-scale separation [8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26]. In particular, much attention was devoted to the effective temperature of glassy systems, where the time-scale separation emerges from the many-body physics [8, 12, 10, 11]. Recent works studied time-scale separation in stochastic thermodynamics focusing on dissipative features such as entropy production [13, 14, 15, 16, 17, 18].
Time-scale separation is a much wider notion, and it is frequent also in biology and society, where it allows to reduce the complexity of emergent structures [27, 28]. Indeed, components of such systems enjoy a certain autonomy (though in different ways): fast variables “live” under fixed values of the slow ones, while the slow variables “see” stationary distributions of the fast ones.
We aim to look at a class of stationary, non-equilibrium states for a Markov stochastic system that is out of equilibrium due to interaction with two different thermal baths; see [29] for an introduction to such systems. We impose three conditions.
– Time-scale separation: the system under consideration consists of two variables, fast and slow.
– Partial controllability: external fields act on the slow variable only. This assumption can be validated from operational reasons: it is normally difficult to have a precise control on variables that move fast.
Both conditions need not hold for the equilibrium situation (equal temperatures of the thermal baths), where thermodynamics applies under any type of controllability and the ratio of characteristic times.
– Transition rates of the slow variable hold certain constraints—they are given by activation rates, or the slow variable lives in a tree-like structure—that amount to an effective detailed balance that holds after averaging (tracing out) over the fast variable.
Under these conditions the slow (isothermal) work done on this system admits a potential, i.e., there exists the free energy. This definition is unambiguous, because the work is defined for an arbitrary non-equilibrium state [4]; see the discussion after (15) for details of this point. Hence the free energy need not have further equilibrium features. Indeed, we show that its behavior with respect to temperature changes can be different from the features of the equilibrium free energy.
Two-temperature Markov dynamics. Consider the following master equation for two discrete variables and (see, e.g., [2, 7]):
[TABLE]
where is the joint probability of and , and are the transition probabilities for (for a fixed ) and (for a fixed ), respectively. In (1), is a small parameter that makes slower than . We assume that all sums over Latin (Greek) indices run from to (from to ).
The transitions are controlled by different thermal baths at temperatures and respectively. Hence the transition probabilities and hold the detailed balance conditions (see e.g., [2, 7]):
[TABLE]
Without loss of generality we parametrize (3) as
[TABLE]
where accounts for the symmetric part of .
Time-scale separation holds when in (1) is sufficiently small; see fig. 1. This is a reliable approximation, since its predictions are close to the exact stationary probability even for moderately small ; see the left panel of fig. 1
We now work out the stationary state () of (1) for a small following the standard perturbation theory approach; [30] for a rigorous presentation and [31] for a physical discussion. One puts into (1)
[TABLE]
and obtains for successive terms which are of order and , respectively:
[TABLE]
Note that there are solvability conditions found from summing (7) over :
[TABLE]
Now (6) and (8) with are solved as
[TABLE]
where we used (2), the stationary conditional probability of the fast variable has the equilibrium form
[TABLE]
and where the probability of the slow variable is found from (8) with
[TABLE]
The meaning of (9) is that the fast variable relaxes to the conditional equilibrium, which determines via (11) and the effective rates the probability of the slow variable.
Due to , (9) implies normalization condition: for . Thus is found from this condition, (7) with , and (8) with . Expectedly, for a small we get . Now the right panel of fig. 1 confirms this fact with numeric results, but it also shows that the difference between (9) and the true joint probability is a sublinear function of for a larger values of . This is favorable for the approximation
[TABLE]
that we adopt from now on for the stationary probability of (1).
External fields are introduced via time-dependent parameters in the energy of the system. We assume that
[TABLE]
where does not depend on . Eq. (13) means that external fields couple only with the slow variable, e.g., because it is difficult to control fast objects. The stationary probabilities (12) depend on parameters through (13).
Now is slow as compared to the relaxation of both fast and slow variables. The temperatures and are constant. Hence the (quasi-stationary) probabilities of the system are found from (12), where is replaced by . The differential thermodynamic work is [2] [see (13)]
[TABLE]
where is the gradient in the -space. Note that relations between thermodynamic and mechanic work have to be specified in the context of concrete applications [32]. This has to do with the following freedom in the definition (14): (and hence ) will change upon adding to a factor that does not depend on , but depends on (this is akin to the gauge-freedom of the potential energy in mechanics). Hence the energies need to be specified, before defining the work [32].
Each component of can have a separate physical meaning, since components of may be driven by different sources.
The integral work is a line integral in the -space:
[TABLE]
where and are the initial and final values of reached at times and , respectively. We stress that (15) refers to slow changes of parameters , but if we change in (14) to the time-dependent probability found from (1), then the same expression for work applies for arbitrary processes [4].
The work admits a potential (i.e., free energy ) if
[TABLE]
If (16) does not hold, the work extraction (i.e., ) by means of a slow, cyclic () variation of is possible. Indeed, if , then changes its sign when the cycle is passed in the opposite direction. The extracted work is determined by the closed-contour integral.
Below we give pertinent examples of for cyclic processes; see (39). Now we turn to studying cases, in which (16) does hold despite the fact that .
Non-equilibrium free energy for the activation rate. An example of slow dynamics is given by the activation energy rate in (4) [7]
[TABLE]
One interpretation of (17) is that there is a barrier with energy that is larger than all other energies. Choosing , we see that in (17) assumes the standard Arrhenius form.
Now in (11) depends only on : . The probability in (11) is found via the detailed balance condition . This leads to . Thus we get from (10, 11, 13, 17)
[TABLE]
where corresponds to the weight of the energy and expresses via the statistical sum of the fast variable:
[TABLE]
Note that does not depend on due to (20) and (13). Eqs. (13, 16, 18) imply the existence of a free energy:
[TABLE]
Recall that is defined up to a constant,
[TABLE]
that can depend on anything besides [32]. Eq. (21) is obtained under a specific choice of that proves useful below when calculating derivatives of (21).
Temperature-dependence of and entropy reduction (cooling) via external fields. The free energy of an equilibrium system is a decreasing function of the temperature:
[TABLE]
where is the entropy. Since is defined under a specific choice of an additive and temperature-dependent constant (cf. (22)), (23) changes for a different choice of the constant. Hence (23) cannot be interpreted directly. But it can be related to the work cost of a cooling process via externally driven parameters [33].
Normally, cooling means temperature reduction of a macroscopic system that has a single and well-defined temperature. But the needs of NMR-physics [34], atomic and molecular physics [35, 36], quantum computation [37] etc led to generalizing this definition [33, 38, 39]. In these fields one needs to reduce the entropy of a system that is coupled to a fixed-temperature thermal bath. Reducing the bath temperature is not feasible. But it is feasible to reduce the entropy via external fields. For instance, in NMR spin systems, entropy decrease for a spin means its polarization increase, which is necessary for the NNR spectroscopy [34].
Thus, cooling amounts to an isothermal process, where slowly changes from to and achieves a lower final entropy. For the equilibrium situation this means (dynamic aspects of this problem are analyzed in [33]). Now
[TABLE]
compares two setups at different temperatures, but the same values of the external fields. Eq. (24) means that the work cost of cooling increases with the temperature , i.e., cooling from a higher temperature is harder, as expected.
Turning to the non-equilibrium free energy , we characterize its temperature dependence via and , since and are independent parameters. We deduce for the activation energy rate (17, 19):
[TABLE]
where
[TABLE]
is the entropy of the slow variable (cf. (23)), and
[TABLE]
is the conditionally averaged energy of the fast variable; cf. (10). monotonously decays from to when goes from to . Hence we get in (25, 26):
[TABLE]
Now we explore implications of for a cooling process. Denote by
[TABLE]
the entropy of the fast variable conditioned by a fixed value of the slow variable; cf. (10). Recall that amounts to the full entropy .
We denote: . Then we can define a cooling process, where all energies in (13) slowly increase leading to ; see (18). Though the process is realized by external fields holding the partial controllability restriction (13), we still get a cooling of the whole (slow plus fast) system. Indeed, not only the entropy of the slow variable decreases to zero, but also the conditional entropy of the fast variable decreases from its initial value to a smaller value .
We note that the considered cooling process can also decrease the internal energy of the system. Recall that the internal energy is defined as [cf. (12), (13)]
[TABLE]
where is defined after (26). Since and are equilibrium quantities, is an increasing function of . Hence it is possible to choose in addition to . We can also choose . Then means that the internal energy (31) decreases during the cooling.
The cooling process incurs a work cost [cf. (21)]
[TABLE]
where is the initial free energy. Now means that the work is taken from the external source, hence this is indeed a work cost.
The change of with the temperature of the fast variable reads from (26):
[TABLE]
Let us now indicate several scenarios for in (33), and show that they are consistent with condition that defines the cooling process. For example, in (33) one can take , but . Another example is to make small. Then amounts to the heat-capacity , and then can be implied by , which is consistent with .
We conclude from that it can be easier (in terms of the work cost) to cool from higher temperatures than from the lower ones. Note that the result survives also in the near-equilibrium limit . With the same logics, one shows from (25) that when increasing the temperature of the slow variable, one normally has [cf. (24)], and we revert to equilibrium with .
Tree-like topology of the slow variable. Free energy (21) exists for general rates (4), if the topology of connections between the states in (11) is that of a tree (a network without loops or closed cycles): for a fixed , (and hence ) only along branches of a tree; see fig. 2 for examples. Let node be the root of the tree. This root can be chosen arbitrarily, the freedom of choosing it will connect to an arbitrary constant (22) in the free energy. Each node of the tree is related to via a unique path . The stationary probability of is deduced from (11), it is made of the transition probabilities along this unique path:
[TABLE]
where is gotten from normalization. We add to (13):
[TABLE]
where does not depend on . Now cancels out in (34), and (34) reduces to (18), where [cf. (20)]
[TABLE]
Hence the free energy (21) applies with given by (36). For (17), this free energy differs from (21) due to a choice of the constant in (22).
Note that the tree topology supports the detailed balance: . This relates with the Kolmogorov’s criterion for the detailed balance: for all loops of the connection network the product of the transition probabilities calculated in the clock-wise direction should be equal to the product in the anti-clock-wise direction [40]. For a tree-like network there are no loops, hence the criterion holds.
For general rates already one loop (i.e., a three-level system) may suffice for invalidating the existence free energy. But we expect that loops will be less relevant for multi-dimensional models. Indeed, let us return to , , but instead of (13) we assume that only one externally driven parameter (out of two) pertains to the slow variable:
[TABLE]
Eq. (38) is the minimal situation, where the work-extraction via a slow, cyclic process is possible.
We get for the rotor of the work [cf. (14) and (20)]:
[TABLE]
where . Eq. (39) shows how far is the work from having a gradient when the partial controllability (13) does not hold. It also determines the amount of work extracted from a cycle in the -space. Now since (39) is a correlation, the existence of a gradient for can be recovered if fluctuations are negligible.
Quasi-continuous limit for the slow variable. Recall that the index numbers the states of the slow variable. We now consider the case, where these states are arranged over a segment of a line (1D situation), so that only transition from one neighbour state to another are allowed. This is an example of birth-death processes that have many applications [41]. If the segment is finite and the states are homogeneously and densely located in it, then one can pass to the continuous limit, where instead of a discrete index we shall have a continuous variable [42, 43]. The continuous limit is achieved by
[TABLE]
where is a continuous parameter, is a small parameter (the distance between the states and on the segment), and where . We now expand in (4) over a small
[TABLE]
and get in (1) (see [43, 42] for similar derivations):
[TABLE]
Taking in (43) the continuum limit, we get from (1):
[TABLE]
where is the joint probability of and : , and is the probability current related to the slow variable.
Eq. (44) shows that as compared to (1) the slow-fast limit is facilitated due to the additional small factor . This is confirmed by fig. 1 which shows that the slow-fast limit improves when the number of states of the slow variable is large. Thus the above continuous limit is a way to get the time-scale separation naturally.
We treat (44) wih the same time-scale separation argument (5–11) with minor modifications for the continuous case. The stationary conditional probability is still . For the stationary probability density —which holds the zero-current condition —we obtain from (44, 45):
[TABLE]
where by analogy to (13, 35): , . The free energy is given as in (21) by . Note that it goes to the equilibrium expression for .
Non-equilibrium free-energy with equilibrium features. A particular case of (46) is when does not depend on , i.e., . Now the non-equilibrium free energy reads [20]:
[TABLE]
The following features of are deduced directly from (47) [20]. They all are very similar to those of the equilibrium free energy.
– is the potential for the work without restriction (13), i.e., now can also enter :
[TABLE]
– For the temperature-derivatives of we get
[TABLE]
where and are, respectively, the conditional entropy of the fast variable, and the marginal entropy of the slow variable. Hence the anti-thermodynamic cooling effect noted in (33) is impossible here.
Note the analogy between (49) and the equilibrium formula . It leads us to , where is the average (overall) energy. This expression for already appeared in the physics of Brownian motion [19, 20] and glasses [12].
Eq. (47) has an intuitively appealing meaning [19, 20, 21, 22, 23, 24, 25, 26], since it implies that the free energy of the fast variable (evaluated at a fixed value of the slow variable) serves as an effective potential for the (Gibbs) distribution of the slow variable: . And then is the free energy related to that effective Gibbs distribution: . While this heuristic explanation is frequently applied in statistical physics, we should keep in mind from the above derivation that exists due to the continuous limit, e.g., assuming in the discrete case does not recover the above equilibrium features (for ).
Summary. We studied a class of non-equilibrium stationary states generated by two thermal baths at different temperatures. Three conditions were assumed:
(i) Variables of the system have different characteristic times.
(ii) External fields act only on the slow variable.
(iii) The transition rates of the slow variable are such that (under time-scale separation) it holds an effective detailed balance. Examples of the effective detailed balance are the activation transition rate for an arbitrary topology of connections, and arbitrary rates for a tree-like topology.
Conditions (iii) are similar to those governing the no-pumping theorem [44, 45, 46, 47, 48]. It considers a stochastic system coupled to a single thermal bath—but driven by an oscillating external field—and studies conditions under which the time-averaged probability currents vanish. The no-pumping theorem has extensions beyond of the activation rates and loop-less networks [49].
Under conditions (i)–(iii) there exists a non-equilibrium free energy. It describes quasi-stationary isothermal processes, and is defined via the potential of the work done via a slow variation of external parameters. As compared to its equilibrium counterpart, the non-equilibrium free-energy can be an increasing function of the bath temperature. As a physical demonstration of this feature we studied the free-energy cost of the isothermal cooling (i.e., entropy decrease via slow external fields): this cost can decrease if the cooling starts from a higher temperature. It is interesting to compare this finding with the Mpemba phenomenon [50, 51]: given two samples of water that are identical except of their initial temperature, the initially hotter samples cools (and freezes) quicker than the colder one. This can have relatively straightforward physical explanations, e.g., some part of the hotter water can evaporate thus decreasing its amount and making its cooling easier. The Mpemba phenomenon disappears once such scenarios are ruled out [51]. To compare our finding with the Mpemba effect, we first of all note that our cooling is not spontaneous. We focus on an (isothermal) entropy reduction of the system by means of external fields. (Still our cooling process is slow and it leaves the system in its locally stationary state.) Hence we study not the speed of cooling, but its work cost, as determined by the free-energy. The effect we found is impossible in equilibrium, because the equilibrium free energy is a decreasing function of the temperature. However, we show that this effect survives close to equilibrium. Thus, the second difference with the Mpemba phenomenon is that our effect is out of equilibrium, as it relates to two different temperatures.
The existence of the free energy is facilitated in the (one-dimensional) continuous limit of the slow variable. Now time-scale separation comes out naturally, and there is a situation, where a non-equilibrium free energy exists for all driven parameters, i.e., assumption (ii) can be relaxed.
The free energy allows to involve ideas of equilibrium thermal physics for understanding non-equilibrium stationary states; other approaches that aim at a thermodynamical description of such states are studied in [52, 53, 54, 55, 56, 57, 58, 59].
Acknowledgements. This work is partially supported by COST MP1209 and by the ICTP through the OEA-AC-100.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. N. Zubarev, V. Morozov, G. Ropke, Statistical Mechanics of Nonequilibrium Processes (John Wiley & Sons, NY, 1996).
- 2[2] R.L. Stratonovich, Nonlinear Nonequilibrium Thermodynamics I (Springer-Verlag, Berlin, 1992).
- 3[3] J. Meixner, in A Critical Review of Thermodynamics , edited by E.B. Stuart, B. Gal-Or, and A. J. Brainard (Mono Book Corporation, Baltimore, MD, 1970), p. 40.
- 4[4] R. Balian, From Microphysics to Macrophysics , volume I, (Springer, 1992).
- 5[5] C. Jarzynski, Phys. Rev. Lett. 74 , 2937 (1995).
- 6[6] A. Riegert, N. Baba, K. Gelfert, W. Just, and H. Kantz, Phys. Rev. Lett. 94 , 054103 (2005).
- 7[7] K. Sekimoto, Stochastic Energetics (Springer Berlin, 2010).
- 8[8] L.F. Cugliandolo, J. Kurchan, and L. Peliti, Phys. Rev. E 55 , 3898 (1997).
