On the invariant quantities in an entropy balance and their impact on the stability of some physical systems
Andrea Di Vita

TL;DR
This paper explores invariant entropy quantities in physical systems, establishing a universal stability condition based on entropy production minimization, applicable across diverse macroscopic and mesoscopic systems without relying on traditional thermodynamic assumptions.
Contribution
It introduces a general stability criterion based on entropy invariance and minimization, independent of detailed dynamics or equilibrium assumptions, unifying stability analysis across various physical systems.
Findings
Derives a necessary stability condition from entropy invariance.
Retrieves known stability criteria for specific systems like plasmoids and Brownian motors.
Challenges the validity of the maximum entropy production principle in these contexts.
Abstract
In a wide class of physical systems, diffeomorphisms in the state space leave the amount of entropy produced per unit time inside the bulk of the system unaffected [M. Polettini et al., 12th Joint European Thermodynamics Conference, Brescia, Italy, July 1-5, 2013]. This invariance implies that if relaxation towards some final ('relaxed') state occurs, then the necessary condition for the stability of the relaxed state against slowly evolving perturbations is the same for all the systems of this class, regardless of both detailed dynamics of the system, amplitude of fluctuations around mean values and possible occurrence of periodic oscillations in the relaxed state. Our discussion invokes no Onsager symmetry, no detailed model of heat transport and production, and no approximation of local thermodynamic equilibrium. This necessary condition of stability is the constrained minimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Thermodynamics and Statistical Mechanics · Cosmology and Gravitation Theories · Solar and Space Plasma Dynamics
On the invariant quantities in an entropy balance and their impact on the stability of some physical systems
A. Di Vita 111Univ. Genova D.I.C.C.A. Genova, Italy,
Recently, it has been shown [M. Polettini et al., 12th Joint European Thermodynamics Conference, Brescia, Italy, July 1-5, 2013] that diffeomorphisms in the state space of a wide class of physical systems leave the amount of entropy produced per unit time inside the bulk of the system unaffected. Starting from this invariance, we show that if the boundary conditions allow the system to relax towards some final (’relaxed’) state, then the necessary condition for the stability of the relaxed state against slowly evolving perturbations is the same for all the systems of this class, regardless of the detailed dynamics of the system, the amplitude of fluctuations around mean values, and the possible occurrence of periodic oscillations in the relaxed state. We invoke also no Onsager symmetry, no detailed model of heat transport and production, and no approximation of local thermodynamic equilibrium. This necessary condition is the constrained minimization of suitably time- and path-ensemble-averaged [G. E. Crooks, Phys. Rev. E 61, 3, 2361 (2000)], the constraints being provided by the equations of motion. Even if the latter may differ, the minimum property of stable relaxed states remains the same in different systems. Relaxed states satisfy the equations of motion; stable relaxed states satisfies also the minimum condition. (Thermodynamic equilibrium corresponds to and satisfies trivially this condition). Our class of physical systems includes both some mesoscopic systems described by a probability distribution function which obeys a simple Fokker-Planck equation and some macroscopic, classical fluids with no net mass source and no mass flow across the boundaries; the problems of stability of these systems are dual to each other. As particular cases, we retrieve some necessary criteria for stability of relaxed states in both macroscopic and mesoscopic systems. Retrieved criteria include (among others) constrained minimization of dissipated power for plasma filaments in a Dense Plasma Focus [A. Di Vita, Eur. Phys. J. D 54, 451 (2009)], of electrical resistivity in plasmoids [L. Comisso et al., Phys. Plasmas 23, 100702 (2016)], of time- and path-ensemble-averaged dissipated power in flashing ratchets [J. Dolbeault et al., Technical Report 0244, CEREMADE (2002)] and of the so-called ’effective temperature’ in Brownian motors [A. Feigel et al., arXiv:1312.5279v1 cond-mat.stat-mech 18 Dec 2013], as well as the scaling of the average magnetic field to the reciprocal of Rossby number in dynamo-affected, convection-ruled shells of rotating stars [C. Jordan et al., Mon. Not. R. Astr. Soc. 252, Short Communication, 21p–26p (1991)]. Finally, we argue that the validity of the popular "maximum entropy production principle" [L. M. Martyushev, V. D. Seleznev, Phys. Rep. 426, 1 (2006)] is questionable in these systems.
PACS: 05.70.Ln
Keywords: non-equilibrium thermodynamics, entropy production
1 The problem
When considering a physical system which is far from thermodynamic equilibrium and is ’open’, i.e. which exchanges either mass, momentum or energy with the external world, it is customary to write the total time derivative of Boltzmann-Gibbs entropy
[TABLE]
of the system in the form [1]:
[TABLE]
Here and in the following, the word ’entropy’ refers to Boltzmann-Gibbs entropy (1) only. We have adopted the notation of [5], [6] and [7]; and are Boltzmann’s constant and the distribution function respectively; is the probability of finding the system at the time inside the volume element of a -dimensional state space which is centered at and satisfies the normalization condition at all times. For instance, may be coordinates in , concentrations of chemical species, etc. Moreover, and are the contributions of physical processes occurring inside the bulk of the system and at the system boundary respectively (some authors denote and with and respectively; we drop also the dependence on x and unless otherwise specified below, for simplicity).
The quantity may include e.g. Joule heating, viscous heating etc. The quantity may be related to the exchange of mass, energy etc. with the external world; it is the amount of entropy which escapes away from the system across the system boundaries per unit time and may be , depending on the particular problem. It is customary to write and as a volume integral of a suitably defined entropy production density (referred to as ’entropy source strength’ in [9]) on the system volume and as a surface integral of some suitably defined entropy flow across the system boundary respectively. Here at all times everywhere throughout the system because of the second principle of thermodynamics (as for the counterexample provided by [9], see Sec. 10). It follows that at all times -see e.g. Sec. 49 of [12] for a detailed discussion of this issue in the particular case of a viscous, thermally conducting fluid.
In contrast with ’closed’ (i.e., not open) systems where at thermodynamic equilibrium, the relative role of and in open systems far from thermodynamic equilibrium is far from being assessed even in the neighbourhood of steady states - but for a very restricted class of problems [15] [16] [17] [18]. In particular, a crucial issue is the spontaneous evolution (’relaxation’) towards a final (’relaxed’) state.
As for closed systems (where at all times), relaxation is related to a growth of towards its maximum value. Thermodynamic equilibrium is the relaxed state; its extremum property , which implies also , is related to stability of this state against perturbations. Of course, if we decide to apply a transformation of coordinates:
[TABLE]
the validity of the extremum property remains unaffected, just like the equations of motion involved in Liouville theorem underlying it. This conclusion holds even if the actual value of may be not invariant under (3) - see Appendix B. Invariance concerns the extremum property, rather than the actual value of the extremised quantity.
As for open systems (where may ), boundary conditions may keep the relaxed state (if any exists) far from thermodynamic equilibrium, so that . The question if a description of stable relaxed states of an open system involves extremum properties while requiring no detailed knowledge of the microscopic dynamics of the system - in analogy with the familiar equilibrium thermodynamics - is still open. Usually, once the extremum property is proven relaxation is described as the evolution of a system which starts from some initial condition inside a suitable attraction basin and ends at the configuration corresponding to the extremum.
Some researchers have invoked the so-called ’local thermodynamic equilibrium’ approximation (’LTE’) [1]. In a nutshell, LTE means that - although the total system is not at thermodynamic equilibrium - the relationships among thermodynamic quantities within a small mass element are the same as in real equilibrium. LTE leads to the so-called ’general evolution criterion’, an exact result concerning the total time derivatives of thermodynamic quantities [2], which rules relaxation in many physical systems - see Ref. [17] for a review.
Moreover, it has been proven [18] that relaxed states in open systems satisfy extremum properties if some very restrictive assumptions are satisfied, like Onsager symmetry relationships , where is Onsager matrix: these properties are Glansdorff and Prigogine’s ’Minimum Entropy Production principle’ of suitably constrained minimization of , henceforth referred to as ’MinEP’, and its equivalent formulation, Onsager and Machlup’s ’least dissipation principle’. The relevant formulation of MinEP depends on the actual boundary conditions. The proof of MinEP relies on the positive sign of scalar quantities like the quadratic form , with -th and -th thermodynamic force respectively. Scalars are invariant under (3), hence MinEP too is invariant under (3). If such restrictions are dropped, MinEP does not apply [19] [20].
Among the many attempts to go beyond LTE and Onsager, many authors have postulated a ’Maximum Entropy Production Principle’ (’MEPP’) in different forms [21] [24] [26] [27] [28] [29] [30] [31] [32] [33] [34] [35] [36] [37] [38] - for a review, see [15]. According to MEPP, a stable relaxed state corresponds to a suitably constrained maximum of . To date, and in spite of its present popularity, attempts to derive MEPP rigorously meet no general consensus [17] [22] [23] [44], as they ’are so far unconvincing since they often require introduction of additional hypotheses, which by themselves are less evident than the proven statement’ [15] - see also the critical review of [16]. A way out is to drop (1) altogether, and to postulate - in the framework of the so-called Extended Irreversbiel Thermodynamics - that entropy depends locally not only on temperature, pressure etc. but also on the heat flux and the viscous stress tensor [45]. However, the results reviewed in [17] cast doubt on the very existence of a MEPP-like, general-purpose extremum property of in far-from-equilibrium, stable, relaxed states; for selected classes of problems, however, validity of variational principles concerning particular contributions to the entropy balance remains possible. For instance, many alleged applications of MEPP seem indeed to be statements concerning -see Appendix A for further discussion.
A different, quite successful approach to non-equilibrium, namely ’stochastic thermodynamics’ (’ST’), deals precisely with selected classes of physical systems, namely those systems where a non-equilibrium process occurs which is coupled to one (or several) heat bath(s) of constant temperature. ST provides a framework for extending the notions of familiar thermodynamics like work, heat and entropy production to the level of individual trajectories across the space state of the system - see the reviews of [3] and [14]. Typically, ST describes systems where a few observable degrees of freedom - like the positions of colloidal particles or the gross conformations of biomolecules - are in non-equilibrium due to the action of possibly time-dependent external forces, fields, flow or unbalanced chemical reactants. In contrast with the approaches described above, which deal with large (’macroscopic’) systems where fluctuations around mean values may be negligible, ST is uniquely suitable for describing smaller (’mesoscopic’) systems where the role of (usually quantum-mechanical) fluctuations is not negligible. In the following, and with a slightly misleading wording, we take the words ’macroscopic’, ’classical’, ’large’ and ’non-degenerate’ as interchangeable; in the spirit of our thermodynamic treatment, a rigorous definition of the words ’macroscopic’ and ’mesoscopic’ is provided in Sec. 7 which invokes no information on the detailed mechanics underlying fluctuations.
The time-scale separation between the observable, typically slow, degrees of freedom of the system and the unobservable fast ones (linked either with thermal baths or with the internal dynamics of the system, like e.g. in the case of biopolymers), together with the fact that temperatures remain well-defined at all times, allows for a consistent thermodynamic description in ST. Internal energy, entropy and free energy are well-defined and, in principle, computable for fixed values of the slow variables. Usually, the time-scale separation implies that the dynamics becomes Markovian, i.e., the future state of the system depends only on the present one with no memory of the past. If the states are made up by continuous variables (like position), the dynamics follows a Langevin equation for an individual system and a Fokker-Planck equation for the whole ensemble. This is another difference from the above described approaches, which rely on no detailed description of dynamics.
Rather than to conditions for stability of relaxed states, ST leads to a large number of exact results - usually referred to as ’fluctuation theorems’ (’FTs’). Basically, FTs are mathematical identities derived from the invariance of the microscopic dynamics under time-reversal - see e.g. assumption (2) in Sec. 2.1 of [4]. They acquire physical meaning by associating their mathematical ingredients with the thermodynamic quantities identified within ST.
The success of invariance-based FTs in the description of some mesoscopic systems is confirmed by experiments. It suggests that even a rigorous non-equilibrium thermodynamics of some selected classes of macroscopic systems should take advantage of some property of invariance - just like in some systems Onsager symmetry follows from invariance of microscopic dynamics under time reversal. Remarkably, a derivation of MEPP from the second principle of thermodynamics actually exists, but only provided that ’*the maximum flow is taken as a zero flow *[…] in practice, this can be realised, e.g. by time/space scaling’ [39]. Generally speaking, therefore, a different choice of ’time/space scaling’ seems to lead to violation of MEPP, i.e. MEPP may not be invariant under (3). In contrast, some authors [43] [44] [46] [47] [54] suggest that invariance under (3) is precisely a requirement for meaningful extremum properties for stable relaxed states of selected, open macroscopic systems. (In the following, by ’invariance’ we mean ’invariance under (3)’).
The issue is far from being purely academic. Generally speaking, when it comes to systems which remain confined at all times with a certain region of the phase space, (3) does not leave the measure of unaffected. As a matter of principle, therefore, should an invariant extremum property for stable relaxed states actually exist, it would provide information concerning stability regardless of the size of the system; in other words, a common thermodynamic description of stability would be available for both macroscopic and mesoscopic systems.
A simple, far-reaching corollary follows straightforwardly. To date, the relaxed states are usually assumed to be steady states in the non-equilibrium thermodynamics of macroscopic systems - for an exception concerning MinEP, see e.g. Chap. XV of Ref. [13]. In contrast, cyclic behaviour plays a central role in ST. In fact, despite the fundamental difference between isothermal engines operating at one temperature and genuine heat engines like thermoelectric devices involving two baths of different temperature, ST provides a common framework based on the representation of entropy production in terms of cycles of the underlying network of states [3]. Accordingly, should an invariant extremum property for stable relaxed states actually exist, it would provide information on the stability of both steady and oscillating relaxed states. A bridge could be established between different problems of stability far from thermodynamic equilibrium.
In this paper we focus our attention on a wide - but well-delimited - set of physical systems and assume that relaxed states exist. Our goal is to find an invariant, necessary condition of stability of these relaxed states. We examine the invariant quantities in the entropy balance (2) of the system in Sec. 2 . We discuss the connection between invariance and stability of steady relaxed states and hint at a possible criterion for stability in Sec. 3 ; when discussing invariance, we shall make use of dimensionless quantities only, in order to simplify the maths. Since we are looking for necessary condition of stability, we are free to select the perturbations stability is to be checked against; this allows us to focus our attention on stability against slowly evolving perturbations, and leads therefore to dramatic simplification, as shown in Sec. 4 . In Sec. 5 we take advantage of the results of the previous Sections and show that the criterion hinted at in Sec. 3 is actually the looked-for necessary condition for stability of steady relaxed states, starting from the detailed analysis of a relevant, ’pivotal’ problem of plasma physics. No Onsager symmetry is assumed. Generalization to oscillating relaxed states is provided in Sec. 6 . We show that invariance of physics allows easy reformulation of the result of Sec. 6 into a form which is feasible for both macroscopic and mesoscopic systems in Sec. 7 . The following Sections deal with further applications in both macroscopic and mesoscopic systems: Rayleigh-Bénard convection (Sec. 8) in a rotating shell (Sec. 9), a flashing ratchet (Sec. 10) and a Brownian motor (Sec. 11). Conclusions are drawn in Sec. 12. Detailed computations are to be found in Appendices B, C, D, E and F. Appendix D contains also a qualitative discussion of our necessary condition of stability starting from ST.
2 Looking for invariants in an entropy balance
The aim of this Section is to review some useful results [7] [47] [53] concerning the behaviour of (2) under (3) for a quite wide set of physical systems with degrees of freedom. These systems are described by the following ’PDE’ (partial differential equation) problem:
[TABLE]
Here J is a probability density current with components, the quantity W too has components, refers to partial derivatives on the ’s (), and is a positive scalar with and , which plays the role of a diffusion coefficient in a space with dimensions, so that (4) acts as a simple balance equation for . We invoke no information concerning the detailed physical nature of W and . The boundary condition corresponds to a system which remains confined at all times inside a region - with measure - of the state space, n being the unit normal vector of the boundary surface of . Here and in the following, we compute all integrals in on and denote with the average of the generic quantity y on .
Any physical system undergoing a Markovian evolution dictated by a Fokker-Planck equation in thermal contact with an environment (say, a thermal bath) provides us [47] with an example of a system described by (4); generalization is discussed in [7]. Another example of physical system described by (4) is a single macromolecule [53] in a fluid, which is composed by atoms and whose configuration is represented at any time by a point x in a space with dimensions (the fundamental equation describing the evolution of the distribution function of the system, namely equation (5) of [53], is a slight generalisation of (4) - the main difference is that is a tensor). Finally, if , the system is made of many particles with the same mass and we identify W with a velocity (so that and J are the mass density and the mass current density respectively), then in the limit of negligible the PDE (4) describes a system where mass is conserved with no net source and is exchanged nowhere across the boundary with the external world.
For the purpose of our discussion, we provide some preliminary information about (3). Quite generally, we assume the ’s to be curvilinear coordinates in a dimensional Riemannian variety (like e.g. ) with state-space volume element . We denote with and the i-th controvariant and covariant component of the generic dimensional quantity a respectively ( here and in the following). Moreover, we denote the Jacobian of (3) with . For simplicity, we assume (3) to be an orientation-preserving diffeomorphism, so that . Note that (3) leaves time unaffected. Finally, we denote with the value of the generic quantity after the transformation.
First of all, it is shown in Appendix B that:
[TABLE]
[TABLE]
As for (5), basically a particular case of equation (19) of [5], it retrieves both equation (12) of [7] and equation (7) of [53] provided that in agreement e.g. with Einstein’s relation ( is a scalar quantity, hence it is invariant). Indeed, both and are supposed to vanish in both [47] and [53], just like in our treatment. This assumption is justified because of the interaction with a thermal bath in [47], and is explicitly stated in [53], where . Finally, in Sec. 4 we are going to discuss the behaviour of in the limit of negligible .
As for (6), it is a particular case of equation (20) of [5]. If W is ’conservative’, i.e. if where the scalar quantity is a differentiable function of x, then our boundary condition makes to vanish after integration by parts in steady state (where and ); (2) implies therefore . (An alternative definition of a conservative W is , where the -th component of is for the generic a). Accordingly, if W is conservative then thermodynamic equilibrium is the only possible steady state, and far-from-equilibrium steady states are possible only for other boundary conditions - think e.g. of a net amount of current flowing across the boundary. Far-from-equilibrium steady states with the boundary condition of (4) are possible only if W is not conservative. Not surprisingly, the latter statement is false when and [6] [47], since in this case; the question is briefly assessed in Sec. 4.
Now, it comes to the discussion of invariance. Physical intuition dictates that (2) and (4) take the same form before and after the transformation (3). As for the quantities involved in (4), it is shown in Appendix B that this requirement of invariance implies:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
As for the quantities involved in (2), it is shown in Appendix B that the values of both , and are not invariant; in contrast, is invariant:
[TABLE]
Accordingly, the only invariant quantities in (2) are and . The roles of and are discussed in Sec. 3 and Sec. 4 respectively. We discuss in Appendix C a further aspect of (11).
In spite of its simplicity, (8) has a far-reaching consequence. For a sufficiently well-behaved , the theorem of the mean allows us to rewrite the invariant normalization condition as where belongs to . Together, invariance of the latter relationship and equation (8) lead to: , a quantity which may differ from 1. This means that is usually not invariant. It follows that if an invariant criterion of stability exists, then it holds regardless of the actual value of ; in particular, it may hold for both macroscopic and mesoscopic systems, as anticipated in 1 . Sec. 7 presents further, in depth-discussion of the issue.
For future reference, here we discuss also the particular case of rescaling. By ’rescaling’ we mean a diffeomorphism (3) where , if and otherwise, and . Equation (4) is invariant under rescaling; indeed, if then (8), (9) and (10) ensure that transforms just like , that J is invariant, and that W transforms like a vector.
3 Invariance and stability
The aim of this Section is to assess the impact of (11) on the stability of steady relaxed states of a system described by (4) against small perturbations. Under mild assumptions, the system evolves - with a typical time-scale - towards a relaxed state where the current J has no sinks and sources, i.e. . The relaxed state coincides with thermodynamic equilibrium when , hence , while non-equilibrium relaxed states are characterized by non-vanishing currents that circulate in the system’s state space. The words ’small perturbation’ refer to a perturbation with small measure in the Riemannian variety of the ’s. The word ’stability’ refers to the fact that if we apply a small perturbation to a steady state then the perturbation relaxes back to the initial state; (which is zero before the perturbation) takes a non-zero value with a well-defined sign (possibly oscillating in time), and after a while relaxes back to zero.
To start with, we observe that the set M of transformations (3), (7), (8), (9) and (10) maps the space onto itself. By construction, M leaves (4) unaffected. Then, M maps solutions of (4) onto solutions of (4). Moreover, M is both reflexive, symmetric and transitive (as it can be shown with the help of both chain rule and the well-known properties of Jacobians after cumbersome but straigthforward algebra), i.e. M is a relationship of equivalence. Consequently, M establishes a partition of the set of the solutions of (4) in equivalence classes; each class is unambiguously labelled by a value of because of (11). Furthermore, M leaves unaffected. Then, M maps steady states onto steady states. Different steady states of the same equivalent class correspond to different choices of and W, which are mapped onto each other by M. Outside steady states, invariance of (4) ensures that the sign of is also invariant. Accordingly, M maps stable steady states onto stable steady states. Finally, different choices of and W may even correspond to different physical systems, as far as that the latter satisfy the same equation (4). For example, if W is a velocity then it is ruled by the equations of motion, which in turn contain the forces acting on the systems. Different systems are subject to different forces, and the behaviour of W changes accordingly. However, the proof of (11) relies on no invariance of the equations of motion. Correspondingly, M may map different physical systems onto each other even if (4) remains valid. It follows that M can map the stable steady state (if any exists) of a physical system which satisfies (4) onto the stable steady state of another physical system which satisfies (4), both steady states having the same value of . The basins of attraction of stable states may be different in different physical systems, as the equations of motion may be different; but as far as these equations include (4), our argument remains unaffected. For example, maps thermodynamic equilibria () onto thermodynamic equilibria ().
Now, in Sec. 5 we describe one particular (’pivotal’) physical system which satisfies (4) and which enjoys the following property: a necessary condition for the stability of a steady state against small, slowly evolving perturbations is that this state satisfies:
[TABLE]
under suitable constraints. We stress the point that (12) differs from both MinEP and MEPP, which involve and not just . Small perturbations of a stable steady state raise above its value in steady state; after a while, relaxes back to the initial value. Minimization in (12) is constrained by the requirement that the solutions of (12) are also solutions of the equations of motion in steady state. While steady states correspond to solutions of the equations of motion in steady state, stable steady state solutions satisfy also (12).
Indeed, the following facts occur: a) when we apply (3) then invariance of (4) forces and W to get transformed under M; b) all steady states obtained from the stable, relaxed state of the pivotal system via M have the same, constant value of , which is a minimum in the pivotal system; c) all unsteady, perturbed states obtained from the unsteady, perturbed states of the pivotal system have the same, time-dependent value of , which is larger than the minimum value of in the pivotal system; d) M leaves the sign of unaffected, so that relaxing states are mapped onto relaxing states.
Together, these facts lead to the conclusion that once the pivotal system is given, the same necessary criterion for stability (12) holds for all physical systems which satisfy (4) and which are obtained from the pivotal system via (3). Indeed, when one of such systems relaxes back to a stable steady state after a perturbation, then the value of in the relaxing state and in the relaxed state respectively are equal to the corresponding values of the pivotal system, where (12) holds.
We conclude that invariance of (2) and (4) implies (12) for steady, stable, relaxed states of systems described by (4) provided that we find a pivotal system. Noteworthy, we invoke no Onsager symmetry and no detailed knowledge of the physical nature of and W. A qualitative argument which invokes no pivotal system is provided in Appendix D .
4 Slow diffusion
The aim of this Section is to assess the impact of (7) on the stability of relaxed states. Since we are looking for a necessary condition of stability, we are free to select the perturbations which stability is to be checked against. We focus our attention on slowly evolving perturbations [17] and denote with the typical time-scale of their relaxation in the following. For large we assume that (’slow diffusion approximation’):
[TABLE]
According to (7), (10) and (11), (13) is invariant.
The impact of (13) on stability is no new topic in the literature. Sec. VII of [53] links (12) and (13) in the particular case of the relaxation of a physical system made of two coupled subsystems at different temperatures, where Joule heating is the only dissipative process in the bulk of each subsystem, so that is just equal to times the Joule heating power ( Joule power density). In each subsystem it is assumed that (4), and hold. It turns out that if (13) holds then the relaxed state minimizes the amount of heat produced per unit time in the bulk by irreversible phenomena. This implies that (12) holds, as in each subsystem. In contrast, if then we are near thermodynamic equilibrium, i.e. in Onsager’s regime, and MinEP holds.
More generally, (13) leads to the consequences listed below.
- •
Even in the extreme case equation (5) gives provided that . Physically, a finite, positive amount of entropy is produced in the bulk per unit time even if provides just a higher-order correction to the equations of motion like (4). Even if seemingly rather artificial, the latter statement applies to many physical systems where dissipative phenomena like the diffusion represented by in (4) allow relaxation, but are not necessarily invoked when it comes to a detailed description of the relaxed states itself. For example, friction damps small oscillations of a pendulum around its position of stable equilibrium until the pendulum stops, but provides no information about the final position itself. This may be true even if no conservative forces occur, like e.g. in the case of stable equilibria of magnetically confined plasmas; indeed, these equilibria are described by Grad-Shafranov equation of dissipation-free magnetohydrodynamics (’MHD’) even if the relaxation processes leading to them is ruled by dissipation [52]. This discussion plays a crucial role in Appendix E below.
- •
Per se, (13) does not imply ; rather, it is an useful simplification for the investigation of slowly evolving perturbations. This issue turns out to be relevant in Sec. 10 below.
- •
Admittedly, this simplification comes at a price. When computing in a relaxed state, we cannot straightforwardly rely on equation (5); we need rather an explicit expression for as provided e.g. by the equation of motions of the particular physical system under investigation. This is e.g. the case of the problems discussed in both Secs. 5 and 8 below.
- •
Furthermore, (13) implies that provides just a higher-order correction to the dynamics of the system implies that the same holds for too. It follows that our results apply even if within a negligible error. The case is of particular interest. In this case, if then both and affect (4) through , and have therefore no impact in the slow diffusion approximation. According to (13), we expect therefore our results below to apply to both systems with and . In particular, such systems may belong to the same equivalence class even if the equations of motion may differ; for example, gravitating fluids with may exhibit convection, which can be absent when . We anticipate here that in the pivotal system in Sec. 5; all the same, according to our discussion we shall be able to apply (12) even to systems in Secs. 8 and 10 .
- •
The quantity describes diffusion; in order to maintain a stable steady state satisfying (13), it is therefore required that the amount of heat produced per unit time within the bulk of the system is also small; hence too is small. For example, the macroscopic fluids dealt with in Secs. 5 and 8 below dissipate only weakly. In the general case, (4), (5) and (13) give and where and . If is small enough, it follows that:
[TABLE]
All the way around, (14) implies (13).
- •
Relationships (4), (10) and (13) make W to transform like a vector and allow us to write:
[TABLE]
where is either zero or the gradient of an harmonic function with vanishing normal derivative at the boundary, so that (14) implies:
[TABLE]
All the way around, if J satisfies (16) then (14) allows J to satisfy (4) up to an additive gradient of an harmonic function.
5 A pivotal system
The aim of this Section is to describe a pivotal system. Here and in the following, we limit ourselves to the case and we take as the state space. We are going to show that the electrons in the pinch of a Dense Plasma Focus (’DPF’) [56] [57] form a pivotal system.
A Dense Plasma Focus (’DPF’) is a kind of discharge in which a high-pulsed voltage is applied to a gas between electrodes, generating a short-duration, high-density plasma region. The pinch observed at the final stage of a well-behaved DPF discharge is made of a dense, hot plasma. This plasma is an example of classical (i.e., non-degenerate), electrically neutral, viscous, resistive, electrically conducting, magnetised fluid, where the electrons carry electric current and the (much heavier) ions ensure charge neutralization, so that the ratio of ion and electron densities is fixed everywhere at all times (we refer to both densities as to ’particle density’ below). The inner structure of DPF pinch spontaneously exhibits typical, complex patterns, both in the form of filaments [58] and of plasmoids [59] - see Fig. 1 .
The life-time of the DPF pinch is s the decay time-scale s predicted by resistive MHD, so we are allowed to speak of a ’steady state’. Accordingly, is the sum of four terms, due to Joule heating, viscous heating, heat transport (due to conduction and radiation) and particle diffusion [66]. The contributions of Joule heating and viscous heating are the volume integrals of times the Joule heating power density and the viscous heating power density respectively. Both radiation and turbulence ensure efficient transport of heat across the pinch, so we may safely assume both and vanishing gradient of particle density; the contributions of heat transport and particle diffusion to are therefore negligible. Correspondingly, (12) is equivalent to simultaneous minimization of Joule heating power (Kirchhoff’s principle [68]) and viscous heating power (Kortweg and Helmholtz’ principle [69]), i.e. to minimization of where . Now, simultaneous validity of Kirchhoff’s and Kortweg and Helmholtz’ principles is precisely a necessary condition of stability of steady, relaxed states of viscous, electrically conducting fluids against small, slowly evolving perturbations [17] [55] [67]. As discussed below, the equations of motion in steady state constrain the minimisation. Then, we may say that (12) is a necessary condition of stability for the relaxed state of this system. Indeed, it has been shown in Sec. 8 of [55] that the above minimization leads to:
[TABLE]
in the DPF pinch. Here is the electric current density, , , is the electric charge/mass ratio of the electrons (in the notation of [55] the quantity is utilised), and is the magnetic field with vector potential . Both and are functions of the position vector inside our fluid, and refers to partial derivatives on the components of . Physically, (17) plays the role of a London equation where London depth is equal to the collisionless skin depth of electrons. It describes successfully [55] the filamentary structure of electric currents flowing across the DPF pinch [58] provided that the Hartmann number (the ratio of electromagnetic force to the viscous force) is lower than a certain threshold (in a turbulent plasma of pure hydrogen and with current , [61]). (Experiments show that the impact of velocity, which depends on, is definitely not negligible in DPF [62] filaments). Ultimately, it is this success which justifies (12) in this system. The case is discussed below and corresponds to plasmoids.
We want to show that we are actually dealing with a pivotal system. To this purpose, we show that the electrons in the DPF pinch satisfy (4) and (13), or, equivalently, (14) and (15). Then, we link (16) - itself a consequence of (14) - and (17).
For the moment, we assume (13) to hold; this assumption is justified below. We apply a rescaling to (4), where is the total mass of DPF electrons in the pinch and . As anticipated in Sec. 1 , we are going to utilize dimensionless quantities. In dimensionless form, for example, is the total number of electrons. Relationships (7) and (13) ensure that the impact of remains negligible. Then, (8) and (9) ensure that and are times the electron mass density and mass density flow respectively. The PDE in (15) is just the mass balance in a system - like the electrons in a DPF pinch - where no net source of mass exists. The boundary condition in (15) is discussed below.
As for (13), the facts that and that mass density, together with equations (5.2) and (6.1) of [55], ensure that the pressure gradient is ; in turn, this justifies (14), hence (13). Finally, agrees [12] with the fact that the contribution of mass transport to is negligible, as anticipated.
Moreover, (17) is just a particular case of (16) provided that (the corresponding M is an identity) and that we take and . Together, the latter relationship and the fact that W transforms like a vector ensure that the vector potential transforms like a vector, as expected. Gauge invariance allows addition of the gradient of an harmonic function to the vector potential to leave physics unaffected in steady state. As expected from Sec. 4, dissipation provides no information on the relaxed state; indeed, (17) contains no information concerning dissipation, and the latter plays a minor role in the momentum balance.
We are left to discuss both the role of the boundary condition in (15) and the constraints on minimization in (12). For simplicity of notation, we drop all superscripts below, so that x is the position vector inside the fluid, etc.
As for the boundary condition , it has been assumed in [55] that the interaction of the system with fields due to external sources is much weaker than the interaction among currents internal to the pinch. In our language, this is equivalent to neglect the impact of the external world represented by non-vanishing net flux of charge carriers across the boundaries of the system, i.e. to .
Admittedly, the virial theorem of MHD forbids the existence of magnetically confined plasmas without interaction with external currents. However, Sec. 6 of [55] shows that the impact of such interaction is comparable to the impact of dissipation, which is considered to be weak in agreement with (13). Even so, and in spite of , if then is not conservative and a steady state far from thermodynamic equilibrium is possible; B is the ’curvature’ in the language of [47]. In contrast with the 1D systems of [72], however, our systems are 3D.
As for the constraints on the minimization [55], they are provided by the mass balance quoted above, Maxwell’s equation of electromagnetism and Navier-Stokes equation with a bulk force density corresponding to the electromagnetic force. For an electrically neutral, magnetized fluid Maxwell’s equation reduce to Gauss’ and Ampère’s law, while the electromagnetic term in the momentum balance corresponds to the density of Lorenz force. The viscous term in Navier-Stokes equation is as small as the impact of interaction with external currents, as stated above. Moreover, the actual value of throughout the system provides minimization with the further constraint throughout the system [55], as .
We stress the point that - even if electrons are the electric charge carriers in a DPF pinch - the analysis of [55] relies on the particular nature of the electric charge carriers nowhere. In other words, validity of (12) does not depend on the actual value of . Accordingly, we allow to take arbitrary real values below. If , (17) makes the electron velocity to be the sum of a potential term and a quantity times a rotational field; in the limit , a potential flow is retrieved, the structure on the small spatial scale of the electron collisionless skin depth is lost and (17) reduces to the identity . Unlike (16), however, (17) follows from Kirchhoff’ and Kortweg and Helmholz’ principles, which in turn agree with (12) for only; accordingly, relaxed states with may have non-potential flows even if - see e.g. Secs. 8 and 11 below.
Admittedly, there is still a weak point in our discussion. In Ref. [55], the proof of Kirchhoff’s and Korweg and Helmholz’ principles in the DPF pinch relies on LTE. Thus, one could think that validity of (12) is limited to systems where LTE holds. In order to show that this is not the case, let us discuss what happens if . According to the same constrained minimization invoked above, it turns out that the relaxed configuration is a plasmoid [59], with spheroidal geometry and as , and is very large ( T) [56]; moreover, a plasmoid is approximately described by Taylor’s principle of minimum magnetic energy with fixed magnetic helicity [63] , and no velocity affects Taylor’s principle - see [55] , [61] and Refs. therein. Correspondingly, and (17) follows from (16) no more, even if the velocity of charge carriers ( the electric current density) may still have non-zero vorticity. Since the electrical resistivity , we expect the -minimizing relaxed state to correspond - all other things being equal - to some minimum of or of some monotonically increasing function of . Indeed, this is precisely what happens with plasmoids, according to the independent analysis leading to equations (1)-(4) of [64]. We may take this fact as a confirmation of our discussion, provided that we take into account the following three facts. Firstly, it starts from investigation of plasmoids in astrophysics, not in DPF. Secondly, it does not invoke LTE; validity of (12) is therefore not limited to systems where LTE holds. Finally, it relies on a postulate in the form of a Fermat-like principle concerning the whole relaxation of a reconnection-affected magnetized fluid towards a plasmoid; as such, the final outcome depends on the initial condition (more precisely, on the initial inverse aspect ratio of the current sheet of interest): as a result, a whole class of relaxed plasmoids are obtain, each of them corresponding e.g. to a given value of embedded magnetic flux, an approximately constant quantity for weakly dissipating plasmas.
In conclusion, electrons in a DPF pinch are an example of a classical fluid with viscous and Joule heating, negligible mass diffusion (i.e., (14) holds), no net mass source and negligible mass flow across the boundary (i.e., (15) holds), and . If a steady, stable state of this fluid exists, then it satisfies also the variational problem (12). Then, we identify this fluid with the pivotal system. We drop the restriction altogether in Sec. 8. In particular, if (the pivotal system is magnetized), and then equation (16) follows from (14) and reduces to the London-like equation (17) describing a filamentary pattern of electric currents. If , and then a plasmoid-like structure arises.
6 Oscillating vs. steady
Until now, we have assumed that fluctuations occur on time-scales much shorter than the time-scale of the perturbation, in order to allow us to get rid of such fluctuations through suitable time averaging. Nothing is said about the detailed frequency spectrum of the fluctuations. In this Section we discuss what happens if the fluctuations oscillate in time with period . We denote -as usual by now- the generic physical quantity by y, write:
[TABLE]
and introduce the time-averaging with , so that and . The same result holds if we replace with a generic function of time evolving on a typical time-scale . Accordingly, time-averaging just erases all contributions of the dynamics on the fast time-scale and allows us to replace y with everywhere while leaving all relationships involving y and its time derivative unaffected -up to a small error at least. To the best of author’s knowledge, this argument has been invoked for the first time in Chap. XV of [13] for systems satisfying MinEP. Up to the same error, we are therefore allowed to say that a steady, stable (unstable) state is a configuration where the value of y undergoes fluctuations with time-scale and where is stable (unstable) against small perturbations evolving on a time-scale .
We retrieve physical examples of such configurations when investigating e.g. systems described by (4) with W periodic function of time. These systems may relax to oscillating states, where both amplitude and period of the oscillations depend on the detailed structure of . We may ask if these oscillating states are stable against perturbations evolving on time-scales much longer than . Now, both (2), (5), (6) and (27) remain valid even if W depends on - as it can be shown with step-by-step repetition of the proof of equation (10) in [7]. Maps still leave unaffected, and (11) still holds. It follows that ; if then
[TABLE]
i.e. too is invariant. Of course, the instantaneous value of in a stable relaxed state coincides at all times with if and only if the stable relaxed state is a steady state, i.e. in the limit . In the general case, the whole argument concerning the pivotal system still holds after replacement of with . Thus, (12) and (18) lead to the following necessary condition for stability:
[TABLE]
If W becomes constant, and we retrieve (12) as expected; we limit therefore ourselves to the discussion of (19) below. After time-averaging of both its sides, (27) gives because as too is periodic in time; may differ from zero even if because of correlations.
7 Mesoscopic vs. macroscopic
Remarkably, the invariance property (11) depends on no actual value of , and the same class of equivalence may contain both macroscopic systems (like e.g. the pivotal one) and mesoscopic systems, i.e. may map macroscopic and mesoscopic systems onto each other. We may think e.g. of a rescaling with a suitably defined value of , so that changes as discussed in Sec. 2 . If we apply to a macroscopic system and, as a result, gets shrinked too much, the linear dimension of the resulting may become of the same order of magnitude of the De Broglie length of the resulting system, and the latter is definitely mesoscopic. All the way around, we could start from a mesoscopic system satisfying (4) (and also (13), as far as stability against slow perturnations is concerned) and obtain a macroscopic fluid with no net mass source, no mass flow across the boundary and negligible particle diffusion (in agreement with (13)). As far as is invariant, the problems of stability of a star and of a molecular motor may therefore be dual to each other, and share a common solution. Of course, a star produces an immensely larger amout of entropy per unit time; but the actual value of is not relevant to the problem of stability, just its minimum property counts. (Analogously, stability of thermodynamic equilibrium requires just maximisation of , and does not depend on the actual value of ).
We may therefore ask ourselves if, given an oscillating relaxed state in a mesoscopic system, there is some reformulation of (19) which applies to the stability of such state. We need this reformulation for two reasons: a) a mesoscopic system which belongs to an equivalence class may still satisfy (14) and (15), but and J are no more to be given the meaning of mass density and mass flow respectively; b) the impact of fluctuations should be explicitly taken into account. If it exists, such reformulation may hold for both macroscopic and mesoscopic systems. This is the topic of this Section.
To start with, we stress the point that the proof of (19) relies on (5). In turn, the proof of (5) relies on the positiveness of . In average, this positiveness follows from the fact that at the trajectory level, realizations with positive entropy production are exponentially more likely than the corresponding realizations with negative entropy production [14]. The probability of observing a ’reverse’ behaviour, i.e. an entropy production opposite to that dictated by the second law of thermodynamics, decreases exponentially as the system size increases. Thus, careful discussion of the averaging procedure is required when discussing mesoscopic systems.
The quantity depends on the path C, i.e. the actual trajectory followed by the system in phase space starting from the configuration at and arriving at the configuration at the time (with no loss of generality, we suppose time averaging is computed on the time interval from [math] to here and in the following.). We define [6] [10] the path ensemble average of the path-dependent, generic quantity where the sum is over the set of all paths connecting all possible initial and final states and the probability of the path C depends on both the detailed dynamics of the system and the probability of the initial state. The smaller the system, the more likely the probability of observing a ’reverse’ behaviour, the more relevant the distinction between and . In contrast, we may safely write in macroscopic systems, where ’reverse’ behaviour is extremely unlikely. Accordingly, and regardless of the detailed mechanics of fluctuations, we may provide the following definitions, just as anticipated in Sec. 1 :
- •
the relationship defines a macroscopic system;
- •
the relationship defines a mesoscopic system.
Now, by definition a system which is initially in a stable state relaxes back to its initial configuration after a small perturbation has been applied. This means that -after a sufficiently long time at least- both the C’s and the corresponding ’s are unaffected. This property holds as far as the system is stable. Since we are looking for necessary conditions of stability, we are free to choose the quantity to be perturbed in order to check stability. With no loss of generality, let us consider a quantity y which is a function of x such that the transformation of coordinates is a diffeomorfism with positive Jacobian . Accordingly, any small perturbation of y on a time-scale is equivalent to a small perturbation of on the same time-scale. The perturbation corresponds to an infinitesimal which maps stable states onto stable states. If we apply it to a stable state, then the ’s remain unaffected and . This relationship remains valid even if we repeat the transformation again and again, so that we may take it to be valid even for a finite . If , then we obtain where the last equality follows from (18). If the system we start from is macroscopic, then . Stability implies (19), hence:
[TABLE]
As anticipated, the crucial point is that -even if the starting system is macrosopic- the final system needs not to be macroscopic; (20) is a generalization of (19), valid for both mesoscopic and macroscopic systems. As for mesoscopic systems, an open question in ST research is whether the probability distributions of work, heat and entropy production can be grouped into "universality classes" characterized, e.g., by the asymptotics of such distributions, and, if yes, which specific features of a system determine this class [3]. By now, it is temptative to identify each "universality class" with the equivalence class labeled by the value of which the mesoscopic system belongs to. As for macroscopic systems, and we retrieve (19) as expected; we limit therefore ourselves to the discussion of (20) below.
It is clear, by now, that a suitable choice of the value of allows a rescaling to map any mesoscopic system which obeys (4) and (13), i.e. (14) and (15), onto a pivotal system for some value of , so that the system belongs to an equivalence class and its stable relaxed states satisfy (20). As for (4), and J are to be given the meaning of mass density and mass flow respectively no more, in contrast with what occurs in macroscopic systems, as anticipated in Sec. 7. As for (13), it is equivalent to take into account slowly evolving perturbations only, as anticipated in Sec. 4.
The proof of (20) relies on the existence of a pivotal system, through (19) and (12). If, furthermore, everywhere and at all times (as it happens e.g. when the system is in contact with just one heat reservoir at temperature ), then an independent, qualitative confirmation of (20) exists - see Appendix D - which invokes no pivotal system and shows its agreement with ST ; see Fig. 2 for an overall sketch of our results.
8 Rayleigh-Bénard convection
Until now we have dealt with the pivotal system, where . Generally speaking, if a macroscopic, system is a fluid with Joule and viscous heating in relaxed, steady state then (20) agrees with Kirchhoff’s and Kortweg and Helmholtz’ principles invoked in Sec. 5. Remarkably, however, the arguments leading to (20) invoke nowhere. In particular, we invoke the argument presented in Sec. 4 : if then affects (4) through only; then, in the slow diffusion approximation leaves the validity of (20) unaffected - and the same holds for . In the following, we are dealing with applications of (20) to systems where . In these systems, and in in contrast with the pivotal system, the transport of energy is not so strong to flatten . For example, different boundaries of the same system may be in thermal contact with heat reservoirs at different temperatures [6] [47] [53]. In such systems, far-from-equilibrium relaxed states are possible even if W is conservative.
The aim of this Section is to discuss a class of macroscopic systems, namely fluids with Joule and viscous heating where Rayleigh-Bénard convection occurs, where and with negligible mass diffusion, no net mass sources and no mass flow across the boundary. Convection is triggered when the Rayleigh number - basically, a dimensionless version of - exceeds a threshold . We discuss mesoscopic systems with in Sec. 10.
Physically, Joule heating and viscous heating are related to the transport of different quantities, namely electric charge and momentum respectively. Correspondingly, they provide with different, additive contributions and different thermodynamic flows. Heat transport provides a further, additive contribution with a further thermodynamic flow. Entropy production in our fluid is therefore a ’compound process’ [16], and validity of MEPP is questionable - see Appendix A .
We show in Appendix E that a steady, relaxed, possibly unmagnetized fluid belongs to the same class of equivalence of a particular pivotal system. More to the point, we can map a Rayleigh-Bénard problem with its own boundary condition (Dirichlet, Neumann or a linear combination of the two) in a steady, relaxed fluid on the verge of convection onto the steady, relaxed filamentary configuration of a DPF, and vice-versa. Differences between the dynamics of the two systems do not matter; what is relevant here is that such mapping is possible. The proof takes into account the fact - hinted at in Sec. 4 - that the description of a stable relaxed state involves no dissipation; the latter affects rather the description of the relaxation process, which is very slow in the slow diffusion approximation and may therefore safely being described as a succession of steady states.
This result, together with the above-quoted fact that the impact of non-zero on stability is negligible in the slow diffusion approximation, allows us to invoke (19), i.e. the version of (20) for macroscopic systems, as a necessary condition of stability for relaxed states in Rayleigh-Bénard convection problems. The problems of the stability of the relaxed steady states of the DPF filaments and the fluid are dual to each other. In the latter fluid W plays a role of a fluid velocity, and as convection cells rotate, say, with period . A far-from-equilibrium relaxed state is possible as .
Rayleigh-Bénard convection is the result of the simultaneous occurrence of gravitation (which, per se, leaves the entropy balance (2) unaffected) and of a temperature jump across a fluid layer. Both are absent in the pivotal fluid of Sec. 5. In contrast, these systems share the following, common features: a) mass balance holds with no net mass source and no mass flow across the boundaries, i.e. (15) holds with mass density, J mass density flow; b) viscous and Joule dissipation are the only heating processes of the bulk (Joule heating plays a role in magnetized fluid), while the impact of particle diffusion on entropy production is negligible - in agreement with (13) and (14) - so that includes a new term due to heat conduction. The entropy balance of a small mass reads:
[TABLE]
where q is the heat flow due to conduction and radiation [17]. If no mass flows across the boundary, Reynolds’ transport theorem [48] and integration of both sides of (21) on the volume of the system gives back (2) with and after invoking Gauss’ theorem of divergence and after integration by parts. Both and are and the latter is an increasing function of the former; then, (19) implies minimization of . We invoke this minization in order to discuss a Rayleigh-Bénard problem below. We are going to retrieve some results, which include the well-known variational principle of Chandrasekhar [77] at . Our argument is a slightly modified version of the argument presented in Sec. VI.B of [17]. Here the novelty is that we invoke LTE no more; we have replaced it with the assumption of no mass flow across the boundary.
It is clear, by now, that our pivotal system acts just as a Trojan horse which allows us to smuggle the stability criterion (20) into the world of the relaxed solutions of (14) and (15). Even in the framework of the slow diffusion approximation, the relevance of (20) does not depend on the detailed physics. Rather, it follows from the very existence of a partition of the set of the solutions of (4) in equivalence classes - in other words, on the invariance of under (3).
A far-reaching corollary follows. Both gravitation and non-zero leave and (29) and (21) unaffected. Then, we may apply our discussion above to the relaxed state of any classical fluid with Joule and viscous heating, negligible mass diffusion, no net mass source and negligible mass flow across the boundary. Regardless of , stability of the relaxed states of this fluid requires (20). The detailed dynamics of the system appears in the constraints of minimization, and depend on the particular problem.
Now, let us come back to Rayleigh-Bénard convection. The contribution of heat conduction to increases with increasing , for not too large at least (the precise meaning of ’not too large’ is given below); it vanishes only if everywhere - see Sec. 49 of [12] and Sec. 66 of [66]. Accordingly, (19) implies minimization of .
As for the constraints on minimization, we want to retrieve the constraints given by the equations of motion and by in the limit of vanishing . It can be shown [17] that the correct choice of constraints for the which satisfy this condition is given by the equations of motion and by a given value of . The equations of motion include the mass balance with no net mass source and the Navier-Stokes equation; in an electrically conducting fluid, Maxwell’s equation of electromagnetism are also included, and the density of Lorenz force appears in the Navier Stokes equation. No mass flows anywhere across the boundaries. After time-averaging, the energy balance obtained from the equations of motion ensures that the time-averaged total heating power is equal to the time-averaged mechanical power supplied by buoyancy (with density ); this relationship acts as a constraint on minimization.
Under such constraint, minimization of retrieves the result derived in Secs. 33 and 43 of [77] from the analysis of the equations of motion near the onset of Rayleigh-Bénard’s convection cells in both unmagnetised and magnetised fluids. Indeed, the author keeps the value of fixed, so that the quantity actually minimized is :
[TABLE]
Since in the problems solved in [77], the words ’not too large’ above mean .
9 Convection in a rotating shell
Some consequences of (22) are discussed in [17]. Here we apply (22) to the description of the relaxed state of a macroscopic, rotating spherical shell
- with inner radius , outer radius , thickness , volume and rotation period P - of a viscous, electrically conducting fluid where a non-vanishing temperature jump drives convective cells with typical linear size rotating with period and typical rotation velocity . We are allowed to invoke (22) as rotation affects (2) unaffected. This is e.g. the case of the subphotospheric, convection-ruled zone of rotating stars [78], where typical values of the Rossby number [79] are [80]. This convection zone is usually supposed to be a spherically symmetric shell where convection and dynamo effect [84] sustain a temperature gradient and a magnetic field B respectively [85]. In the following, we derive a scaling law for B from (22).
The reciprocity principle of variational calculus - see Sec. IX.3 of [86] - makes the solution of (22) to solve also the variational problem:
[TABLE]
Further discussion requires two auxiliary assumptions, to be dropped below. Firstly, we neglect viscosity, i.e. we replace with where E is the electric field and is supposed to be a scalar. Faraday’s law gives where . Secondly, we focus our attention on the case , so that we make a small error if we replace with everywhere inside the shell. For example, according to [85] solar dynamo occurs in a narrow layer between the convection and the radiation zone ( is the solar radius).
As a further, more fundamental hypothesis, we assume small electromagnetic fluctuations, so that we may neglect contributions of dynamo to [87] and write in the relaxed configuration of the shell. Thus, we write with , hence:
[TABLE]
Cowling’s theorem [84] forbids occurrence of dynamo in two-dimensional systems: convection cells in the dynamo-relevant zone are therefore likely to exhibit non-trivial three-dimensional structure. Azimuthal periodicity provides us with an upper bound . Then, maximization in (23) requires that we replace with in (24), and the constraint in (22) gives:
[TABLE]
where . Here we have implicitly assumed that maximization affects just and not directly , which in turn means that we neglect possible saturation of dynamo at very large rotation velocity, i.e., very low . If, furthermore, the centrifugal force is not too strong, then the dependence of F on is weak. In fact, both , , and depend rather - through , , and - on the rotation-independent chemical composition and mass of the star, as well as on its global balances of energy and momentum, i.e. the hydrostatic balance of gravity and pressure gradient, etc. The same holds for the value of in (23). Moreover, is likely to be the same for the stars of the same spectral class, so that (25) leads to:
[TABLE]
In Appendix F we discuss some independent estimates of x which confirm (26) and which do not rely on the auxiliary assumptions referred to above. Indeed, observations [79] for F and G/K dwarf stars are in agreement with , where is the coronal field and . A review [81] which includes both Sun-like and M-type stars shows that (26) fits data as far as , while the above-neglected dynamo saturation occurs at - see Fig. 3 .
10 Flashing ratchet
We have discussed macroscopic systems so far. In contrast, when the number of particles becomes small, like e.g. in molecular machines [93], the environment itself continuously drive the system away out of equilibrium and the fluctuations it generates over the system are very relevant, so that the probability of observing an apparent violation of the second principle becomes significant. Such small machines will spend part of their time actually running in reverse, i.e. now and then it is possible to observe that these small molecular machines are able to generate work by taking heat from the environment, in agreement with Kuramoto’s identity [8].
The generation of mechanical work out of thermal fluctuations requires a) either a time-dependent modulation or another form of energy flux, b) or some spatial asymmetry with a temperature gradient [92]. In ST, cases a) and b) correspnd to systems which are coupled to one (or several) heat bath(s) of constant temperature each respectively. The aim of this Section is to apply (20) in the description of stable operation of such machines in a particular class of problems in the case a). Sec. 11 below is devoted to b).
In case a) and particles diffuse in a potential , which is periodic both in space and time [82] [91] [90]. The particle distribution satisfies a Fokker-Planck equation which is solved by a particle distribution and where , just as in [7], and the relaxed state (if any) may oscillate in time. As for the boundary condition, the system remains confined at all times inside a region of the state space, just like in Sec. 2. The equations of chemical kinetics acts as equations of motion and provide the constraints. Since and both the volume and the chemical composition are fixed (particle diffusion is neglected in agreement with (13)), the free Helmholtz energy F too is constant. Since and the slow diffusion approximation allow us to neglect the amount of entropy produced per unit time by heat transport and particle diffusion respectively, the minimization in (20) reduces to minimization of the path-ensemble-averaged, time-averaged total dissipated power.
Indeed, it has been shown that the evolution of a particular kind (’flashing ratchet’ see Fig. 4) of molecular motor - where the potential oscillates in time between zero and a periodic, asymmetric function of position - follows a path of ’minimum dissipation of the total energy’ with the constraint of given F - see equation (4) of [90]. Admittedly, the treatment of Ref. [90] deals mainly with the 1D case, but in its own words ’many of our results apply when is a bounded domain in ’. The same principle has been applied to more complicated molecular motors - see equation (4.4) of [93]; on a qualitative base, generalization to a Fokker-Planck system with chemical species seems possible - see Appendix D.
Mathematically, the analysis of [90] relies on the results of [83]. Physically, in the flashing ratchet; moreover, the latter is compared to a overdamped mass-string-dashpot system where viscous dissipation only occurs; the crucial point is that ’molecular motors move in a very viscous environment’ in the words of Sec. 2.2 of Ref. [90], so that minimization of the dissipated power resembles Kortweg-Helmholtz’ principle invoked in the DPF pinch - see Sec. 5.
Crucially, never vanishes exactly, even if it is small; smallness of diffusion means just that we are focussing on slowly relaxing perturbations, as anticipated in Sec. 4. Rather, the overall behaviour of the system turns out to be the outcome of the interplay between the tendency of particles tovards concentration at the minima of the potential and diffusion, which tends to spread and dissipate density. If is uniform and constant, and if the ratchet keeps on oscillating in a stable way - so that the average amount of mechanical energy exchanged with the external world is also constant - then the same holds for F - this explains the constraint. Once the ratchet has started oscillating at a given period, physical intuition requires that the external world must do some work per period on the system in order to compensate the effect of thermal fluctuations, which tend to destroy the ordered oscillatory behaviour of the ratchet; ratchet stability against fluctuations is maximum when the required time-averaged amount of power is a minimum.
A final remark. In ST, if a system is in thermal contact with a heat reservoir at temperature , equations (3) and (7) of [9] - together with the fact that is the volume integral of - show that is equal to the time-averaged amount of work done on the system by the external world divided by . The same equations ensure also that even if the discussion of [9] takes explicitly into account the possibility that may not be positive-definite.
11 Brownian motor
Our result (20) provides information on systems too. Again, and in analogy with the discussion of macroscopic systems in Sec. 8 , if then affects (4) through only; then, in the slow diffusion approximation both and leave the validity of (20) unaffected. For example, Feynman describes [95] a ratchet coupled via an axle to a paddle wheel immersed in a fluid. Because of the Brownian motion of molecules, the paddle wheel has an equal probability to be rotated to the right or the left. Owing to the presence of the pawl preventing one rotation direction, say the left, the ratchet will rotate on the right, seemingly violating the second law of thermodynamics. The apparent contradiction is solved and equal rotation probability restored by including thermal fluctuations of the ratchet. However, if its temperature is smaller than that of the fluid, unidirectional rotation becomes possible, enabling conversion of heat into work. Rather than an applied, oscillating potential, it is therefore the non-uniformity of temperature which allows the motor to work. No W is required anymore; for simplicity we are going to take altogether. The aim of this section is to apply (20) to the description of stable, relaxed states of such mesoscopic system, henceforth referred to as ’Brownian motor’. To this purpose, we follow the treatment of [92] and [94].
In a system made by parts, the th part () being in thermal contact with a heat bath at given temperature , we are allowed to invoke (20) as far as the system remains confined at all times inside a region of the state space and obeys a conservation equation like (4) - see equations (3)-(4) of [92]; as for both the self-consistency of slow diffusion approximation and the physical meaning of , see below. Minimization in (20) implies minimization of the contribution of each part to separately. In turn, this implies minimization of the path-ensemble-averaged, time-averaged total dissipated power in the th part for all ’s, because in the bulk of the th part no entropy is produced by heat transport (as across the th part) and the entropy produced by particle diffusion is negligible in the slow diffusion approximation. Finally, and in analogy with our discussion of flashing ratchets above, minimization of the path-ensemble-averaged, time-averaged total dissipated power follows - as the latter is just the sum of the minimized amounts of power dissipated in each part.
To fix the ideas, let us suppose that the th part is immersed in a fluid heat bath and undergoes a viscous drag with friction coefficient , where the th coefficient of proportionality depends on the detailed geometry and orientation in space of the th part - see Fig. 5, as well as equation (12) of Ref. [92] and equation (34) of [94], where the fluids are perfect gases. The th part is free to change its own orientation in space during the operation of the motor. The amount of power dissipated in the th part is an increasing function of ; then, minimization of the path-ensemble-averaged, time-averaged total dissipated power in each part - required by (20) - implies minimization of for all ’s. This minimization occurs as all the parts adjust their own orientation in space. This orientation is unambiguosly described by a set of Euler angles, which are the ’s of the state space of the system, and , the probability distribution of orientation, depends on both the ’s and time. If only 1 part out of is free to change orientation then there are 3 Euler angles; the state space is and can be linked with by a diffeomorphism. It is useful to define the quantity ; it is an increasing function of fluctuation amplitude - see equation (2) of [92]. Since , minimization of implies minimization of .
We have seen that minimization of is a necessary condition for the validity of (20) in our system. We show that is also a sufficient condition. Indeed, the facts that and that make the minimization of to imply the minimization of viscous heating power. Since the latter is just the sum of the amounts of viscous heating power dissipated in each gas and each amount is non-negative, minimization of each amount separately follows. This minimization is achieved through changes in the detailed geometry of each part [92], hence all ’s remain unaffected. As a consequence, minimization of the amount of viscous heating power in each part implies minimization of the amount of entropy produced in each part; after summing on all parts and averaging, (20) follows.
Indeed, it has been shown - in the particular case where all Euler angles but one are blocked at least - that: a) a molecular motor whose parts are in thermal contact with perfect gases acting as heat baths at different temperatures tends to minimize ; b) this minimization implies also the minimization of the diffusion coefficient, in agreement with the slow diffusion approximation; c) the relaxed state may violate MinEP - see equations (5), (7) and the comment to equation (18) of [92]. Reults c) holds in analogy with the results of [53] concerning Joule heating and referred to in Sec. 4 .
12 Conclusions
The second principle of thermodynamics provides information concerning a physical system at thermodynamic equilibrium; such information do not depend on the choice of coordinates with the system is described with. Far from thermodynamic equilibrium, the same holds for the description provided by Glansdorff and Prigogine’s minimum entropy production principle (MinEP) for the stable, relaxed state of a macroscopic system which satisfies Onsager symmetry [18]. (Conservatively, here we stick to the familiar Gibb’s entropy). To date, no proposed extremum property for far-from-equilibrium, stable relaxed states of systems which violate Onsager symmetry seems to enjoy such invariance. For example, a first-principle proof [39] of the so-called maximum entropy production principle (MEPP) [15] seems to rely precisely on violation of this invariance. Physical intuition, in contrast, makes us to look for extremum properties - for stable, far-from-equilibrium relaxed states - which are invariant under change of coordinates. The price to be paid is likely to be the loss of generality. Indeed, the results of [17] suggest that variational principles concerning particular contributions to the entropy balance may hold for selected classes of problems.
Another kind of invariance - namely, invariance of microscopic physics under time reversal - plays a crucial role in Stochastic Thermodynamics (ST), which provides a framework for extending the notions of classical thermodynamics to the level of individual trajectories across the space state of a selected class of (usually mesoscopic) physical systems, namely those systems where a non-equilibrium process occurs which is coupled to one (or several) heat bath(s) of constant temperature. Remarkably, ST is often concerned with oscillating relaxed states, in contrast with MinEP and MEPP, which deal with steady relaxed states. Rather than to conditions of stability for stable relaxed states, ST leads to a large number of exact, experimentally confirmed results - usually referred to as ’fluctuation theorems’ (’FTs’) - starting from the invariance of the microscopic dynamics under time-reversal. FTs allow to quantify the probability of observing occasional ’reverse’ behaviour, i.e. i.e. an entropy production opposite to that dictated by the second law of thermodynamics; it decreases exponentially as the system size increases [4] [3] [14].
The success of FTs suggests to take advantage of the properties of invariance of a system while looking for some property of its stable, far-from-equilibrium relaxed states. In particular, it seems worthwhile to investigate the consequences of the invariance under diffeomorphisms involving the coordinates of the space state. Generally speaking, these diffeomorphisms can map macroscopic systems (where the volume of the system in the space state is large enough and fluctuations can be neglected in some cases at least) and mesoscopic systems (where the volume is small and fluctuations, possibly of quantum origin, cannot be neglected altogether) onto each other. Correspondingly, if such a property actually exists and is invariant under change of cooordinates then it provides information on the stability of both macroscopic and mesoscopic systems. Moreoverm given the fact that relaxed states of mesoscopic systems are often oscillating states, this - so far hypothetical - property should describe oscillating relaxed states - and steady states as a particular, zero-frequency case - in order to cope with both macroscopic and mesoscopic systems. A connection could be established between different problems of stability. Finally, of course, this property should agree with ST.
The aim of this paper is twofold: a) to write down this invariant property for the relaxed states of a selected class of physical systems, and b) to discuss some applications to stability. We limit ourselves to systems described by a PDE problem (4) made of a simple Fokker-Planck equation and a boundary condition (the system remains confined at all times within a given region of the state space). It has been shown [7] [47] [53] that if a generic, differentiable transformation M of both the coordinates which the probability distribution function depend upon and of the quantities involved in (4) leaves both (4) and time unchanged, then the amount (often referred to as in the literature) of entropy produced per unit time by irreversible processes inside the bulk of the system is invariant under M .
We have investigated the consequences of this invariance. To start with, each M is a relationship of equivalence, hence it establishes a partition of the set of the solutions of (4) in equivalence classes; each class is unambiguously labelled by a value of . Secondly, given the possibility of writing (4) in dimensionless form (and of avoiding therefore problems with dimensional analysis), it follows that different elements of the same class of equivalence may represent different physical systems with the same value of , as far as all of them satisfy (4) even if the other details of the dynamics are different. For further discussion, firstly we discuss the necessary condition for the stability of a steady relaxed state of a macroscopic system; generalization follows.
If relaxation occurs towards some final, relaxed, steady state which is stable against small perturbations then unsteady, relaxing states are mapped onto unsteady, relaxing states and relaxed, stable states are relaxed onto stable, relaxed states. For example, thermodynamic equilibria are mapped onto thermodynamic equilibria (). In particular, if at least one (’pivotal’) of the systems which satisfy (4) exists where constrained minimization of (the constraints being provided by the equations of motion) is a necessary condition for the stability of relaxed states against small, slowly evolving perturbations, then the same property is shared by the stable, relaxed states of all physical systems which are obtained from the pivotal system via a map M; even if the detailed equations of motion may be different, the minimized quantity is the same. This corollary follows ultimately from the invariance of , not from validity of Onsager symmetry nor from the detailed nature of the involved systems.
Further discussion takes advantage of the fact that we are looking for necessary conditions for stability, hence we are free to select the perturbation which stability is to be checked against. For simplicity, we assume that the perturbation is slow, i.e. that its typical time scale is longer than other typical time-scales of the dynamics of the system (for example, the oscillation period in a oscillating relaxed state). This is equivalent to assume that the diffusion term in the Fokker-Planck equation is relatively small (diffusion stands for irreversible phenomena which damp perturbations of stable states); this approximation is referred to as ’slow diffusion approximation’ below.
Now, we show that the electrons in the pinch of a Dense Plasma Focus (’DPF’) [57] form a pivotal system. Depending on the relative relevance of electromagnetic and viscous force [61], the spatial structure of the system may either filamentary or plasmoid-like [55]. Stable, steady relaxed configurations minimize . (Experiments [58] [59] suggest that these structures are indeed exceptionally stable against resistive decay as predicted by magnetohydrodynamics (’MHD’), so that the words ’stable’ and ’steady’ seem to be justified).
Generalization from steady to oscillating relaxed states of macroscopic systems requires replacement of with its time-average , the time-average being taken on a time scale which is both much longer than one period of oscillation of the relaxed state and much shorter than the typical relaxation time scale of the perturbation [13]. Analogously, generalization to mesoscopic systems requires further replacement of with its path-ensemble average [10] . These generalizations rely on the fact that the map commute with both time-averaging and path-ensemble-averaging, near a stable relaxed state at least. When it comes to mesoscopic systems, we could say that the equivalence classes referred to above act as "universality classes" which these systems can be grouped in.
A striking consequence of the generalisation to mesoscopic systems is that the problems of stability of macroscopic and mesoscopic systems may be dual to each other, i.e. share a common solution. The problem of stability against slow perturbations of the relaxed state of a mesoscopic system described by (4) which remains confined at all times within a given region of the state space (e.g., a Brownian motor) and the problem of stability of the relaxed state of a weakly dissipative fluid with no net mass source, no mass flow outside the boundary and negligible particle diffusion (e.g., the subphotospheric, convection-ruled, dynamo-affected layer of a G-class rotating star) are dual to each other. Even if the star produces an immensely larger amount of entropy per unit time, it is not the actual, averaged value of which is relevant to the problem of stability, but its minimum property. (Analogously, it is the maximum property of entropy , not its actual value, which is relevant to the stability of thermodynamic equilibrium). Qualitative arguments even suggest that constrained minimization of are indeed in agreement with ST, even if no pivotal system is available - see Appendix D . This minimization is therefore the looked-for invariant property of stable relaxed states. Fig. 2 displays a general overview of the results of our discussion. In a nutshell, invariance bridges the gap between the problems of stability of some selected macroscopic and mesoscopic systems.
In order to discuss applications, we start with the following remark. As far as we investigate their stability against slowly evolving perturbations, a macroscopic, classical fluid - with Joule and viscous heating, negligible mass diffusion, no net mass source and negligible mass flow across the boundary - and a mesoscopic system satisfying (4) - with as the state space - enjoy the same property: both can be mapped onto a pivotal system. In the macroscopic fluid, equation (4) reduces to a simple mass balance; in the mesoscopic system, investigation of stability against slow perturbations allow us to invoke the slow diffusion approximation. Consequently, they share also the same necessary condition of constrained minimization of for the stability of relaxed states. The equations of motion provide the constraints, which depend therefore on the particular system of interest. It does not matter what the detailed dynamics is like; what is relevant here is that such mapping is possible. Examples of the macroscopic fluids quoted above are the filamentary configuration of the electrons in the DPF pinch and a (possibly unmagnetized) fluid near the onset of Rayleigh-Bénard convection; indeed, it is shown in Appendix E that the problems of stability of the two systems are dual to ech other, even if the dynamics is utterly different.
Beyond thermodynamic equilibrium (where of course ), in the following we are going to discuss four particular problems as particular cases of the macroscopic and the mesoscopic systems hinted at above. In each case, we have derived from the constrained minimization of a necessary condition of stability, not necessarily in the form of a variational principle. The criterion of stability we obtain depends on the particular problem. Noteworthy, both MinEP and MEPP apply in no case. In particular, according to the language of Ref. [16] all macroscopic systems we deal with in the following are ’compound’ systems, where MEPP fails. This is not surprising, as both MinEP and MEPP are concerned with the entropy produced per unit time by all irreversible processes in the system, while is the amount produced per unit time by the irreversible processes only which occur in the bulk. In all cases we retrieve results previously available in the literature, independently derived from either observations or dedicated proofs from first principles.
Results are collected in Table 1 . The list is not - and is not meant to be - complete. Each column of Table 1 refers to a different problem far from thermodynamic equilibrium, where each of the quantities on the left may either vanish (’’) or differ from zero (’’). For each problem, results are discussed below; we start from the minimization of and retrieve a corresponding necessary criterion of stability, which finds its confirmation in the Refs. referred to in the first line of the Table. For comparison, is the necessary criterion of stability of thermodynamic equilibrium ().
(A) Electrons in a DPF pinch [57]. Rather than the identically vanishing , it is which keeps the stable, steady relaxed state far from thermodynamic equilibrium in this macroscopic system. Here W is a suitably defined linear combination of velocity and vector potential and mass density. (’Steady’ means here ’with lifetime much longer than predicted by MHD’). The necessary criterion of stability of the relaxed state is with fixed [55], where is the sum of viscous heating power and Joule heating power density, and is the value of the temperature at the boundary. If viscous heating is (not) negligible in comparison with electromagnetic force, then a plasmoid-like configuration (a filamentary structure) arises. In the plasmoid case [59], th enecessary criterion of stability a monotonically increasing function of resistivity achieves a minimum - in agreement with the results of [64]; moreover, Taylor’s principle of minimum magnetic energy for given minimum helicity is approximately satisfied [63]. In the filamentary case [58], current filaments are similar to those of a superconductor [73] ; the relaxed state belongs to the same class of equivalence of a (possibly unmagnetized) fluid near the onset of Rayleigh-Bénard convection, in spite of the different equations of motion and of the different values of .
(B) Rayleigh-Bénard convection in a rotating shell of magnetized fluid affected by dynamo. The subphotospheric, convection-ruled zone of a rotating star [78] provides an example. Here W and are the velocity and the mass density repsectively. Both and keep the oscillating relaxed state of this macroscopic system far from thermodynamic equilibrium; rotation of both the shell and the convective cells rule the oscillation frequencies. It turns out that the absolute value of is minimized, with the constraint that the time-averaged dissipated power is equal to the time-averaged mechanical power provided by buoyancy; this result had been previously found in [17] under the more restrictive assumption of local thermodynamic equilibrium [1], and is in agreement with [77]. As a consequence, the averaged magnetic field is inversely proportional to the Rossby number, in agreement with observations for F and G/K dwarf stars [79] [81] .
(C) Flashing ratchet. An externally applied, oscillating potential keeps this mesoscopic system far from thermodynamic equilibrium, even if and (, where is an externally applied potential, which is periodic both in time and in space, and is asymmetric in space; is a particle distribution). In order to sustain regular ratchet oscillations, the external world must do some work per period on the system in order to compensate the effect of thermal fluctuations, which tend to destroy the ordered oscillatory behaviour of the ratchet; ratchet stability against fluctuations is maximum when the required time-averaged amount of power is a minimum. It turns out that stable relaxed states minimize the dissipated power with the constraint of given Helmholz’ free energy. This is in agreement with the results of Ref. [90].
(D) Brownian motor. This mesoscopic system is made of parts (), where the -th part is immersed in a fluid heat bath at temperature and viscous coefficient . Rather than an applied, oscillating potential, it is therefore the non-uniformity of temperature which allows the motor to work, by enabling conversion of heat into work. The differences between the constant ’s keep the system far from thermodynamic equilibrium. Stable relaxed, oscillating states minimize a a quantity (the ’effective temperature’ ) - which has the dimension of a temperature and which depends on the ’s - under the constraint of given ’s. This is in agreement with the results of Ref. [92], where and refers to the probability of orientations in space; this treatment stems from the seminal analysis of Feynman [95].
We have shown that invariance against changes of coordinates plays a crucial role when it comes to understand many different criteria for stability for dissipative macroscopic and mesoscopic systems. In a nutshell, invariance allows us to see that the problem of the stability of the far-from-thermodynamic-equilibrium relaxed state of a physical system may be dual to the corresponding problem in another system, even if the two systems may differ considerably. Then, once the problem has been solved for one system, the solutions of the problems for a whole class of physical system is at hand. Here we have discussed invariance of suitably averaged for a wide class of systems. There is no fundamental reason to think that invariance of other quantities for other classes of systems is impossible, and that it leads to no further criteria for stability.
Finally, it is clear, by now, that our pivotal system acts just as a Trojan horse which allows us to smuggle the stability criterion (20) into the world of the relaxed solutions of (14) under the slow diffusion approximation. DPF pinch electrons (column (A) in Table Table 1) are by no means special: once the existence of the pivotal system is proven, its detailed dynamics is not relevant, as expected in a thermodynamic approach. We could as well start e.g. from the systems described in columns (B, (C) or (D), where the stability criteria have been speparately and independently established. It is even possible that no ’pivotal’ system is really required - as suggested by the qualitative discussion of Appendix D - and that a deeper connection exists between invariance and stability exists. Thus, further extension of the method outlined here to other problems is conceivable, and will be the matter of future work.
Acknowledgments
Fruitful discussions with Prof. M. Polettini (Univ. Luxembourg) and Dr. E. Cosatto (Ansaldo Energia, Genova) are warmly acknowledged.
Appendix A More about MEPP
MEPP meets no general consensus yet. For example, Ziegler’s version [21] of MEPP has ’its statistical substantiation only if the deviation from equilibrium is small’ [15] and is criticised in [22]. According to Ref. [16], Ziegler’s MEPP can be considered just as a working hypothesis for nonlinear nonequilibrium thermodynamics. Moreover, validity of MEPP is questionable in the so called ’compound processes’. By definition, the entropy production of a compound process is a sum of functions dependent only on some (but not on all) of the thermodynamic fluxes, and the fluxes which one of these functions depends upon do not affect the other functions. Simultaneous chemical and thermal processes can be the simplest example of compound processes: the former are scalar and the latter are vector; and according to Curie’s principle, these processes cannot influence each other. Finally, the MEPP-supporting results of [24], which rely on the so-called ’information thermodynamics’ approach [25], are criticized in [23]. In particular, it has been observed [17] that information thermodynamics provides the fundamental tenet of equiprobability of microstates with no physical basis similar to what is provided e.g. by Liouville’s theorem in equilibrium thermodynamics; in the harsh words of [44], the author of [25] ’swept the dirt under the carpet’.
According to a further, ’steepest entropy ascent’ approach [26] [28], irreversibility in Nature is a fundamental microscopic dynamical feature and as such it must be built into the fundamental laws of time evolution. In this approach, MEPP ultimately arises from a fundamental non-unitary extension of standard Schroedinger unitary dynamics. In the words of [26], ’every trajectory unfolds along a path of steepest entropy ascent compatible with the constraints […]. For an isolated system, the constraints represent constants of the motion. […] the qualifying and unifying feature of this dynamical principle is the direction of maximal entropy increase. […] The challenge with this approach is to ascertain if the intrinsic irreversibility it implies at the single particle (local, microscopic) level is experimentally verifiable, or else its mathematics must only be considered yet another phenomenological tool’. Recent research aims precisely at comparison of predictions based on steepest descent approach and experiments where relaxation is postulated to follow a particular law - we refer to [27] and in particular to its equation (12). When it comes to classical physics, the constraints acting on the relaxation identify a hyper-surface in the phase space of the system. In the words of [28], ’on this surface we can identify contour curves of constant entropy, generated by intersecting it with the constant-entropy surfaces. Every trajectory […] lies’ on this surface ’and is at each point orthogonal to the constant entropy contour passing through that point. In this sense, the trajectory follows a path of steepest entropy ascent compatible with the constraints’. However, this conclusion relies on the postulate that relaxation obeys a particular law -see equation (25) of [28], retrieved also in equation (12) of [27].
Admittedly, general arguments supporting MEPP are discussed in [29]. Remarkably, however, this work seems to identify a property of , rather than of . In fact, equation (7) of [29] suggests maximisation of the amount per unit time of the entropy of the Universe due to exchange between the relaxed system and the external world - in our language, maximisation of rather than of . The same limitation seems to apply to many alleged applications of MEPP, including metallurgy [30], crystallization [31] and crystal growth [32], solidification [33], physics of the atmosphere [34] [35] [36] [37] and plasma physics [38].
For example, the growth of a crystal is obviously affected by what happens at its boundaries. Correspondingly, the selection criterion discussed in [30] is related to the ’conjecture that each lamella must grow in a direction which is perpendicular to the solidification front’. Analogously, the results of [31] are related to the growth velocity via the ’rate of entropy production per unit area’, which is a surface quantity like and unlike . Finally, in [32] a crystal grows and changes its surface so as to select a flow which maximises the entropy production, being the only contribution to the entropy balance which depends on both a flow and the surface of the system. As for solidification, the relevant quantity is the ’entropy generation term which depends on the solid-liquid region’ [33], i.e. on an interfacial boundary. As for the atmosphere, the only contribution to the entropy balance investigated in [34], [35], [36] and [37] is due to energy fluxes across boundaries - see equations (5) and (14) of [34], equations (1) and (2) of [35], Fig. 1 of [36] and equation (1) of [37] - and is therefore to be identified with ; in contrast, the agreement of MEPP with models of a stratified atmosphere including viscosity (so that is not negligible) seems to be poor [42]. As for plasmas, what is actually maximized in equation (7) of [38] is the inhomogeneity of temperature in a boundary layer, i.e. a quantity related to .
In contrast with MEPP, different extremum properties for and seem to hold for relaxed states of different physical systems under different boundary conditions -see e.g. Sec. 3 of [40] for a particular example and [17] for a more general discussion, which includes some of the results quoted above. For instance, the latter discussion involves also an extremum property similar to what is proposed in [29], but concerning rather than ; such property is suggested in [41] for a shock wave in the limit of large Mach number, where the volume integral is as vanishingly small as the volume occupied by the shock front.
Appendix B More about (2) and (4)
We follow the treatment of [7] and [47]. The results listed below have been derived in [47] in the framework of the research concerning the models [49] [50] based on Schnakenberg’s treatment of master equation [51]. Equations (1), (2) and (4) give:
[TABLE]
after integration by parts - for details, see the proof of equation (10) in [7] for a Boltzmann-Gibbs entropy. The first term on the R.H.S. of (27) is the only term which is in all cases, because . Then, comparison of (2) and (27) leads to (5) and (6).
According to [7] and [47], (4) is invariant whenever (7), (8) and (9) hold. Indeed, is a scalar, then (7) holds. Moreover, the definition of allows us to write:
[TABLE]
Now, invariance of the normalization condition and equation (28) imply (8), i.e. transforms like a scalar density. Finally, invariance of and equation (8) imply (9), i.e. J transforms like a vector density.
In turn, equations (7), (8) and (9) have a number of consequences.
Firstly, invariance of our boundary condition follows from (9).
Secondly, (1) and (8) imply , i.e. is not invariant -see equation (8) of [47].
Thirdly, (1) and Reynolds’ transport theorem [48] give . Together with (4), integration by parts and Gauss’ theorem of divergence lead to . In turn, the latter result, together with (28) and (9), leads to:
[TABLE]
i.e. is not invariant - see equation (11) of [47].
Fourth, (4), (7), (8), (9) and (29) lead to (10) - see equation (10) of [47].
Finally, after cumbersome algebra equations (28), (4), (9), (29) and (10) lead to (11). This result can also be obtained straightforwardly by observing that both and the quantity in (5) are invariant according to (7), (8), (9) and (28). In contrast with , the lack of invariance of , together with equations (2) and (11), show that too lacks invariance.
Appendix C More about (11)
We can rewrite (11) in a sligthly different way for systems which exchange only heat with a thermal bath at temperature . In such systems, is equal to the amount of heat flowing per unit time from the system towards the bath divided by and is therefore equal to the time derivative of bath entropy, provided that - as usual - we neglect all irreversible processes occurring inside the bulk of the bath, i.e. we assume in the bath. Then, , so that (2) and (11) give . Integration on time from to a time gives:
[TABLE]
Just like (2) and (11) which is based upon, (30) holds for both steady states and unsteady states. As for steady states, ; moreover, we denote with the value taken by . As for unsteady states, may differ from [math] and we define . Finally, we write (30) for both steady and unsteady state, subtract term by term and obtain
[TABLE]
Now, equation (31) is just equation (11) of [54] (but for a slightly different definition of ); the latter relationship deals with the invariance properties of the entropy balance in systems undergoing Markovian evolution according to an equation which generalises (4) under transformations which include also rescaling of time-scales.
Appendix D More about (20)
After multiplying by and exponentiating, the reationship (18) implies invariance of the dimensionless quantity where . In other words, X does not depend on the Jacobian defined in Sec. 2. After deriving X once and twice on , we obtain therefore:
[TABLE]
and respectively; the latter relationship gives:
[TABLE]
After taking the path ensemble average of both sides of (32) and (33) we obtain
[TABLE]
Until now, we have assumed to be constant. Nothing essential changes, however, if we let it to oscillate mildly and slowly in time, say - think e.g. of a self-similar rescaling with slow, alternating expansion and compression. A relaxed, stable state will be also oscillating with period , at least as far as the external world keeps on driving such slow oscillations.
If no dissipation occurs inside the system, then and at all times. If dissipation occurs, our assumption of constant and uniform leads to , where is the amount of dissipated work from to the time . It follows that and . Let us denote with and the quantities obtained by and via time reversal respectively. Now, we may invoke the identity
[TABLE]
-see equation (15) of [10]- provided that some assumptions are made; this point is discussed below. If then (35) reduces to Jarzynski’s equality . Together, (34) and (35) lead to:
[TABLE]
Since our stable relaxed state is permanently oscillating, however, we may drop the subscript R altogether in (36), as time reversal leaves oscillating quantitites unaffected but for a change of phase. Accordingly, (36) leads to:
[TABLE]
Now, we invoke the argument of Sec. 7. The small perturbation of a stable, relaxed state leaves the ’s remain unaffected, and we are allowed to write all along the perturbation. Accordingly, for a stable, relaxed state relationships (37) and the definition of Y lead to:
[TABLE]
i.e., is a minimum in stable relaxed states with respect to a small perturbation of . The assumption of oscillating is relaxed in the limit (i.e. is constant). It is relaxed too, provided that we write the generic, slowly evolving as a sum of harmonic functions of time with different periods, provided that each term is so small that we may consider the response of to as a sum of responses to each term.
Admittedly, our argument relies on an identity, (35), whose validity is limited by a number of additional assumptions [10]: a) the energy of the system is finite; b) the dynamics are Markovian; c) if the system is already in equilibrium and is unperturbed, then it must remain in equilibrium; d) the system starts in thermal equilibrium and is driven from that equilibrium by an external perturbation. Assumption a) and b) are satisfied in many systems described by (4). Assumption c) abd d) may be satisfied e.g. if the system is in contact with one heat bath at temperature . In particular, assumption d) is satisfied e.g. if the external world raises slowly starting from a zero value at . However, these assumptions are rather artificial, and their validity in many applications is questionable, to say the least. Accordingly, our argument is just a qualitative support, rather than a rigorous proof, of (20).
In conclusion, we discuss a result which seems to agree - qualitatively at least - with the validity of (20) for stable relaxed states of a system which describes cyclic population dynamics of species, which undergoes a Markovian stochastic process and which is described by an approximate Fokker-Planck equation [70]. The space of the concentrations of 3 chemical species is our state space. It turns out that when the external world changes a control parameter and drives the system across a supercritical Hopf bifurcation from large limit-cycle-like oscillations to small erratic oscillations around a fixed point, then peaks very much near the transition, and decreases away from it. In our language, it is the amplitude of the oscillation which is in steady state in both configurations; the limit cycle and the fixed point are equivalent to a stable oscillation with constant amplitude and respectively, once small-scale, fast fluctuations have been averaged out. The words ’approximate Fokker-Planck’ are justified because the diffusion coefficient is not exactly constant; however, once the system has relaxed then fluctuations of diffusion coefficient too can be averaged out. No exchange with the external world is considered, then and (2) gives . As the system is driven from one stable configuration to the other one, firstly the entropy production increases, then decreases: this is consistent with our result that each stable state corresponds to a local minimum of entropy production. Admittedly, this is no rigorous result, as the value of computed when fluctuations are taken into account differs from the value of computed when fluctuations are neglected (but in particular cases). Remarkably, however, fluctuations may even lower [71], so that our arguments are not weakened.
Appendix E More about the pivotal system
A pivotal system satisfies (14), (15) and, as a consequence, (16) (or, equivalently, (17) if ). In order to fix the ideas, we start with the DPF filamentary structure discussed in Sec. 5 . At the end, we shall see that a suitable rescaling maps this structure onto somewhat completely different. To start with, let us introduce a complex-valued function of position where is the (’Bohm’) diffusion coefficient [65]. We neglect ; this assumptions is justified below. We invoke the following two results of Ref. [73]. Firstly, Ampère’s law ensures that both (17) and are satisfied provided that . Secondly, both (15) and the momentum balance (including Lorenz force density) are satisfied, provided that: a) a contribution to the pressure of the fluid (which appears in equation (5) of [73] and Sec. 8 of [55]) is neglected, in agreement with (14); b) viscous terms in the momentum balance are of the same order of the interaction with external currents in Sec. 5 and are neglected, in agreement with equations (3) of [73] and (8.3) of [55]; c) solves the PDE problem:
[TABLE]
where is the speed of sound and , is the value of in the limit , , . A non-zero value for the quantity (which has the dimension of ) takes into account possible dissipation-free dispersion of waves across the system; it leaves therefore our discussion concerning unaffected. As for the equation in (39), it is just equation (9) of [73], and contains no viscous term. As for the boundary condition, we refer to Sec. 45 of [74] which deals with a case and which corresponds precisely to vanishing flow across the boundary; this makes sense as (39) leads to both (17) and to the balance of momentum regardless of the actual value of , thus allowing a complete description of the pivotal system.
In particular, if - like e.g. in [55] - then according to [73] the equation in (39) is formally similar to the Ginzburg-Landau (’GL’) equation for a superconductor [74]. Remarkably, dissipation in a superconductor occurs mainly near the axis of a filament, i.e. where . In contrast, the opposite, ’far-from-the-filament-axis’ limit (14) corresponds to weak dissipation, in agreement with (13) as anticipated. (Even if the main entropy-raising mechanisms in the bulk of of the pivotal system, namely Joule heating and viscous heating, are related to diffusion of magnetic field lines and to diffusion of particles respectively. Both processes lead to dispersion of matter across space. Then, it is self-consistent to neglect dissipation-related terms in the equations of motion when (13) holds). As shown in Sec. 4, this does not prevent from being . In the same limit, terms become small - as anticipated above - and A is ruled by the contribution of the external world, which is negligible in the pivotal system. (Gradients of , and vanish everywhere). Finally, if , then A plays no role and we retrieve an unmagnetized fluid where Euler equation plays the role of momentum balance.
Now, let us further discuss (39) in the ’far-from-filament-axis’ region, i.e. in the slow diffusion approximation. To this purpose, we need some intermediate relationships. Firstly, we introduce the quantities , , , and (so that ). Secondly, the definition of and (16) imply and:
[TABLE]
[TABLE]
The 3 equations (15), (40) and (41) lead to the 3 following relationships:
[TABLE]
[TABLE]
[TABLE]
where (13) and (14) imply and respectively. Equations (39), (41), (42), (43) and (44) lead to:
[TABLE]
Now, we take (Coulomb gauge), hence . We neglect also because of (14). Finally, we note that and that . Then, (13) allows us to write , and (45) reduces to:
[TABLE]
Equation (46) is equivalent to a reduced form of the equation in (39) in the slow diffusion approximation. It enjoys two relevant properties. Firstly, in a pivotal system the fact that depends explicitly on neither A, , and allows the latter quantities to take arbitrary values independently from each other, and (14) allows us to neglect their gradients across the system; by doing so, we make an error . Secondly, if then and the scaling gives , and . Then, (46) reduces to a particular case of nonlinear Schroedinger equation, i.e. . Not surprisingly, equations (6), (53) and (54) of [75] show that this equation leads precisely to the Euler equation for unmagnetized fluids, as a term (the so called ’quantum pressure’) is discarded because of (14).
At last, here it comes the fundamental result of this Section. Let us take with no loss of generality. Then, in steady state ( ) (46) is the steady state version of a GL equation with real coefficients. Now, the latter equation, which we write in the form
[TABLE]
describes also the steady state of a fluid near the onset of Rayleigh-Bénard convection - see equation (5) of [75] for the case and equation (3) of [76] for the general case respectively; here and are positive constants, and is the Laplacian on some rescaled spatial coordinate q. Depending on the values of and , the boundary condition on may be either Dirichlet (), Neumann () or a linear combination of the two (). Together, the fact that and equation (8) allow us to find a rescaling which maps (46) in steady state and (47) onto each other (gradients of are negligible in the slow diffusion approximation, because of (14)). In particular, once both , , c and are known then (46) in steady state reduces to (47) provided that we identify and q with , , , and y respectively.
The reverse is also true. All the way around, once and are known for a fluid described by the quantity in the space near the convection onset and a known boundary condition (Dirichlet, Neumann or a linear combination of the two). Then, it is always possible to map (47) onto a steady state version of (46) of a pivotal system through a rescaling with Jacobian , because we have 5 unknown quantities (, , , c and ) against the 4 = 3 + 1 constraints given by the 3 known values of and and by the 1 boundary condition (which gives the value of ). There is still room for 5 - 4 = 1 degree of freedom, namely .
In other words, a steady state fluid on the verge of Rayleigh-Bénard convection is in the same class of equivalence of a particular pivotal system, namely the steady, relaxed filamentary configuration of a DPF pinch. It is precisely the fact that dissipation is not required for the description of the steady relaxed state - see Sec. 4 - which allows the inviscid equation (39) to provide information on the steady relaxed configurations of fluids with convection. Rather, dissipation describes relaxation; the latter is very slow in the slow diffusion approximation and may therefore safely be described as a slow succession of steady states.
Of course, detailed dynamics is completely different in the two systems: DPF filaments are ’steady’ only when compared with MHD time-decay, the temperature gradient across the fluid is not negligible, radiation transport of heat occurs across the DPF, and is it is not even required that a magnetic field occurs in the Rayleigh-Bénard case. But dynamics provides just constraints on minimization of ; it does not change the fact that if a steady state is stable in one system - hence - then the corresponding steady state is also stable in the other system - i.e., remains a minimum. This is in analogy to what happens at thermodynamic equilibrium, where the fact that attains a maximum does not depend on the choice of the x’s. We may say that the problems of stability of the electric current filaments in a DPF pinch and of a fluid near Rayleigh-Bènard convection are dual to each other.
Appendix F More about (26)
Independent estimates of x confirm (26). The plasma of our shell is made of has three remarkable features [78]: a) inter-particle collsions are far from negligible, as ; b) the pressure is not negligible in comparison with the magnetic energy density ; c) radiative transport of heat is relatively weak. In such plasmas, a rescaling by a dimensionless factor leaves physics unaffected provided that the Collisional Vlasov High Beta Scaling (’CVHBS’) holds [88]:
[TABLE]
According to (48), and , as and have the dimensions of (length)(time) and of time respectively. Together, (26) and (48) imply:
[TABLE]
i.e., the relaxation time of a perturbation of our shell increases with increasing Rossby number. The results of [80], which do not rely on our assumption , confirm (49).
We have neglected viscosity heating so far. Numerical simulations of an extensive set of dynamo models in rotating spherical shells [89] take into account also viscous heating and cover a wide range of control parameters. Generally speaking, these simulations hint at lack of explicit dependence of the magnetic field on either resistivity or rotation period, and suggest rather a strong dependence on buoyancy, in agreement with our discussion. Results are fitted by the following scaling laws:
[TABLE]
where , and are a modified Rossby number, a modified Nusselt number and a modified Rayleigh number representing the cell rotation, the heat transport and the (suitably normalized) amount of heat advected by buoyancy per unit volume and unit time respectively. Together, (50) and the definitions of and give:
[TABLE]
Now, (48) imply
[TABLE]
because has the dimensions of energy (length)(time)2 (or,equivalently, of ). Relationships (48), (51) and (52) give:
[TABLE]
Together, (48), (50), (53) and the definition of give:
[TABLE]
Moreover, we assume that a small mass element follows a random walk among many convective cells and write , so that (48) implies:
[TABLE]
Finally, (26), (48), (50), (55) and (54) and the definition of give , a result which is still compatible with the range reported in [79].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] P. Glansdorff, I. Prigogine Thermodynamic Theory Of Structure, Stability And Fluctuations Wiley, London (1971)
- 2[2] P. Glansdorff, I. Prigogine, Physica 30 , 351 (1964)
- 3[3] U. Seifert, Rep. Prog. Phys. 75 , 126001 (2012)
- 4[4] D. J. Evans, D. J. Searles, Advances in Physics, 51 , 7, 1529-1585 (2002)
- 5[5] T. Tomé, Braz. J. Phys. 36 , 1285 (2006).
- 6[6] T. Tomé, M. J. de Oliveira, Phys. Rev. E 82 , 021120 (2010)
- 7[7] G. A. Casas, F. D. Nobre, E. M. F. Curado, Phys. Rev. E 86 , 061136 (2012)
- 8[8] D. M. Carberry, S. R. Williams, G. M. Wang, E. M. Sevick, D. J. Evans, J. Chem. Phys. 121 , 17, 8179-8182 (2004)
