Optimization of stochastic thermodynamic machines
Yunxin Zhang

TL;DR
This paper optimizes the performance of stochastic thermodynamic machines by deriving optimal external potentials that minimize irreversible work and entropy production, providing explicit bounds and illustrative examples.
Contribution
It introduces a method to optimize stochastic thermodynamic machines using Fokker-Planck equations, variational techniques, and characteristics, deriving bounds on work, power, and efficiency.
Findings
Optimal external potentials minimize irreversible work and entropy production.
Explicit bounds for work output, power, and efficiency are derived.
Examples demonstrate the application of optimal protocols.
Abstract
The study of stochastic thermodynamic machines is one of the main topics in nonequilibrium thermodynamics. In this study, within the framework of Fokker-Planck equation, and using the method of characteristics of partial differential equation as well as the variational method, performance of stochastic thermodynamic machines is optimized according to the external potential, with the irreversible work , or the total entropy production equivalently, reaching its lower bound. Properties of the optimal thermodynamic machines are discussed, with explicit expressions of upper bounds of work output , power , and energy efficiency are presented. To illustrate the results obtained, typical examples with optimal protocols (external potentials) are also presented.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
∎
11institutetext: Shanghai Key Laboratory for Contemporary Applied Mathematics, Centre for Computational Systems Biology, School of Mathematical Sciences, Fudan University, Shanghai 200433, China. Email: [email protected].
Optimization of stochastic thermodynamic machines
Yunxin Zhang
Abstract
The study of stochastic thermodynamic machines is one of the main topics in nonequilibrium thermodynamics. In this study, within the framework of Fokker-Planck equation, and using the method of characteristics of partial differential equation as well as the variational method, performance of stochastic thermodynamic machines is optimized according to the external potential, with the irreversible work , or the total entropy production equivalently, reaching its lower bound. Properties of the optimal thermodynamic machines are discussed, with explicit expressions of upper bounds of work output , power , and energy efficiency are presented. To illustrate the results obtained, typical examples with optimal protocols (external potentials) are also presented.
Keywords:
Variational Method; characteristic curves; heat engine; total entropy production.
††journal: J. Stat. Phys.
1 Introduction
How we can improve the performance of thermodynamic machines is always an interesting but difficult problem in thermodynamics since the pioneer work of Carnot and Clausius Carnot1824 ; Clausius1856 . According to the second law of thermodynamics, the energy efficiency, defined as with the output work and the heat uptake from the hot heat bath, is not more than the Carnot efficiency , where and are absolute temperatures of hot and cold heat baths, respectively. However, it is commonly thought the Carnot efficiency can only be achieved in quasistatic limit with output power vanished. So, in recent decades, most studies turned to discuss the efficiency at (or near) maximum power (EMP) . For example, the Curzon-Ahlborn efficiency at maximum power is obtained in Yvon1955 ; Chambadal1957 ; Novikov1958 ; Curzon1975 ; Broeck2005 . The bounds of EMP, , is obtained in Schmiedl2008Efficiency1 ; Esposito2010Efficiency ; Izumida2011Efficiency . By methods of linear irreversible thermodynamics, general expressions for maximum power and maximum efficiency are obtained in Proesmans2016 . The upper bound of efficiency at arbitrary power is discussed in Holubec2016 ; Ryabov2016 ; Pietzonka2018 . For more studies which try to optimize the performance of thermodynamic machines, see Benenti2011 ; Izumida2011Efficiency ; Golubeva2012Efficiency ; Allahverdyan2013 ; Holubec2015 ; Calvo2015 ; Reyes2017 ; Polettini2017 . Or see Sekimoto2010 ; Seifert2012Stochastic ; Mart2016Colloidal ; Giuliano2017 for general reviews. Meanwhile, methods to operate heat engines infinitely close to the Carnot bound but with nonzero power are also suggested recently Benenti2011 ; Allahverdyan2013 ; Mart2016Brownian ; Lee2017Carnot ; Polettini2017 ; Holubec2018 , though there are also studies which show that it is impossible Hondou2000Unattainability ; Verley2014The .
One of the main difficulties in the study of stochastic thermodynamic machines is that, except for few special cases, no expressions for work , power and efficiency can be obtained explicitly Schmiedl2007 ; Schmiedl2008Efficiency1 ; Holubec2014 ; Holubec2015 ; Zhang2019 . So, in references, they are usually optimized by choosing optimal work period to get maximum power, or just through extensive numerical calculations to get optimal values of thorough protocols Then2008 ; Horowitz2018 . In this study, using variational method, the external time-dependent potential, as well as the boundary distributions of isothermal steps in each work cycle, will be optimized generally to achieve the optimal performance of stochastic thermodynamic machines.
The same as in Schmiedl2008Efficiency1 ; Holubec2014 ; Zhang2019 , this study considers a thermodynamic machine with potential , which depends on spatial (state) variable and time variable . For overdamped cases, the time evaluation of probability to find the system at position at time is governed by the following Fokker-Planck equation
[TABLE]
with the flux of probability, the instantaneous velocity given by , and the friction coefficient, which satisfies Einstein relation . Here is the Boltzmann constant, is the absolute temperature, and is the free diffusion constant.
The thermodynamic machine discussed in this study is assumed to work cyclicly, including two isothermal processes and two adiabatic transitions. In each work cycle, it performs sequently through the following four subprocesses Schmiedl2008Efficiency1 ; Zhang2019 . (1) Isothermal process with high temperature during time interval . (2) Adiabatic transition (instantaneously) from high temperature to low temperature at time . (3) Isothermal process with low temperature during time interval . (4) Adiabatic transition from low temperature to high temperature at time . The same as in Schmiedl2008Efficiency1 ; Holubec2014 ; Zhang2019 , adiabatic transitions are idealized as sudden jumps of potential, and assumed to occur instantaneously without heat exchange. Therefore, probability and system entropy do not change during adiabatic transitions. For convenience, the work period is denoted as .
It is obvious that most of physical quantities of the thermodynamic machine, including output work in each work cycle, power , and energy efficiency , are functions of duration (or ), and period , or even functions of temperatures , friction coefficients , and diffusion constants , and so can be optimized according to them, as have been done previously Schmiedl2008Efficiency1 ; Esposito2010Efficiency ; Izumida2011Efficiency ; Holubec2016 ; Golubeva2012Efficiency ; Allahverdyan2013 ; Van2005Thermodynamic ; Cleuren2009Universality ; Van2012Efficiency ; Wang2013Efficiency ; Hooyberghs2013Efficiency . However, it is no doubt that , and are also functionals of external potential (or called protocols sometimes), probabilities and . One main contribution of this study is that the performance of thermodynamic machine is optimized according to potential , as well as to probabilities and .
2 Methods of optimization
As stated above, this study assumes that the work cycle begins from the isothermal process with high temperature . By definition, the heat uptake from the hot heat bath during one work cycle is
[TABLE]
In the following, we assume . For other cases, such as , or , results are similar, and corresponding examples will be presented at the end of this study. Here is a positive real number, which is used to describe the scale of the thermodynamic system, see Sec. 2.2 for some details.
From Eq. (1), and with the no flux boundary conditions at and , it can be verified that potential can be formally reformulated as
[TABLE]
for and , with an arbitrary function of time . Substituting into Eq. (2), and through routine calculations, one can get
[TABLE]
where the equality is from the definition of instantaneous velocity , and the equality is because
[TABLE]
The entropy production with system entropy , and the irreversible work is defined by
[TABLE]
In Eq. (5), is the distribution function of the system at time , which satisfies (see Eq. (1) and the no flux boundary condition at )
[TABLE]
Eq. (6) implies that, along its characteristic curves, which are given by , distribution function satisfies
[TABLE]
Which shows that is constant along any characteristic curve of Eq. (6). For simplicity, this study assumes that, for any , is reversible as a function of , or equivalently, probability at any given time is positive at almost everywhere. Or, in other words, there is no interval with , such that in it, i.e., at the most for a zero measure set Loeb2016 . But all results obtained in this study hold true for any general cases (note that, more complicated details need to be added to discuss the general cases).
Similarly, heat exchange between thermodynamic machine and cold heat bath in one work cycle can be written as , with irreversible work defined similarly. By definition, the output work during one work cycle is
[TABLE]
Two strategies to improve work will be presented in this study. (I) For given probabilities and , i.e. for given , find the minimum value of irreversible work by optimization according to potential . (II) For given probability , find the maximum value of by optimization according to probability .
2.1 Optimization of distribution function
From Eq. (5), it can be shown that the variation of irreversible work according to the distribution function is as follows,
[TABLE]
where is an arbitrary variation of , which satisfies
[TABLE]
The reason of these zero boundary conditions is that satisfies , and .
Since at the minimum of (as a functional of distribution function ) for any variation , Eq. (8) indicates that, for the optimal distribution function ,
[TABLE]
Due to Eq. (6), . Therefore,
[TABLE]
Eq. (9) shows that, for the optimal distribution function , the slope of its characteristic curves satisfies
[TABLE]
It means that, the slope of any characteristic curves of the optimal distribution function is constant. Therefore, characteristic curves of , which are given by , are all straight lines.
For convenience, we denote , , and define a map which satisfies , or equivalently
[TABLE]
Note, for extreme cases that at least one of functions and is irreversible, map can also be well defined but with some complicated and tedious details, so will not be presented here. Obviously, map satisfies , and it increases monotonically with , since .
The above analysis shows that, for optimal distribution function , the characteristic curve of Eq. (6), which is initiated from any at time , satisfies
[TABLE]
Which means that the characteristic curve of Eq. (6), which begins from any at time , is a straight line connecting point and point in the 2-dimensional - plane. For any given at time , the position obtained by Eq. (10) satisfies , see Eq. (6). For convenience, we denote the inverse function of by . Then, for any given position and time , satisfies . Finally, Eq. (10) is actually obtained by definition from the following ordinary differential equation, , with . While for the optimal distribution function .
Using function and its inverse function , probability corresponding to the optimal distribution function can be obtained by . From Eq. (5), it can be shown that, the irreversible work corresponding to the optimal distribution function is
[TABLE]
Here, the third equality is obtained by change of variable .
Due to , or . So, it can be shown that
[TABLE]
Here, the last equality is obtained by change of variable . Meanwhile, it can be verified that
[TABLE]
So, by definition,
[TABLE]
From Eqs. (4,11,13), with the optimal distribution function ,
[TABLE]
[TABLE]
where . Note, .
Generally, in one work cycle, the total entropy production of the system can be obtained by Brandner2015 ; Proesmans2015 . Within the framework of Fokker-Planck equation, can be reformulated as . For the particular case with optimal distribution function , it can be shown that the lower bound of total entropy production is
[TABLE]
One can easily show that, with optimal distribution function ,
[TABLE]
where . Eq. (16) looks the same as the one assumed in discussions of the low dissipation cases Esposito2010Efficiency ; Izumida2011Efficiency ; Calvo2015 ; Gonzalezayala2016 ; Reyes2017 . However, Eq. (16) holds for any durations and , and may be very large, though they are corresponding to the least values of entropy production. While in discussions of low dissipation cases, expressions like Eq. (16) are only the first order of their real values, and are reasonable for larger durations and .
2.2 Properties of thermodynamic machine with optimal distribution function
To discuss the influence of the scale of , normalized probabilities , normalized distribution functions , and the corresponding map are defined as follows,
[TABLE]
with . It can be easily verified that , , , , , and , .
[TABLE]
[TABLE]
Note that . Eq. (20) indicates that the value of work depends on parameters (or ), as well as functions and , i.e. .
From Eq. (20), the stalling time of the machine, defined by (see Schmiedl2008Efficiency1 ), is as follows,
[TABLE]
For given functions , , and period , the work reaches its maximum when . This can be easily obtained by minimizing under constraint . It can be shown that
[TABLE]
It can also be shown that power reaches its maximum when period (see Appendix). The corresponding work for period is
[TABLE]
and the heat absorbed from hot heat bath is
[TABLE]
So the efficiency at maximum power is
[TABLE]
As has been found previously, Schmiedl2008Efficiency1 ; Esposito2010Efficiency ; Izumida2011Efficiency ; Johal2017Heat ; Giuliano2017 ; Reyes2017 . It tends to lower bound if , and tends to upper bound if . For particular cases with , with which work is maximized ( and are minimized), . If , . It can be easily known that, for , the optimal probability is mirror symmetrical to time , i.e., for . For such cases, .
It can be verified that if and only if . This condition is equivalent to the one given in Schmiedl2008Efficiency1 , where the Curzon-Ahlborn efficiency is recovered if , with parameter in this study, see the above Eq. (25) and the Eq. (31) in Schmiedl2008Efficiency1 . For special cases with , inequality reduces to . While for cases with , it reduces to . So, with optimal distribution function and optimal ratio of duration , and if diffusion constant , which means , then the efficiency at maximum power is exactly the Curzon-Ahlborn efficiency .
With optimal distribution function , the efficiency for any cyclic period is, see Eqs. (18,20),
[TABLE]
where is the characteristic time of free diffusion, is a functional of and ,
[TABLE]
Eq. (26) shows that efficiency increases with both durations (fractions) and , while decreases with both characteristic times and . Meanwhile, increases with cyclic period but decreases with scale parameter . Moreover, it can be shown from Eq. (26) that
[TABLE]
where the equality holds if , or .
Denote as the maximum of power , then for any ,
[TABLE]
which equals to zero when or , and reaches it maximum 1 when , see Eq. (A2) in Appendix. Meanwhile, efficiency can be reformulated as (see Eq. (A3) in Appendix)
[TABLE]
It can be easily shown that , so efficiency increases monotonically from 0 to as the cyclic period increases from stalling time to infinity. For , i.e., when power reaches its maximum , efficiency get its value as given in Eq. (25).
From Eqs. (27, 28), the ratio can be reformulated as a function of efficiency ,
[TABLE]
which equals to zero if or , and reaches its maximum 1 when as given in Eq. (25). It can be shown that increases with for , while decreases with for .
It can also be shown that (see Appendix),
[TABLE]
Therefore, increases from 0 to 2 as period increases from *stalling time * to infinity, or equivalently as efficiency increases from 0 to . , i.e., , if and only if , or equivalently. As implied by Eq. (23), .
Finally, from Eqs. (27, 30), it can be obtained that
[TABLE]
So, ratio increases first and then decreases with ratio .
2.3 Optimal probability density
From Eq. (23), the variation of according to map is,
[TABLE]
where is a (small) arbitrary variation of function . Due to boundary conditions and , variation satisfies .
Eq. (32) indicates that reaches its maximum when , or equivalently when , see Eq. (17). Notice that, means and , while means and . In any way,
[TABLE]
2.4 Explanatory notes
As mentioned before, work is not only a function of period and duration (or ), but also a functional of the potential and probability (or equivalently). So there are many ways to do optimization. Optimization according to distribution function is actually equivalent to optimization according to potential , and optimization according to map is equivalent to optimization according to probability . In references, ratio is usually optimized to get the maximum of work , while period is usually optimized to get the maximum of power . However, in this study, for any given work period , the potential is optimized to get the minimum of entropy production , or equivalently to get the maximum of work .
From Eq. (17), . Therefore, increases (logarithmically) with scale parameter if the initial entropy is fixed, see Eq. (33). However, to keep power to be maximum, the optimal period should increase like , see Eq. (21). Therefore, the corresponding value of maximum power actually decreases with scale parameter .
For convenience, we denote the spatial coordinate of a point on the characteristic curve of Eq. (6), which begins from at time , by , or for simplicity. That is to say, with satisfies Eq. (10). Particularly, by definition . In the following, methods to get the optimal probability and optimal potential , which correspond to the optimal distribution function , will be presented for only . For , methods are almost the same.
By definition, along any characteristic curves of Eq. (6). This means that distribution function satisfies for any . Therefore, . By taking derivatives on both side of , one gets . So
[TABLE]
Particularly, for , .
By definition , so , with an arbitrary function of time . For cases with optimal distribution function , the instantaneous velocity . Therefore, the optimal potential is
[TABLE]
Here the last equality is obtained from Eqs. (10, 34) and .
Since thermodynamic machines work cyclically, function should satisfies for any time . One can easily find that irreversible works are independent of the choice of function , so in all the following illustrative examples, we always set simply.
3 Illustrative examples with optimal control
To illustrate the results obtained in this study, we present several examples of thermodynamic machine with the optimal distribution function , or equivalently with the optimal potential .
3.1 Examples for with a finite number
(I) Let
[TABLE]
one can show that
[TABLE]
From Eq. (10) and the definition of , we have
[TABLE]
Therefore,
[TABLE]
and potential can be obtained by Eq. (35). For this particular case, the work output in one cyclic period (see Eq. (15)) is
[TABLE]
and the efficiency (see Eq. (26)) is
[TABLE]
with
[TABLE]
(II) Next, for
[TABLE]
it can be shown that , , and
[TABLE]
So
[TABLE]
Similarly, can be obtained by Eq. (35). For this particular case, the work (see Eq. (15)) is
[TABLE]
and the efficiency (see Eq. (26)) is
[TABLE]
with
[TABLE]
3.2 An example for
We now give an example for . Let , and
[TABLE]
i.e., both and are Weibull distributions. Then for the optimal case, one can get , and
[TABLE]
Consequently,
[TABLE]
with . From Eq. (23), the work at maximum power is
[TABLE]
which decreases with while increases with .
For , tends to Dirac delta function . While for , tends to a constant. This is consistent with previous discussion for finite scale parameter cases, in which maximum of is reached when is constant. As before, the optimal potential can be obtained by Eq. (35). The work for this particular case is (see Eq. (15))
[TABLE]
with the Gamma function defined as
[TABLE]
For given value of parameter in the expression of initial probability , there exists an optimal parameter
[TABLE]
with which work reaches its maximum. The efficiency for these particular cases (see Eq. (26) for definition) is also but with
[TABLE]
One can easily show that increases with for , and when , but with which the work .
3.3 An example for
Finally, we give an example in which state (spatial) variable is defined in the whole real domain, i.e., , ). As before, for convenience, let , but assume the following initial and final probabilities
[TABLE]
Usually, distributions with probability like these are called Laplace distributions. For this example, , and
[TABLE]
So the optimal distribution function
[TABLE]
with
[TABLE]
Here is the sign function. It can be shown that the optimal probability is
[TABLE]
For these particular cases, the output work is
[TABLE]
Compared with the results for Weibull distribution with parameter (Note, ), an extra term is added to the irreversible work , see Eqs. (11, 12, 36, 37). This is due to the spatial translocation of the system from one mean position to another mean position . If , then properties of work and efficiency are the same as the ones discussed in the above subsection for Weibull distribution with parameter .
4 Conclusions and Remarks
In this study, optimization methods for stochastic thermodynamic machines are discussed, and general properties of optimal machines are analyzed. To illustrate the results, several explicit examples with optimal protocol are also presented. The main idea of optimization is to find the optimal external potential, with which the heat dissipation into environment reaches its minimum. It should be noted that, within the theoretical framework of Langevin stochastic processes and using the method of Monge-Kantorovich optimal mass transport, similar results about the minimum of heat dissipation have been obtained generally in Aurell2011 ; Aurell2012 ; Bo2013 . But differently, in this study, the minimum of heat dissipation is obtained by the method of characteristics and variation, and within the framework of Fokker-Planck equation, which makes it easier to construct illustrating examples and to know more clearly what protocol (external potential) we should choose to reach the optimal one.
It seems that several results obtained in this study are similar as the ones have been presented in previous references, including those for efficiency and work at maximum power. But with the optimal distribution function , and consequently with the optimal potential , the period required to get the maximum power is less than those given in previous references. In fact, this study indicates that the period , see Eq. (21), required to get the maximum power is the least one. In other words, the thermodynamic machines designed in this study, with optimal potential , can output work more fast and have the smallest stalling time than any other ones. One of the main contributions of this study is that, the least value of irreversible work , or equivalently the least value of entropy production , dissipated in one work cycle is obtained, and the optimal protocol which can lead to this lower bound is also presented.
Appendix: Details of analysis on maximum power
For convenience, in this appendix, the following notations will be employed,
[TABLE]
With these notations, and from Eqs. (14, 15, 21), one can easily show that stalling time , optimal period , work , heat , power , and the maximum power . Therefore,
[TABLE]
which equals to zero when or , and reaches its maximum 1 for . Meanwhile, with notations in Eq. (A1), the efficiency can be written as , which gives that
[TABLE]
Since , it can be shown that efficiency increases monotonically from 0 to as the period increases from to infinity. For , i.e., when power reaches its maximum , efficiency get its value .
From Eqs. (A2, A3), the ratio can be rewritten as follows
[TABLE]
which equals to zero if or , and reaches its maximum 1 if .
From Eqs. (23, A1), the work at maximal power is . So . Obviously, increases monotonically from 0 to 2 as period increases from *stalling time * to infinity, or equivalently as efficiency increases from 0 to Carnot efficiency .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) S. Carnot. Refléxions sur la Puissance Motrice du Feu, etsur les Machines Propres à Développer cette Puissance . 1824.
- 2(2) R. Clausius. On a modified form of the second fundamental theorem in the mechanical theory of heat. Phil. Mag. Ser. , 4(12):81–98, 1856.
- 3(3) J. Yvon. Proceedings of the International Conference on Peaceful Uses of Atomic Energy . United Nations, Geneva, 1955, p. 387.
- 4(4) P. Chambadal. Les Centrales Nucléaries. Armand Colin , 11:41–58, 1957.
- 5(5) I. I. Novikov. The efficiency of atomic power stations (a review). J. Nucl. Energy , 7:125–128, 1958.
- 6(6) F. L. Curzon and B. Ahlborn. Effciency of a carnot engine at maximum power output. Phil. Mag. Ser. , 43:22–24, 1975.
- 7(7) C. Van den Broeck. Thermodynamic efficiency at maximum power. Phys. Rev. Lett. , 95(19):190602, 2005.
- 8(8) T. Schmiedl and U. Seifert. Efficiency at maximum power: An analytically solvable model for stochastic heat engines. Europhys. Lett. , 81(2):20003, 2008.
