Fluctuation Theorems Containing Information for Autonomous Maxwell's Demon-assisted Machines
Xuehao Ding, H. T. Quan

TL;DR
This paper develops new fluctuation theorems that incorporate information for autonomous Maxwell's demon-assisted machines, deriving a Landauer's principle-based second law and analyzing a novel information device.
Contribution
It introduces two types of fluctuation theorems with information for autonomous Maxwell's demon systems, extending the theoretical framework of thermodynamics with information.
Findings
Derived fluctuation theorems containing information.
Formulated Landauer's principle for the entire process.
Analyzed a new information device using the developed theorems.
Abstract
In this article, we introduce two kinds of Fluctuation Theorems (FT) containing information for autonomous Maxwell's demon-assisted machines. Using Jensen's Inequality, we obtain Landauer's principle formulation of the second law for the whole process of the machine. Finally we make use of our results to analyze a new information device. \pacs{05.70.Ln, 05.40.-a, 89.70.Cf}
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Advanced Thermodynamics and Statistical Mechanics · Quantum Computing Algorithms and Architecture
Fluctuation Theorems Containing Information for Autonomous Maxwell’s Demon-assisted Machines
Xuehao Ding
School of Physics, Peking University, Beijing 100871, China
H. T. Quan
School of Physics, Peking University, Beijing 100871, China
Collaborative Innovation Center of Quantum Matter, Beijing 100871, China
Abstract
In this article, we introduce two kinds of Fluctuation Theorems (FT) containing information for autonomous Maxwell’s demon-assisted machines. Using Jensen’s Inequality, we obtain Landauer’s principle formulation of the second law for the whole process of the machine. Finally we make use of our results to analyze a new information device.
pacs:
05.70.Ln, 05.40.-a, 89.70.Cf
I INTRODUCTION
In 1871 Maxwell (1871), James C. Maxwell conceiced an intelligent creature, now known as Maxwell’s demon Leff et al. (1992) to challenge the second law of thermodynamics. In 1982, based on Landauer’s princeple Landauer (1961), C. H. Bennett gave an explanation Bennett (1982) of Maxwell’s demon, involving the concept of the information entropy (or Shannon entropy), and finally solved the lone-time dispute. The study of Maxwell’s demon has triggered a lot of interests in the exploration of physics of information Quan et al. (2006); Horowitz and Vaikuntanathan (2010); Vaikuntanathan and Jarzynski (2011); Abreu and Seifert (2012); Deffner and Jarzynski (2013); Barato and Seifert (2014); Lu et al. (2014); Horowitz and Esposito (2014); Parrondo et al. (2015); Pekola (2015); Kutvonen et al. (2016); Rana and Jayannavar (2016); Merhav (2017); Boyd et al. (2017); Strasberg et al. (2017).
Following the understanding of C. H. Bennett, we can simplify the autonomous Maxwell’s demon-assisted engine Mandal and Jarzynski (2012) to a system coupled to a heat bath and a tape (see Fig. 1). The tape acts as the Maxwell’s demon’s memory. When the machine converts heat into work, information will be written to the tape, or conversely when work is cost and converted into heat, the information on the tape will be (partially) erased. In the former case, the sum of the entropy production of the system and the heat bath can be negative, which is compensated by the increase of the information entropy of the tape. Along this line, some exactly solvable models of Maxwell’s demon-assisted machine, such as the information refrigerator Mandal et al. (2013), the information pump Cao et al. (2015) are proposed. Under some conditions, they can also work as information erasers. The evolution of the composite system (the system plus the bit flow) is governed by a Master equation, and is periodically interrupted by new coming bits. It has been demonstrated that the second law of thermodynamics is valid in these autonomous information machines Mandal and Jarzynski (2012); Mandal et al. (2013); Cao et al. (2015) as long as the information entropy of the bits is put on the same footing as thermodynamic entropy. However, a detailed understanding of these fluctuating quantities, such as information, heat and work in these machines is still lacking. It is thus desirable to explore stronger relations among these quantities, such as Fluctuation Theorems in these information machines.
In 2005, Udo Seifert introduced the entropy for the microstate Seifert (2005a) in stochastic thermodynamics, that is , where denotes the microstate of the system, and denotes the probability distribution of the microstate . Using this definition, Seifert derived an Integral Fluctuation Theorem (IFT) for stochastic processes,
[TABLE]
where is the fluctuating total entropy change, and is the fluctuating entropy change of the heat bath. Using Jensen’s Inequality, one can immediately obtain the second law of thermodynamics,
[TABLE]
where is the total entropy change, and are the ensemble averages of the fluctuating entropy changes.
However, the IFT Eq.(1) can not be directly applied to the information machine. Because the IFT Eq.(1) is valid when the evolution of the system is governed by a Master equation for a single period (see, e.g., Refs. Jarzynski and Mazonka (1999); Min et al. (2005); Seifert (2005b); Lacoste et al. (2008)), but the evolution of the information machine consists of many periods which are interrupted by new coming bits. The evolution of the composite system cannot be reduced to a process governed by a single Master equation. For this reason, FTs describing the information machine are still lacking. In this article, we derive the FTs containing information for a class of information machines. Using the IFTs, we can derive the second law for the whole process. Our FTs containing information content are different from the FT with measurement and feedback Sagawa and Ueda (2010, 2012) in three aspects. First, we use the information entropy to characterize the information content, while in the FT with measurement and feedback, the information content is characterized by the mutual entropy between the system and the demon’s memory. Second, similar to Seifert’s FT Eq.(1), the FT with measurement and feedback Sagawa and Ueda (2010, 2012) is also valid for a single process governed by a Master equation but does not apply to information machines because the evolution consists of many periods. Third, the information machine is autonomous. There is no measurements and feedback controls. Furthermore, we make use of our results to analyze a new information device Mcgrath et al. (2016).
This paper is organized as follows: In Sec. II we derive the IFTs, the Detailed Fluctuation Theorems (DFT) and the second law for this class of models. In Sec. III We use our methods to study the IFT for a new model proposed recently by Thomas McGrath, et al. In Sec. IV we give some discussions and summarize our paper.
II FLUCTUATION THEOREMS FOR INFORMATION MACHINES
Let us recall the models of information machine proposed in Refs. Mandal and Jarzynski (2012); Mandal et al. (2013); Cao et al. (2015). In these three models, the information engine, the information pump and the information refrigerator, there is a system and a tape. Every bit of the tape interacts with the system for a fixed time interval t consecutively. There is only one bit interacting with the system at a time.
The information engine and the information pump are very similar. The system itself has three states. And when the system is coupled to a bit, the whole system can be viewed as a composite system of six states (see Fig. 2). The allowed transitions between these states are illustrated with solid lines in Fig. 2. When a transition between A (ES) and C (EP) occurs, an accompanied flip of the bit will occur. Meanwhile, for the information engine energy will flow into (out of) the work reservoir, and for the information pump a chemical reaction will occur.
The information refrigerator is slightly different from the information engine and the information pump. The system itself has two states with different energies. When coupled to a bit, the system can be viewed as a composite system of four states (see Fig. 3). The transition between and is due to its coupling to a hot heat bath, while the other transitions are due to the coupling to a cold heat bath. When a transition between and occurs, the bit will flip.
We can summarize this class of models of information machine as follows: First, the system and the incoming bit form a composite system. Second, coupled to one or more heat baths, the whole system evolves under a Master equation for a time interval t. Third, at the end of the period, the evolution ends. Fourth, a new period starts, the initial state of the system is the final state of the system of the last period, but the initial state of the bit probably changes because a new incoming bit replaces the old one. The machine repeats these steps.
During an interacting interval, the probability distribution of the states of the composite system is governed by the Master equation,
[TABLE]
where is the transition rate matrix between the states of the composite system Mandal and Jarzynski (2012); Mandal et al. (2013); Cao et al. (2015).
For the convenience of the following analysis, we introduce the following notations. denotes the entropy of the system, denotes the information entropy of the bit, denotes the mutual entropy between the system and the bit, denotes the entropy of the heat bath, () denotes the probability of state 0 (1) of the incoming bit.
II.1 DERIVATION AND DISCUSSIONS OF THE IFTS
For the definition of the information entropy, we adapt base-e logarithm instead of base-2 logarithm. Then the information entropy has the same form as the thermodynamic entropy (we set Boltzmann constant =1 in this article),
[TABLE]
By employing the definition of the entropy for the microstate, we can define the fluctuating information entropy of a bit and the fluctuating mutual entropy between the system and the bit,
[TABLE]
[TABLE]
The ensemble averages of these fluctuating quantities correspond to the information entropy and the mutual entropy,
[TABLE]
[TABLE]
Now we consider a trajectory and its time-reversed trajectory. We set the initial distribution of the backward process as the final distribution of the forward process. In Fig. 4, every solid arrow represents the evolution of the whole system during every interacting time interval, and every dashed arrow represents a process of replacing the old bit with the incoming new bit. For the forward process, is the transition rate matrix between the states (see example Eq.(18)). We use and to denote the initial and the final states of the system and bit in the i-th period. At the end of every period, the state of the system is recycled as the initial state of the system for the next period . The bit is replaced by a new bit, and the initial state of the new bit is randomly sampled from the given distribution of the incoming bit flow , where denotes the initial distribution of the bit in the i-th period in the forward process. If we use , , , ….., to denote a forward trajectory. Its time-reversed trajectory in the backward process is , , , ….., . For the backward process, the transition rate matrix is denoted as , which is identical to , i.e., in our paper. Now we can discuss two different choices of the initial distribution of every bit in the backward process. They correspond to and respectively, where denotes the initial distribution of the bit in the i-th period conditioned on the final state of the system in the (i+1)-th period in the backward process, denotes the final distribution of the bit in the i-th period in the forward process, denotes the final distribution of the bit in the i-th period conditioned on the final state of the system in the i-th period in the forward process. Please note that the initial distribution of every bit is the same in the forward process, but it is not necessarily the same in the backward process. Corresponding to these two scenarios we can derive two different Fluctuation Theorems.
II.1.1 CASE A: WITH THE DECORRELATION PROCESS AT THE END OF EVERY PERIOD
We choose the initial distribution of bits in the backward process as the distribution of the outgoing bits in the forward process . The physical meaning of this choice of the backward process is the following, after finishing the forward process from the 0th to the nth bit, reverse the tape and let it go through the machine from the nth to the 0th bit, and what you get is just the backward process.
From the setting of the initial distribution of every bit in the backward process, one can obtain the ratio of probabilities between a pair of forward and backward trajectories,
[TABLE]
where and represent the probability distributions of the backward and the forward trajectories of the composite system, and represent the probability distributions of the states of the composite system at the beginning and the end of the i-th period in the forward process, and represent the marginal probability distributions of the states of the bit at the beginning and the end of the i-th period in the forward process, and represent the probability distributions of the trajectories in the i-th period conditioned on the initial state of this period in the forward and the backward processes. From Eq.(9), we can obtain
[TABLE]
where and represent the fluctuating entropy change of the system and the fluctuating information change of the bit in the i-th period. The detailed derivation of this equation can be found in Appendix A. This is one of the main results in our paper. It is an IFT for a class of information machines. We would like to emphasize that this IFT is very general. It is valid for an arbitrary initial distribution of the system and an arbitrary initial distribution of the bit flow. Also, it is valid for an arbitrary number of periods. Using Jensen’s Inequality we can immediately obtain the second law,
[TABLE]
where and represent the ensemble averaged entropy change of the system and the information change of the bit in the i-th period.
Please notice that there is a difference between the LHS of Eq.(11) and the total entropy production (see Eq.(16)), which is the sum of the mutual entropies between the system and the bit at the end of each period. One can understand the physical process underlying Eq.(11) as follows: A decorrelation process between the system and the bit occurs before it interacts with a new bit. That is, there is an entropy increment following the dynamic evolution of the composite system, which is denoted by the dashed arrow in Fig. 4. This imaginary decorrelation process is reasonable. We remember that before the system interacts with the first bit, the system and the tape are uncorrelated, so before it interacts with the latter bits, it could be uncorrelated with the tape either. We would like to emphasize that, the imaginary decorrelation processes do not influence the work extraction in the information engine, but they influence the total entropy production.
What’s more, if we consider the periodic steady states, the term vanishes, and we obtain
[TABLE]
where is used. This is Laudauer’s principle Landauer (1961).
We can also consider a DFT. If the initial distribution of the backward process is equal to the final distribution of the forward process, by summing up the probabilities of those trajectories with the same total entropy production, we can rewrite Eq.(9) as
[TABLE]
where , and and represent the probability distributions of the total entropy production in the forward and the backward processes respectively. Eq.(13) is the DFT in our models.
We would like to emphasize that the DFT in steady states proposed by Seifert (see Eq.(21) of Ref. Seifert (2005a)) does not exist in the periodic steady states of our current information machine. The reason is that in the periodic steady states, the distribution of the system does not change in each period, but the distribution of the bit changes.
II.1.2 CASE B: WITHOUT THE DECORRELATION PROCESS AT THE END OF EVERY PERIOD
In Case A, the total entropy change in the Fluctuation Theorems (Eq.(10) and Eq.(13)) does not contain the mutual entropy at the end of every period between the system and every bit. In this section, we would like to derive the IFT and the DFT related to the real total entropy production.
If in the forward process there is no decorrelation process at the end of every period, in the backward process, the probability distribution of the incoming bit is given by . Notice that now the transition probability of the backward process not only depends on the initial condition, as is the case in Case A, but also depends on the final state of the system in the i-th period in the forward process . Because we have set , the backward process can be constructed in the following way, at the end of every period, the probability distribution of the next bit will depend on the state of the system which is . This choice implies that for different state sequence of the system , the choices of the initial distribution of the bits in the backward process are different. However, in Case A, the choice of the initial distribution of the bits does not depend on the state sequence. For this reason, we should subgroup the trajectories in the backward process according to the the state sequence . The physical meaning of this choice is as follows, suppose that we have an ensemble of tapes already gone through the machine, then we pick up the ones that have the same sequence to form a new ensemble, and let them go through the machine reversely. For different subgroups , are different.
Now, we can similarly derive an IFT as follows,
[TABLE]
where and represent the marginal probability distributions of the states of the system at the beginning and the end of the i-th period in the forward process, represents the fluctuating mutual entropy change between the system and the bit in the i-th period. And the detailed balance condition Eq.(22) is used. After taking the average, we obtain the IFT,
[TABLE]
from which we can also derive the second law,
[TABLE]
where represents the mutual entropy change between the system and the bit in the i-th period. This relation also appears in the supplemental material of Ref. Mandal et al. (2013). Notice that , so Eq.(16) is tighter than Eq.(11). Similar to Case A, we also have the DFT,
[TABLE]
where . Similar to Case A, the relations Eq.(15), Eq.(16), Eq.(17) are valid for arbitrary initial states of the system and the bit. If the system has reached the periodic steady states, one can find a tighter relation than the Landauer’s principle Eq.(12) for this model.
II.2 NUMERICAL RESULTS OF THE TOTAL ENTROPY PRODUCTION
In order to show the results above visually, we calculate the probability distributions of the total entropy production in the two IFTs (Eq.(10), Eq.(15)) numerically. The model we choose is the information engine Mandal and Jarzynski (2012), whose transition rate matrix is given by
[TABLE]
where , where denotes the marginal distribution of the bit in equilibrium, denotes the energy difference between the states and the states, T denotes the temperature of the heat bath. In our calculation, we choose and we consider a process that consists of three periods in the periodic steady states. The results are illustrated in Fig. 5 and Fig. 6.
From these figures, we can see that when and =0.25, the distributions can reach a delta distribution centered at (see Fig. 5, Fig. 5, Fig. 6 and Fig. 6).
This result can be understood as follows. If the initial distribution of the states of the bits is equal to the equilibrium distribution, the distribution of the six states will not change under the evolution governed by the Master equation. So that the system and the bit will not be correlated . What’s more, the probability distribution of the three states will be just equal to each other. cancels out with . So the changes of the fluctuating total entropy will always be equal to 0 in both cases.
The length of the interacting interval also influences the distributions of the fluctuating total entropy production in the periodic steady states. The shorter the interacting interval of every period t is, the closer are the distributions of the fluctuating entropy production to the delta distribution. This result can be understood as follows, the shorter is the time interval of every period, the less will the heat bath influence the system and the bit.
III APPLICATION TO A NEW MODEL
In this section, we will exploit our previous methods to study a new information-device model proposed by Thomas McGrath, et al. recently Mcgrath et al. (2016). Their model is about chemical reactions, polymers and enzymes. But the mathematical essence of their device is quite similar to the model which we discuss in Sec. II. The main difference is that it has two tapes instead of one.
As shown in Fig. 7, tape Y is just the same as the tape in the previous models, while tape Z is an extra one. In the process, the bit of tape Z does not change its value, and its function is to control the transition paths. When the value of the bit of tape Z passing through the system is 1, the allowed transitions will be just the same as before shown in Fig. 2(a). On the contrary, when the value is 0, the transition between C0 and A1 will be forbidden.
This model is interesting when there is correlation between tape Y and tape Z. For example, when these two tapes are fully correlated and the values of the bits are complementary. That is, if the bit in tape Y is 1 (0), the accompanied bit in tape Z must be 0 (1). In this situation, one can see that in every single period the device can only do positive work or zero work. It can never do negative work, because if the initial value of tape Y is 1, the transition between C0 and A1 will be forbidden so that the system cannot go down to the b=0 states to do negative work. Therefore, the total work increases at the price of (partially) destroying the correlation between the two tapes.
Now, we can study the Fluctuation Theorem for this model. First, we can view these two tapes as one tape. This tape consists of many pairs of bits. Every pair of bits has 4 possible values: 00, 01, 10 and 11. With this in mind, the analysis and the results of the Fluctuation Theorem for this model are the same as those in Sec. II. So Eq.(15), Eq.(16) and Eq.(17) also hold true for this model.
Furthermore, we can decompose the total entropy production of the tape in this way. The fluctuating and the ensemble averaged information entropy of every pair of bits can be expressed as and , where , and denote the fluctuating entropy changes of tape Y, tape Z, and the fluctuating mutual entropy change between them, , and denote the entropy productions of tape Y, the tape Z, and the mutual entropy change between them. Because the values of the bits in tape Z do not change, . Then Eq.(15) and Eq.(16) become
[TABLE]
[TABLE]
where denotes the fluctuating mutual entropy change between the system and the composite tape Y and Z, and denotes the mutual entropy change between the system and the composite tape Y and Z.
Now, we can see that the device can do work by exploiting not only the information entropy, but also the mutual entropy between the two tapes. We would like to emphasize that in many previous studies of Fluctuation Theorems containing information, the information content is about the mutual entropy only Sagawa and Ueda (2010, 2012). But in our current study, the information content includes both the information entropy and the mutual entropy.
Actually, we can view tape Z as a measurement-feedback device, measuring the initial state of tape Y in every period, and then doing a feedback to control the transition between C0 and A1. If the tapes are fully correlated (so the mutual entropy is just equal to the entropy of one tape), the corresponding measurement will be viewed as a perfect measurement. Otherwise the measurement will be viewed as imperfect. Therefore, we can view this device as a combination of a low-entropy-tape-fueled device Mandal and Jarzynski (2012) and a measurement-feedback device Sagawa and Ueda (2010, 2012).
IV CONCLUSION
In summary, we have derived the FTs containing information and the second law for a class of information machines which can be viewed as autonomous Maxwell’s demon-assisted machines. Because the mutiple-bits process in our information machine cannot be described by a single Master equation, the FTs can not be regarded as special cases of the standard FT Seifert (2005a). From our IFT, we have straightforwardly obtained the Landauer’s principle Eq.(12). Finally we studied an application of our results to a new information device.
In the future it might be interesting to explore whether the results also hold true for the quantum version. Because in quantum mechanics, the entropy of entanglement is very similar to the mutual entropy in classical cases.
Acknowledgements.
H. T. Quan gratefully acknowledges support from the National Science Foundation of China under grants 11375012, 11534002, and The Recruitment Program of Global Youth Experts of China.
Appendix A Derivation of Eq.(10)
Because and , Eq.(9) can be written as
[TABLE]
where we have used the detailed balance condition
[TABLE]
where denotes the fluctuating entropy change of the heat bath in the i-th period. Then, we can calculate the summation of both sides of Eq.(21),
[TABLE]
which is Eq.(10).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Maxwell (1871) J. C. Maxwell, Theory of Heat. (Appleton, London, 1871).
- 2Leff et al. (1992) H. S. Leff, A. F. Rex, and D. L. Hogenboom, American Journal of Physics 60 , 282 (1992).
- 3Landauer (1961) R. W. Landauer, Ibm Journal of Research and Development 5 , 183 (1961).
- 4Bennett (1982) C. H. Bennett, International Journal of Theoretical Physics 21 , 905 (1982).
- 5Quan et al. (2006) H. T. Quan, Y. D. Wang, Y. Liu, C. P. Sun, and F. Nori, Physical Review Letters 97 , 180402 (2006).
- 6Horowitz and Vaikuntanathan (2010) J. M. Horowitz and S. Vaikuntanathan, Physical Review E 82 , 061120 (2010).
- 7Vaikuntanathan and Jarzynski (2011) S. Vaikuntanathan and C. Jarzynski, Physical Review E 83 , 061120 (2011).
- 8Abreu and Seifert (2012) D. Abreu and U. Seifert, Physical Review Letters 108 , 030601 (2012).
