Solution to the key problem of statistical physics -- calculations of partition function of many-body systems
Bo-Yuan Ning, Le-Cheng Gong, Tsu-Chien Weng, Xi-Jing Ning

TL;DR
This paper introduces a novel, faster method for calculating the partition function of many-body systems, enabling accurate predictions of thermal properties in complex molecules and condensed matter, with significant improvements over existing algorithms.
Contribution
The authors present a new computational approach that significantly accelerates partition function calculations, allowing for precise thermal property predictions in large and complex systems.
Findings
Method is at least four times faster than existing algorithms.
Achieves higher precision in thermodynamic property calculations.
Successfully reproduces the isochoric equation of state for solid argon.
Abstract
The key problem of statistical physics standing over one hundred years is how to exactly calculate the partition function (or free energy) of many-body interaction systems, which severely hinders application of the theory for realistic systems. Here we present a novel approach that works at least four orders faster than state-of-the-art algorithms to the problem and can be applied to predict thermal properties of large molecules or macroscopic condensed matters via \emph{ab initio} calculations.The method was demonstrated by C molecules, solid and liquid copper (up to GPa), solid argon, graphene and silicene on substrate, and the derived internal energy or pressure is in a good agreement with the results of vast molecular dynamics simulations in a temperature range up to K, achieving a precision at least one order higher than previous methods. And, for the first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · History and advancements in chemistry
Solution to the key problem of statistical physics
– calculations of partition function of many-body systems
Bo-Yuan Ning
Center for High Pressure Science Technology Advanced Research, Shanghai, 202103, China
Le-Cheng Gong
Institute of Modern Physics, Fudan University, Shanghai, 200433, China
Applied Ion Beam Physics Laboratory, Fudan University, Shanghai, 200433, China
Tsu-Chien Weng
Center for High Pressure Science Technology Advanced Research, Shanghai, 202103, China
Xi-Jing Ning
Institute of Modern Physics, Fudan University, Shanghai, 200433, China
Applied Ion Beam Physics Laboratory, Fudan University, Shanghai, 200433, China
Abstract
The key problem of statistical physics standing over one hundred years is how to exactly calculate the partition function (or free energy) of many-body interaction systems, which severely hinders application of the theory for realistic systems. Here we present a novel approach that works at least four orders faster than state-of-the-art algorithms to the problem and can be applied to predict thermal properties of large molecules or macroscopic condensed matters via ab initio calculations. The method was demonstrated by C60 molecules, solid and liquid copper (up to GPa), solid argon, graphene and silicene on substrate, and the derived internal energy or pressure is in a good agreement with the results of vast molecular dynamics simulations in a temperature range up to K, achieving a precision at least one order higher than previous methods. And, for the first time, the realistic isochoric equation of state for solid argon was reproduced directly from the partition function.
I Introduction
By the end of th century, the born of statistical physics brought a promising prospect that products of complex chemical reactionsChipot and Pohorille (2007) and all the thermodynamic properties of macroscopic systems, such as equation of states (EOSs)D Stacey (2005); Ross and Young (1993) and phase transitionsMonson and Kofke (2000), can be thoroughly predicted without empirical data by calculating the free energy (FE) or partition function (PF). Nevertheless, it was soon realized that such an implementation is impossible for large molecules and condensed mattersUshcats et al. (2016) because PF is in principle determined by all the possible microstates over entire phase space and a -fold integral has to be solved to obtain the PF for a system consisting of particles, which goes far beyond the capability of modern supercomputers if the standard algorithm of numerical integral is appliedAllen and Tildesley (1987). For instance, a rough calculation of PF for a C60 molecule, involving a -fold integral, would cost at least years by using even the fastest high-performance computing facility with FP64 operations per second.
Alternative routes are resorted to sampling approaches, that is, either following the formalism of molecular dynamics (MD) simulation to trace the trajectories by integrating Newton’s equation of motionRapaort (2004), or, in a manner of Monte Carlo (MC) way to explore the microstates with substantial contributions to PF by stochastic walk in phase spaceLandau and Kurt (2005). Although time-average-based MD and ensemble-average-based MC algorithms have been developed over half a century and proved to be impressive to gain mechanical propertiesAllen and Frenkel (1989), when it comes to thermodynamic properties, the long-standing problem in face of both schemes remains that a balance has to be reluctantly sought for between limited-length sampling and the demanding requirement of ergodicity as the size of a system increasesBallard et al. (2015). Progress has been made in evaluations of the relative difference of PF (or free energy)Jarzynski (1997); Moustafa et al. (2015), and more attentions are being paid to the density of state to calculate absolute PFWang and Landau (2001); *DOS; *nxj; *multicanonical; *transitionmatrix. Nested sampling may be state-of-the-art techniqueSkilling (2004), which aims at uniformly sampling a series of fixed fractions partitioned by potential energies (PEs) in configurational spacePártay et al. (2010); *nestsample7 and has been applied in several systems described by empirical potentialsWilson et al. (2015); Pártay et al. (2014); Do et al. (2011); Coe et al. (2009); Baldock et al. (2016); Do and Wheatley (2013, 2016); Do et al. (2012); Burkoff et al. (2012); Baldock et al. (2017); Bolhuis and Csányi (2018). Despite of its improved - orders of computational efficiency, NS can hardly work with ab initio calculations because of too much computational cost, and even if pairwise interaction potentials are employed, the affordable systems are limited to a scale of hundreds particles.
In this work, instead of tackling PF in the fashion of sampling, we established a direct integral approach (DIA) to solve the -fold integral on the basis of reinterpretation of original sense of integral, which works at least four orders faster than NS algorithm. Validations of the method were made in the systems including C60 clusters, condensed copper, solid argon, graphene and silicene on substrate, where the internal energies or EOSs obtained by DIA were compared to MD simulations, and for solid argon, the computed isochoric EOSs under high pressure zone were directly compared to experiments. Excellent agreements confirmed the accuracy of DIA. Ultrahigh efficiency of DIA paves a way to calculate the FE of large molecules and macroscopic condensed matters with ab initio computations.
II Direct Integral Approach to Partition Function
The original sense of one-fold () integral is interpreted as the sum of infinite number of rectangles with area , and . Here, we interpret the integral from a different angle: The length of the element at is modulated by to be a new length element and . In other words, the D integral is mapped to a summation of length elements instead of area elements and equals to an effective length of (see left in Fig.1). Similarly, a two-fold integral equals to an effective area of because the area element is enlarged (or shrunk) by giving rise to an effective area element (see right in Fig.1). Followed by this notion, an -fold integral equals to an effective volume of .
When the integrand is in a form of with being positive definite within the integral domain and having minimum at the origin (), the effective length of is defined as
[TABLE]
and the effective volume approximates to a product (see proof in Supplementary Information), i.e.,
[TABLE]
Now consider the configurational integral (CI) in PF () for a continuum system consisting of particles at a given temperature (see details in Supplementary Information),
[TABLE]
where with the Boltzmann constant, the Cartesian coordinates of particles and the potential function. Although the integrand is of the same form as required by Eq.(2), it may not be positive definite or have no minimum at the origin (). Letting the set be the coordinates of particles in state of the lowest potential energy , we may introduce a function
[TABLE]
where . By inserting Eq.(4) into Eq.(3), we obtain
[TABLE]
Clearly, is positive definite within all the integral domain and has minimum at the origin (). According to Eq.(2), the integral in Eq.(5) equals to an effective -fold volume,
[TABLE]
where the effective length on the th degree of freedom is defined as
[TABLE]
In this way, the -fold integral is turned into one-fold integrals.
For some homogeneous systems with certain geometric symmetry, such as perfect one-component crystals, all the particles are equivalent and felt by one particle moving along may be the same as the one along (or ). In such a case, Eq.(6) turns into
[TABLE]
where is the effective length determined by Eq.(7). Otherwise, it is needed to calculate the effective length, , , by Eq.(7) of an arbitrary particle, and Eq.(6) turns into
[TABLE]
The procedure can be extended to systems composed of different particle species by calculating the effective length of each species respectively.
For inhomogeneous systems, such as defects or interfaces existed, particles may be grouped into sets numbered by with each containing equivalent particles, and CI becomes
[TABLE]
where denotes the effective volume of an arbitrary particle in the th set with Cartesian coordinates , , .
To implement DIA, the first step is to find the most stable structure (MSS) of the system for determining , which can be accomplished in principle by several well-developed methods, such as global optimizationsOganov and Glass (2006); *glopt2; *glopt3; *glopt4 or dynamic dampingZhang and Buch (1990); Ye et al. (2009). Actually, the MSS for crystals can be immediately obtained by placing the particles right at the lattice sites. Then, we move a particle along its one degree of freedom to obtain while its other degrees of freedom and all the other particles are kept fixed. Clearly, this is an easy task for ab initio calculations and therefore, the PF of a -particle system can be obtained even if we have no knowledge about the analytical expression of potential function , which is usually hard to be constructed precisely for realistic systems composed of more than one kind of particles.
For testing the DIA, we may perform ab initio calculations of on some realistic systems and compare the derived results with related experiments. However, results from first principle calculations may be strongly dependent on the specific algorithms, such as different types of exchange-correlation functions in density functional theory, and the experimental data are usually insufficient for extensive comparisons. In such cases, even if the results derived from the PF are in good agreement with the experimental data, it would be yet doubted of the accuracy of DIA. In order to have a stringent test, empirical potentials were used in the computations of PF, and the derived results were compared to the MD simulations using the same potentials to see if there exist some deficiency in DIA.
III Results and discussions
III.1 For isomers of C60 molecule
For a cluster consisting of carbon atoms, the most appealing impression may be the discovery of buckminsterfullerene (BF) with a football-cage structurekroto et al. (1985). Following investigations revealed that, besides the lowest potential-energy BF, there exists a bunch of C60 isomers, such as those introduced by the Stone-Wales (SW) rearrangementHeggie et al. (2016). For a long time, researchesZhao et al. (2003); Bettinger et al. (2003); Li and Ning (2004) have been attempting to answer the very questions whether the isomers are able to survive in realistic systems at finite temperatures and what the probability relative to BF is. The most reasonable argument should be the ratio of the PF of isomers () to that of BF (), but to our best knowledge, no such theoretical works have been tried out. Among all the isomers, the stack- SW (SW) isomer (see bottom in Fig.2(a)) has the lowest PE, which is eV higher than that of BF when using many-body Brenner potentialLi and Ning (2004). Here DIA was applied to calculate to determine the relative surviving probability of SW isomer in a temperature range from K to K.
To implement DIA, we firstly determined the MSS of the BF and the SW isomer by the Polak-Ribiere conjugate gradient algorithm. For the MSS of BF as shown in the top panel of Fig.2(a), each atom is shared by two hexagons and one pentagon, indicating that all the atoms are geometrically equivalent and thus only one atom is needed for the calculation of . For an arbitrary atom, the felt PEs along its three Cartesian coordinates are obviously not the same, so , and has to be calculated respectively and Eq.(9) was applied. The selected atom was moved step by step with an interval of Å along its Cartesian -axis (or , -axis, 0.5 Å in positive and negative direction) to obtain the corresponding potential-energy curves, (or , ), during which, the other two coordinates and all the other atoms were kept fixed (see details in Supplementary Information). Fig.2(b1) shows the internal energy of BF derived from the PF, , where the atom labeled No.1 in the top panel of Fig.2(b1) was selected to calculate . As a comparison, MD simulations were performed to calculate the internal energy and the Nose-Hoover algorithm Evans and Holian (1985) for canonical ensemble was employed with a time step fs. The systems were allowed to relax 20 ps initially and continued to run for another 50 ps, during which averages of were recorded in every 10 fs. All the computations concerning the C60 molecules were carried out in the Large-scale Atomic/Molecular Massively Parallel Simulator software package Plimpton (1995), and the many-body Brenner potentialBrenner et al. (2002) was selected to characterize the interactions between carbon atoms. As we can see, obtained from are in an excellent agreement with those from MD simulations and the relative difference of , RDE(), is up to K, which verifies the accuracy of the PF obtained by DIA. It should be noted that, based on our derivations, DIA is independent of how the directions of the Cartesian coordinates are chosen. To test this, we arbitrarily selected four more atoms, labeled No.- in the top panel of Fig.2(a), to calculate the respectively. Although the potential-energy curves for different atoms are not the same, i.e., differs a lot from , the calculated internal energies based on the five atoms are the same (see detailed data listed in Supplementary Information).
For the MSS of SW isomer shown in the bottom panel of Fig.2(a), atoms were divided into three groups, where the first group is the atoms shared by two hexagons and one pentagon (colored in brown), the second one is the atoms shared by one hexagon and two pentagons (colored in magenta), and the last one is the atoms shared by three hexagons (colored in black). Accordingly, Eq.(10) was used to calculate the , and, an arbitrary atom in each group, surrounded by dashed lines in bottom panel of Fig.2(a), was selected to obtain the corresponding potential-energy curves respectively. The selected atoms were moved Å in positive and negative direction, and PEs were recorded to obtain the corresponding potential-energy curves. Fig.2(b2) shows the comparison of the internal energy derived from and MD simulations, where the RDE is within the whole temperature range. To test the universality of DIA, we additionally chose other different atoms in the three groups to calculate the respectively and the obtained results are the same as those shown in Fig.2(b2) (see detailed data listed in Supplementary Information).
Fig.2(b3) shows the obtained FEs of BF and SW isomer. Although the FE of BF remains smaller than that of SW isomer within the whole temperature range, the difference is quite small, which makes it difficult to conclude whether BF is the more stable one than the isomer or not. On the other hand, an unambiguous picture was presented by calculating the ratio of PF between these two clusters. Apparently, SW isomer has negligible probability to survive at low temperature zone compared to BF, while, for K, the surviving probability of SW prominently increases and ends up to at K, exhibiting that there is a considerable chance of the isomer to form at higher temperature zone. Such a finding qualitatively agrees with previous results by ns-long MD simulationsLi and Ning (2004) but the exact value of differs from each other, which calls for further clarifications by future experiments.
III.2 For solid and liquid Copper
Considering the huge computational cost of the MD simulations rather than DIA, the number of copper (Cu) atoms in our model is limited to , which were confined in a cubic box with periodic boundary condition (PBC) applied. The tight-binding (TB) potentialCleri and Rosato (1993) was employed to describe the interatomic interactions. For solid Cu, the MSS was found by arranging the atoms at the FCC lattice sites. In consideration of the Fm-3m symmetry of FCC lattice, the Cartesian -axis of atoms is set to [] direction so that the potential felt by an atom moving along the -axis (or -axis) is the same as the one along the -axis. To obtain , the coordinate of the geometry-center atom was changed step by step with its and coordinates fixed to record the potential energies, during which all the other atoms stay fixed as well. For the liquid system, the atoms were heated up to K in MD simulation to generate a uniform distribution and a damped trajectory methodZhang and Buch (1990) was used to determine the MSS and potential energy (see details in Supplementary Information). Different from the cases in crystal Cu, the potential felt by a liquid atom moving along the -, - or -axis may not be the same, so , and were calculated respectively and Eq.(9) was applied to obtain the CI.
Common procedures for MD simulations of a canonical ensembleYe et al. (2009) was employed to produce the internal energy () and pressure () of the systems contacted with a thermal bath at given temperatures, and the Verlet algorithmVerlet (1967) was employed for integrating the equations of motion with time step fs and fs for solid and liquid respectively. We first fully relaxed the systems for steps, and ran another (or ) steps to record the values of and for solid (or liquid). Attentions have to be paid that the commonly used Virial equationsAllen and Tildesley (1987) are inaccurate to compute pressure in the case of many-body potential with PBC appliedLouwerse and Baerends (2006); Thompson et al. (2009), and we employed the method proposed by TsaiTsai (1979) that considers stress and momentum flux across an area for conducting the statistic of .
As shown in Fig.3, for the solid systems with atomic volume ranging from to Å3/atom, the internal energy and pressure derived from the PF are in excellent agreement with those ( and ) obtained by MD simulations at temperatures from K to K. For K, the relative difference of internal energy RDE) is less than ((Fig.3a, see data in Supplementary Information). As the temperature rises up to K, the difference gets a bit larger (the maximum is ), which may be attributed to the statistical fluctuations of MD simulations because the relative standard deviations of internal energy (RSDE), , increase by about three times of those () below K. As shown in Fig.3b, the relative difference of pressure (RDP) is on the level of or less, which is larger than the RDE. The reason should be that the MD method for statistic of pressure needs more simulation time since the relative standard deviations of pressure (RSDP), , are about two orders larger than the RSDE. In order to confirm this conjecture, we did similar computations except that the TB potential was replaced with a Lennard-Jones (L-J) potentialAgrawal et al. (2002), for which the Virial theorem can be applied in the MD simulations to calculate the pressure exactly. For the volumes of the system ranging from to Å3/atom at K, the RSDP reduces down to , and correspondingly, the RDP gets smaller to a level of (see data in Supplementary Information).
Fig.4 shows that the internal energy and pressure of liquid Cu obtained by DIA are in good agreement with the MD simulations. The RDE for all the systems is less than , which is about ten times larger than that for solid Cu. This difference may stem from the fact that the felt by a liquid atom differs a little from that felt by other atoms, which is not the same as the situation in the solid systems. So, we can perform DIA for more liquid atoms to obtain more accurate results. As to the pressure, the RDP is less than for most systems except for the one with a density of g/cc, which displays larger RDP and can be understood because the RSDP is apparently larger than others (see Fig.4(b3)).
It should be noted that the precision of DIA is so high that it has reached limit of the MD simulations. For instance, the internal energy obtained by DIA of solid Cu with the atomic volume of Å3/atom is eV for K and eV for K, which are almost the same as eV and eV obtained by the MD simulations with RSDE of ‰. The gradual increases of RDE and RDP should be attributed to the fluctuation rise of the MD simulations (see Fig.3(a3) and (b3)), for which smaller time step should be applied to integrate the equations of motion at high temperatures to ensure the precision.
As shown in Fig.5, the isothermal EOS of solid Cu derived from the PF using TB potentialCleri and Rosato (1993) exhibits the same trend as the experimental results though the pressures are about larger than the measured valuesKraus et al. (2016); Dewaele et al. (2004). This discrepancy should be attributed to the inaccuracy of the empirical potential because the pressures obtained by DIA and MD simulations coincide with each other quite well. By contrast, we also used the L-J potentialAgrawal et al. (2002) to calculate the EOS by DIA. Although the outcome coincides well with the corresponding MD simulations, it deviates much more from the experiments. Accordingly, it calls for a more accurate interatomic potential, which may resort to ab initio calculations in the future.
III.3 For Solid Argon
Considering that solid argon (Ar) has been extensively studied experimentally and is believed to be well characterized by L-J pair potentialAllen and Tildesley (1987), we calculated the PF of solid Ar by DIA to produce the EOS and compared the results with experiments. To our best knowledge, no accurate EOS has been put forward for solid Ar by directly solving the PF, though various EOSs towards Ar systems have been put forward either based on a fitting-parameter procedure (see detailed reviews in Refs.Tegeler et al. (1999); Young et al. (2016)), or in a MC sampling way for disordered liquid and gas statesDesgranges and Delhommelle (2012); Baldock et al. (2016).
In our work, the system consists of argon atoms placed at FCC sites and the computational procedure of DIA was the same as described in the system of solid Cu according to Eq.(8) except for the interatomic potential being replaced with L-J potentialAllen and Tildesley (1987). As shown in Fig.6(a) (see data in Supplementary Information), the pressure () derived from the PF coincides quite well with the experimental measurements ()Lewis et al. (1974). For the atomic volume ranging from to Å3/atom, the relative difference (RDP) between the theoretical pressures and the experimental ones is within in most conditions as demonstrated in Fig.6(b). The maximum RDP of occurs in the system with an atomic volume of Å3/atom at K, and may result from the fact, according to the original referenceLewis et al. (1974), that the uncertainty of the measured pressure ranges from bars above K and becomes larger for lower temperatures. Considering the possible influence from the empirical L-J potentials, we may reach a conclusion that the isochoric EOS obtained by DIA is in a good agreement with the experiment.
III.4 Comparisons with Nested Sampling
It is very necessary and interesting to compare DIA with NS, the state-of-the-art approach to PF, in terms of computational efficiency and precision. On the efficiency, the computational cost of the two methods is determined by the number of times, , to calculate the total potential energy. In previous work employing NS in solid Al (or NiTi) systems of atomsBaldock et al. (2016), is about (or , while for DIA applied in the solid Cu of atoms is , showing that DIA is at least four orders faster. The real computer time of DIA to calculate the PF of Cu atoms characterized by the many-body TB potentialCleri and Rosato (1993) is about minutes with full use of a desktop 8-core AMD Ryzen X CPU (GHz per core), and, for the Ar atoms characterized by the pairwise L-J potentialAllen and Tildesley (1987), is about minutes using one physical core of the CPU. For a solid argon system of atoms described by L-J potential at temperatures ranging from K to K, we ran a NS algorithm with and DIA with (real computer time is about hours for NS and about seconds for DIA respectively by using one physical core of the CPU) to calculate internal energy and pressure, which were compared to the MD simulations using the same potential function, demonstrating that the precision of DIA is about times higherGong et al. (2019). The accuracy of DIA has also been proved by calculating the internal energy of graphene or -graphyne materials on Cu substrate using Brener potential functionLiu et al. (2019a), and silicene on Ag substrate using Tersoff potential functionLiu et al. (2019b). Due to the ultrahigh efficiency, DIA has been successfully applied to predict the optimal conditions for silicene growth on Ag substrate with ab initio calculationsLiu et al. (2019b), which should be the first time to calculate the absolute free energy at finite temperatures up to thousands Kelvins with first-principle laws.
IV Conclusion
In summary, by our reinterpretation of integral, DIA to PF of large molecules and condensed systems was established. The accuracy of DIA was strictly validated by vast MD simulations for C60 clusters, condensed Cu, solid argon, graphene and silicene on substrate, and by experiments for solid Ar respectively. Compared to state-of-the-art method for PF, DIA works at least four orders faster with about one order more precise. The new approach will find its vast applications in investigating the products of complex chemical reactions, thermodynamic properties of large molecules and macroscopic systems, which highly relates to designing novel material, predicting various phase transitions and parameter-free EOS under extreme conditions.
V acknowledgement
TCW acknowledges the support from National Natural Science Foundation of China under Grant No.21727801.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Chipot and Pohorille (2007) C. Chipot and A. Pohorille, eds., Free Energy Calculations: Theory and Applications in Chmistry and Biology (Springer Berlin, 2007).
- 2D Stacey (2005) F. D Stacey, Rep. Prog. Phys. 68 , 341 (2005).
- 3Ross and Young (1993) M. Ross and D. A. Young, Annu. Rev. Phys. Chem. 44 , 61 (1993) . · doi ↗
- 4Monson and Kofke (2000) P. A. Monson and D. A. Kofke, Adv. Chem. Phys. 115 , 113 (2000).
- 5Ushcats et al. (2016) M. V. Ushcats, L. A. Bulavin, V. M. Sysoev, V. Y. Bardik, and A. N. Alekseev, J. Mol. Liq. 224 , 694 (2016).
- 6Allen and Tildesley (1987) M. P. Allen and D. J. Tildesley, Computer Simulation of Liquids (Oxford University Press, 1987).
- 7Rapaort (2004) D. C. Rapaort, The Art of Molecular Dynamics Simulation (Cambridge University Press, 2004).
- 8Landau and Kurt (2005) D. P. Landau and B. Kurt, A Guide to Monte Carlo Simulations in Statistical Physics (Cambridge University Press, 2005).
