Making the most of data: Quantum Monte Carlo Post-Analysis Revisited
Tom Ichibha, Kenta Hongo, Ryo Maezono, Alex J.W. Thom

TL;DR
This paper evaluates and compares different error estimation methods for energy estimators in quantum Monte Carlo simulations, proposing a hybrid approach for reliable error analysis across various series lengths.
Contribution
It introduces a hybrid error estimation method and heuristic schemes for identifying the equilibrated phase in QMC energy series analysis.
Findings
Hybrid analysis provides reliable error estimates for all series lengths.
MSER heuristic effectively identifies the start of the equilibrated phase.
Comparison of three error estimation methods demonstrates their strengths and limitations.
Abstract
In quantum Monte Carlo (QMC) methods, energy estimators are calculated as the statistical average of the Markov chain sampling of energy estimator along with an associated statistical error. This error estimation is not straightforward and there are several choices of the error estimation methods. We evaluate the performance of three methods, Straatsma, an autoregressive model, and a blocking analysis based on von Neumann's ratio test for randomness, for the energy time-series given by Diffusion Monte Carlo, Full Configuration Interaction Quantum Monte Carlo and Coupled Cluster Monte Carlo methods. From these analyses we describe a hybrid analysis method which provides reliable error estimates for series of all lengths. Equally important is the estimation of the appropriate start point of the equilibrated phase, and two heuristic schemes are tested, establishing that MSER (mean squared…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Making the most of data:
Quantum Monte Carlo Post-Analysis Revisited
Tom Ichibha
School of Information Science, JAIST, 1-1 Asahidai, Nomi, Ishikawa, 923-1292, Japan.
Kenta Hongo
Research Center for Advanced Computing Infrastructure, JAIST, 1-1 Asahidai, Nomi, Ishikawa 923-1292, Japan.
Center for Materials Research by Information Integration, Research and Services Division of Materials Data and Integrated System, National Institute for Materials Science, 1-2-1 Sengen, Tsukuba 305-0047, Japan.
PRESTO, Japan Science and Technology Agency, 4-1-8 Honcho, Kawaguchi-shi, Saitama 322-0012, Japan.
Computational Engineering Applications Unit, RIKEN, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan.
Ryo Maezono
School of Information Science, JAIST, 1-1 Asahidai, Nomi, Ishikawa, 923-1292, Japan.
Computational Engineering Applications Unit, RIKEN, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan.
Alex J.W. Thom
Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, U.K.
Abstract
In quantum Monte Carlo (QMC) methods, energy estimators are calculated as the statistical average of the Markov chain sampling of energy estimator along with an associated statistical error. This error estimation is not straightforward and there are several choices of the error estimation methods. We evaluate the performance of three methods, Straatsma, an autoregressive model, and a blocking analysis based on von Neumann’s ratio test for randomness, for the energy time-series given by Diffusion Monte Carlo, Full Configuration Interaction Quantum Monte Carlo and Coupled Cluster Monte Carlo methods. From these analyses we describe a hybrid analysis method which provides reliable error estimates for series of all lengths. Equally important is the estimation of the appropriate start point of the equilibrated phase, and two heuristic schemes are tested, establishing that MSER (mean squared error rule) gives reasonable and constant estimations independent of the length of time-series.
I Introduction
With the increase in availability of large-scale computers, Quantum Monte Carlo (QMC) methods have spread rapidly owing to the embarassing parallelizability of such algorithms. 1; 2; 3 QMC is one of the most accurate ab initio methods, and it is often used for systems which cannot be sufficiently accurately described by Density Functional Theory (DFT) 4; 5; 6; 7 or which are too large to apply post Hartree–Fock methods.8
Diffusion Monte Carlo (DMC)9 is one such QMC method with a computational scaling of for a system of electrons, and, as such, it can be applied even to large-sized systems including more than 1000 electrons. 5 The drawback of DMC is the requirement to use the fixed-node approximation10 to avoid the sign problem, which introduces a systematic error dependent upon the quality of the nodes of a trial wavefunction, and, although there are ways to suppress this error,11; 12 they make calculations considerably more expensive.
Two newer QMC methods in quantum chemistry have attracted interest of late, as they are not constrained by the fixed-node approximation: Firstly the full-configuration interaction QMC (FCIQMC) method,13; 14; 15 which stochastically solves the equations of full-configuration interaction (FCI), by sampling with discrete particles. Although the scaling of calculation cost is still exponential15 in the number of electrons like FCI, the prefactor of scaling curve is significantly reduced. Thus, this method can be applied to medium-sized systems.16 Secondly is Coupled Cluster Monte Carlo (CCMC), which stochastically solves Coupled Cluster (CC) equations.17; 18 Since the parameter space of a truncated CC calculation is smaller than that of FCI, CCMC will in general have a smaller memory cost than FCIQMC.
QMC methods commonly provide an energy estimator as the statistical average of sampling a Markov chain, also producing an estimate of the statistical error. It is difficult to estimate the error reliably due to the following reasons:19 (i) The samples are not independent of each other but correlated along the simulation time evolution. (ii) When the distribution of sampling is non-normal, the probability distribution of the mean value is also non-normal unless the number of sampling is large enough to satisfy the central limit theorem. In this work, we examine the performance of three characteristic automatic error estimation methods, Straatsma, 20, the AutoRegressive (AR) model 21, and blocking analysis based on von Neumann’s ratio test for randomness 22; 23 (von Neumann blocking) for the energy time-series obtained by applying DMC, FCIQMC, and CCMC to the neon atom. From these data we establish recommendations for the most reliable error estimation method for different lengths of time series, and devise a new hybrid scheme applicable to any length of time series.
Another important issue on the post-analysis of QMC is to determine the length of the pre-equilibration (warm-up) phase.19 Underestimation of the length gives a systematic error in the energy but its overestimation also increases the statistical error. In this work, we tested two heuristic methods to estimate the warm-up steps. One is MSER (mean squared error rule) 24 and the other is min-WREE (minimization of weighted relative error of the error) inspired by the post-analysis implemented in HANDE QMC code for stochastic quantum chemistry. 25; 26 Our analysis establishes that the estimation of warm-up steps by min-WREE changes depending on the length of time-series. On the other hand, MSER makes reasonable and constant estimation of warm-up steps, independent of the length of time-series.
II Error estimation method
In this section we elucidate the error estimation methods used: Straatsma,20 AR model,21 and von Neumann blocking.23
Straatsma
Straatsma et al. show that the variance of the statistical average of stationary time-series including samples is given as follows without any assumptions:20
[TABLE]
Here, is the estimation of time-correlation length (steps).000 The correlation length is differently defined in two papers, 20; 21 but they give the same definition of error. We took the newer definition of .21
is the autocorrelation function with lag , which is approximately given by the finite number of samples as:
[TABLE]
Here, the approximation of is inaccurate when the number of terms to be summed up is small. Thus, in equation 2 we limit the summation over to values before becomes negative for the first time, since later oscillates around zero as shown in Figure 1. The resulting is then used to calculate .
AutoRegressive (AR) Model
The AR model assumes that the random process of Markov chain sampling can be reasonably described by 21
[TABLE]
The -th sample is given as a linear combination of the previous steps and a random Gaussian noise with average 0 and variance . The coefficients and the variance are fitted to the given time-series using Yule–Walker equation.28 The number of coefficients is decided based on Akaike’s Information Criterion (AIC). 29 Large number of coefficients are needed to accurately describe the stochastic process of the given time-series, but, if it is too large, it becomes over-fitting. AIC aims to provide an appropriate compromise value.
The estimation of the correlation length is calculated by
[TABLE]
where are defined by equation 3.
von Neumann blocking
Von Neumann blocking takes into account the non-normality of the distribution of the sampling, in contrast to the two above-mentioned methods. The given time-series is divided into blocks with block size , and a new time-series produced:
[TABLE]
Both the non-normality of the distribution and the correlation length are reduced in the new time-series . The block size is decided such that each can be regarded to be sampled from independent and identical normal distributions, based on von Neumann’s rate test for randomness.22 After blocking, the variance of the statistical average is given by , .
Here, we also note the very commonly used blocking approach by Flyvbjerg and Petersen.30 This method performs integration of autocorrelation functions almost equivalent to equation 1 yet utilizing blocking, aiming to reduce the computational cost of analysis: It is mathematically similar to Straatsma 20 but it suffers a systematic bias stemming from blocking as well as von Neumann blocking.
III Estimation schemes of warm-up steps
We introduce two schemes for warm-up steps estimation in this section, MSER24 and min-WREE, which are implemented in HANDE code.25; 26
Mean squared error rule (MSER)
MSER aims to give an adequate estimate of warm-up steps as minimizing a sum of the systematic error from the warm-up phase and the statistical error. The number of warm-up steps, , is determined by minimizing the following quantity:
[TABLE]
Here, is a constant independent of corresponding to the case where does not include a warm-up phase. When a warm-up phase is present, increases from the constant value according to how much warm-up phase remains in . Meanwhile monotonically increases according to , and the value of minimizing their product gives an appropriate estimate of the number of warm-up steps.
Minimization of weighted relative error of the error (min-WREE)
This scheme estimates the warm-up steps as minimizing the relative error of error of the statistical average, , weighted by :
[TABLE]
We evaluated using von Neumann blocking 23 in this work.
IV QMC calculation details
We employed CASINO31 for DMC calculations. We used a Slater-Jastrow trial wave-function.9 The determinant is generated by the Hartree–Fock method using a STO-6G Gaussian basis set.32 The Jastrow factor consists of one- and two-body terms and includes 42 parameters in total. For the DMC calculations, the target population of walkers is set to be 1024 and the time step is 0.005 a.u.-1. We also used the cusp correction scheme,33 which replaces the shape of orbitals nearby ionic cores with Slater functions to satisfy the Kato cusp conditions.34 Each sample of the energy time-series is given by averaging the local energies9 over all of the walkers for every QMC iteration. The influence of the population fluctuation and the population control31 is not considered in this work.
We performed FCIQMC and CCMC calculations with HANDE.25; 26 The reference Slater determinant is prepared by Hartree–Fock method with cc-pVDZ Gaussian basis set35 using Psi4.36 The target number of walker population is 500 and the time step is 2.0 a.u.-1. Each sample of the energy time-series is given as the instantaneous projected energy, which is a ratio between and for every QMC iteration. The influence of the population fluctuation and the population control 31 is not considered in this work.
V Results and Discussion
We prepared one thousand different energy time-series for the neon atom, with the same calculation settings but with the different random seeds, using DMC, FCIQMC, and CCMC methods, for different lengths of time-series, respectively. We applied the error estimation methods to them and surveyed the concordance rate between the energy means and the reference mean value within the estimated errors with confidential interval (CI) as shown in Figure 2. The concordance rate for a 1 CI is ideally 68.27%. Thus, when the observed rate is closer to this value, the error estimation is regarded to be more reliable. Here, the reference mean value is given by taking an average of very long length of time-series, and the error is just less than 3% of those of the energy means of 1000 time-series.
First, we discuss the case of FCIQMC/CCMC (see Figure 2bc). All of the error estimation methods give lower concordance rate than the ideal value for confidential interval, 68.27%, when the length of time-series is small. This is typically observed for error estimation.21 For comparatively short lengths of time-series, the AR model shows the highest concordance rate among them. The comparison of the estimated errors shown in Figure 3bc further distinguishes the AR model from the others: Only the AR model reproduces that the estimated error normally decreases in proportion to . It clearly proves the advantage of taking AR model of equation 4. On the other hand, the lowest concordance rate is measured for von Neumann blocking. The von Neumann’s criteria to check randomness and normality tends to be not effective for small numbers of data,22 so it underestimates the correlation length for small length of time-series.
Straatsma gives the intermediate concordance rates for small lengths . The calculated fully depends on the autocorrelation function through equations 1 and 2, so we examined how the shape of the autocorrelation function changes according to the length of time-series in Figure 4: The autocorrelation function becomes negative more quickly for smaller by the oscillating since it is estimated by insufficient number of terms, , through equation 3. The truncation of the sum in equation 2 is so drastic that Straatsma underestimates the correlation time. In contrast, the oscillation of does not much affect the AR model, although its estimation also depends on autocorrelation functions through equation 6. This is because taking a product with drastically reduces the contribution of with large lag : Figure 5 shows the expansion of the parameters , where they are terminated or converged to zero only within a few terms. Therefore, just a few terms of from small lag is used to calculate in AR model.
When the length of time-series is comparatively large, the concordance rate of AR model converges to 68.27 % the most slowly. This is because the assumption of equation 4 in AR model cannot fully describe the target random process, and therefore its reliability is reduced. To summarize, the AR model (Straatsma) is the most reliable for small (middle/large) length time-series, respectively. We have therefore devised a hybrid scheme of AR model and Straatsma, which works reasonably for any length of time-series. It simply adopts the larger of the errors estimated by both methods:
The concordance rate for the hybrid method is shown as ‘AR model + Straatsma’, in Figure 2 and always comparatively close to 68.27 %.
We performed the same test of the error estimation methods for DMC time-series. Von Neumann blocking gives the closest concordance rate to the ideal value, 68.27%, for any length of time-series, and only the concordance rate of this method reaches 68.27 %. The difference from the case of FCIQMC/CCMC comes from that the distribution of DMC energy time-series tends to be non-normal: Figure 6 clearly shows that the distributions of the time-series given by DMC have larger skewness and kurtosis than those of FCIQMC/CCMC. As mentioned in section II, only von Neumann blocking can take into account non-normality for error estimation, which would be the reason why von Neumann blocking the most works in the case of DMC. This difference in non-normality also explains why the AR model gives the lowest concordance rate: the AR model assumes that the randomness between the neighboring steps is expressed by normally distributed noise, so it would not be possible to make a description when the distribution of time-series is non-normal.
Finally, we discuss the performance of the estimation schemes of warm-up steps, MSER and min-WREE. We apply these schemes to the time-series including non-plateau part and removed the estimated warm-up steps . Then, we applied ‘von Neumann blocking’ to obtain the concordance rate and the statistical error. Figure 2 shows that MSER basically gives higher concordance rate and smaller statistical error especially in the case of DMC: MSER is superior to min-WREE on the whole. The advantage is further distinguished comparing the average of warm-up steps. They are shown in Figure 7 with the standard errors. It clearly shows that the estimations of min-WREE strongly depends on the length of time-series and largely scattered. On the other hand, MSER estimates constant warm-up steps with small variances, independent of the length of time-series.
VI Conclusion
We compared the reliability of three kinds of error estimation methods, Straatsma,20 AR model,21 and von Neumann blocking,22; 23 in a statistical manner, when they are applied to the energy time-series given by applying DMC9, FCIQMC13, and CCMC17; 18 to the neon atom. In the case of FCIQMC/CCMC, it is shown that Straatsma (the AR model) is the most reliable for comparatively long (short) length of time-series, respectively. We established that the assumption in the AR model significantly reduces the influence of the oscillation of autocorrelation functions for short time-series lengths, but we concluded that the systematic error from the assumption is pronounced for long lengths. We devised a hybrid scheme, which takes the larger error from the ones estimated by Straatsma and AR model, and established it works for any length of time-series. In contrast, for DMC, we showed that von Neumann blocking is the most reliable for any length of time-series. This method has an advantage of considering the non-normality of the distribution of time-series, and we showed an strong evidence that the advantage is essential to analyse the DMC results. We have also tested two kinds of warm-up steps estimation schemes, MSER24 and min-WREE25, and established that MSER gives constant and reasonable estimations of warm-up steps, independent of the length and for time-series made by any of the QMC methods.
Acknowledgments
The computation in this work has been performed using the facilities of the Research Center for Advanced Computing Infrastructure (RCACI) at JAIST. T.I. is grateful for financial suport from Grant-in-Aid for JSPS Research Fellow (18J12653). K.H. is grateful for financial support from a KAKENHI grant (JP17K17762), a Grant-in-Aid for Scientific Research on Innovative Areas “Mixed Anion” project (JP16H06439) from MEXT, PRESTO (JPMJPR16NA) and the Materials research by Information Integration Initiative (MI2I) project of the Support Program for Starting Up Innovation Hub from Japan Science and Technology Agency (JST). R.M. is grateful for financial supports from MEXT-KAKENHI (17H05478 and 16KK0097), from Toyota Motor Corporation, from I-O DATA Foundation, and from the Air Force Office of Scientific Research (AFOSR-AOARD/FA2386-17-1-4049). R.M. and K.H. are also grateful to financial supports from MEXT-FLAGSHIP2020 (hp170269, hp170220). A.J.W.T. thanks the Royal Society for a University Research Fellowship (UF110161).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Spencer et al. (2018) J. S. Spencer, V. A. Neufeld, W. A. Vigor, R. S. T. Franklin, and A. J. W. Thom, The Journal of Chemical Physics 149 , 204103 (2018) , https://doi.org/10.1063/1.5047420 . · doi ↗
- 2Blunt et al. (2015) N. S. Blunt, S. D. Smart, J. A. F. Kersten, J. S. Spencer, G. H. Booth, and A. Alavi, The Journal of Chemical Physics 142 , 184107 (2015) , https://doi.org/10.1063/1.4920975 . · doi ↗
- 3Mathuriya et al. (2017) A. Mathuriya, Y. Luo, R. C. Clay, III, A. Benali, L. Shulenburger, and J. Kim, in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis , SC ’17 (ACM, New York, NY, USA, 2017) pp. 38:1–38:12. · doi ↗
- 4Ichibha et al. (2017) T. Ichibha, Z. Hou, K. Hongo, and R. Maezono, Sci. Rep. 7 , 2011 (2017) . · doi ↗
- 5Luo et al. (2016) Y. Luo, A. Benali, L. Shulenburger, J. T. Krogel, O. Heinonen, and P. R. C. Kent, New Journal of Physics 18 , 113049 (2016) .
- 6Trail et al. (2017) J. Trail, B. Monserrat, P. López Ríos, R. Maezono, and R. J. Needs, Phys. Rev. B 95 , 121108 (2017) . · doi ↗
- 7Ken et al. (2018) Q. S. Ken, T. Ichibha, K. Hongo, and R. Maezono, “Difficulty to capture non-additive enhancement of stacking energy by conventional ab initio methods,” (2018), ar Xiv:1807.04168 .
- 8Benali et al. (2018) A. Benali, Y. Luo, H. Shin, D. Pahls, and O. Heinonen, The Journal of Physical Chemistry C 122 , 16683 (2018) , https://doi.org/10.1021/acs.jpcc.8b 02368 . · doi ↗
