Fractional charge by fixed-node diffusion Monte Carlo
Matej Ditte, Matus Dubecky

TL;DR
This paper investigates how fixed-node diffusion Monte Carlo (FNDMC) can accurately reproduce the linear energy behavior with fractional electron numbers, demonstrating its effectiveness in electronic structure calculations involving fractional charges.
Contribution
It shows that FNDMC, when combined with mean-field trial wave functions, can restore the piecewise linearity of energy versus fractional electron number, improving charge localization accuracy.
Findings
FNDMC restores piecewise linearity in $E(N)$ for fractional charges.
FNDMC effectively localizes charge in systems with fractional electrons.
FNDMC outperforms other methods in charged noncovalent systems.
Abstract
Fixed-node diffusion Monte Carlo (FNDMC) is a stochastic quantum many-body method that has a great potential in electronic structure theory. We examine how FNDMC satisfies exact constraints, linearity and derivative discontinuity of total energy vs. fractional electron number , if combined with mean-field trial wave functions that miss such features. H and Cl atoms with fractional charge reveal that FNDMC is well able to restore the piecewise linearity of . The method uses ensemble and projector ingredients to achieve the correct charge localization. Water-solvated Cl complex illustrates superior performance of FNDMC for charged noncovalent systems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Fractional charge by fixed-node diffusion Monte Carlo
Matej Ditte
Department of Physics, University of Ostrava, 30. dubna 22, 701 03 Ostrava, Czech Republic
Matúš Dubecký
Department of Physics, University of Ostrava, 30. dubna 22, 701 03 Ostrava, Czech Republic
ATRI, Slovak University of Technology, Paulínska 16, 917 24 Trnava, Slovakia
Abstract
Fixed-node diffusion Monte Carlo (FNDMC) is a stochastic quantum many-body method that has a great potential in electronic structure theory. We examine how FNDMC satisfies exact constraints, linearity and derivative discontinuity of total energy vs. fractional electron number , if combined with mean-field trial wave functions that miss such features. H and Cl atoms with fractional charge reveal that FNDMC is well able to restore the piecewise linearity of . The method uses ensemble and projector ingredients to achieve the correct charge localization. Water-solvated Cl- complex illustrates superior performance of FNDMC for charged noncovalent systems.
Motivation: fixed-node diffusion Monte Carlo (FNDMC) is a many-body stochastic quantum Monte Carlo (QMC) projector methodAnderson1975 ; Reynolds1982 ; Umrigar1993 ; Mitas1991 that has been popular in electronic structure theory for its accuracy, scalability and versatilityKolorenc2011rev ; Luchow2011rev ; Austin2012rev ; Morales2014rev ; Dubecky2016rev . FNDMC acts in a continuum position space; for a Hamiltonian it projects out an exact ground state (GS) that has non-zero overlap with the antisymmetric trial state , in imaginary time :
[TABLE]
where is an offset energy that keeps the norm of asymptotically constant. The total FNDMC energy is an upper bound to the exact energyMoskowitz1982 and the related FN bias goes quadratically with the nodal displacement errorMitas1991 . can be well sophisticated so that the related FN bias becomes negligibleMorales2012 ; Clay2015 , nevertheless, practical FNDMC feasible for large systems requires simple and efficient ansätze like the popular Slater-Jastrow wave function Ceperley1977 , , where is a single Slater determinant and is a positive-definite explicit correlation Jastrow termJastrow1955 .
Mean-field theories used to produce , including DFTPer2012 , as well as the states and (see below), however miss fundamental constraints on exact electronic structure theory: the total energy as a continuous function of a particle number must show piecewise linear relationship with possible derivative discontinuities at integer Perdew1982 ; Yang2000 ; Mori2009 . Lack thereof has been referred to as (de)localization error and it has severe consequences; it leads to an artificial charge adjustment and related spurious energy minimization which causes poor predictivity of charge-transfer/reaction barriers, band gaps, or noncovalent interactions (e.g., solvation of ions)Cohen2008 ; Mori2014rev . In addition, must also satisfy constancy condition of fractional spin Mori2009 important for correct description of strong correlation.
Here we examine how FNDMC satisfies linearity of for fractional , and, derivative discontinuity at integer , if combined with based on popular spin-restricted mean-field that show unphysical convex dependencyMori2014rev . Such an analysis is important, for instance, to understand if the method is able to provide right energetics of charged noncovalent systemsCohen2008 for right reasons. We consider atoms with fractional charge, H and Cl, to show that FNDMC is well able to restore the piecewise linearity of from the states that do not possess such a property. Walker population analyzes indicate that ensemble and projector features are both operative in this achievement. Insights gained from water-solvated Cl- complex suggest that accuracy and robustness of FNDMC in charge-involving noncovalent interactions relate to the accurate charge localization.
Theory: the effective fractional charge per atom was achieved for both, H and Cl, by modeling cube (system) composed of 8 atoms (subsystems) of the same type separated enough (infinite-separation limit) so that the interactions between the subsystems can be neglected. The Hamiltonian of such a system with electrons,
[TABLE]
satisfies the Schrödinger equation,
[TABLE]
where
[TABLE]
is a -th -times degenerate antisymmetric GS wave function assembled as an antisymmetrized product of individual subsystem -times degenerate wave functions that each satisfies
[TABLE]
is an -electron antisymmetrization operator, and,
[TABLE]
is a corresponding degenerate total energy of the system. Addition of an integer charge to such a system adds effective charge per subsystem thus enabling fractional charge as well.
QMC simulations of systems in large separation limit require additional considerations for the lack of VMC thermalization within usual simulation times (see Methods), and, non-ergodicity (electron localization on specific subsystems) due to the branching term in FNDMC that eliminates configurations with small weights, including those with electrons possibly attempting to diffuse between the distant subsystems. We argue that the total energy expectation values from such FNDMC simulations are energetically correct.
One needs to realize that within a properly initialized non-ergodic equilibrium FNDMC simulation (starting from ), any walker , irrespective of its specific localization , i.e. a specific set of occupation numbers of the subsystems labeled by , randomly samples one of the degenerate symmetry-broken GS wave functions and contributes to according to the so-called mixed distribution . For a totally symmetric , like, e.g., our based on a spin-restricted Slater determinant, that satisfies for all , it is not hard to accept that the population-based , that is of our sole interest here, is not affected by the symmetry breaking.
Hydrogen cube: first, we present the results obtained for H8 cubeMori2014rev with an edge of 1000 Å, 0-16 electrons and singlet spin multiplicity (), allowing us to study essentially isolated H atom with an effective fractional charge in range , i.e., along HHH*-* pathway and 0.25 increments (Fig. 1). In agreement with the previous fractional-charge studies of HMori2009 ; Mori2014 ; Mori2014rev , we find that the considered mean-field methods, HF and DFT with PBEPerdew1996 functional, miss linearity and derivative discontinuity of and show unphysical convex behavior instead (Fig. 1, red, blue).
Interestingly, FNDMC with related (cf. Fig. 3) recovered piecewise linear dependency within the statistical resolution (Fig. 1, black). The results were indistinguishable for HF and PBE (more types were thus not considered). FNDMC with based on PBE orbitals produced electron affinity EA(H)=0.713(1) eV to be compared vs. experimental value of 0.754 eVLykke1991 . Slight discrepancy of 0.04 eV is attributed to the use of effective core potential (ECP) and residual FN bias due to 1-determinantGasperich2017 . Note that if FNDMC method satisfies linearity condition, the quality of the nodal surface determines only the slope of the actual segment.
Chlorine cube: next we consider Cl8 cube (1000 Å) in order to examine how FNDMC describes fractional charge in many-electron system. We consider only singlet states (). Effective fractional charge per Cl atom ranges between (1ClClCl*-*) with 0.25 increments (Fig. 2). We observe that although the mean-field methods (HF, PBE, HSE06Heyd2006 ) miss linearity, and derivative discontinuityCohen2008 at point per atom (neutral Cl), FNDMC well recovers both features from the related and produces accurate EA(Cl)=3.64(2) eV consistent with experiment (3.613 eVBerzinsh1995 ). It appears that FNDMC is well able to produce correct energetics in fractional-charge systems.
Wave function analysis: let us now focus on how FNDMC achieves linearity of in fractional charge models. As mentioned above, the electrons within each walker localize on specific centers, leading to the correspondence between the explicit (alive) walker position (varying along simulation) and subsystem occupation vector in occupation representation (fixed during the non-ergodic simulation unless the walker dies out). Since each walker randomly samples one of the degenerate GS wave functions , and the subsystem occupations randomly vary between the walkers, we find it convenient to introduce irreducible subsystem occupation vector obtained as a descending-ordered set of subsystem occupations , . In addition, we introduce a corresponding vector of total subsystem spin momenta ordered in a descending way for each subset of identical . The pair , labels the state of a subsystem (after reordering) with electrons and a total spin momentum , as sampled by the walker . We arrive at the more convenient correspondence, , that enables equal-footing comparisons between the walkers irrespective of the specific localization (and local position), and, unambiguous analysis of the system state within FNDMC simulation. We are interested in coefficients of an expansion of irreducible GS
[TABLE]
subject to , that could be obtained from a stochastic realization of sampled by walkers within equilibrium QMC simulation,
[TABLE]
by virtue of the projectors,
[TABLE]
For electrons in the system with a specific partition between the subsystems, the GS consistent with the subsystem total energy convexity conditionPerdew1982 ,
[TABLE]
requires that each distant subsystem contains electrons, and, for nonzero , subsystems contain 1 additional electron each. is such that it minimizes the total energy. For , any 2 of the 8 subsystems must contain 2 electrons and the remaining subsystems 1 electron each, , assuming a spin GS, e.g., for a singlet H8 cube. Other partitions (or non-GS ) represent excited-state configurations, e.g., , or, . Therefore, for the special case of distant subsystems, an exact GS may be conveniently identified by 1-term eq. 7 expansions.
In Fig. 3, we compare the total energies obtained for H cube by VMC sampling of and , and FNDMC using . Clearly, contrary to the piecewise linear FNDMC, the VMC total energies show convex behavior, although in case of SJ, it seems that the derivative discontinuity starts to develop due to the use of Jastrow termGoetz2017 . For , the curve is indeed consistent with HF in Fig. 1 within the statistical error.
Analysis of the related walker ensembles in terms of configurations revealed multi-configuration states containing large fractions of excited-states for VMC, and, only GS configurations for FNDMC equilibrium ensembles, for all , orbital types, and Cl cube as well (cf. Fig. 5, Appendix I). For instance, / of PBE orbitals of neutral H cube with electrons contained only 6%/11% of GS configurations and 6/5 types of configurations, whereas sampled by FNDMC using contained GS configurations only. For , / contained 22%/38% of GS configurations and 6/4 configuration types, whereas, again, FNDMC contained 100% of GS. Details of such analyzes for H and Cl cubes, DFT and HF orbitals, VMC and FNDMC, and all considered , are reported in Appendix III.
The observation that FNDMC produces 100% of GS configurations in our fractional-charge models is very important as it may improve our understanding of the method robustness in chargedWineman-Fisher2019 and charge-transfer systems (e.g., reactionsSwann2017 ; Krongchon2017 , excited statesValsson2010 ; Dubecky2011 ). It has been shownYang2000 that the linearity of for fractional , and thus accurate charge localization, directly relates to the subsystem GS occupations. Namely, if the total energy of the system can be expressed, using , as
[TABLE]
where and were defined above, then for is a piecewise linear function. The success of FNDMC in recovering the piecewise linearity is thus attributed to the ensemble nature of the method causing that each electron resides on a specific subsystem at a time (fractional density is only possible via ensemble), and, projector property that apparently projects out proper (GS) configurations.
Cl-(H2O)6: Finally, we consider a practical application of the concepts discussed above for noncovalent interactions of charged systemsWineman-Fisher2019 . It was found that FNDMC provides correct dependency for Cl cube contrary to mean-field (Fig. 2). In systems with a charged Cl atom, one would thus generally expect that FNDMC would be more robust than, say, DFT, and insensitive to the orbitals. We have considered interaction energiesDubecky2016rev () of Cl-(H2O)6 complex by various DFT approximations and FNDMC using the same DFT functionals. Fig. 4 summarizes the results and illustrates superior robustness of FNDMC that reproduces the reference CCSD(T)/complete-basis-set results to within 1% for all . In contrast, from DFT and HF altogether varies by more than 20% due to the known (de)localization error.
This is now easy to understand. Randomly chosen DFT determines fractional charge from a certain interval according to its intrinsic delocalization error. In addition, it produces energy on a convex curve with its intrinsic curvature, that varies between functionalsCohen2006 . As a consequence, for a large set of DFT approximations, one obtains a large distribution of biases in the total energy of a complex, and, consequently in , as observed. On the other hand, if FNDMC predicts a correct charge localization, as expected from the results reported above, the fractional charge in subsystems of a complex is accurate and well defined. Inaccuracy of dictates only the slope of the linear dependency, which leaves only a limited interval of bias that can be realized with varying (reasonable) . The dispersion of the results is thus much smaller, or, in other words, the method is much more robust, as observed, and we attribute its success to the accurate charge localization.
Summary: Fractional-charge computations of H and Cl atoms revealed that FNDMC is well able to restore the correct piecewise linear behavior from that does not have such a property. Insights from the walker population analyses, in combination with formal discussions, indicate that FNDMC correctly describes linearity thanks to its ensemble nature and projection property. Inaccuracy of is expected to determine only the slope of segments. This finding enables understanding of FNDMC robustness, as demonstrated in noncovalent Cl-(H2O)6 complex, from a new perspective. Study of fractional spin by FNDMC is underway.
Methods: Naive simulations of cube models with an edge of Å would not be possible because the VMC equilibration times would vastly overcome the maximum simulation times available. In order to overcome this issue in an automated fashion (avoiding manual initialization of configurations that may be biased by user imagination), the walkers for large-cube FNDMC simulations were initialized by ergodic VMC thermalization and stepwise quasi-adiabatic extension of the system size starting from small . We verified that the walker distributions produced in this way sample , and, in case of a Slater determinant with Hartree-Fock (HF) orbitals, VMC reproduced the HF total energy for both, H and Cl, and all considered . This was not the case for walkers initialized directly in a large cube.
FNDMC computations used QMCPACKKim2018 code with imaginary time step of 0.005 a.u. and T-movesCasula2006 . Nuclei were represented by Burkatzki-Filippi-Dolg effective core potentialsBurkatzki2008 . Single-determinant Slater-Jastrow were based on orbitals expanded in 1s-augmented VTZ one-particle basis sets without highest angular momentum channelsDubecky2017b obtained with a tight SCF convergence (Gaussian G09G09 ). Jastrows containing up to electron-electron-nucleus terms, with a cutoff radius of 10 Bohr, were optimized by the linear methodToulouse2007 , as usual, but cube simulations reused Jastrows optimized for H- and Cl, respectively. The structure of Cl-(H2O)6 (reported in Appendix II) was optimized at the MP2/aug-VTZ level (G09).
Discussions with A. Gendiar have been fruitful. Financial support by Czech Science Foundation (18-24321Y, 18-25128S), University of Ostrava (IRP201826), and ERDF (ITMS 26220220179), is gratefully acknowledged. The computations used IT4Innovations National Supercomputing Center (LM2015070).
Appendix
I Chlorine cube
II Structure of solvated chlorine-anion complex
The structure of the considered Cl-(H2O)6 complex optimized at the MP2/aug-VTZ level in G09 (xyz-format):
19
Cl 0.000000 0.000000 0.000000 O 3.290138 0.000000 0.000000 O -3.290138 0.000000 0.000000 O 0.000000 3.264842 0.000000 O 0.000000 -3.264842 0.000000 O 0.000000 0.000000 3.353781 O 0.000000 0.000000 -3.353781 H 2.664747 0.734646 0.000000 H 2.664747 -0.734646 0.000000 H -2.664747 0.734646 0.000000 H -2.664747 -0.734646 0.000000 H 0.000000 2.636302 0.731619 H 0.000000 2.636302 -0.731619 H 0.000000 -2.636302 0.731619 H 0.000000 -2.636302 -0.731619 H 0.000000 0.732695 2.727551 H 0.000000 -0.732695 2.727551 H 0.000000 0.732695 -2.727551 H 0.000000 -0.732695 -2.727551
III Analysis of walkers
Walker ensemble analyzes for H and Cl cube models, various methods (VMC, FNDMC), and, trial states (Slater determinant , Slater-Jastrow ). For each (in electrons per atom), reported are %, and . GS configurations are indicated by the bold font.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) J. B. Anderson. J. Chem. Phys. , 63:1499–1502, 1975.
- 2(2) P. J. Reynolds, D. M. Ceperley, B. J. Alder, and W. A. Jr. Lester. J. Chem. Phys. , 77:5593–5603, 1982.
- 3(3) C. J. Umrigar, M. P. Nightingale, and K. J. Runge. J. Chem. Phys. , 99:2865–2890, 1993.
- 4(4) L. Mitas, E. L. Shirley, and D. M. Ceperley. J. Chem. Phys. , 95:3467–3475, 1991.
- 5(5) J. Kolorenč and L. Mitas. Rep. Prog. Phys. , 74:026502, 2011.
- 6(6) A. Lüchow. WIR Es Comput. Mol. Sci. , 1:388–402, 2011.
- 7(7) B. M. Austin, D. Y. Zubarev, and W. A. Lester. Chem. Rev. , 112:263–288, 2012.
- 8(8) M. A. Morales, R. Clay, C. Pierleoni, and D. M. Ceperley. Entropy , 16:287–321, 2014.
