Equilibrium configurations of large nanostructures using the embedded saturated-fragments stochastic density functional theory
Eitam Arnon, Eran Rabani, Daniel Neuhauser, Roi Baer

TL;DR
This paper introduces an ab initio Langevin dynamics method based on stochastic density functional theory with an embedded saturated fragment formalism, enabling scalable simulations of large covalently bonded nanostructures like silicon nanocrystals.
Contribution
It develops a linear-scaling, stochastic DFT-based Langevin dynamics approach applicable to large covalent systems using an embedded fragment formalism.
Findings
Successfully simulated silicon nanocrystals up to 3 nm in diameter.
Generated canonical configurations across a range of temperatures.
Analyzed surface structure and geometry reconstruction of nanocrystals.
Abstract
An \emph{ab initio} Langevin dynamics approach is developed based on stochastic density functional theory (sDFT) within a new \emph{embedded saturated } \emph{fragment }formalism, applicable to covalently bonded systems. The forces on the nuclei generated by sDFT contain a random component natural to Langevin dynamics and its standard deviation is used to estimate the friction term on each atom by satisfying the fluctuation\textendash dissipation relation. The overall approach scales linearly with system size even if the density matrix is not local and is thus applicable to ordered as well as disordered extended systems. We implement the approach for a series of silicon nanocrystals (NCs) of varying size with a diameter of up to nm corresponding to electrons and generate a set of configurations that are distributed canonically at a fixed temperature, ranging fromâŚ
| NC | T(K) | |||||
| H | Si | |||||
| 30 | 0.12 | 0.04 | 120 | 1.2 | 1 | |
| 300 | 0.04 | 0.04 | 30 | 1.2 | 1 | |
| 30 | 0.12 | 0.04 | 120 | 1.2 | 2 | |
| 300 | 0.12 | 0.04 | 30 | 1.2 | 2 | |
| 30 | 0.12 | 0.04 | 120 | 1.2 | 10 | |
| 300 | 0.12 | 0.04 | 92 | 1.2 | 10 | |
| Shell | ||||
|---|---|---|---|---|
| 0 | 5.5 | 35 | 52 | |
| 5.5 | 8.5 | 113 | 158 | |
| 9.0 | 11.6 | 153 | 163 | |
| 11.6 | 13.6 | 200 | 189 | |
| 13.6 | 15.1 | 205 | 168 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Equilibrium configurations of large nanostructures using the embedded
saturated-fragments stochastic density functional theory
Eitam Arnon
Fritz Haber Center for Molecular Dynamics, Institute of Chemistry, The Hebrew University of Jerusalem, Jerusalem 91904, Israel
ââ
Eran Rabani
Department of Chemistry, University of California and Materials Science Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, U.S.A.
The Raymond and Beverly Sackler Center for Computational Molecular and Materials Science, Tel Aviv University, Tel Aviv, Israel 69978
ââ
Daniel Neuhauser
Department of Chemistry, University of California at Los Angeles, CA-90095 USA
ââ
Roi Baer
Fritz Haber Center for Molecular Dynamics, Institute of Chemistry, The Hebrew University of Jerusalem, Jerusalem 91904, Israel
Abstract
An ab initio Langevin dynamics approach is developed based on stochastic density functional theory (sDFT) within a new *embedded saturated * *fragment *formalism, applicable to covalently bonded systems. The forces on the nuclei generated by sDFT contain a random component natural to Langevin dynamics and its standard deviation is used to estimate the friction term on each atom by satisfying the fluctuationâdissipation relation. The overall approach scales linearly with system size even if the density matrix is not local and is thus applicable to ordered as well as disordered extended systems. We implement the approach for a series of silicon nanocrystals (NCs) of varying size with a diameter of up to nm corresponding to electrons and generate a set of configurations that are distributed canonically at a fixed temperature, ranging from cryogenic to room temperature. We also analyze the structure properties of the NCs and discuss the reconstruction of the surface geometry.
I Introduction
*Ab initio *molecular dynamics based on density functional theory (DFT) is becoming an important tool for studying the plethora of structural and dynamical processes in a broad range of systems in material science, chemistry, biology and physics.Car and Parrinello (1985); Barnett et al. (1991); Payne et al. (1992); Kresse and Hafner (1993); Tuckerman et al. (1995); Marx and Hutter (2000); Schlegel et al. (2001); Herbert and Head-Gordon (2005); Raty, Gygi, and Galli (2005); Bockstedte et al. (1997); Cawkwell, Niklasson, and Dattelbaum (2015) The application of this approach to very large systems is still limited by the computational scaling of the electronic structure portion of the calculation, regardless of whether one uses a Lagrangian-based or Born-Oppenheimer-based methods. This is because of the cubic scaling involved in solving the Kohn-Sham equations coupled with the need to iterate to self-consistency or to propagate the Kohn-Sham (KS) orbitals, as both of these options further increases the computational times by an order of magnitude.
Significant advances in these respects have been made along two major directions. One primary direction is based on a Lagrangian formulation of density functional theory Car and Parrinello (1985); Cawkwell, Niklasson, and Dattelbaum (2015) and circumvents the need for SCF iterations by propagation of the KS orbitals. This venue does not eliminate the cubic scaling and is therefore limited to relatively small systems. Another approach is based on linear-scaling techniques Goedecker (1999); Gillan et al. (2007); Baer and Head-Gordon (1997a); Soler et al. (2002), that reduces the algorithmic complexity by finding the density matrix directly, relying on its asymptotic sparseness in real-space. However, sparsity sets in only for very large systems, limiting the applicability sparse-matrix methods, especially in 3D.
In a recent set of papers we have introduced the stochastic DFT (sDFT) methods Baer, Neuhauser, and Rabani (2013); Neuhauser, Baer, and Rabani (2014); Gao et al. (2015); Neuhauser et al. (2015) which scales linearly (or even sublinearly) with the system size and does not rely on the sparsity of the density matrix. sDFT is a general approach to electronic structure based on a stochastic process and is applicable to extended ordered as well as disordered materials. Some of the techniques we use, based on the stochastic trace formula.Hutchinson (1990), have been developed for tight-binding electronic structure Drabold and Sankey (1993); Wang (1994); RÜder et al. (1997), for molecular electronics Baer et al. (2004) and for multi-exciton generation in nanocrsytals Baer and Rabani (2012). The success of sDFT in reducing the scaling comes at a price of introducing a stochastic error in all its predictions, including forces, and that precludes application to *ab initio *molecular dynamics.
In this paper we show that sDFT can be used to study equilibrium structural properties of large NCs, despite the statistical fluctuations in the force estimates. For this, we invoke the Langevin equation following the work of Attaccalite and Sorella,Attaccalite and Sorella (2008) and generate a sequence of configurations distributed according to the canonical ensemble. These configurations can be used in a variety of applications for studying the structural, electronic and optical properties of NCs. Here we demonstrate their use for studying the structural properties of silicon nanocrystals (NCs) with a diameter of up to  nm, and electrons.
The paper includes development of the embedded saturated fragments method which allows reducing the statistical errors in sDFT. This new method is inspired by, but more general than, the embedded fragments method developed in Ref. Neuhauser, Baer, and Rabani, 2014. It uses small saturated fragments of the system, and carves out the relevant part of the density to be embedded in the system. Hence, it is applicable not only to clusters of molecules, like Ref. Neuhauser, Baer, and Rabani, 2014, but also to covalently-bonded systems such as silicon NCs. The method is described in detail in Appendix A.
II Methods
II.1 Stochastic DFT
Kohn-Sham density functional theory Hohenberg and Kohn (1964); Kohn and Sham (1965) maps a system of interacting electrons in an external electron-nucleus potential , where () are the nuclei positions and are their charge ( is the electron charge), onto a system of *non-interacting *electrons (the KS system), having the same ground-state density . This mapping is performed by solving the KS equations Hohenberg and Kohn (1964); Kohn and Sham (1965)
[TABLE]
where the KS Hamiltonian is:
[TABLE]
and the KS potential is the sum of the external electron-nuclear potential , the density-dependent Hartree potential , and the exchange-correlation potential :
[TABLE]
In the KS system, the density is expressed in terms of the normalized single electron KS eigenstates and eigenvalues
[TABLE]
where is the Heaviside function and is the chemical potential chosen so that . Eqs. (1)-(4) must be solved self-consistently, since depends on the density. While the entire scheme is a significant simplification over the original many-electron problem, it remains a challenge for large systems since the computational effort scales as .
An important step towards reducing the computational scaling of KS-DFT was recently proposed by Baer*,* Neuhauser, and Rabani (BNR),Baer, Neuhauser, and Rabani (2013) where the density of Eq. (4) was expressed as a trace over the projected density operator:Baer, Neuhauser, and Rabani (2013)
[TABLE]
The problem now shifts into calculating self-consistently the trace in Eq. (5) (since depends on ) rather than solving the KS equations by brute-force diagonalization. When the trace is performed using the KS eigenstates, the computational cost remains similar to the traditional approach. However, since the trace is invariant to the basis, alternative schemes that potentially lead to improved scaling can be used. One such scheme is based on the concept of a stochastic trace formula, which reduces the scaling of the trace operation by introducing a controlled statistical error.Hutchinson (1990)
Using the stochastic trace formula, the density can be estimated as a symmetrized stochastic trace formula, given by:Baer, Neuhauser, and Rabani (2013)
[TABLE]
where denotes an average over stochastic orbitals , defined as:
[TABLE]
for each grid point , the parameter (not to be confused with the KS Hamiltonian operator) is the grid spacing, and are statistically independent random variables in the range (). The density is, strictly speaking, given by the limit and we approximate it with a finite . The Heaviside function in Eq. (6) is smoothed by the function , where is a large constant satisfying , where is the KS-DFT fundamental gap. Throughout this paper we set the value of to . The action of on is evaluated by a Chebyshev expansion in powers of the sparse KS Hamiltonian, .Tal-Ezer and Kosloff (1984) The length of the Chebyshev series is determined by the value of and where and are the minimal and maximal eigenvalues of . Under the conditions of the present systems, the length of the series is ~3000 terms.
The stochastic trace evaluation (Eq. 6) reduces the computational scaling of KS-DFT to and for certain properties even to a sub-linear scaling.Baer, Neuhauser, and Rabani (2013) Linear-scaling complexity is achieved due to the following facts:
- the application of a Hamiltonian to a stochastic orbital, requires a linear scaling effort (irrespective of the structure of the orbital); 2) The length of the Chebyshev series is independent (or at most weakly dependent) of system size and 3) Only a system-size independent number of stochastic orbitals are required. This type of assumptions is different from the linear-scaling approaches depending on density matrix sparsity,Baer and Head-Gordon (1997b); Goedecker (1999) which assume that locality of orbitals is not completely destroyed by the repeated operation of the Hamiltonian.
A converged self-consistent solution of Eq. (6) provides an estimate of the electron density and in addition can be used to generate other quantities, such as the density of states (DOS), the total energy per electron, and the forces acting on the nuclei. All estimates contain a statistical error that can be controlled by increasing the number of stochastic orbitals () used to evaluate the trace in Eq. (6). Of particular relevance to this work are the Cartesian forces exerted by the electrons on nuclei (), which can be evaluated through the Hellmann-Feynman theorem:Hellman (1937); Feynman (1939)
[TABLE]
It should be stressed that for finite sampling, these forces are only approximately commensurate with the stochastic estimate of the energy (which is not used in the sampling procedure at all), as discussed in Appendix B. The stochastic estimate of the Hellmann-Feynman forces is an excellent estimator of the deterministic forces, as can be seen in Fig. 1, where the mean absolute deviation is dominated by the fluctuations and not by additional bias terms.
These sDFT forces can be expressed as:
[TABLE]
where is the deterministic (generally unknown) force, is the pure fluctuating term, and is the bias expected to be proportional to in leading order. The choice of should be large enough to reduce to negligible values and the only source of error in the procedure is then the statistical fluctuations proportional to with vanishing mean ().
II.2 Embedded saturated fragments sDFT
The reduction of the scaling in sDFT is achieved by replacing the deterministic, numerically exact, trace evaluation with a stochastic sampling of the density. In return, this leads to statistical errors in the computed observables. To reduce the size of the statistical fluctuations, an embedded saturated fragments method is introduced inspired by (but different from) the method of Ref. Neuhauser, Baer, and Rabani, 2014 . The latter approach was suitable mainly for systems composed of proximate but chemically separated molecules (like clusters of water molecules, for examples). The present method is applicable for fragmenting covalently bonded systems, like silicon NCs.
In this approach, the system is divided into small fragments that are possibly overlapping. The division to fragments is flexible, and any desired physically motivated fragmentation can be used. The density is then a sum of the fragment density and a small correction term:
[TABLE]
where is the density generated by the individual fragments obtained from a deterministic KS-DFT calculation for each fragment and is a correction term evaluated using stochastic orbitals. Here, is given by Eq. (6) and is a sum over a *stochastic *estimate of the fragments density. In the limit , Eqs. (6) and (10) are identical and equal to the deterministic density. For finite values of , the size of the statistical fluctuations of the two approaches are quite different. Since the deterministic fragmented density, , provides a reasonable approximation for the full density , the correction term, , which is evaluated stochastically, is rather small, leading to a reduced variance in the relevant observables (forces, DOS, total energy per electron, etc.) compared to the direct stochastic approach of Eq. (6). An equivalent viewpoint is that the fragmentation is a device for reducing the variance in the stochastic evaluation of the density. This is evident by rewriting Eq. (10) in the following form
[TABLE]
and the implementation of this form is described in Appendix (A).
To assess the accuracy of the embedded saturated fragmented sDFT (efsDFT), we calculated the standard deviations (STDs) and mean absolute deviations with respect the deterministic DFT (MADs) of the atomic forces in a NC using hydrogen passivated fragments. The results are shown in Fig. 1.111All calculations in this work use real-space grids of spacing , Troullier-Martins norm-conserving pseudopotentials Troullier1991 within the Kleinman-Bylander approximation.Kleinman1982 Fast Fourier Transforms were used for applying the kinetic energy operator and for determining the Hartree potentials and the method of Ref. Martyna1999 was used for treating the long range Coulomb interactions in a finite simulation cell with periodic boundary conditions. DFT calculations were performed under the local density approximation (LDA). The STDs and MADs decrease as , indicating that the bias in the force estimation is negligible. The standard deviations in the sDFT forces are larger by a factor of compared to those of efsDFT. This implies that the required number of stochastic orbitals in efsDFT is nearly an order of magnitude smaller than in sDFT for similar STDs. The STDs can be further reduced by using larger fragments as discussed below (cf., Fig. 7).
II.3 Langevin dynamics based on
efsDFT
The standard approach to generate canonically distributed configurations using ab initio techniques is based on molecular dynamics, which requires as input accurate force estimates for each atomic degree of freedom. Since the forces generated by efsDFT contain a stochastic component, we use Langevin dynamics (LD) instead of molecular dynamics to sample configurations according to the Boltzmann distribution. A LD trajectory Allen and Tildesley (1987); Van Kampen (1992); Zwanzig (2001); Frenkel and Smit (2002) is a sequence of configurations at discrete âtimesâ , where is the time step, and and are the Cartesian coordinates and conjugate momenta, respectively, for the atoms, The trajectory is a solution of the Langevin equation (LE) of motion:Langevin (1908)
[TABLE]
where is the mass of the atom , is its friction constant, and is the total efsDFT force acting on it, including deterministic and fluctuating parts (see Eq. 9). The bias is assumed negligible, so that . In Eq. (12) is an additional uncorrelated white-noise force introduced so as to satisfy the fluctuation-dissipation (FD) relation. We require that the total random fluctuation on each atom obey:
[TABLE]
and
[TABLE]
where are atom indices, designates average over the atomic force distribution, is the unit matrix, and is the atomic force STD of atom , which is taken to satisfy the fluctuation-dissipation relation:
[TABLE]
We use the Verlet-like algorithm Grønbech-Jensen and Farago (2013) for numerically integrating the LE of motion at a fixed temperature and a predefined time-step . The positions and momenta in time step depend on the positions and momenta in time step as well as on the forces in time step and the additional white noise is sampled from a Gaussian distribution such that the discretized version of Eq. (13) holds: \left\langle\left(\boldsymbol{\eta}{}_{\alpha}^{m}+\boldsymbol{f}_{\alpha}^{\text{fluc}}\right)\otimes\left(\boldsymbol{\eta}{}_{\alpha^{\prime}}^{n}+\boldsymbol{f}_{\alpha^{\prime}}^{\text{fluc}}\right)\right\rangle\Delta t=\text{\boldsymbol{\text{I}}}\sigma_{\alpha}^{2}\delta_{\alpha\alpha^{\prime}}\delta_{mn}:
[TABLE]
where and . The algorithm allows for stable and accurate solutions of the LE with time step comparable to that used in molecular dynamics simulations for similar systems. It treats the additional white noise component of the force differently from the force that result from the efsDFT calculation containing deterministic and fluctuating components that cannot be separated.
In Fig. 2 we plot for the running average of the transient temperature, , calculated from the kinetic energy
[TABLE]
and from the virial estimator,
[TABLE]
In the above, is the time average of the coordinate of atom . The initial positions of the Si atoms were taken from the bulk values. All surface Si atoms with more than two dangling bonds were removed and the remaining surface Si atoms were passivated using one or two H atoms placed in a tetrahedral geometry at the Si-H distance of Ă . The momenta were sampled from a Boltzmann distribution at . This non-equilibrium initial configuration relaxes towards equilibrium.
The agreement in Fig. 2 between the two temperature estimators is consistent with a proper sampling of the canonical distribution of both positions and velocities. The small discrepancies at the longest averaging time are due to the large fluctuations of the transient temperature, particularly when using the virial estimator. We have also calculated the fluctuations in the kinetic energy and found good agreement with the corresponding analytical value (see caption of the figure). The two atomic species have a slightly deviations in the temperatures. These may result from several factors such as the finite timestep of the Langevin propagator Grønbech-Jensen and Farago (2013), incomplete SCF convergence,MartĂnez et al. (2015) insufficiently accurate estimate of the amount of white noise in Eq. (13) required to fulfill the fluctuation-dissipation relation.
II.4 Determining the optimal friction
The effect of on the configurational relaxation and on the velocity autocorrelation decay is illustrated in Fig. 3 for . In order to decrease the number of unknown parameters we set the values of and to be identical for all atoms of the same type (i.e. Si or H in the systems studied here). To achieve such a uniform value of we introduced white noise for each degree of freedom (see Eq. 13), where is estimated by a separate set of runs on the initial NC configuration using several independent sets of stochastic orbitals. Note, that we have tested that the magnitude of the sDFT force fluctuation is not sensitive to the particular configuration used.
As expected, the configurational relaxation time increases with increasing values of with the opposite trend for the decay time of the velocity autocorrelation function. Based on the results of presented in Fig. 3 we conclude that a friction coefficient of is sufficiently small for this system, with respect to minimizing both velocity and pair distance autocorrelation times. Although lower values of the friction coefficients could decrease the correlation time further, they would require reducing the statistical noise, which would be expensive to achieve using sDFT. Thus we chose the friction constants, for Silicon and for the lighter H atoms. These values were used for the larger systems described in the next section (see Table 1). Note that the results shown in Fig. 3, which were generated using LD under dDFT, could have been equally well generated under efsDFT. This is shown explicitly for (dotted red line) proving that the relaxation times are similar to those of the dDFT based LD calculation with the same value of .
II.5 Validation of LD within efsDFT
Validation of the structure obtained using efsDFT based LD is demonstrated using the pair distribution function .Allen and Tildesley (1987) For finite size NCs the average number of neighbors at a distance is expected to be smaller than the bulk value due to surface atoms with a smaller number of neighbors. Fig. 4 shows a close agreement between the dDFT and efsDFT based LD estimates of of the NC at two temperatures. The inset focuses on the first peak in , comparing the efsDFT to dDFT at and .
III Results
In the previous sections we presented the methods and assessed the accuracy and validity of the efsDFT based LD. Here we apply the method to study structural properties of larger NCs exceeding electrons. The Si-Si pair distribution functions at two temperatures ( and ) are displayed in the upper panel of Fig. 5 for and . Temperature broadens the peaks by a factor of without significantly changing the peak position.
In the lower panel of Fig. 5 we plot the transient and relaxed at for the two systems, focusing on the first, nearest neighbor peak. As described also for , the initial positions of the Si atoms for both systems were taken from the experimental bulk values and all surface Si atoms with more than two dangling bonds were removed. The remaining surface Si atoms were then passivated using one or two H atoms placed in a tetrahedral position at the Si-H distance of à . The initially sharp peak broadens and shifts to longer Si-Si bond lengths as the system relaxes towards thermal equilibrium. For , the relaxation times are and fs for and , respectively. For they are and fs respectively.
The relaxation transient is studied in greater detail in Fig.6, where the average nearest-neighbor bond lengths are shown for (spherical shells A-B) and (shells A-E); see Table 2 for the definition and properties of the shells. In the deep layer shells (A-D) relax slower than those near the surface showing that relaxation progresses from the surface inwards. The difference between the relaxation times of the two systems is correlated with the smaller frequency, , of the breathing mode of the larger NC. In the limit of an over damped motion (as is the case here since ), the relaxation is dominated by two timescales proportional to and . The former leads to a fast relaxation while the latter is slower and depends on the value of . The ratio of the breathing mode frequency for the two particles is (L/S for large/small) assuming that the breathing mode frequency scales linearly with the NC diameter.Ghavanloo et al. (2015) This is similar to the ratio of the relaxation times () for the lower temperature. At the higher temperature, one needs to consider anharmonic effects which are more pronounced in the large NC with lower acoustic phonons. Another noticeable feature in Fig. 6 is that the Si-Si bonds seem slightly shorter in than in Si_{705}$$H_{300}. This results from the difference in the bond distance of atoms in the outer shell, while the inner shell atoms have similar bond distances.
IV Conclusions
In this paper we developed an ab initio Langevin dynamics approach based on a new embedded saturated fragment stochastic DFT method. We showed how the noisy forces resulting from the efsDFT calculation are used to generate a set of configurations that are distributed canonically at cryogenic and room temperatures. By proper choice of the friction coefficients and the number of stochastic orbitals, thermalization is reached within time steps for these materials, since the method is trivially parallelizable, larger computer resources we allow to easily reduce the friction coefficients thus greatly improving the sampling efficiency. While the methods presented here have already allowed impressive achievements, such as determining structural properties of silicon NCs of diameter containing more than electrons, larger systems still, of unprecedented size, are now coming within our grasp due to the fact that linear-scaling highly parallelizable features of sDFT.
Acknowledgements.
E.R. acknowledges support from the Physical Chemistry of Inorganic Nanostructures Program, KC3103, Office of Basic Energy Sciences of the United States Department of Energy under Contract DE-AC02-05CH11232. D.N. acknowledge support by the NSF Grant DMR/BSF-1611382. R.B. acknowledges the US-Israel Binational Science foundation support under the BSF-NSF program, Grant 2015687.
Appendix A The embedded saturated fragments
approach
Here, we provide the technical details for the embedded fragment method described in Section II. The method corrects the stochastic estimate for the expectation value of a one-body operator , using calculations performed on separate fragments (see Eq. (11)):
[TABLE]
where the stochastic correction due to fragment is
[TABLE]
and the deterministic and the stochastic estimates, and are calculated directly on the fragment itself.
Previous implementations of embedded fragment sDFT were applied to systems of many weakly interacting molecules where the selection of fragments or clusters of such molecules was natural.Neuhauser, Baer, and Rabani (2014) We now describe a new method for defining and carrying calculations with fragments which can break up covalently bonded systems such as silicon NCs. The large system is divided into small fragments composed of one or more bonded atoms each. The surface dangling bonds of the fragment are passivated using a H atom placed in Ă from the Si atom, in the direction of the neighboring atom which is not included in the fragment. This forms a saturated fragment. For a saturated fragment , the deterministic KS-DFT method is applied to determine the KS eigenvalues and eigenfunctions . Further, occupation numbers are introduced for determining the saturated fragment density . The fragment density is âcarved outâ of using a carving function . Thus:
[TABLE]
where, inspired by Hirshfeld partitioning, Hirshfeld (1977) the carving function is defined as:
[TABLE]
where is the spherical density of neutral atom . The temperature parameter in the definition of the population is chosen be the same value as that of the sDFT calculation, while the chemical potential of each fragment is determined by the condition of neutrality of the fragment:
[TABLE]
Defining non-orthogonal functions , the fragment density of Eq. (20) becomes , so the chemical potential is determined from the condition:
[TABLE]
After determining and in order to construct the reduced density matrix (RDM), we orthogonalize the functions by diagonalizing the overlap matrix , obtaining the unitary matrix of eigenvectors and the eigenvalues (so that ). The orthogonal wavefunctions are: and the norm is . Using the new wave functions, the unsaturated fragment density is given by:
[TABLE]
and the RDM by
[TABLE]
Using the RDM we express the unsaturated fragment expectation value appearing in Eq. (19) as:
[TABLE]
where
[TABLE]
By choosing the fragment grid-points to be a subset of the full system grid, each stochastic orbital () of the full system appears as a stochastic orbital on the fragment grid and can be used to perform the stochastic estimate appearing in Eq. (19) as:
[TABLE]
where the subscript on the left denotes integration over the fragment grid. The difference in Eq. (19) can now be written in a unified form as:
[TABLE]
where:
[TABLE]
Hence, by calculating the matrix all types of expectation value corrections can be obtained from Eq. (24).
The efficacy of embedded fragments in sDFT force calculations is achieved through a reduction of the STD of a force component. The STD is proportional to , where is the number of stochastic orbitals and the proportionality constant, denoted , is called the *inherent *STD. This quantity depends on the NC characteristics but not on the number of stochastic orbitals. In Fig. 7 we plot the inherent STD on each atom for and as a function of fragment size. Even the use of the smallest fragments reduces the inherent force STD by a significant factor, (for ) to (for ). Using larger fragments reduces the STD by an additional factor of , with increasing effect for larger systems, since the electron density in the larger fragments is similar to that of the full system. It is interesting to see that for the forces there is no noticeable sublinear scaling: the inherent STD for both systems is similar, with the larger system having a slightly () STD.
In summary, the embedded fragment sDFT method serves as a way to expedite the sDFT calculation by a judicious choice of fragment size and composition. As the fragment size grows, the numerical effort invested in sDFT decreases (due to reduction of STD) while in dDFT it increases. For example, consider Fig. 7 where we showed that increasing the fragment size by a factor of reduces the STD by a factor of and therefore the sDFT CPU time by a factor of . On the other hand since the fragments are ten-fold larger, the amount of dDFT work on them increases (cubically) by a factor of more than. Clearly then, the optimal fragment size is system dependent. Embedded fragments have the additional benefit of providing an initial density for the SCF calculation, significantly reducing the number of SCF cycles.
Appendix B Stochastic estimates of the forces
and energy perturbations
In the stochastic method, the electronic density is (see Eq. (6)):
[TABLE]
where is the Chebyshev expansion of the projection operator, depending on and , on the occupied space of , and the energy is
[TABLE]
where is the Hartree-exchange-correlation energy functional, depending only on the electronic density . Under variation in position of nuclei :
[TABLE]
which using
[TABLE]
can be written as:
[TABLE]
The average of the second term on the right leads to the work of the Hellmann-Feynman force Eq. (8) while the first term can be shown to vanish when a full sampling is made on and when for then .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Car and Parrinello (1985) R. Car and M. Parrinello, Phys. Rev. Lett. 55 , 2471 (1985).
- 2Barnett et al. (1991) R. N. Barnett, U. Landman, A. Nitzan, and G. Rajagopal, J. Chem. Phys. 94 , 608 (1991).
- 3Payne et al. (1992) M. C. Payne, M. P. Teter, D. C. Allan, T. A. Arias, and J. D. Joannopoulos, Rev. Mod. Phys. 64 , 1045 (1992).
- 4Kresse and Hafner (1993) G. Kresse and J. Hafner, Phys. Rev. B 47 , 558 (1993) .
- 5Tuckerman et al. (1995) M. Tuckerman, K. Laasonen, M. Sprik, and M. Parrinello, The Journal of chemical physics 103 , 150 (1995).
- 6Marx and Hutter (2000) D. Marx and J. Hutter, âAb initio molecular dynamics: Theory and implementation,â in Modern Methods and Algorithms of Quantum Chemistry, Proceedings , NIC Series,, Vol. 3, edited by J. Grotendorst (John von Neumann Institute for Computing, Julich, 2000) p. 329.
- 7Schlegel et al. (2001) H. B. Schlegel, J. M. Millam, S. S. Iyengar, G. A. Voth, A. D. Daniels, G. E. Scuseria, and M. J. Frisch, The Journal of Chemical Physics 114 , 9758 (2001).
- 8Herbert and Head-Gordon (2005) J. M. Herbert and M. Head-Gordon, Phys. Chem. Chem. Phys. 7 , 3269 (2005).
