A complete OSV-MP2 analytical gradient theory for molecular structure and dynamics simulations
Ruiyi Zhou, Qiujiang Liang, Jun Yang

TL;DR
This paper introduces an exact algorithm for computing analytical gradients in OSV-MP2 theory, enabling accurate molecular structure and dynamics simulations with applications demonstrated on water cations and ethanol rotors.
Contribution
It presents the first exact gradient algorithm for OSV-MP2 with explicit OSV relaxation, improving accuracy and enabling molecular dynamics simulations.
Findings
High accuracy of gradients demonstrated on selected molecules
Successful BOMD simulations of water cations using OSV-MP2
Computed free energy surface of ethanol rotors
Abstract
We propose an exact algorithm for computing the analytical gradient within the framework of the orbital-specific-virtual (OSV) second-order M{\o}ller-Plesset (MP2) theory in resolution-of-identity (RI) approximation. We implement the exact relaxation of perturbed OSVs through the explicit constraints of the perturbed orthonormality, the perturbed diagonality and the perturbed singular value condition. We explicitly show that the rotation of OSVs within the retained OSV subspace makes no contribution to gradients, as long as the iterative solution of the unperturbed Hylleraas residual equation is well converged. The OSV relaxation is solved as the perturbed non-degenerate singular value problem between the retained and discarded OSV subspaces. The detailed derivation and preliminary implementations for gradient working equations are discussed. The coupled-perturbed localization method is…
| def2-SVP | def2-TZVPP | def2-QZVPP | ||
|---|---|---|---|---|
| ME | 0.071 | 0.097 | 0.151 | |
| MAE | 0.081 | 0.166 | 0.167 | |
| max | 0.380 | 0.570 | 0.620 | |
| ME | 0.009 | 0.013 | 0.016 | |
| MAE | 0.014 | 0.017 | 0.019 | |
| max | 0.050 | 0.070 | 0.080 | |
| ME | 0.000 | -0.002 | 0.005 | |
| MAE | 0.009 | 0.016 | 0.012 | |
| max | 0.040 | 0.080 | 0.060 |
| DTFS (Si-N) | RESVAN (S-S) | ||||
| Thresholda | r (pm) | rRI-MP2 (pm) | r (pm) | rRI-MP2 (pm) | |
| DLPNO-MP2b | Loose | 1.94 | 214.91b | 9.36 | 390.66b |
| Normal | 0.78 | 2.98 | |||
| Tight | 0.33 | 0.91 | |||
| OSV-MP2b | Loose | 0.70 | 211.80c | 6.90 | 385.80c |
| Normal | 0.40 | 2.70 | |||
| Tight | 0.00 | 0.80 | |||
| a Predefined in Figure 9. | |||||
| b DLPNO-MP2 results with frozen core approximation from Ref.70. | |||||
| c Our results without frozen core approximation. | |||||
| osv | def2-SVP | def2-TZVPP | def2-QZVPP | |
|---|---|---|---|---|
| MAE | 0.05 | 0.08 | 0.10 | |
| max | 1.10 | 0.80 | 1.00 | |
| MAE | 0.02 | 0.03 | 0.03 | |
| max | 0.10 | 0.10 | 0.20 | |
| MAE | 0.02 | 0.03 | 0.02 | |
| max | 0.10 | 0.30 | 0.20 |
| Basis set | Thresholda | RI-MP2 | OSV-MP2 | PNO-MP2b | DLPNO-MP2 |
|---|---|---|---|---|---|
| def2-SVP | Loose | 138.8 | 137.2 | - | -c |
| Normal | 138.7 | - | 137.5 | ||
| Tight | 138.7 | - | 137.6 | ||
| def2-TZVPP | Loose | 142.2 | 139.9 | 92.6 | -c |
| Normal | 142.0 | 143.2 | 138.9 | ||
| Tight | 142.2 | 142.3 | 139.3 | ||
| def2-QZVPP | Loose | 142.1 | 138.7 | 138.4 | -c |
| Normal | 141.9 | 141.3 | 139.2 | ||
| Tight | 142.1 | 140.6 | 139.5 | ||
| a Predefined in Figure 9. | |||||
| b PNO-MP2 results without PNOs relaxation from Ref.67. | |||||
| c DLPNO-MP2 reported not converged. | |||||
| Bond length | Bond angle | Dihedral angle | |
|---|---|---|---|
| MAE (pm) | MAE (∘) | AE (∘) | |
| 0.043 | 0.043 | 0.6 | |
| 0.021 | 0.028 | 0.3 | |
| 0.019 | 0.023 | 0.5 | |
| 0 | 0.019 | 0.025 | 0.2 |
| Molecules | RI-MP2 | DLPNO-MP2 | OSV-MP2 | Speedups | Percentages | ||
|---|---|---|---|---|---|---|---|
| (Gly)4 | 611 | 1502 | 2 | 7 | 6 | 0.4 | 99.96%, 99.96% |
| 49 | 28 | 14 | 3.5 | ||||
| (Gly)6 | 895 | 2200 | 14 | 14 | 15 | 0.9 | 99.96%, 99.95% |
| 163 | 65 | 40 | 4.1 | ||||
| (Gly)8 | 1179 | 2898 | 53 | 22 | 32 | 1.7 | 99.96%, 99.95% |
| 475 | 112 | 90 | 5.3 | ||||
| (Gly)10 | 1463 | 3596 | 145 | 30 | 65 | 2.2 | 99.95%, 99.95% |
| 1069 | 171 | 198 | 5.4 | ||||
| (Gly)12 | 1747 | 4294 | 355 | 39 | 112 | 3.2 | 99.95%, 99.95% |
| 2796 | 243 | 332 | 8.4 | ||||
| (Gly)14 | 2031 | 4992 | 751 | 52 | 194 | 3.9 | 99.95%, 99.95% |
| 5937 | 328 | 569 | 10.4 | ||||
| Nonactin | 1996 | 4912 | 598 | 135 | 198 | 3.0 | 99.91%, 99.89% |
| 4728 | 598 | 697 | 6.8 |
| Molecule | (K) | (kJ/mol) | RMSD (kJ/mol) | ||
|---|---|---|---|---|---|
| H9O | 0.000 | 149.3 | -0.45 | 0.41 | |
| 0.001 | 149.9 | -0.50 | 0.40 | ||
| 0.010 | 152.0 | 0.94 | 0.65 | ||
| 0.020 | 152.1 | 1.21 | 1.21 | ||
| 0.000 | 149.3 | 0.36 | 0.32 | ||
| 0.000 | 150.4 | 0.00 | 0.17 | ||
| 0.001 | 149.5 | -0.04 | 0.18 | ||
| 0.010 | 150.8 | 0.08 | 0.13 | ||
| 0.020 | 149.1 | 0.08 | 0.14 | ||
| H13O | 0.000 | 153.0 | -0.98 | 0.55 | |
| 0.001 | 191.1 | 50.97 | 16.14 | ||
| 0.000 | 146.6 | -0.02 | 0.21 | ||
| 0.000 | 148.5 | 0.06 | 0.22 | ||
| 0.001 | 149.2 | -0.04 | 0.25 | ||
| 0.010 | 150.3 | 1.57 | 0.62 | ||
| 0.020 | 150.0 | -0.53 | 0.34 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A complete OSV-MP2 analytical gradient theory for molecular structure and dynamics simulations
††Equal contributions
Ruiyi Zhou†
Qiujiang Liang†
Jun Yang
[
Abstract
We propose an exact algorithm for computing the analytical gradient within the framework of the orbital-specific-virtual (OSV) second-order Møller-Plesset (MP2) theory in resolution-of-identity (RI) approximation. We implement the relaxation of perturbed OSVs through the explicit constraints of the perturbed orthonormality, the perturbed diagonality and the perturbed eigenvalue condition. We show that the rotation of OSVs within the retained OSV subspace makes no contribution to gradients, as long as the unperturbed Hylleraas energy functional reaches minimum. The OSV relaxation is solved as the perturbed non-degenerate eigenvalue problem between the retained and discarded OSV subspaces. The detailed derivation and preliminary implementations for gradient working equations are discussed. The coupled-perturbed localization method is implemented for meta-Löwdin localization function. The numerical accuracy of computed OSV-MP2 gradients is demonstrated for the geometries of selected molecules that are often discussed in other theories. From OSV-MP2 with the normal OSV selection, the canonical RI-MP2/def2-TZVP gradients can be reproduced within a.u. The OSV-MP2/def2-TZVPP covalent bond lengths, angles and dihedral angles are in good agreement with canonical RI-MP2 structures by 0.017 pm, and , respectively. No particular accuracy gains have been observed for molecular geometries compared to the recent local pair-natural-orbital MP2 by using the predefined orbital domains. Moreover, the OSV-MP2 analytical gradients can generate atomic forces that are utilized to drive the Born-Oppenheimer molecular dynamics (BOMD) simulation for studying structural and vibrational properties with respect to OSV selections. By performing the OSV-MP2 BOMD calculation using the normal OSV selection, the structural and vibrational details of protonated water cations are well reproduced. The 200 picoseconds well-tempered metadynamics at 300 K has been simulated to compute the OSV-MP2 rotational free energy surface of coupled hydroxyl and methyl rotors for ethanol molecule.
The University of Hong Kong] Department of Chemistry, The University of Hong Kong, Hong Kong SAR, P.R. China
1 INTRODUCTION
Ab-initio electronic structure theory has been significantly progressed with many theoretical and algorithmic developments. Reduced-scaling post-Hartree-Fock methods are now capable of efficiently computing molecular systems of substantially increased size by managing trade-offs between the accuracy that can be achieved and the resource that can be accessed1, 2. The reduced-scaling techniques are often based on the unique strength of the spatial locality, i.e., the short-range behaviour of electron correlation that emerges as the size of a system increases. Many schemes have been devised and implemented to compute the energies of large molecules by operating on a sufficiently accurate and reduced subset of of Hilbert space in which an approximate wavefunction can be efficiently represented, manipulated and stored3, 4, 5, 6.
The locality of dynamic electron correlation was introduced by Pulay7 and initially implemented by Sæbø8, 9, 10. This has led to a fruitful variety of wavefunction representations by which the unphysical steep computational scaling can be drastically diminished. Notably, a hierarchy of Møller-Plesset perturbation and coupled-cluster (CC) methods has been developed by employing projected atomic orbitals (PAO) by Werner, Schütz and coworkers 11, 12, 13, 14, 15, 16, pair-nature-orbitals (PNOs) pioneered by Meyer et al. 17, 18 and revitalized by Neese19, and orbital-specific-virtuals (OSVs) by Chan 20, 21, 22, 23. By construction, the PNO and OSV are both inherently local and specific to a single orbital pair and orbital, respectively. The hybrid near-linear-scaling PNO-MP2 and PNO-CCSD schemes by mixing PAO/OSV/PNO have been demonstrated to further reduce the number of PNOs that are required to compress the cluster operators by employing PAO or OSV as an intermediate stage 24, 25, 26, 27, 28. The hybrid PNO schemes ensure the most compact virtual space for recovering a certain percentage of correlation energy. In addition, explicitly correlated CCSD(T) methods in the PNO framework have been developed to reduce basis set error 29, 30, 31, 32, 33, 34. Open-shell PNO-CCSD35, PNO variants of state-specific multi-reference perturbation and CC theories 36, 37, 38, 39, 40, as well as PNO-based EOM-CC241/CCSD42, 43, CIS(D)44, ADC(2)-x45 for excited states in both state-specific and state-average approaches have also been implemented and demonstrated.
A wide range of chemistry problems, such as molecular geometries, reaction pathways, thermal and spectroscopic properties, and so on, involves the physical motion of atoms. In essence, these molecular properties require an efficient computation of analytical energy gradients46, 47 with respect to relaxations of molecular orbitals and/or other parameters of a deterministic electronic wavefunction. Apparently, analytical gradient techniques are highly specific to the way in which wavefunction of the system is constructed. In the past decades, for instance, for computing analytical gradients with manageable cost-accuracy balance, the implementations have adopted very different reduced-scaling strategies for the variants of MP2 48, 6, 49, 50, 51, 52, 53, 54 and CC methods 55, 56, 57, 58, 59, 60, 61.
For applications to large molecules, the PAO-based analytical gradients have been established for the local MP262, 63, CC264 and CCSD65 models. In the context of the more recent PNO and OSV schemes, the implementation of PNO- or OSV-based analytical gradients is more limited, primarily due to the complexity of PNO or OSV related approximations. The performance of the simulated PAO-, PNO- and OSV-based CCSD for non-resonant optical properties was assessed by Crawford and coworker using linear response theory66. The PNO-MP2 and PNO-CCSD analytical energy gradients were developed by Hättig67 and Neese68, respectively, without accounting for the relaxation of PNOs. Most recently, the PNO relaxation problem was circumvented for the domain-based local PNO-MP2 (DLPNO-MP2) by enforcing a block-diagonal semi-canonical external pair density matrix which assumes zero off-blocks between the retained and discarded PNO orbitals69, 70, making the PNO-MP2 energy invariant to the rotation among the kept PNO orbitals.
In the present work, we turn our attention to developing the exact OSV-MP2 analytical gradient theory for its much simpler way of constructing OSVs relative to the hybrid PNOs in DLPNO-MP2 model. Here, we implement the OSV relaxation explicitly as a perturbed eigenvalue problem by using both orthonormality and eigenvalue conditions for perturbed OSVs. We also show that the degenerate eigenvalue issue that may break OSV relaxation69, 70 does not occur for reasons that will be described in our formalism and implementation. The resulting OSV relaxation vectors may be further scrutinized in a way that their intrinsic sparsity can be explored to pre-select important OSV relaxations making most contributions to OSV-MP2 gradients. In addition, OSV-MP2 was shown to produce smooth potential energy curve with respect to molecular structures even with small pair domains20. This is essential to efficient simulations of Born-Oppenheimer molecular dynamics (BOMD)71.
The paper is organized as follows. In Sec. II we describe the details of our OSV-MP2 analytical gradient theory and implementation. We implemented the algorithm in a standalone Python program, and PYSCF72 has been used for obtaining the one- and two-electron integrals, their derivatives and the RHF reference wavefunction. In Sec. III we compute the optimized molecular structures and assess the accuracy of the OSV-MP2 analytical gradients with respect to the selection parameters for OSVs and orbital pairs. The results are also compared with canonical MP2 and DLPNO-MP2 results available in the literature. In Sec. IV, we carry out the ab-initio BOMD simulations driven by OSV-MP2 analytical gradients. Illustrative applications of OSV-MP2 metadynamics are demonstrated to protonated water cations and ethanol molecule. Here our focus is to find out whether the errors due to the OSV approximations for cost-accuracy trade-off have any significance in reproducing the energy, structural and vibrational details by BOMD simulations at a finite-temperature. Our work is concluded in Sec. V.
2 THEORY AND IMPLEMENTATION
2.1 OSV-MP2 wavefunction
In the previous work by one of the authors, the OSV-based single-reference local MP2, CCSD and CCSD(T) methods20, 22, 23 were developed. In this section, we briefly review the algorithm for introducing the notations relevant to the OSV-MP2 gradient algorithm. We use to denote the occupied localized molecular orbitals (LMOs), canonical virtual MOs, the OSVs associated with an occupied MO , while and pertain to generic indices of MOs and atomic orbitals (AOs), respectively. Here LMOs refer to the spatial orbital basis. The bra-ket symbol is used to evaluate the matrix trace through the discussion.
In OSV ansätz, a sparse structure of the amplitudes and the first-order wavefunction can be explored by constructing a compact virtual space in a transformative OSV adaption to the occupied space by associating a set of OSVs with each occupied orbital ,
[TABLE]
The compactness of the OSV space is determined by the tensorial character of the transformation matrix for each occupied orbital. An excellent yet simple choice20 of is to require its column vector to be the orthonormal eigenvector of the MP2 diagonal pair amplitudes for each by performing the diagonalization,
[TABLE]
with the orthonormality . According to the magnitude of eigenvalues , a single parameter is utilized as a measure to select a set of OSVs pertaining to each occupied orbital by which is solved efficiently without losing too much accuracy. The elements of in Eq. (2) are computed as
[TABLE]
are the diagonal elements of the Fock matrix.
The OSV wavefunction and amplitudes are associated with a collated excitation manifold in which the occupied orbital excites to its own OSV set () as well as the exchange set (),
[TABLE]
The doubly excited configuration is built through the spin-free excitations operator in terms of the creation and annihilation operators for all spins acting on the zero-order wavefunction . The OSV amplitudes are computed iteratively by solving the residual equation for an pair,
[TABLE]
In the OSV basis, , and denote the two-electron integrals, overlap and Fock matrices for an pair, respectively. is adopted to represent a generic composite matrix assembled between and elements as needed. In essence, is a projection of from the canonical virtual MOs to OSVs basis
[TABLE]
Since is hermitian in canonical MO basis, permuting and pairs yields the self-adjoint property of ,
[TABLE]
In the OSV basis, the MP2 Hylleraas73 correlation energy has the following form of Lagrangian,
[TABLE]
This energy Lagrangian essentially imposes the vanishing residual condition with the corresponding multiplier . An elimination of the linear dependency in the OSV-concatenated pair domain is essential for solving , and can be effectively carried out by preconditioning in a transformation made by nonredundant vectors20.
2.2 Perturbed OSVs and relaxation
2.2.1 OSV orbital rotation
The OSVs are defined as the eigenvectors of the semi-canonical MP2 diagonal pair amplitude associated with a specific occupied orbital , as given in Eqs. (1) and (2). Upon a perturbation acting on the system, the perturbed OSVs can be expanded exactly in a linear combination of the complete unperturbed OSV basis , with the unknown combination coefficient matrix that must be specific to the occupied orbital as well,
[TABLE]
The exact OSV relaxation is thus given in terms of the relaxation matrix
[TABLE]
Given the perturbation , the perturbed OSV amplitudes must fulfill the perturbed residual equation , analogous to Eq. (5). The perturbed quantity of Eq. (6) exhibits a dependence on the perturbation and can be evaluated with reference to the unperturbed ,
[TABLE]
Using the OSV relaxation matrix in Eq. (10), the OSV derivative in is therefore
[TABLE]
with the curly brackets specifying the derivatives of OSVs accounting for the OSV relaxation. Here we introduce an OSV pair-specific relaxation matrix for pair in a block diagonal form,
[TABLE]
The perturbed OSVs for each orbital must be always orthonormal
[TABLE]
which implies that the OSV relaxation matrix must be antisymmetric,
[TABLE]
2.2.2 OSV relaxation as perturbed non-degenerate eigenvalue problem
Assuming real values of the antisymmetric , all diagonal elements of the OSV relaxation matrix must vanish
[TABLE]
Now we discuss an approach in which the off-diagonal can be explicitly solved based on the perturbation analysis74, 75 to the perturbed eigenvalue problem as
[TABLE]
with the diagonal eigenvalue matrix. Differentiating the above equation, we arrive at
[TABLE]
Multiplying onto both sides and using the OSV orthonormality, there is
[TABLE]
The derivative gives the relaxation of semi-canonical MP2 diagonal amplitudes upon a perturbation. However, since the canonicality and does not necessarily hold and in fact is not required in general for a perturbed Fock matrix, can not be evaluated directly by taking the derivative of Eq. (3). Instead, it must be computed by differentiating the MP2 residual equation assuming the generic Fock matrix for a diagonal pair, which leads to the following expression
[TABLE]
Above, canonical and are composed of the derivatives with respect to both AOs () and MOs [] of the exchange integral and Fock matrix, respectively, for instances,
[TABLE]
Here the MO-specific derivatives and are given later according to Eqs. (43).
The diagonal part of Eq. (19) yields the relaxation of eigenvalues,
[TABLE]
When has all distinct eigenvalues, the off-diagonal part of Eq. (19) leads to the OSV relaxation matrix , expressed in Hadamard product below
[TABLE]
where . And the pair-specific relaxation matrix is
[TABLE]
with
[TABLE]
Therefore the computation of the off-diagonal element of requires only the first derivative of matrix.
We can prove (c.f. S2 in Supporting Information) that the gradient of OSV-MP2 energy of Eq. (8) is invariant with the rotations among all retained OSVs ,
[TABLE]
As long as this invariance holds, the orbital rotation must be made between the discarded and kept OSVs belonging to the subsets of different eigenvalues. Therefore the non-degenerate formalism of Eq. (25) is precisely applicable to .
The eigenvalue matrix can be understood as the projection of the semi-canonical MP2 diagonal amplitude in the OSV basis, which is diagonal and uniquely defined for each orbital. As we can show (c.f. S3 in Supporting Information), the relaxation must always remain rigorously diagonal as
[TABLE]
With correct through the first-order expansion, we conclude then that the perturbed OSV-projected amplitudes must be diagonal as well between subspaces belonging to different eigenvalues. This imposed diagonal constraint, similar to the canonical condition of Hartree-Fock gradients, has some convenience, for example, of allowing in principle different (usually smaller) OSV gradient domains from original energy domains for more efficient gradient computation, which will be the subject of our future work.
2.3 OSV-MP2 analytical gradient theory
The analytical gradient of the OSV-MP2 correlation energy with respect to a perturbation (eg, an atomic position displacement) can be computed in terms of the derivatives of both and in the OSV basis
[TABLE]
whereas the amplitudes make no contribution as they are simply variational to . It is obvious that the derivatives and must be jointly determined through the responses of the OSVs, LMOs and AOs. The MP2 energy gradient of Eq. (30) thus consists of the relaxation contributions from OSVs (), MOs () and AOs (), respectively,
[TABLE]
2.3.1 OSV-specific energy gradient
The OSV-specific energy gradient is determined by
[TABLE]
which requires the OSV derivatives of the quantities associated with , , and pairs, such as the exchange integral , overlap and the OSV block of the Fock matrix , according to Eq. (12). Here the intermediate is specific to the pair , arising from the residual contribution in the second term of Eq. (5),
[TABLE]
The OSV-OSV blocks of the unrelaxed overlap- and energy-weighted density matrices are hermitian and defined as and , respectively,
[TABLE]
[TABLE]
Since the gradients are invariant with the rotations among all kept OSVs , and must involve the discarded OSVs at the dimensions attached to , while the amplitudes and remain within the retained OSV subspace.
The evaluation of employs the non-degenerate formalism using all pairs of distinct eigenvalues associated with the discarded and retained OSV subspaces, respectively. Substituting Eq. (25), can be rewritten as
[TABLE]
with
[TABLE]
and is given in Eq. (27). The numerical stability of computed can be demonstrated by illustrating the maximum element of for each orbital in Figure 1. To this end, we choose N2 and C6H6 which own high symmetry and thus a larger number of near-degenerate eigenvalues of the semi-canonical MP2 diagonal amplitudes. As seen in the insets of Figure 1, it is evident that the large values of due to the vanishingly small difference are largely compensated by , which in fact yields smooth analytical gradients, without instability hurdles in practice.
2.3.2 MO-specific energy gradient
The sparse structure of the OSV-MP2 amplitudes is most favorably exploited with the locality of LMOs. The occupied canonical MOs are localized using Pipek-Mezey (PM)76 with meta-Löwdin atomic charges for their good transferability in different molecular environments created by the variation of atomic positions77. For evaluating the meta-Löwdin charges, the core and valence orbitals are distinguished based on the locality of the predefined NAO (natural atomic orbital), and then Löwdin-orthogonalized within their own space. The localization procedure introduces a new transformation matrix that transforms the occupied canonical MOs into the orthonormal LMOs , which must hold as well for a system under the perturbation ,
[TABLE]
with the orthonormal condition . The LMO response therefore arises from both derivative contributions of and ,
[TABLE]
The coupled-perturbed localization (CPL) described in Ref.62 for PM localization function and the coupled-perturbed Hartree-Fock equations are solved to determine and , respectively. However, neither nor is explicitly computed or stored in our implementation for reasons of computational efficiency, and their contributions are merged into the OSV-based Z-vector equation.
As seen in Eqs. (5) and (8), apparently the MO-specific is determined by the quantities that involve the derivatives with respect to LMOs and canonical virtual MOs, i.e., the derivatives of the exchange integral and the Fock matrix,
[TABLE]
The occupied-occupied elements of the unrelaxed density matrix is
[TABLE]
According to Eq. (6), we have
[TABLE]
where
[TABLE]
with superscripts for the MO derivatives. Substituting Eqs. (42)–(43) into Eq. (40) and utilizing the particle permutation symmetry, we arrive at the MO-specific energy gradient
[TABLE]
Above, and are associated with the relaxation of one virtual MO,
[TABLE]
and are the derivatives with respect to one of the LMOs, respectively, which can be evaluated according to Eq. (39),
[TABLE]
[TABLE]
In Eqs. (53) and (54), the symmetric block of is transformed into LMOs, and depends solely on the AO derivative of overlap matrix according to the MO orthonormal condition. However, the off-diagonal block of accounts for the rotation of MOs between the occupied and virtual spaces, which is solved in the OSV Z-vector approach.
2.3.3 AO-specific energy gradient
simply evaluates the energy expression of Eq. (8) in terms of AO derivative integrals, the occupied-occupied block (Eq. (41)) and OSV-OSV block (Eq. (34)) of the unrelaxed density matrices,
[TABLE]
The OSV overlap \mathbf{S}_{(ij,kl)}=\left({\begin{array}[]{c}\mathbf{Q}_{i}^{\dagger}\\ \mathbf{Q}_{j}^{\dagger}\end{array}}\right)\left({\begin{array}[]{cc}\mathbf{Q}_{k}&\mathbf{Q}_{l}\end{array}}\right) makes no contribution here to the AO-specific energy gradient. The corresponding two- and one-electron derivative integrals for an pair are computed using their AO derivative integrals, including the AO derivatives of the exchange integral matrix , the OSV-OSV block of the Fock matrix , and the occupied-occupied Fock elements .
2.4 Implementation scheme
Computing the OSV-, MO- and AO-specific two-electron contributions to the OSV-MP2 energy gradient according to Eqs. (32), (51) and (55) would be straightforward with yet unfortunately very demanding expenses. The primary bottleneck originates from the evaluation and transformation of the subsumed exchange integral and the AO/MO derivatives , and involving more than two virtual MO indices. Both computational storage and operation costs increase rapidly with sizes of molecule. Significant savings can be achieved by employing the resolution of identity (RI) technique 78, 79. In the present work, RI approximate exchange integrals and their derivatives are implemented in adaption to OSV basis for accelerated evaluation and transformation. According to the RI scheme in the Coulomb metric, the four-center two-electron (4c2e) integral is approximated as a simple product of the lower-rank three-center two-electron (3c2e) integrals and , specific to each LMO and , respectively,
[TABLE]
with the 3c2e matrix element in terms of a set of auxiliary basis functions , and denotes the Coulomb metric matrix
[TABLE]
In the following, we use and for the occupied and virtual blocks, respectively.
In our OSV-MP2 gradient formulation, we must however deal with the integrals and in both kept and discarded OSV basis for treating OSV relaxation. The number of these integrals for all pairs grows as , and the storage becomes rather unfavorable for large molecules if they are explicitly computed. To avoid such high storage costs, we have exploited an implementation in which the 3c2e MO integrals are transformed into an intermediate accounting for two-electron contributions to the OSV-MP2 gradient from both MO and OSV rotations,
[TABLE]
where
[TABLE]
with the symbols and denoting the upper and lower diagonal blocks. are computed and accessed on the fly for each pair. The one-index transformations made in are carried out with the kept () and discarded () OSV orbitals. Both , AO derivative and are of the row dimension and column dimension , and can be conveniently stored on disk as their total number grows as , forming no major obstacle for a usual range of molecular sizes. In our implementation, the dominant formal operation scales as for computing and for , where and are the number of the kept and discarded OSVs, respectively. Nevertheless, when working with reasonably selected OSVs and pairs for a good accuracy-cost balance, the actual computational cost can be reduced to .
By combining , and , our working equation for evaluating the OSV-MP2 energy gradient can be written in terms of the AO-derivatives of Fock ( and ), overlap ( and ) and 3c2e integral () matrices,
[TABLE]
The unrelaxed () and relaxed () density matrices are utilized in MO basis,
[TABLE]
and the energy-weighted unrelaxed () and relaxed () density matrices are,
[TABLE]
[TABLE]
The fourth term needs the matrix,
[TABLE]
as well as matrix that are obtained by solving the following linear CPL equation for PM localization constraint,
[TABLE]
Finally, of the last term in Eq. (LABEL:eq:ec1e2e) collects all AO-derivatives in the Fock and overlap matrices, compuated only once and for all,
[TABLE]
where
[TABLE]
for which the two-electron integrals are evaluated with RI approximation. The remaining Z-vector must be solved in the other linear equation
[TABLE]
The source term takes the form below,
[TABLE]
Finally, the explicit mathematical forms of the intermediates , and are specified in Eqs. (28)–(30) in Ref.62, and thus will not be repeated here.
3 APPLICATIONS TO MOLECULAR STRUCTURES
3.1 Accuracy of OSV-MP2 analytical gradients
The correctness of our implementation has been examined by comparing the OSV-MP2 analytical gradients with OSV-MP2 numerical gradients for N2 and water clusters (H2O)n (). The root mean square deviations (RMSDs) of the gradient differences are about – a.u. for various OSV selections (, and ).
To assess the convergence of OSV-MP2 gradients with respect to the OSV selection thresholds, the RMSDs between the gradients of OSV-MP2 and RI-MP2 reference are presented in Figure 6 for molecules of varying sizes and bonding types in the Baker test set80. As shown in Figure 6(a), the average RMSDs among all computed molecules are , and for , and , respectively. For , the RMSDs range from – for smaller molecules (the molecule number lower than 15), and increase to about – for larger molecules.
The effect of the OSV relaxation is illustrated in Figs. 6(b)-(d) by comparing the OSV-MP2 gradients computed with and without OSV relaxation. The OSV-MP2 analytical gradient without OSV relaxation merely considers the MO- and AO-specific gradient contributions described in Secs. 2.3.2 and 2.3.3. It is obvious that the inclusion of the OSV relaxation considerably reduces the RMSDs by an order of magnitude. For instance, with (Figure 6(c)), the average RMSDs decrease from around to . Nevertheless, the exclusion of OSV relaxations appears less significant when more OSVs are selected according to (Figure 6(d)) by which the resulting gradient RMSDs are less than , virtually comparable to results with (Figure 6(c)).
The gradient RMSDs of OSV-MP2 are compared with those of DLPNO-MP2 available in a recent publication70. To be as consistent as possible with the corresponding PNO thresholds (, and ), we adopted the OSV threshold as for comparison since the PNOs are chosen according to eigenvalues of semi-canonical pair density matrices, that is about the squared eigenvalues of the associated semi-canonical amplitudes. Nonetheless, a rigorous accuracy comparison between DLPNO-MP2 and OSV-MP2 is difficult, since at the same level of truncation (e.g, vs ) nondiagonal pair amplitudes are represented in a much more compact basis in the DLPNO approach than in the OSV approach.
As seen in Figure 6(b) by comparing loose OSVs and PNOs, the RMSDs of two methods are generally similar especially for larger molecules, yet with marginally better performance for OSV-MP2 than DLPNO-MP2 for smaller molecules. For / in Figure 6(d), the RMSDs of OSV-MP2 are remarkably smaller than those of DLPNO-MP2. We note that benzidine ( molecule 29) is peculiar here for DLPNO-MP2 with an RMSD above even using . The OSV-MP2 analytical gradient however yields no significant RMSDs which are consistently below and for and .
3.2 Optimized molecular structures
Bond lengths
The statistical errors of OSV-MP2 bond lengths relative to the reference data of RI-MP2 are summarized in Table 1. For all basis sets, tighter OSV thresholds lead to decreased errors of bond lengths. Notably, the OSV-MP2 optimization with is sufficiently accurate and increasing basis set sizes only slightly increases MAEs. However, the calculations with yields much larger errors.
In Figure 9(a), the MAEs of bond lengths with different basis sets and OSV/PNO selection thresholds are compared between OSV-MP2, PNO-MP2 and DLPNO-MP2. DLPNO-MP2 yields lower MAEs than OSV-MP2 with the loose threshold for all basis sets, but is overtaken by OSV-MP2 with tighter thresholds. For def2-TZVPP and loose threshold, the MAE for OSV-MP2 is significantly lower than PNO-MP2 by around 0.3 pm, but larger than DLPNO-MP2. The performances of the three methods are comparable for normal and tight calculations.
By repeating the OSV-MP2 geometry optimization, the long interatomic distances of noncovalent bonds have been examined in Table 2. We have chosen DTFS and RESVAN molecules out of LB12 set for which the DLPNO-MP2 optimized Si-N and S-S bond distances report quite large errors in Ref.70. All electrons are correlated in OSV-MP2 calculations, and both OSV-MP2 geometries are well converged for three OSV-MP2 thresholds. It is observed that is necessary in order to reduce the errors below 1.0 pm. However, appears to be sufficient for achieving relative deviations below , which is acceptable for such long bond distances. In general, the OSV-MP2 outperforms DLPNO-MP2 for loose selection, and both methods are comparable for normal and tight selections.
Bond and dihedral angles
The errors of bond angles are reported in Table 3 for selected Baker’s test molecules according to the specification in Ref.67. In general, the MAEs are smaller than for all values in combination with all basis sets. results in relatively large maximum errors about . Both and substantially reduce the maximum errors by about an order of magnitude and are recommended for accurate structure optimizations. The performances of OSV-MP2, PNO-MP2 and DLPNO-MP2 in bond angles are compared in Figure 9(b). Most notably PNO-MP2 without the PNO relaxation yields larger errors than OSV-MP2 and DLPNO-MP2, in particular for def2-TZVPP and def2-QZVPP basis sets. The performances of OSV-MP2 and DLPNO-MP2 are similar with normal and tight thresholds.
The dihedral angles of benzidine molecule are compared between OSV-MP2, PNO-MP2 and DLPNO-MP2 in Table 4. Overall, OSV-MP2 performs much better than PNO-MP2 and DLPNO-MP2 for all thresholds and basis sets, and the deviations from RI-MP2 dihedral angles are less than for OSV-MP2/normal and OSV-MP2/tight.
Performance with pair screening
The use of pair screening can considerably accelerate the OSV-MP2 calculations by discarding the pairs of occupied orbitals that make little contribution to the total correlation energy. By exploring the orbital locality and the definition of OSVs, the OSV overlap matrix elements associated with a pair exhibit an exponential decay with the separation between and . Therefore the relevant pairs entering OSV-MP2 calculations are chosen according to the previous simple scheme20 in which the renormalized OSV overlap matrix is computed for a given pair and compared to a predefined pair screening threshold . When a looser (greater value) is used, more orbital pairs will be screened and not participate in the OSV-MP2 energy and gradient computation. The MAEs of bond lengths, bond angles and dihedral angles are reported with respect to in Table 5. It is shown that the MAEs at are similar to those without pair screening for both bond lengths and angles. However, there is a significant increase of MAEs as is increased from to . Interestingly, for dihedral angles, the errors for all thresholds are less than .
Timing comparison
We finally compare the elapsed times between RI-MP2, DLPNO-MP2 and OSV-MP2 for both energy and gradient evaluations on a single CPU. For all molecules considered in Table 6, our current OSV-MP2 implementation achieves speedups of 3-10 folds for gradients and 0.4-4.0 for energies compared to RI-MP2, respectively. In particular, the OSV-MP2 gradient computation is faster than RI-MP2 by an order of magnitude for the longest molecule (Gly)14. For Nonactin molecule similar to (Gly)14 in size, OSV-MP2 gradient calculation exhibits a poorer speedup than (Gly)14 due to more kept pairs of Nonactin (6737 out of 20100 pairs) than (Gly)14 (4218 out 23220 pairs), since apparently the pair screening is less effective to the cyclic Nonactin structure than the linear (Gly)14. The average pair domain sizes of Nonactin and (Gly)14 are similar, i.e., both own 96 OSVs, which is much larger than DLPNO pair domains (about 17-20 PNOs). On the other hand, DLPNO-MP2 retains 14937 and 8192 pairs for (Gly)14 and Nonactin, respectively, that are much larger than those of OSV-MP2. Moreover, the OSV-MP2 energy and gradient scalings are and , respectively, as shown in Figure S1. This is still higher than DLPNO-MP2 and leads to longer elapsed time than DLPNO-MP2 by nearly 2 folds for large glycine chains. However, for smaller molecules, OSV-MP2 gradient computation can be 2 times faster than DLPNO-MP2, which makes it attractive for driving efficient BOMD simulations on molecules of similar size.
4 OSV-MP2-DRIVEN AB-INITIO BOMD
4.1 Protonated Eigen and Zundel water cations
We have performed the constant simulation for protonated Eigen (H9O4+) and Zundel water cluster (H13O6+). They are not only structural units of biological and chemical significance, but also the benchmark systems that have been extensively used to establish accuracy of other theories.
Energy drifts
In OSV-MP2 simulation, the OSV-MP2 approximated trajectories propagate according to the numerical integration over a finite time step which may break the energy conservation by a range of drifts at long simulation time. Therefore such drifts must be examined carefully with respect to both OSV and pair selections. The results are reported in Table 7 for benchmarking OSV-MP2 BOMD accuracy. When no pairs are screened (), all energy drifts are very small. The total energies of all OSV-MP2/10 ps trajectories with are conserved within 1.0 kJ/mol, the energy drifts are substantially reduced with by two and one orders of magnitude for H9O4+ and H13O6+, respectively. The RMSDs, which measure the time-dependent energy fluctuation statistically, are as small as half kJ/mol for and – kJ/mol for . The difference of the computed between and is about 1 K for H9O4+ and 5 K for H13O6+, respectively.
Table 7 suggests that the use of pair screenings yields larger statistical errors than the OSV selection. Nevertheless, a proper combination of selected and can produce results of acceptable accuracy. For instance, for , the choice of the medium pair screening does not lead to significant shifts of energy (both and RMSD) and temperature. However, with and , the energy conservation is not well sustained. As seen in Figure S2, with more pair screenings for Zundel cluster, the simulation after about 4.5 ps leads to a hotter Zundel cation by 1 kJ/mol, probably arising from a more drastic change of the number of the kept pairs with time.
Radial distribution function (RDF)
The trajectory specification of computing RDFs of the O-O and O-H distances was adopted according to the description of Ref.83. As seen in Figure 14, the OSV-MP2 BOMD calculations with are capable of retrieving all O-O and O-H structural details including the RDF landscape and peak positions for both Eigen and Zundel clusters, and also in excellent agreement with the canonical MP2 BOMD reference results83. However, for Zundel cluster, the calculations with the loose OSV selection do not well resolve two innermost peaks of the O-O RDF at about Å (Zundel-like O-O distance) and Å (Eigen-like O-O distance), but rather predict a more dominating Eigen-like solvation shell. It is demonstrated in Figure S3 that the pair screenings, when combined with the normal OSV selection , have little effects on the RDF landscapes yet with a small broadening of the RDF peaks at longer O-H and O-O distances by increasing .
Vibrational density of states (VDOS)
Vibrational density of states are computed as the Fourier transform of the velocity autocorrelation function according to the Ref.83. However, our initial structures are generated from RI-MP2 optimization, with the momentum corresponding to 300 K. We compare the computed VDOS spectra in Figures 17 and S4. The positions of significant peaks can be hardly affected by the OSV selection and pair screening. In particular, for Eigen cluster in Figures 17 (a) and S4(a), the weak peaks at about 3000 cm*-1* representing the proton stretch mode are well reproduced84 in all OSV-MP2 BOMD calculations. For Zundel cluster in Figures 17 (b) and S4(b), the two peaks of medium intensity around 4000 cm*-1* are clearly resolved. However, the peak intensities are largely influenced by the combined and . For instance, the peaks at both low and high frequency regions are relatively intensified by decreasing . On the other hand, a large pair screening appears to substantially weaken the 4000 cm*-1* peak at the lower frequency side.
4.2 Rotational free energy of ethanol
The OSV-MP2/cc-pvTZ simulations were carried out for computing the rotational free energies of the coupled hydroxyl and methyl groups in ethanol molecule at 300 K. The simulation features thermal energy exchange which may compensate the electronic energy loss due to selected OSVs through adding a thermostat into Hamiltonian for coupling the system and reservoir. This thus opens up the feasibility of making OSV-MP2 BOMD available for simulating systems at a finite temperature. However, the detailed investigation on the interplay between the thermal coupling and the OSV selection is not the subject of this work and will be probed in future applications. In the current work, the Nosé-Hoover thermostat was employed with the temperature coupling time constant of 100 fs. The simulation temperature is conserved within a drift of only -0.051 K for and .
Ethanol can exist in two conformers, the trans-ethanol with the hydroxyl group trans to the methyl group, and the gauche-ethanol with the hydroxyl group gauche to the methyl group. The gauche-ethanol stability computed by single point DFT is close to the trans-ethanol by, for instances, 0.01 kcal/mol for B3LYP/cc-pVTZ85 and -0.08 kcal/mol PBE-TS/cc-pVTZ86. Our OSV-MP2/cc-pVTZ simulation predicts that the trans-ethanol conformer is more stable than the gauche-ethanol conformer by 0.22 kcal/mol, as shown in Figure 18. The OSV-MP2/cc-pVTZ also finds the free energy barriers of 1.00 kcal/mol and 0.62 kcal/mol to the hydroxyl rotation and trans-to-gauge transformation, respectively. Recently, Chmiela et al.86, 87 reported that the corresponding CCSD(T) barriers are 0.11 kcal/mol, 1.30 kcal/mol and 1.18 kcal/mol, by training the symmetrized gradient-domain machine learning (sGDML) model for the CCSD(T) force field in MD simulations. It was observed however that the gauche is more stable than the trans by repeating the same calculation with sGDML@DFT(PBE-TS)86. Compared to sGDML@CCSD(T), our OSV-MP2 simulation seems to underestimate the energy level of the transition state for tran-to-gauche transformation by 0.56 kcal/mol. This disagreement may be ascribed to the difference of the levels in describing electron correlations between MP2 and CCSD(T) methods. Nevertheless, single point calculations88 corresponding to 0 K show that the energy barriers are in fact similar between MP2 and CCSD(T), with differences of only a few hundredth kcal/mol. Therefore it remains a question whether such a subtle difference of electron correlation between MP2 and CCSD(T) for ethanol has any significance due to a thermal fluctuation of kcal/mol at 300 K. More importantly, we realize that in our computational setting for metadynamics simulation, a relatively large time constant of 100 fs was used in order to achieve a small temperature drift ( K) and avoid poor coupling in a long time equilibration. However, this inevitably results in a more wild distribution of Nosé-Hoover frequencies and thus a larger thermal fluctuation. More detailed studies on this issue within the OSV-MP2 framework are underway.
5 CONCLUSIONS
In this work, we have described the algorithm and implementation for analytically computing the energy derivatives from all OSV-MP2 energy contributions with local molecular orbitals. We have shown that it is possible to evaluate the OSV relaxation by explicitly solving non-degenerate perturbed eigenvalue problem in which exact OSV rotations can be implemented between the retained and discarded OSV subspaces. The simplicity of the OSV construction leads to the block-diagonal structure of pair-specific OSV relaxation matrix which decouples OSV rotations within a single orbital pair. The solution of pair-specific OSV relaxation elements enters the source of a single Z-vector equation along with the MO relaxation and the localization constraint, as solved in a conventional way that is independent of the degrees of freedom.
The accuracy of this approach has been benchmarked on a set of well studied molecules for optimized geometries and molecular dynamics simulations. The OSV relaxation effects are significant and can be recovered with the normal OSV selection for practical use of reproducing canonical RI-MP2 molecular structures. Moreover, the classical molecular dynamics with OSV-MP2 input gradients has been implemented. It has been demonstrated that using a normal OSV selection, all major peaks of the O-O/O-H radial distribution functions and vibrational densities of states for protonated water tetramer and hexamer can be well identified. A 200 ps well-tempered metadynamics simulation with OSV-MP2 gradients at 300 K has been shown to be capable of distinguishing the gauche and trans conformers of ethanol molecule.
There is much to explore for improving the current implementation by noting the aspects as follows. (1) Solving the Z-vector equation and two-electron integral transformation therein in MO basis become one bottleneck step for large molecules. (2) The evaluation of energy gradients through Eq. (LABEL:eq:ec1e2e) does not yet take advantage of OSV savings and therefore scales quickly with system sizes. (3) Embarrassing parallelization schemes seem obvious within the OSV-MP2 framework by distributing local orbitals over many processes. (4) Finally, the OSV-MP2 gradient computation is currently much slower than OSV-MP2 energy by 3-4 folds. An appropriate scheme for pruning out insignificant OSV relaxations and associated pairs shall further speed up gradient computation. The efforts along these directions are being made and will be reported in future.
{acknowledgement}
J.Y. acknowledges financial supports from the Hong Kong Research Grant Council (RGC) Early Career Scheme (ECS) through Grant No. ECS27307517, and the Hui’s fund provided by department of chemistry at the University of Hong Kong. R.Y.Z. thanks Prof. Roberto Car for hosting his summer research and valuable discussions. J.Y. thanks Dr. Qiming Sun for general assistance in PySCF package.
{suppinfo}
The file Supporting supporting.pdf contains further results of the computations and is available free of charge.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Zaleśny et al. 2011 Zaleśny, R.; Papadopoulos, M. G.; Mezey, P. G.; Leszczynski, J. Linear-Scaling Techniques in Computational Chemistry and Physics: Methods and Applications ; Springer Science & Business Media, 2011; Vol. 13
- 2Gordon 2017 Gordon, M. S. Fragmentation: Toward Accurate Calculations on Complex Molecular Systems ; John Wiley & Sons, 2017
- 3Martinez and Carter 1994 Martinez, T. J.; Carter, E. A. Pseudospectral Møller–Plesset perturbation theory through third order. J. Chem. Phys. 1994 , 100 , 3631–3638
- 4Ayala and Scuseria 1999 Ayala, P. Y.; Scuseria, G. E. Linear scaling second-order Møller–Plesset theory in the atomic orbital basis for large molecular systems. J. Chem. Phys. 1999 , 110 , 3660–3671
- 5Reynolds et al. 1996 Reynolds, G.; Martinez, T. J.; Carter, E. A. Local weak pairs spectral and pseudospectral singles and doubles configuration interaction. J. Chem. Phys. 1996 , 105 , 6455–6470
- 6Lee et al. 2000 Lee, M. S.; Maslen, P. E.; Head-Gordon, M. Closely approximating second-order Mo/ller–Plesset perturbation theory with a local triatomics in molecules model. J. Chem. Phys. 2000 , 112 , 3592–3601
- 7Pulay 1983 Pulay, P. Localizability of dynamic electron correlation. Chem. Phys. Lett. 1983 , 100 , 151–154
- 8Sæbø and Pulay 1985 Sæbø, S.; Pulay, P. Local configuration interaction: An efficient approach for larger molecules. Chem. Phys. Lett. 1985 , 113 , 13–18
