A complete OSV-MP2 analytical gradient theory for molecular structure   and dynamics simulations

Ruiyi Zhou; Qiujiang Liang; Jun Yang

arXiv:1908.03674·physics.chem-ph·December 17, 2019

A complete OSV-MP2 analytical gradient theory for molecular structure and dynamics simulations

Ruiyi Zhou, Qiujiang Liang, Jun Yang

PDF

TL;DR

This paper introduces an exact algorithm for computing analytical gradients in OSV-MP2 theory, enabling accurate molecular structure and dynamics simulations with applications demonstrated on water cations and ethanol rotors.

Contribution

It presents the first exact gradient algorithm for OSV-MP2 with explicit OSV relaxation, improving accuracy and enabling molecular dynamics simulations.

Findings

01

High accuracy of gradients demonstrated on selected molecules

02

Successful BOMD simulations of water cations using OSV-MP2

03

Computed free energy surface of ethanol rotors

Abstract

We propose an exact algorithm for computing the analytical gradient within the framework of the orbital-specific-virtual (OSV) second-order M{\o}ller-Plesset (MP2) theory in resolution-of-identity (RI) approximation. We implement the exact relaxation of perturbed OSVs through the explicit constraints of the perturbed orthonormality, the perturbed diagonality and the perturbed singular value condition. We explicitly show that the rotation of OSVs within the retained OSV subspace makes no contribution to gradients, as long as the iterative solution of the unperturbed Hylleraas residual equation is well converged. The OSV relaxation is solved as the perturbed non-degenerate singular value problem between the retained and discarded OSV subspaces. The detailed derivation and preliminary implementations for gradient working equations are discussed. The coupled-perturbed localization method is…

Tables7

Table 1. Table 1: Mean error (ME), mean absolute error (MAE) and the maximum error (max) in bond lengths (pm) of selected Baker test molecules.

$l_{o s v}$		def2-SVP	def2-TZVPP	def2-QZVPP
$10^{- 3}$	ME	0.071	0.097	0.151
	MAE	0.081	0.166	0.167
	max	0.380	0.570	0.620
$10^{- 4}$	ME	0.009	0.013	0.016
	MAE	0.014	0.017	0.019
	max	0.050	0.070	0.080
$10^{- 5}$	ME	0.000	-0.002	0.005
	MAE	0.009	0.016	0.012
	max	0.040	0.080	0.060

Table 2. Table 2: Comparisons of the optimized interatomic distances in DTFS and RESVAN molecules between OSV-MP2 and DLPNO-MP2. The long noncovalent interatomic distances are specified in Ref. 70

^a Predefined in Figure 9.
		DTFS (Si-N)		RESVAN (S-S)
	Threshold^a	$Δ$ r (pm)	r_RI-MP2 (pm)	$Δ$ r (pm)	r_RI-MP2 (pm)
DLPNO-MP2^b	Loose	1.94	214.91^b	9.36	390.66^b
	Normal	0.78		2.98
	Tight	0.33		0.91
OSV-MP2^b	Loose	0.70	211.80^c	6.90	385.80^c
	Normal	0.40		2.70
	Tight	0.00		0.80
^b DLPNO-MP2 results with frozen core approximation from Ref.⁷⁰.
^c Our results without frozen core approximation.

Table 3. Table 3: Mean absolute error (MAE) and the maximum error (max) in bond angles ( ∘ ) of selected Baker test molecules.

$l$ _osv		def2-SVP	def2-TZVPP	def2-QZVPP
$10^{- 3}$	MAE	0.05	0.08	0.10
	max	1.10	0.80	1.00
$10^{- 4}$	MAE	0.02	0.03	0.03
	max	0.10	0.10	0.20
$10^{- 5}$	MAE	0.02	0.03	0.02
	max	0.10	0.30	0.20

Table 4. Table 4: Dihedral angles of benzidine for OSV-MP2, PNO-MP2 and DLPNO-MP2 with the basis sets of def2-TZVPP and def2-QZVPP.

^a Predefined in Figure 9.
Basis set	Threshold^a	RI-MP2	OSV-MP2	PNO-MP2^b	DLPNO-MP2
def2-SVP	Loose	138.8	137.2	-	-^c
	Normal		138.7	-	137.5
	Tight		138.7	-	137.6
def2-TZVPP	Loose	142.2	139.9	92.6	-^c
	Normal		142.0	143.2	138.9
	Tight		142.2	142.3	139.3
def2-QZVPP	Loose	142.1	138.7	138.4	-^c
	Normal		141.9	141.3	139.2
	Tight		142.1	140.6	139.5
^b PNO-MP2 results without PNOs relaxation from Ref.⁶⁷.
^c DLPNO-MP2 reported not converged.

Table 5. Table 5: Mean absolute error (MAE) of bond lengths, bond angles and absolute error (AE) of dihedral angles with respect to pair screening thresholds.

	Bond length	Bond angle	Dihedral angle
$l_{p a i r}$	MAE (pm)	MAE (^∘)	AE (^∘)
$10^{- 2}$	0.043	0.043	0.6
$10^{- 3}$	0.021	0.028	0.3
$10^{- 4}$	0.019	0.023	0.5
0	0.019	0.025	0.2

Table 6. Table 6: Timing comparisons for polyglycine chains (Gly) n and Nonactin (C 40 H 64 O 12 ) with respect to the number of basis ( N 𝑁 N ) and auxiliary functions ( N a u x subscript 𝑁 𝑎 𝑢 𝑥 N_{aux} ) between RI-MP2 and OSV-MP2 computations. Elapsed times (minutes) of a single-point energy and gradient are reported for each molecule in the first and second row entry, respectively. Percentages of correlation energy recovery are reported for DLPNO-MP2 and OSV-MP2 as the first and second element in the last Column, respectively. These calculations were carried out serially on a single CPU (Intel Xeon E5-2640 [email protected]) with def2-TZVP basis. l o s v = 10 − 4 subscript 𝑙 𝑜 𝑠 𝑣 superscript 10 4 l_{osv}=10^{-4} and l p a i r = 10 − 3 subscript 𝑙 𝑝 𝑎 𝑖 𝑟 superscript 10 3 l_{pair}=10^{-3} were used for OSV selection and pair screening. RI-MP2 was performed on the quantum chemistry program ORCA 82 . For DLPNO-MP2/RIJCOSX, TCutPNO= 10 − 8 superscript 10 8 10^{-8} and TCutDO= 10 − 2 superscript 10 2 10^{-2} were set for constructing PNOs and associated PAO domains with Foster-Boys localization 81 , respectively.

Molecules	$N$	$N_{a u x}$	RI-MP2	DLPNO-MP2	OSV-MP2	Speedups	Percentages
(Gly)₄	611	1502	2	7	6	0.4	99.96%, 99.96%
			49	28	14	3.5
(Gly)₆	895	2200	14	14	15	0.9	99.96%, 99.95%
			163	65	40	4.1
(Gly)₈	1179	2898	53	22	32	1.7	99.96%, 99.95%
			475	112	90	5.3
(Gly)₁₀	1463	3596	145	30	65	2.2	99.95%, 99.95%
			1069	171	198	5.4
(Gly)₁₂	1747	4294	355	39	112	3.2	99.95%, 99.95%
			2796	243	332	8.4
(Gly)₁₄	2031	4992	751	52	194	3.9	99.95%, 99.95%
			5937	328	569	10.4
Nonactin	1996	4912	598	135	198	3.0	99.91%, 99.89%
			4728	598	697	6.8

Table 7. Table 7: Comparison of statistical energy conservation properties with respect to OSV and pair selections for N V E 𝑁 𝑉 𝐸 NVE /6-31+g(d,p) simulation. T av subscript 𝑇 av T_{\mathrm{av}} is the average temperature computed according to the equipartition theorem for the average kinetic energy. δ E 𝛿 𝐸 \delta E is the energy drift that is the difference of the linear least-square fit to all energies at the first and last time step. RMSDs are given among all energies relative to this linear fit.

Molecule	$l_{o s v}$	$l_{p a i r}$	$T_{av}$ (K)	$δ E$ (kJ/mol)	RMSD (kJ/mol)
H₉O ${}^{+}_{4}$	$10^{- 3}$	0.000	149.3	-0.45	0.41
	$10^{- 3}$	0.001	149.9	-0.50	0.40
	$10^{- 3}$	0.010	152.0	0.94	0.65
	$10^{- 3}$	0.020	152.1	1.21	1.21
	$5.0 \times 10^{- 4}$	0.000	149.3	0.36	0.32
	$10^{- 4}$	0.000	150.4	0.00	0.17
	$10^{- 4}$	0.001	149.5	-0.04	0.18
	$10^{- 4}$	0.010	150.8	0.08	0.13
	$10^{- 4}$	0.020	149.1	0.08	0.14
H₁₃O ${}^{+}_{6}$	$10^{- 3}$	0.000	153.0	-0.98	0.55
	$10^{- 3}$	0.001	191.1	50.97	16.14
	$5.0 \times 10^{- 4}$	0.000	146.6	-0.02	0.21
	$10^{- 4}$	0.000	148.5	0.06	0.22
	$10^{- 4}$	0.001	149.2	-0.04	0.25
	$10^{- 4}$	0.010	150.3	1.57	0.62
	$10^{- 4}$	0.020	150.0	-0.53	0.34

Equations124

∣ \overset{μ}{ˉ}_{k} ⟩ = a \sum Q_{a \overset{μ}{ˉ}}^{k} ∣ a ⟩

∣ \overset{μ}{ˉ}_{k} ⟩ = a \sum Q_{a \overset{μ}{ˉ}}^{k} ∣ a ⟩

[Q_{k}^{†} T_{k k} Q_{k}]_{\overset{μ}{ˉ}_{k} \overset{ν}{ˉ}_{k}} = ω_{\overset{μ}{ˉ}_{k}} δ_{\overset{μ}{ˉ} \overset{ν}{ˉ}},

[Q_{k}^{†} T_{k k} Q_{k}]_{\overset{μ}{ˉ}_{k} \overset{ν}{ˉ}_{k}} = ω_{\overset{μ}{ˉ}_{k}} δ_{\overset{μ}{ˉ} \overset{ν}{ˉ}},

[T_{k k}]_{ab} = \frac{( k a ∣ k b )}{f _{aa} + f _{bb} - 2 f _{k k}} .

[T_{k k}]_{ab} = \frac{( k a ∣ k b )}{f _{aa} + f _{bb} - 2 f _{k k}} .

\lvert\mathbf{\Phi}_{(ij,ij)}\rangle=\left({\begin{array}[]{cc}\lvert\Phi_{ij}^{\bar{\mu}_{i}\bar{\nu}_{i}}\rangle&\lvert\Phi_{ij}^{\bar{\mu}_{i}\bar{\xi}_{j}}\rangle\\ \lvert\Phi_{ij}^{\bar{\sigma}_{j}\bar{\nu}_{i}}\rangle&\lvert\Phi_{ij}^{\bar{\sigma}_{j}\bar{\xi}_{j}}\rangle\end{array}}\right)^{\mathrm{T}},\hfill\mathbf{T}_{(ij,ij)}=\left({\begin{array}[]{cc}t_{ij}^{\bar{\mu}_{i}\bar{\nu}_{i}}&t_{ij}^{\bar{\mu}_{i}\bar{\xi}_{j}}\\ t_{ij}^{\bar{\sigma}_{j}\bar{\nu}_{i}}&t_{ij}^{\bar{\sigma}_{j}\bar{\xi}_{j}}\end{array}}\right).

\lvert\mathbf{\Phi}_{(ij,ij)}\rangle=\left({\begin{array}[]{cc}\lvert\Phi_{ij}^{\bar{\mu}_{i}\bar{\nu}_{i}}\rangle&\lvert\Phi_{ij}^{\bar{\mu}_{i}\bar{\xi}_{j}}\rangle\\ \lvert\Phi_{ij}^{\bar{\sigma}_{j}\bar{\nu}_{i}}\rangle&\lvert\Phi_{ij}^{\bar{\sigma}_{j}\bar{\xi}_{j}}\rangle\end{array}}\right)^{\mathrm{T}},\hfill\mathbf{T}_{(ij,ij)}=\left({\begin{array}[]{cc}t_{ij}^{\bar{\mu}_{i}\bar{\nu}_{i}}&t_{ij}^{\bar{\mu}_{i}\bar{\xi}_{j}}\\ t_{ij}^{\bar{\sigma}_{j}\bar{\nu}_{i}}&t_{ij}^{\bar{\sigma}_{j}\bar{\xi}_{j}}\end{array}}\right).

R_{(ij, ij)} = K_{(ij, ij)} + k \sum S_{(ij, ik)} T_{(ik, ik)} [δ_{k j} F_{(ik, ij)} - f_{k j} S_{(ik, ij)}] + [δ_{ik} F_{(ij, k j)} - f_{ik} S_{(ij, k j)}] T_{(k j, k j)} S_{(k j, ij)} .

R_{(ij, ij)} = K_{(ij, ij)} + k \sum S_{(ij, ik)} T_{(ik, ik)} [δ_{k j} F_{(ik, ij)} - f_{k j} S_{(ik, ij)}] + [δ_{ik} F_{(ij, k j)} - f_{ik} S_{(ij, k j)}] T_{(k j, k j)} S_{(k j, ij)} .

\mathbf{A}_{(ij,kl)}=\left({\begin{array}[]{c}\mathbf{Q}_{i}^{\dagger}\\ \mathbf{Q}_{j}^{\dagger}\end{array}}\right)\mathbf{A}\left({\begin{array}[]{cc}\mathbf{Q}_{k}&\mathbf{Q}_{l}\end{array}}\right).

\mathbf{A}_{(ij,kl)}=\left({\begin{array}[]{c}\mathbf{Q}_{i}^{\dagger}\\ \mathbf{Q}_{j}^{\dagger}\end{array}}\right)\mathbf{A}\left({\begin{array}[]{cc}\mathbf{Q}_{k}&\mathbf{Q}_{l}\end{array}}\right).

A_{(ij, k l)}^{†} = A_{(k l, ij)} .

A_{(ij, k l)}^{†} = A_{(k l, ij)} .

E_{c} = ij \sum ⟨ K_{(ij, ij)} \overline{T}_{(ij, ij)} ⟩ + ⟨ R_{(ij, ij)} \overline{T}_{(ij, ij)} ⟩ .

E_{c} = ij \sum ⟨ K_{(ij, ij)} \overline{T}_{(ij, ij)} ⟩ + ⟨ R_{(ij, ij)} \overline{T}_{(ij, ij)} ⟩ .

Q_{k} (λ) = Q_{k}^{0} O_{k} (λ) .

Q_{k} (λ) = Q_{k}^{0} O_{k} (λ) .

Q_{k}^{λ} = Q_{k}^{0} O_{k}^{λ} .

Q_{k}^{λ} = Q_{k}^{0} O_{k}^{λ} .

\mathbf{A}_{(ij,kl)}(\lambda)=\left({\begin{array}[]{c}\mathbf{O}_{i}^{\dagger}\\ \mathbf{O}_{j}^{\dagger}\end{array}}\right)\mathbf{A}_{(ij,kl)}^{0}\left({\begin{array}[]{cc}\mathbf{O}_{k}&\mathbf{O}_{l}\end{array}}\right).

\mathbf{A}_{(ij,kl)}(\lambda)=\left({\begin{array}[]{c}\mathbf{O}_{i}^{\dagger}\\ \mathbf{O}_{j}^{\dagger}\end{array}}\right)\mathbf{A}_{(ij,kl)}^{0}\left({\begin{array}[]{cc}\mathbf{O}_{k}&\mathbf{O}_{l}\end{array}}\right).

A_{(ij, k l)}^{{λ}} = O_{ij}^{† λ} A_{(ij, k l)}^{0} + A_{(ij, k l)}^{0} O_{k l}^{λ}

A_{(ij, k l)}^{{λ}} = O_{ij}^{† λ} A_{(ij, k l)}^{0} + A_{(ij, k l)}^{0} O_{k l}^{λ}

O_{k l}^{λ} = d ia g (O_{k}^{λ}, O_{l}^{λ})

O_{k l}^{λ} = d ia g (O_{k}^{λ}, O_{l}^{λ})

Q_{k}^{†} (λ) Q_{k} (λ) = 1

Q_{k}^{†} (λ) Q_{k} (λ) = 1

O_{k}^{† λ} + O_{k}^{λ} = 0 .

O_{k}^{† λ} + O_{k}^{λ} = 0 .

[O_{k}^{λ}]_{\overset{μ}{ˉ} \overset{μ}{ˉ}} = 0.

[O_{k}^{λ}]_{\overset{μ}{ˉ} \overset{μ}{ˉ}} = 0.

T_{k k} (λ) Q_{k} (λ) = Q_{k} (λ) Ω_{k} (λ) .

T_{k k} (λ) Q_{k} (λ) = Q_{k} (λ) Ω_{k} (λ) .

T_{k k}^{λ} Q_{k}^{0} + T_{k k}^{0} Q_{k}^{0} O_{k}^{λ} = Q_{k}^{0} Ω_{k}^{λ} + Q_{k}^{0} O_{k}^{λ} Ω_{k}^{0} .

T_{k k}^{λ} Q_{k}^{0} + T_{k k}^{0} Q_{k}^{0} O_{k}^{λ} = Q_{k}^{0} Ω_{k}^{λ} + Q_{k}^{0} O_{k}^{λ} Ω_{k}^{0} .

Q_{k}^{0 †} T_{k k}^{λ} Q_{k}^{0} + Ω_{k}^{0} O_{k}^{λ} = Ω_{k}^{λ} + O_{k}^{λ} Ω_{k}^{0} .

Q_{k}^{0 †} T_{k k}^{λ} Q_{k}^{0} + Ω_{k}^{0} O_{k}^{λ} = Ω_{k}^{λ} + O_{k}^{λ} Ω_{k}^{0} .

[T_{k k}^{λ}]_{ab} = \frac{[ K _{k k}^{λ} + T _{k k} F ^{λ} + F ^{λ} T _{k k} - 2 T _{k k} f _{k k}^{λ} ] _{ab}}{f _{aa} + f _{bb} - 2 f _{k k}} .

[T_{k k}^{λ}]_{ab} = \frac{[ K _{k k}^{λ} + T _{k k} F ^{λ} + F ^{λ} T _{k k} - 2 T _{k k} f _{k k}^{λ} ] _{ab}}{f _{aa} + f _{bb} - 2 f _{k k}} .

K_{k k}^{λ}

K_{k k}^{λ}

F^{λ}

f_{k k}^{λ}

ω_{\overset{μ}{ˉ}_{k}}^{λ} = [Q_{k}^{†} T_{k k}^{λ} Q_{k}]_{\overset{μ}{ˉ} \overset{μ}{ˉ}} .

ω_{\overset{μ}{ˉ}_{k}}^{λ} = [Q_{k}^{†} T_{k k}^{λ} Q_{k}]_{\overset{μ}{ˉ} \overset{μ}{ˉ}} .

O_{k}^{λ} = Δ G_{k} \circ [Q_{k}^{†} T_{k k}^{λ} Q_{k}]

O_{k}^{λ} = Δ G_{k} \circ [Q_{k}^{†} T_{k k}^{λ} Q_{k}]

O_{k l}^{λ} = Δ G_{k l} \circ d ia g (Q_{k}^{†} T_{k k}^{λ} Q_{k}, Q_{l}^{†} T_{l l}^{λ} Q_{l})

O_{k l}^{λ} = Δ G_{k l} \circ d ia g (Q_{k}^{†} T_{k k}^{λ} Q_{k}, Q_{l}^{†} T_{l l}^{λ} Q_{l})

Δ G_{k l} = d ia g (Δ G_{k}, Δ G_{l}) .

Δ G_{k l} = d ia g (Δ G_{k}, Δ G_{l}) .

\frac{\partial E _{c}^{λ}}{\partial [ O _{k}^{λ} ] _{\overset{μ}{ˉ} \overset{ν}{ˉ}}} = 0.

\frac{\partial E _{c}^{λ}}{\partial [ O _{k}^{λ} ] _{\overset{μ}{ˉ} \overset{ν}{ˉ}}} = 0.

Ω_{k}^{λ} = d ia g (ω_{1}^{λ}, ω_{2}^{λ}, \dots) .

Ω_{k}^{λ} = d ia g (ω_{1}^{λ}, ω_{2}^{λ}, \dots) .

E_{c}^{λ} = \frac{d E _{c}}{d λ} = ij \sum ⟨ K_{(ij, ij)}^{λ} \overline{T}_{(ij, ij)} ⟩ + ⟨ R_{(ij, ij)}^{λ} \overline{T}_{(ij, ij)} ⟩,

E_{c}^{λ} = \frac{d E _{c}}{d λ} = ij \sum ⟨ K_{(ij, ij)}^{λ} \overline{T}_{(ij, ij)} ⟩ + ⟨ R_{(ij, ij)}^{λ} \overline{T}_{(ij, ij)} ⟩,

E_{c}^{λ} = E_{c}^{{λ}} + E_{c}^{[λ]} + E_{c}^{(λ)} .

E_{c}^{λ} = E_{c}^{{λ}} + E_{c}^{[λ]} + E_{c}^{(λ)} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A complete OSV-MP2 analytical gradient theory for molecular structure and dynamics simulations

††Equal contributions

Ruiyi Zhou†

Qiujiang Liang†

Jun Yang

[email protected]

[

Abstract

We propose an exact algorithm for computing the analytical gradient within the framework of the orbital-specific-virtual (OSV) second-order Møller-Plesset (MP2) theory in resolution-of-identity (RI) approximation. We implement the relaxation of perturbed OSVs through the explicit constraints of the perturbed orthonormality, the perturbed diagonality and the perturbed eigenvalue condition. We show that the rotation of OSVs within the retained OSV subspace makes no contribution to gradients, as long as the unperturbed Hylleraas energy functional reaches minimum. The OSV relaxation is solved as the perturbed non-degenerate eigenvalue problem between the retained and discarded OSV subspaces. The detailed derivation and preliminary implementations for gradient working equations are discussed. The coupled-perturbed localization method is implemented for meta-Löwdin localization function. The numerical accuracy of computed OSV-MP2 gradients is demonstrated for the geometries of selected molecules that are often discussed in other theories. From OSV-MP2 with the normal OSV selection, the canonical RI-MP2/def2-TZVP gradients can be reproduced within $10^{-4}$ a.u. The OSV-MP2/def2-TZVPP covalent bond lengths, angles and dihedral angles are in good agreement with canonical RI-MP2 structures by 0.017 pm, $0.03^{\circ}$ and $0.2^{\circ}$ , respectively. No particular accuracy gains have been observed for molecular geometries compared to the recent local pair-natural-orbital MP2 by using the predefined orbital domains. Moreover, the OSV-MP2 analytical gradients can generate atomic forces that are utilized to drive the Born-Oppenheimer molecular dynamics (BOMD) simulation for studying structural and vibrational properties with respect to OSV selections. By performing the OSV-MP2 $NVE$ BOMD calculation using the normal OSV selection, the structural and vibrational details of protonated water cations are well reproduced. The 200 picoseconds $NVT$ well-tempered metadynamics at 300 K has been simulated to compute the OSV-MP2 rotational free energy surface of coupled hydroxyl and methyl rotors for ethanol molecule.

The University of Hong Kong] Department of Chemistry, The University of Hong Kong, Hong Kong SAR, P.R. China

1 INTRODUCTION

Ab-initio electronic structure theory has been significantly progressed with many theoretical and algorithmic developments. Reduced-scaling post-Hartree-Fock methods are now capable of efficiently computing molecular systems of substantially increased size by managing trade-offs between the accuracy that can be achieved and the resource that can be accessed1, 2. The reduced-scaling techniques are often based on the unique strength of the spatial locality, i.e., the short-range behaviour of electron correlation that emerges as the size of a system increases. Many schemes have been devised and implemented to compute the energies of large molecules by operating on a sufficiently accurate and reduced subset of of Hilbert space in which an approximate wavefunction can be efficiently represented, manipulated and stored3, 4, 5, 6.

The locality of dynamic electron correlation was introduced by Pulay7 and initially implemented by Sæbø8, 9, 10. This has led to a fruitful variety of wavefunction representations by which the unphysical steep computational scaling can be drastically diminished. Notably, a hierarchy of Møller-Plesset perturbation and coupled-cluster (CC) methods has been developed by employing projected atomic orbitals (PAO) by Werner, Schütz and coworkers 11, 12, 13, 14, 15, 16, pair-nature-orbitals (PNOs) pioneered by Meyer et al. 17, 18 and revitalized by Neese19, and orbital-specific-virtuals (OSVs) by Chan 20, 21, 22, 23. By construction, the PNO and OSV are both inherently local and specific to a single orbital pair and orbital, respectively. The hybrid near-linear-scaling PNO-MP2 and PNO-CCSD schemes by mixing PAO/OSV/PNO have been demonstrated to further reduce the number of PNOs that are required to compress the cluster operators by employing PAO or OSV as an intermediate stage 24, 25, 26, 27, 28. The hybrid PNO schemes ensure the most compact virtual space for recovering a certain percentage of correlation energy. In addition, explicitly correlated CCSD(T) methods in the PNO framework have been developed to reduce basis set error 29, 30, 31, 32, 33, 34. Open-shell PNO-CCSD35, PNO variants of state-specific multi-reference perturbation and CC theories 36, 37, 38, 39, 40, as well as PNO-based EOM-CC241/CCSD42, 43, CIS(D)44, ADC(2)-x45 for excited states in both state-specific and state-average approaches have also been implemented and demonstrated.

A wide range of chemistry problems, such as molecular geometries, reaction pathways, thermal and spectroscopic properties, and so on, involves the physical motion of atoms. In essence, these molecular properties require an efficient computation of analytical energy gradients46, 47 with respect to relaxations of molecular orbitals and/or other parameters of a deterministic electronic wavefunction. Apparently, analytical gradient techniques are highly specific to the way in which wavefunction of the system is constructed. In the past decades, for instance, for computing analytical gradients with manageable cost-accuracy balance, the implementations have adopted very different reduced-scaling strategies for the variants of MP2 48, 6, 49, 50, 51, 52, 53, 54 and CC methods 55, 56, 57, 58, 59, 60, 61.

For applications to large molecules, the PAO-based analytical gradients have been established for the local MP262, 63, CC264 and CCSD65 models. In the context of the more recent PNO and OSV schemes, the implementation of PNO- or OSV-based analytical gradients is more limited, primarily due to the complexity of PNO or OSV related approximations. The performance of the simulated PAO-, PNO- and OSV-based CCSD for non-resonant optical properties was assessed by Crawford and coworker using linear response theory66. The PNO-MP2 and PNO-CCSD analytical energy gradients were developed by Hättig67 and Neese68, respectively, without accounting for the relaxation of PNOs. Most recently, the PNO relaxation problem was circumvented for the domain-based local PNO-MP2 (DLPNO-MP2) by enforcing a block-diagonal semi-canonical external pair density matrix which assumes zero off-blocks between the retained and discarded PNO orbitals69, 70, making the PNO-MP2 energy invariant to the rotation among the kept PNO orbitals.

In the present work, we turn our attention to developing the exact OSV-MP2 analytical gradient theory for its much simpler way of constructing OSVs relative to the hybrid PNOs in DLPNO-MP2 model. Here, we implement the OSV relaxation explicitly as a perturbed eigenvalue problem by using both orthonormality and eigenvalue conditions for perturbed OSVs. We also show that the degenerate eigenvalue issue that may break OSV relaxation69, 70 does not occur for reasons that will be described in our formalism and implementation. The resulting OSV relaxation vectors may be further scrutinized in a way that their intrinsic sparsity can be explored to pre-select important OSV relaxations making most contributions to OSV-MP2 gradients. In addition, OSV-MP2 was shown to produce smooth potential energy curve with respect to molecular structures even with small pair domains20. This is essential to efficient simulations of Born-Oppenheimer molecular dynamics (BOMD)71.

The paper is organized as follows. In Sec. II we describe the details of our OSV-MP2 analytical gradient theory and implementation. We implemented the algorithm in a standalone Python program, and PYSCF72 has been used for obtaining the one- and two-electron integrals, their derivatives and the RHF reference wavefunction. In Sec. III we compute the optimized molecular structures and assess the accuracy of the OSV-MP2 analytical gradients with respect to the selection parameters for OSVs and orbital pairs. The results are also compared with canonical MP2 and DLPNO-MP2 results available in the literature. In Sec. IV, we carry out the ab-initio BOMD simulations driven by OSV-MP2 analytical gradients. Illustrative applications of OSV-MP2 metadynamics are demonstrated to protonated water cations and ethanol molecule. Here our focus is to find out whether the errors due to the OSV approximations for cost-accuracy trade-off have any significance in reproducing the energy, structural and vibrational details by BOMD simulations at a finite-temperature. Our work is concluded in Sec. V.

2 THEORY AND IMPLEMENTATION

2.1 OSV-MP2 wavefunction

In the previous work by one of the authors, the OSV-based single-reference local MP2, CCSD and CCSD(T) methods20, 22, 23 were developed. In this section, we briefly review the algorithm for introducing the notations relevant to the OSV-MP2 gradient algorithm. We use $i,j,k,\cdots$ to denote the occupied localized molecular orbitals (LMOs), $a,b,c,\cdots$ canonical virtual MOs, $\bar{\mu}_{k},\bar{\nu}_{k},\bar{\xi}_{k},\cdots$ the OSVs associated with an occupied MO $k$ , while $p,q,r,\cdots$ and $\alpha,\beta,\cdots$ pertain to generic indices of MOs and atomic orbitals (AOs), respectively. Here LMOs refer to the spatial orbital basis. The bra-ket symbol $\langle\cdots\rangle$ is used to evaluate the matrix trace through the discussion.

In OSV ansätz, a sparse structure of the amplitudes $\mathbf{T}_{(ij,ij)}$ and the first-order wavefunction $\lvert\mathbf{\Phi}_{(ij,ij)}\rangle$ can be explored by constructing a compact virtual space in a transformative OSV adaption to the occupied space by associating a set of OSVs $\{\bar{\mu}_{k}\}$ with each occupied orbital $k$ ,

[TABLE]

The compactness of the OSV space is determined by the tensorial character of the transformation matrix $\mathbf{Q}_{k}$ for each occupied orbital. An excellent yet simple choice20 of $\mathbf{Q}_{k}$ is to require its column vector to be the orthonormal eigenvector of the MP2 diagonal pair amplitudes $\mathbf{T}_{kk}$ for each $k$ by performing the diagonalization,

[TABLE]

with the orthonormality $\mathbf{Q}_{k}^{\dagger}\mathbf{Q}_{k}=\mathbf{1}$ . According to the magnitude of eigenvalues $\omega_{\bar{\mu}_{k}}$ , a single parameter $l_{osv}$ is utilized as a measure to select a set of OSVs pertaining to each occupied orbital $k$ by which $\mathbf{T}_{(ij,ij)}$ is solved efficiently without losing too much accuracy. The elements of $\mathbf{T}_{kk}$ in Eq. (2) are computed as

[TABLE]

$f_{kk},f_{aa},f_{bb}$ are the diagonal elements of the Fock matrix.

The OSV wavefunction $\lvert\mathbf{\Phi}_{(ij,ij)}\rangle$ and amplitudes $\mathbf{T}_{(ij,ij)}$ are associated with a collated excitation manifold in which the occupied orbital $i$ excites to its own OSV set $\{\bar{\mu}_{i}\}$ ( $i\rightarrow\bar{\mu}_{i}$ ) as well as the exchange set $\{\bar{\nu}_{j}\}$ ( $i\rightarrow\bar{\nu}_{j}$ ),

[TABLE]

The doubly excited configuration $\lvert\Phi_{ij}^{\bar{\mu}_{k}\bar{\nu}_{l}}\rangle$ is built through the spin-free excitations operator $\hat{E}_{i}^{\bar{\mu}_{k}}=\sum_{\sigma}\hat{a}^{\dagger}_{\bar{\mu}_{k}\sigma}\hat{a}_{i\sigma}$ in terms of the creation and annihilation operators for all spins $\sigma=\uparrow,\downarrow$ acting on the zero-order wavefunction $\lvert\Phi_{ij}^{\bar{\mu}_{k}\bar{\nu}_{l}}\rangle=\hat{E}_{i}^{\bar{\mu}_{k}}\hat{E}_{j}^{\bar{\nu}_{l}}\lvert\Psi^{(0)}\rangle$ . The OSV amplitudes $\mathbf{T}_{(ij,ij)}$ are computed iteratively by solving the residual equation $\mathbf{R}_{(ij,ij)}$ for an $(i,j)$ pair,

[TABLE]

In the OSV basis, $\mathbf{K}_{(ij,ij)}$ , $\mathbf{S}_{(ik,ij)}$ and $\mathbf{F}_{(ik,ij)}$ denote the two-electron integrals, overlap and Fock matrices for an $(i,j)$ pair, respectively. $\mathbf{A}_{(ij,kl)}$ is adopted to represent a generic composite matrix assembled between $\{\bar{\mu}_{i},\bar{\nu}_{j}\}$ and $\{\bar{\sigma}_{k},\bar{\xi}_{l}\}$ elements as needed. In essence, $\mathbf{A}_{(ij,kl)}$ is a projection of $\mathbf{A}$ from the canonical virtual MOs to OSVs basis

[TABLE]

Since $\mathbf{A}$ is hermitian in canonical MO basis, permuting $(ij)$ and $(kl)$ pairs yields the self-adjoint property of $\mathbf{A}_{(ij,kl)}$ ,

[TABLE]

In the OSV basis, the MP2 Hylleraas73 correlation energy $E_{c}$ has the following form of Lagrangian,

[TABLE]

This energy Lagrangian essentially imposes the vanishing residual condition $\mathbf{R}_{(ij,ij)}=0$ with the corresponding multiplier $\overline{\mathbf{T}}_{(ij,ij)}=2\mathbf{T}_{(ij,ij)}-\mathbf{T}_{(ij,ij)}^{\dagger}$ . An elimination of the linear dependency in the OSV-concatenated pair domain is essential for solving $\mathbf{R}_{(ij,ij)}=0$ , and can be effectively carried out by preconditioning $\mathbf{R}_{(ij,ij)}$ in a transformation made by nonredundant vectors20.

2.2 Perturbed OSVs and relaxation

2.2.1 OSV orbital rotation

The OSVs are defined as the eigenvectors $\mathbf{Q}_{k}$ of the semi-canonical MP2 diagonal pair amplitude associated with a specific occupied orbital $k$ , as given in Eqs. (1) and (2). Upon a perturbation $\lambda$ acting on the system, the perturbed OSVs can be expanded exactly in a linear combination of the complete unperturbed OSV basis $\mathbf{Q}_{k}^{0}$ , with the unknown combination coefficient matrix $\mathbf{O}_{k}$ that must be specific to the occupied orbital $k$ as well,

[TABLE]

The exact OSV relaxation $\mathbf{Q}_{k}^{\lambda}=\frac{\partial\mathbf{Q}_{k}(\lambda)}{\partial\lambda}$ is thus given in terms of the relaxation matrix $\mathbf{O}_{k}^{\lambda}$

[TABLE]

Given the perturbation $\lambda$ , the perturbed OSV amplitudes $\mathbf{T}_{(ij,ij)}(\lambda)$ must fulfill the perturbed residual equation $\mathbf{R}_{(ij,ij)}(\lambda)=0$ , analogous to Eq. (5). The perturbed quantity $\mathbf{A}_{ij,kl}(\lambda)$ of Eq. (6) exhibits a dependence on the perturbation and can be evaluated with reference to the unperturbed $\mathbf{A}_{ij,kl}^{0}$ ,

[TABLE]

Using the OSV relaxation matrix in Eq. (10), the OSV derivative in $\mathbf{A}_{(ij,kl)}$ is therefore

[TABLE]

with the curly brackets $\{\}$ specifying the derivatives of OSVs accounting for the OSV relaxation. Here we introduce an OSV pair-specific relaxation matrix $\mathbf{O}_{kl}^{\lambda}$ for $(k,l)$ pair in a block diagonal form,

[TABLE]

The perturbed OSVs for each orbital must be always orthonormal

[TABLE]

which implies that the OSV relaxation matrix $\mathbf{O}_{k}^{\lambda}$ must be antisymmetric,

[TABLE]

2.2.2 OSV relaxation as perturbed non-degenerate eigenvalue problem

Assuming real values of the antisymmetric $\mathbf{O}_{k}$ , all diagonal elements of the OSV relaxation matrix must vanish

[TABLE]

Now we discuss an approach in which the off-diagonal $\mathbf{O}_{k}^{\lambda}$ can be explicitly solved based on the perturbation analysis74, 75 to the perturbed eigenvalue problem as

[TABLE]

with $\mathbf{\Omega}_{k}(\lambda)=diag\left[\omega_{1}(\lambda),\omega_{2}(\lambda),\cdots\right]$ the diagonal eigenvalue matrix. Differentiating the above equation, we arrive at

[TABLE]

Multiplying $\mathbf{Q}_{k}^{0\dagger}$ onto both sides and using the OSV orthonormality, there is

[TABLE]

The derivative $\mathbf{T}_{kk}^{\lambda}$ gives the relaxation of semi-canonical MP2 diagonal amplitudes upon a perturbation. However, since the canonicality $f_{ij}(\lambda)=f_{ii}(\lambda)\delta_{ij}$ and $f_{ab}(\lambda)=f_{aa}(\lambda)\delta_{ab}$ does not necessarily hold and in fact is not required in general for a perturbed Fock matrix, $\mathbf{T}_{kk}^{\lambda}$ can not be evaluated directly by taking the derivative of Eq. (3). Instead, it must be computed by differentiating the MP2 residual equation assuming the generic Fock matrix for a diagonal $kk$ pair, which leads to the following expression

[TABLE]

Above, canonical $\mathbf{K}_{kk}^{\lambda}$ and $\mathbf{F}^{\lambda}$ are composed of the derivatives with respect to both AOs ( $\lambda$ ) and MOs [ $\lambda$ ] of the exchange integral and Fock matrix, respectively, for instances,

[TABLE]

Here the MO-specific derivatives $\mathbf{K}_{kk}^{[\lambda]}$ and $\mathbf{F}^{[\lambda]}$ are given later according to Eqs. (43).

The diagonal part of Eq. (19) yields the relaxation of eigenvalues,

[TABLE]

When $\mathbf{T}_{kk}$ has all distinct eigenvalues, the off-diagonal part of Eq. (19) leads to the OSV relaxation matrix $\mathbf{O}^{\lambda}_{k}$ , expressed in Hadamard product below

[TABLE]

where $\left[\Delta\mathbf{G}_{k}\right]_{\bar{\mu}\bar{\nu}}=\frac{1}{\omega_{\bar{\nu}_{k}}-\omega_{\bar{\mu}_{k}}}~{}\mathrm{with~{}}\bar{\mu}\neq\bar{\nu}$ . And the pair-specific relaxation matrix is

[TABLE]

with

[TABLE]

Therefore the computation of the off-diagonal element of $\mathbf{O}_{k}^{\lambda}$ requires only the first derivative of $\mathbf{T}_{kk}$ matrix.

We can prove (c.f. S2 in Supporting Information) that the gradient $E_{c}^{\lambda}=\frac{\partial E_{c}}{\partial\lambda}$ of OSV-MP2 energy of Eq. (8) is invariant with the rotations among all retained OSVs $\{\bar{\mu}_{k}\}$ ,

[TABLE]

As long as this invariance holds, the orbital rotation $\mathbf{O}_{k}^{\lambda}$ must be made between the discarded $\{\bar{\mu}^{\prime}\}$ and kept $\{\bar{\nu}\}$ OSVs belonging to the subsets of different eigenvalues. Therefore the non-degenerate formalism of Eq. (25) is precisely applicable to $\mathbf{O}_{k}^{\lambda}$ .

The eigenvalue matrix $\mathbf{\Omega}_{k}=\mathbf{Q}_{k}^{\dagger}\mathbf{T}_{kk}\mathbf{Q}_{k}$ can be understood as the projection of the semi-canonical MP2 diagonal amplitude $\mathbf{T}_{kk}$ in the OSV basis, which is diagonal and uniquely defined for each orbital. As we can show (c.f. S3 in Supporting Information), the relaxation $\mathbf{\Omega}_{k}^{\lambda}$ must always remain rigorously diagonal as

[TABLE]

With $\mathbf{\Omega}_{k}(\lambda)=\mathbf{\Omega}_{k}^{0}+\lambda\mathbf{\Omega}_{k}^{\lambda}$ correct through the first-order expansion, we conclude then that the perturbed OSV-projected amplitudes $\mathbf{\Omega}_{k}(\lambda)$ must be diagonal as well between subspaces belonging to different eigenvalues. This imposed diagonal constraint, similar to the canonical condition of Hartree-Fock gradients, has some convenience, for example, of allowing in principle different (usually smaller) OSV gradient domains from original energy domains for more efficient gradient computation, which will be the subject of our future work.

2.3 OSV-MP2 analytical gradient theory

The analytical gradient of the OSV-MP2 correlation energy with respect to a perturbation $\lambda$ (eg, an atomic position displacement) can be computed in terms of the derivatives of both $\mathbf{K}_{(ij,ij)}^{\lambda}$ and $\mathbf{R}_{(ij,ij)}^{\lambda}$ in the OSV basis

[TABLE]

whereas the amplitudes $\overline{\mathbf{T}}_{(ij,ij)}$ make no contribution as they are simply variational to $E_{c}$ . It is obvious that the derivatives $\mathbf{K}_{(ij,ij)}^{\lambda}$ and $\mathbf{R}_{(ij,ij)}^{\lambda}$ must be jointly determined through the responses of the OSVs, LMOs and AOs. The MP2 energy gradient of Eq. (30) thus consists of the relaxation contributions from OSVs ( $E^{\{\lambda\}}_{c}$ ), MOs ( $E^{[\lambda]}_{c}$ ) and AOs ( $E^{(\lambda)}_{c}$ ), respectively,

[TABLE]

2.3.1 OSV-specific energy gradient $E^{\{\lambda\}}_{c}$

The OSV-specific energy gradient $E_{c}^{\{\lambda\}}$ is determined by

[TABLE]

which requires the OSV derivatives of the quantities $\mathbf{A}_{(ij,kl)}$ associated with $(i,k)$ , $(i,l)$ , $(j,k)$ and $(j,l)$ pairs, such as the exchange integral $\mathbf{K}_{(ij,kl)}$ , overlap $\mathbf{S}_{(ij,kl)}$ and the OSV block of the Fock matrix $\mathbf{F}_{(ij,kl)}$ , according to Eq. (12). Here the intermediate $\mathbf{M}_{ij}$ is specific to the pair $ij$ , arising from the residual contribution $\mathbf{R}_{(ij,ij)}$ in the second term of Eq. (5),

[TABLE]

The OSV-OSV blocks of the unrelaxed overlap- and energy-weighted density matrices are hermitian and defined as $\mathbf{D}_{(ij,kl)}$ and $\mathbf{D}_{(ij,kl)}^{\prime}$ , respectively,

[TABLE]

Since the gradients $E_{c}^{\{\lambda\}}$ are invariant with the rotations among all kept OSVs $\{\bar{\mu}_{k}\}$ , $\mathbf{K}_{(ij,ij)}$ and $\mathbf{M}_{ij}$ must involve the discarded OSVs at the dimensions attached to $\mathbf{O}_{ij}^{\{\lambda\}}$ , while the amplitudes $\mathbf{T}_{(ij,ij)}$ and $\overline{\mathbf{T}}_{(ij,ij)}$ remain within the retained OSV subspace.

The evaluation of $\mathbf{O}^{\lambda}_{ij}$ employs the non-degenerate formalism using all pairs of distinct eigenvalues associated with the discarded and retained OSV subspaces, respectively. Substituting Eq. (25), $E_{c}^{\{\lambda\}}$ can be rewritten as

[TABLE]

with

[TABLE]

and $\Delta\mathbf{G}_{ij}^{\mathrm{T}}$ is given in Eq. (27). The numerical stability of computed $E_{c}^{\{\lambda\}}$ can be demonstrated by illustrating the maximum element of $\mathbf{N}_{ij}\circ\Delta\mathbf{G}_{ij}^{\mathrm{T}}$ for each orbital in Figure 1. To this end, we choose N2 and C6H6 which own high symmetry and thus a larger number of near-degenerate eigenvalues of the semi-canonical MP2 diagonal amplitudes. As seen in the insets of Figure 1, it is evident that the large values of $\frac{1}{\Delta\omega_{i}}$ due to the vanishingly small difference $\Delta\omega_{i}=\omega_{\bar{\mu}_{i}}-\omega_{\bar{\nu}_{i}^{\prime}}$ are largely compensated by $\mathbf{N}_{ij}$ , which in fact yields smooth analytical gradients, without instability hurdles in practice.

2.3.2 MO-specific energy gradient $E^{[\lambda]}_{c}$

The sparse structure of the OSV-MP2 amplitudes is most favorably exploited with the locality of LMOs. The occupied canonical MOs are localized using Pipek-Mezey (PM)76 with meta-Löwdin atomic charges for their good transferability in different molecular environments created by the variation of atomic positions77. For evaluating the meta-Löwdin charges, the core and valence orbitals are distinguished based on the locality of the predefined NAO (natural atomic orbital), and then Löwdin-orthogonalized within their own space. The localization procedure introduces a new transformation matrix $\mathbf{L}(\lambda)$ that transforms the occupied canonical MOs $\mathbf{C}_{o}(\lambda)$ into the orthonormal LMOs $\mathbf{\tilde{C}}_{o}(\lambda)$ , which must hold as well for a system under the perturbation $\lambda$ ,

[TABLE]

with the orthonormal condition $\mathbf{L}^{\dagger}(\lambda)\mathbf{L}(\lambda)=\mathbf{1}$ . The LMO response $\mathbf{\tilde{C}}_{o}^{[\lambda]}$ therefore arises from both derivative contributions of $\mathbf{U}^{[\lambda]}$ and $\mathbf{\tilde{U}}^{[\lambda]}$ ,

[TABLE]

The coupled-perturbed localization (CPL) described in Ref.62 for PM localization function and the coupled-perturbed Hartree-Fock equations are solved to determine $\mathbf{\tilde{U}}^{[\lambda]}$ and $\mathbf{U}^{[\lambda]}$ , respectively. However, neither $\mathbf{\tilde{U}}^{[\lambda]}$ nor $\mathbf{U}^{[\lambda]}$ is explicitly computed or stored in our implementation for reasons of computational efficiency, and their contributions are merged into the OSV-based Z-vector equation.

As seen in Eqs. (5) and (8), apparently the MO-specific $E^{[\lambda]}_{c}$ is determined by the quantities that involve the derivatives with respect to LMOs and canonical virtual MOs, i.e., the derivatives of the exchange integral and the Fock matrix,

[TABLE]

The occupied-occupied elements of the unrelaxed density matrix is

[TABLE]

According to Eq. (6), we have

[TABLE]

where

[TABLE]

with superscripts $[\lambda]$ for the MO derivatives. Substituting Eqs. (42)–(43) into Eq. (40) and utilizing the particle permutation symmetry, we arrive at the MO-specific energy gradient

[TABLE]

Above, $\mathbf{F}^{0[\lambda]}$ and $\mathbf{K}_{ij}^{0[\lambda]}$ are associated with the relaxation of one virtual MO,

[TABLE]

$\mathbf{K}_{ij^{[\lambda]}}$ and $f_{ij^{[\lambda]}}$ are the derivatives with respect to one of the LMOs, respectively, which can be evaluated according to Eq. (39),

[TABLE]

In Eqs. (53) and (54), the symmetric block of $\mathbf{U}^{[\lambda]}$ is transformed into LMOs, and depends solely on the AO derivative of overlap matrix according to the MO orthonormal condition. However, the off-diagonal block of $\mathbf{U}^{[\lambda]}$ accounts for the rotation of MOs between the occupied and virtual spaces, which is solved in the OSV Z-vector approach.

2.3.3 AO-specific energy gradient $E^{(\lambda)}_{c}$

$E^{(\lambda)}_{c}$ simply evaluates the energy expression of Eq. (8) in terms of AO derivative integrals, the occupied-occupied block (Eq. (41)) and OSV-OSV block (Eq. (34)) of the unrelaxed density matrices,

[TABLE]

The OSV overlap $\mathbf{S}_{(ij,kl)}=\left({\begin{array}[]{c}\mathbf{Q}_{i}^{\dagger}\\ \mathbf{Q}_{j}^{\dagger}\end{array}}\right)\left({\begin{array}[]{cc}\mathbf{Q}_{k}&\mathbf{Q}_{l}\end{array}}\right)$ makes no contribution here to the AO-specific energy gradient. The corresponding two- and one-electron derivative integrals for an $ij$ pair are computed using their AO derivative integrals, including the AO derivatives of the exchange integral matrix $\mathbf{K}_{(ij,ij)}^{(\lambda)}$ , the OSV-OSV block of the Fock matrix $\mathbf{F}_{(ij,ij)}^{(\lambda)}$ , and the occupied-occupied Fock elements $f_{ij}^{(\lambda)}$ .

2.4 Implementation scheme

Computing the OSV-, MO- and AO-specific two-electron contributions to the OSV-MP2 energy gradient according to Eqs. (32), (51) and (55) would be straightforward with yet unfortunately very demanding expenses. The primary bottleneck originates from the evaluation and transformation of the subsumed exchange integral $\mathbf{K}_{(ij,ij)}$ and the AO/MO derivatives $\mathbf{K}_{ij}^{(\lambda)}$ , $\mathbf{K}_{ij^{[\lambda]}}$ and $\mathbf{K}_{ij}^{0[\lambda]}$ involving more than two virtual MO indices. Both computational storage and operation costs increase rapidly with sizes of molecule. Significant savings can be achieved by employing the resolution of identity (RI) technique 78, 79. In the present work, RI approximate exchange integrals and their derivatives are implemented in adaption to OSV basis for accelerated evaluation and transformation. According to the RI scheme in the Coulomb metric, the four-center two-electron (4c2e) integral $(ip|jq)$ is approximated as a simple product of the lower-rank three-center two-electron (3c2e) integrals $\mathbf{J}_{i}$ and $\mathbf{J}_{j}$ , specific to each LMO $i$ and $j$ , respectively,

[TABLE]

with the 3c2e matrix element $[\mathbf{J}_{i}]_{Ap}=\sum_{B}[\mathbf{V}^{-\frac{1}{2}}]_{AB}(B\rvert ip)$ in terms of a set of auxiliary basis functions $\{A,B,\cdots\}$ , and $\mathbf{V}$ denotes the Coulomb metric matrix

[TABLE]

In the following, we use $\mathbf{J}_{i}^{o}$ and $\mathbf{J}_{i}^{v}$ for the occupied $p=j$ and virtual $p=a$ blocks, respectively.

In our OSV-MP2 gradient formulation, we must however deal with the integrals $(i\bar{\mu}_{j}|A)$ and $(i\bar{\mu}_{j}^{\prime}|A)$ in both kept and discarded OSV basis for treating OSV relaxation. The number of these integrals for all $(i,j)$ pairs grows as $\mathcal{O}(O^{2}VN_{aux})$ , and the storage becomes rather unfavorable for large molecules if they are explicitly computed. To avoid such high storage costs, we have exploited an implementation in which the 3c2e MO integrals $\mathbf{J}_{i}$ are transformed into an intermediate $\mathbf{Y}_{i}$ accounting for two-electron contributions to the OSV-MP2 gradient from both MO and OSV rotations,

[TABLE]

where

[TABLE]

with the symbols $\top$ and $\bot$ denoting the upper and lower diagonal blocks. $\mathbf{X}_{ij}^{\top,~{}\bot}$ are computed and accessed on the fly for each $(i,j)$ pair. The one-index transformations made in $\mathbf{X}_{ij}$ are carried out with the kept ( $\mathbf{Q}_{i}$ ) and discarded ( $\mathbf{Q}_{i}^{\prime\dagger}$ ) OSV orbitals. Both $\mathbf{J}_{i}$ , AO derivative $\mathbf{J}_{i}^{(\lambda)}$ and $\mathbf{Y}_{i}$ are of the row dimension $N_{aux}$ and column dimension $V$ , and can be conveniently stored on disk as their total number grows as $\mathcal{O}(OVN_{aux})$ , forming no major obstacle for a usual range of molecular sizes. In our implementation, the dominant formal operation scales as $\mathcal{O}(O^{2}N_{osv}VN_{aux})$ for computing $\mathbf{Y}_{i}$ and $\mathcal{O}(ON_{osv}N_{osv}^{\prime}N_{aux})$ for $\mathbf{X}_{ij}$ , where $N_{osv}$ and $N_{osv}^{\prime}$ are the number of the kept and discarded OSVs, respectively. Nevertheless, when working with reasonably selected OSVs and pairs for a good accuracy-cost balance, the actual computational cost can be reduced to $\mathcal{O}(N^{3\sim 4})$ .

By combining $E_{c}^{(\lambda)}$ , $E_{c}^{[\lambda]}$ and $E_{c}^{\{\lambda\}}$ , our working equation for evaluating the OSV-MP2 energy gradient can be written in terms of the AO-derivatives of Fock ( $\mathbf{F}^{(\lambda)}$ and $f_{ij}^{(\lambda)}$ ), overlap ( $\mathbf{S}^{(\lambda)}$ and $S_{ij}^{(\lambda)}$ ) and 3c2e integral ( $\mathbf{J}_{i}^{v(\lambda)}$ ) matrices,

[TABLE]

The unrelaxed ( $\mathbf{D}$ ) and relaxed ( $\overline{\mathbf{D}}$ ) density matrices are utilized in MO basis,

[TABLE]

and the energy-weighted unrelaxed ( $\mathbf{D}^{\prime}$ ) and relaxed ( $\overline{\mathbf{D}}^{\prime}$ ) density matrices are,

[TABLE]

The fourth term needs the $\mathbf{\Gamma}$ matrix,

[TABLE]

as well as $\mathbf{\Lambda}$ matrix that are obtained by solving the following linear CPL equation for PM localization constraint,

[TABLE]

Finally, $\mathbf{B}^{(\lambda)}$ of the last term in Eq. (LABEL:eq:ec1e2e) collects all AO-derivatives in the Fock and overlap matrices, compuated only once and for all,

[TABLE]

where

[TABLE]

for which the two-electron integrals are evaluated with RI approximation. The remaining Z-vector $\mathbf{Z}$ must be solved in the other linear equation

[TABLE]

The source term takes the form below,

[TABLE]

Finally, the explicit mathematical forms of the intermediates $\mathcal{A}_{ij,\alpha\beta}$ , $\mathcal{B}_{kl,ai}$ and $\mathcal{C}_{kl,ij}$ are specified in Eqs. (28)–(30) in Ref.62, and thus will not be repeated here.

3 APPLICATIONS TO MOLECULAR STRUCTURES

3.1 Accuracy of OSV-MP2 analytical gradients

The correctness of our implementation has been examined by comparing the OSV-MP2 analytical gradients with OSV-MP2 numerical gradients for N2 and water clusters (H2O)n ( $n=1-3$ ). The root mean square deviations (RMSDs) of the gradient differences are about $10^{-6}$ – $10^{-7}$ a.u. for various OSV selections ( $l_{osv}=10^{-3}$ , $10^{-4}$ and $10^{-7}$ ).

To assess the convergence of OSV-MP2 gradients with respect to the OSV selection thresholds, the RMSDs between the gradients of OSV-MP2 and RI-MP2 reference are presented in Figure 6 for molecules of varying sizes and bonding types in the Baker test set80. As shown in Figure 6(a), the average RMSDs among all computed molecules are $1.3\times 10^{-4}$ , $3.2\times 10^{-5}$ and $6.3\times 10^{-6}$ for $l_{osv}=10^{-3.5}$ , $10^{-4}$ and $10^{-4.5}$ , respectively. For $l_{osv}=10^{-4.0}$ , the RMSDs range from $10^{-5}$ – $10^{-7}$ for smaller molecules (the molecule number lower than 15), and increase to about $5\times 10^{-4}$ – $5\times 10^{-5}$ for larger molecules.

The effect of the OSV relaxation is illustrated in Figs. 6(b)-(d) by comparing the OSV-MP2 gradients computed with and without OSV relaxation. The OSV-MP2 analytical gradient without OSV relaxation merely considers the MO- and AO-specific gradient contributions described in Secs. 2.3.2 and 2.3.3. It is obvious that the inclusion of the OSV relaxation considerably reduces the RMSDs by an order of magnitude. For instance, with $l_{osv}=10^{-4}$ (Figure 6(c)), the average RMSDs decrease from around $10^{-4}$ to $10^{-5}$ . Nevertheless, the exclusion of OSV relaxations appears less significant when more OSVs are selected according to $l_{osv}=10^{-4.5}$ (Figure 6(d)) by which the resulting gradient RMSDs are less than $10^{-4}$ , virtually comparable to results with $l_{osv}=10^{-4}$ (Figure 6(c)).

The gradient RMSDs of OSV-MP2 are compared with those of DLPNO-MP2 available in a recent publication70. To be as consistent as possible with the corresponding PNO thresholds ( $l_{pno}=10^{-7}$ , $10^{-8}$ and $10^{-9}$ ), we adopted the OSV threshold as $l_{osv}=\sqrt{l_{pno}}$ for comparison since the PNOs are chosen according to eigenvalues of semi-canonical pair density matrices, that is about the squared eigenvalues of the associated semi-canonical amplitudes. Nonetheless, a rigorous accuracy comparison between DLPNO-MP2 and OSV-MP2 is difficult, since at the same level of truncation (e.g, $l_{pno}=10^{-8}$ vs $l_{osv}=10^{-4}$ ) nondiagonal pair amplitudes are represented in a much more compact basis in the DLPNO approach than in the OSV approach.

As seen in Figure 6(b) by comparing loose OSVs and PNOs, the RMSDs of two methods are generally similar especially for larger molecules, yet with marginally better performance for OSV-MP2 than DLPNO-MP2 for smaller molecules. For $l_{osv}=10^{-4.5}$ / $l_{pno}=10^{-9}$ in Figure 6(d), the RMSDs of OSV-MP2 are remarkably smaller than those of DLPNO-MP2. We note that benzidine ( molecule 29) is peculiar here for DLPNO-MP2 with an RMSD above $10^{-4}$ even using $l_{pno}=10^{-9}$ . The OSV-MP2 analytical gradient however yields no significant RMSDs which are consistently below $10^{-4}$ and $10^{-5}$ for $l_{osv}=10^{-4.0}$ and $10^{-4.5}$ .

3.2 Optimized molecular structures

Bond lengths

The statistical errors of OSV-MP2 bond lengths relative to the reference data of RI-MP2 are summarized in Table 1. For all basis sets, tighter OSV thresholds lead to decreased errors of bond lengths. Notably, the OSV-MP2 optimization with $l_{osv}=10^{-4}$ is sufficiently accurate and increasing basis set sizes only slightly increases MAEs. However, the calculations with $l_{osv}=10^{-3}$ yields much larger errors.

In Figure 9(a), the MAEs of bond lengths with different basis sets and OSV/PNO selection thresholds are compared between OSV-MP2, PNO-MP2 and DLPNO-MP2. DLPNO-MP2 yields lower MAEs than OSV-MP2 with the loose threshold for all basis sets, but is overtaken by OSV-MP2 with tighter thresholds. For def2-TZVPP and loose threshold, the MAE for OSV-MP2 is significantly lower than PNO-MP2 by around 0.3 pm, but larger than DLPNO-MP2. The performances of the three methods are comparable for normal and tight calculations.

By repeating the OSV-MP2 geometry optimization, the long interatomic distances of noncovalent bonds have been examined in Table 2. We have chosen DTFS and RESVAN molecules out of LB12 set for which the DLPNO-MP2 optimized Si-N and S-S bond distances report quite large errors in Ref.70. All electrons are correlated in OSV-MP2 calculations, and both OSV-MP2 geometries are well converged for three OSV-MP2 thresholds. It is observed that $l_{osv}=10^{-4.5}$ is necessary in order to reduce the errors below 1.0 pm. However, $l_{osv}=10^{-4.0}$ appears to be sufficient for achieving relative deviations below $1\%$ , which is acceptable for such long bond distances. In general, the OSV-MP2 outperforms DLPNO-MP2 for loose selection, and both methods are comparable for normal and tight selections.

Bond and dihedral angles

The errors of bond angles are reported in Table 3 for selected Baker’s test molecules according to the specification in Ref.67. In general, the MAEs are smaller than $0.1^{\circ}$ for $l_{osv}$ all values in combination with all basis sets. $l_{osv}=10^{-3}$ results in relatively large maximum errors about $1.0^{\circ}$ . Both $l_{osv}=10^{-4}$ and $10^{-5}$ substantially reduce the maximum errors by about an order of magnitude and are recommended for accurate structure optimizations. The performances of OSV-MP2, PNO-MP2 and DLPNO-MP2 in bond angles are compared in Figure 9(b). Most notably PNO-MP2 without the PNO relaxation yields larger errors than OSV-MP2 and DLPNO-MP2, in particular for def2-TZVPP and def2-QZVPP basis sets. The performances of OSV-MP2 and DLPNO-MP2 are similar with normal and tight thresholds.

The dihedral angles of benzidine molecule are compared between OSV-MP2, PNO-MP2 and DLPNO-MP2 in Table 4. Overall, OSV-MP2 performs much better than PNO-MP2 and DLPNO-MP2 for all thresholds and basis sets, and the deviations from RI-MP2 dihedral angles are less than $0.2^{\circ}$ for OSV-MP2/normal and OSV-MP2/tight.

Performance with pair screening

The use of pair screening can considerably accelerate the OSV-MP2 calculations by discarding the pairs of occupied orbitals that make little contribution to the total correlation energy. By exploring the orbital locality and the definition of OSVs, the OSV overlap matrix elements associated with a pair $(i,j)$ exhibit an exponential decay with the separation between $i$ and $j$ . Therefore the relevant pairs entering OSV-MP2 calculations are chosen according to the previous simple scheme20 in which the renormalized OSV overlap matrix is computed for a given $(i,j)$ pair and compared to a predefined pair screening threshold $l_{pair}$ . When a looser $l_{pair}$ (greater value) is used, more orbital pairs will be screened and not participate in the OSV-MP2 energy and gradient computation. The MAEs of bond lengths, bond angles and dihedral angles are reported with respect to $l_{pair}$ in Table 5. It is shown that the MAEs at $l_{pair}=10^{-4}$ are similar to those without pair screening for both bond lengths and angles. However, there is a significant increase of MAEs as $l_{pair}$ is increased from $10^{-4}$ to $10^{-2}$ . Interestingly, for dihedral angles, the errors for all $l_{pair}$ thresholds are less than $0.8^{\circ}$ .

Timing comparison

We finally compare the elapsed times between RI-MP2, DLPNO-MP2 and OSV-MP2 for both energy and gradient evaluations on a single CPU. For all molecules considered in Table 6, our current OSV-MP2 implementation achieves speedups of 3-10 folds for gradients and 0.4-4.0 for energies compared to RI-MP2, respectively. In particular, the OSV-MP2 gradient computation is faster than RI-MP2 by an order of magnitude for the longest molecule (Gly)14. For Nonactin molecule similar to (Gly)14 in size, OSV-MP2 gradient calculation exhibits a poorer speedup than (Gly)14 due to more kept pairs of Nonactin (6737 out of 20100 pairs) than (Gly)14 (4218 out 23220 pairs), since apparently the pair screening is less effective to the cyclic Nonactin structure than the linear (Gly)14. The average pair domain sizes of Nonactin and (Gly)14 are similar, i.e., both own 96 OSVs, which is much larger than DLPNO pair domains (about 17-20 PNOs). On the other hand, DLPNO-MP2 retains 14937 and 8192 pairs for (Gly)14 and Nonactin, respectively, that are much larger than those of OSV-MP2. Moreover, the OSV-MP2 energy and gradient scalings are $N^{2.74}$ and $N^{2.96}$ , respectively, as shown in Figure S1. This is still higher than DLPNO-MP2 and leads to longer elapsed time than DLPNO-MP2 by nearly 2 folds for large glycine chains. However, for smaller molecules, OSV-MP2 gradient computation can be 2 times faster than DLPNO-MP2, which makes it attractive for driving efficient BOMD simulations on molecules of similar size.

4 OSV-MP2-DRIVEN AB-INITIO BOMD

4.1 Protonated Eigen and Zundel water cations

We have performed the constant $NVE$ simulation for protonated Eigen (H9O4+) and Zundel water cluster (H13O6+). They are not only structural units of biological and chemical significance, but also the benchmark systems that have been extensively used to establish accuracy of other theories.

Energy drifts

In OSV-MP2 $NVE$ simulation, the OSV-MP2 approximated trajectories propagate according to the numerical integration over a finite time step which may break the energy conservation by a range of drifts at long simulation time. Therefore such drifts must be examined carefully with respect to both OSV and pair selections. The results are reported in Table 7 for benchmarking OSV-MP2 BOMD accuracy. When no pairs are screened ( $l_{pair}=0.0$ ), all energy drifts are very small. The total energies of all OSV-MP2/10 ps trajectories with $l_{osv}=10^{-3}$ are conserved within 1.0 kJ/mol, the energy drifts are substantially reduced with $l_{osv}=10^{-4}$ by two and one orders of magnitude for H9O4+ and H13O6+, respectively. The RMSDs, which measure the time-dependent energy fluctuation statistically, are as small as half kJ/mol for $l_{osv}=10^{-3}$ and $0.1$ – $0.2$ kJ/mol for $l_{osv}=10^{-4}$ . The difference of the computed $T_{\mathrm{av}}$ between $l_{osv}=10^{-3}$ and $l_{osv}=10^{-4}$ is about 1 K for H9O4+ and 5 K for H13O6+, respectively.

Table 7 suggests that the use of pair screenings yields larger statistical errors than the OSV selection. Nevertheless, a proper combination of selected $l_{osv}$ and $l_{pair}$ can produce results of acceptable accuracy. For instance, for $l_{osv}=10^{-4}$ , the choice of the medium pair screening $l_{pair}=0.001$ does not lead to significant shifts of energy (both $\delta E$ and RMSD) and temperature. However, with $l_{pair}=0.01$ and $l_{pair}=0.02$ , the energy conservation is not well sustained. As seen in Figure S2, with more pair screenings for Zundel cluster, the $l_{osv}=10^{-4}/l_{pair}=0.01$ simulation after about 4.5 ps leads to a hotter Zundel cation by 1 kJ/mol, probably arising from a more drastic change of the number of the kept pairs with time.

Radial distribution function (RDF)

The trajectory specification of computing RDFs of the O-O and O-H distances was adopted according to the description of Ref.83. As seen in Figure 14, the OSV-MP2 BOMD calculations with $l_{osv}=10^{-4}$ are capable of retrieving all O-O and O-H structural details including the RDF landscape and peak positions for both Eigen and Zundel clusters, and also in excellent agreement with the canonical MP2 BOMD reference results83. However, for Zundel cluster, the calculations with the loose OSV selection $l_{osv}=10^{-3}$ do not well resolve two innermost peaks of the O-O RDF at about $2.4$ Å (Zundel-like O-O distance) and $2.8$ Å (Eigen-like O-O distance), but rather predict a more dominating Eigen-like solvation shell. It is demonstrated in Figure S3 that the pair screenings, when combined with the normal OSV selection $l_{osv}=10^{-4}$ , have little effects on the RDF landscapes yet with a small broadening of the RDF peaks at longer O-H and O-O distances by increasing $l_{pair}$ .

Vibrational density of states (VDOS)

Vibrational density of states are computed as the Fourier transform of the velocity autocorrelation function according to the Ref.83. However, our initial structures are generated from RI-MP2 optimization, with the momentum corresponding to 300 K. We compare the computed VDOS spectra in Figures 17 and S4. The positions of significant peaks can be hardly affected by the OSV selection and pair screening. In particular, for Eigen cluster in Figures 17 (a) and S4(a), the weak peaks at about 3000 cm*-1* representing the proton stretch mode are well reproduced84 in all OSV-MP2 BOMD calculations. For Zundel cluster in Figures 17 (b) and S4(b), the two peaks of medium intensity around 4000 cm*-1* are clearly resolved. However, the peak intensities are largely influenced by the combined $l_{osv}$ and $l_{pair}$ . For instance, the peaks at both low and high frequency regions are relatively intensified by decreasing $l_{osv}$ . On the other hand, a large pair screening appears to substantially weaken the 4000 cm*-1* peak at the lower frequency side.

4.2 Rotational free energy of ethanol

The OSV-MP2/cc-pvTZ $NVT$ simulations were carried out for computing the rotational free energies of the coupled hydroxyl and methyl groups in ethanol molecule at 300 K. The $NVT$ simulation features thermal energy exchange which may compensate the electronic energy loss due to selected OSVs through adding a thermostat into Hamiltonian for coupling the system and reservoir. This thus opens up the feasibility of making OSV-MP2 BOMD available for simulating systems at a finite temperature. However, the detailed investigation on the interplay between the thermal coupling and the OSV selection is not the subject of this work and will be probed in future applications. In the current work, the Nosé-Hoover thermostat was employed with the temperature coupling time constant of 100 fs. The simulation temperature is conserved within a drift of only -0.051 K for $l_{osv}=10^{-4}$ and $l_{pair}=0.0$ .

Ethanol can exist in two conformers, the trans-ethanol with the hydroxyl group trans to the methyl group, and the gauche-ethanol with the hydroxyl group gauche to the methyl group. The gauche-ethanol stability computed by single point DFT is close to the trans-ethanol by, for instances, 0.01 kcal/mol for B3LYP/cc-pVTZ85 and -0.08 kcal/mol PBE-TS/cc-pVTZ86. Our OSV-MP2/cc-pVTZ $NVT$ simulation predicts that the trans-ethanol conformer is more stable than the gauche-ethanol conformer by 0.22 kcal/mol, as shown in Figure 18. The OSV-MP2/cc-pVTZ $NVT$ also finds the free energy barriers of 1.00 kcal/mol and 0.62 kcal/mol to the hydroxyl rotation and trans-to-gauge transformation, respectively. Recently, Chmiela et al.86, 87 reported that the corresponding CCSD(T) barriers are 0.11 kcal/mol, 1.30 kcal/mol and 1.18 kcal/mol, by training the symmetrized gradient-domain machine learning (sGDML) model for the CCSD(T) force field in MD simulations. It was observed however that the gauche is more stable than the trans by repeating the same calculation with sGDML@DFT(PBE-TS)86. Compared to sGDML@CCSD(T), our OSV-MP2 simulation seems to underestimate the energy level of the transition state for tran-to-gauche transformation by 0.56 kcal/mol. This disagreement may be ascribed to the difference of the levels in describing electron correlations between MP2 and CCSD(T) methods. Nevertheless, single point calculations88 corresponding to 0 K show that the energy barriers are in fact similar between MP2 and CCSD(T), with differences of only a few hundredth kcal/mol. Therefore it remains a question whether such a subtle difference of electron correlation between MP2 and CCSD(T) for ethanol has any significance due to a thermal fluctuation of $\approx 0.6$ kcal/mol at 300 K. More importantly, we realize that in our computational setting for metadynamics simulation, a relatively large time constant of 100 fs was used in order to achieve a small temperature drift ( $\approx-0.051$ K) and avoid poor coupling in a long time equilibration. However, this inevitably results in a more wild distribution of Nosé-Hoover frequencies and thus a larger thermal fluctuation. More detailed studies on this issue within the OSV-MP2 framework are underway.

5 CONCLUSIONS

In this work, we have described the algorithm and implementation for analytically computing the energy derivatives from all OSV-MP2 energy contributions with local molecular orbitals. We have shown that it is possible to evaluate the OSV relaxation by explicitly solving non-degenerate perturbed eigenvalue problem in which exact OSV rotations can be implemented between the retained and discarded OSV subspaces. The simplicity of the OSV construction leads to the block-diagonal structure of pair-specific OSV relaxation matrix which decouples OSV rotations within a single orbital pair. The solution of pair-specific OSV relaxation elements enters the source of a single Z-vector equation along with the MO relaxation and the localization constraint, as solved in a conventional way that is independent of the degrees of freedom.

The accuracy of this approach has been benchmarked on a set of well studied molecules for optimized geometries and molecular dynamics simulations. The OSV relaxation effects are significant and can be recovered with the normal OSV selection for practical use of reproducing canonical RI-MP2 molecular structures. Moreover, the classical molecular dynamics with OSV-MP2 input gradients has been implemented. It has been demonstrated that using a normal OSV selection, all major peaks of the O-O/O-H radial distribution functions and vibrational densities of states for protonated water tetramer and hexamer can be well identified. A 200 ps well-tempered metadynamics simulation with OSV-MP2 gradients at 300 K has been shown to be capable of distinguishing the gauche and trans conformers of ethanol molecule.

There is much to explore for improving the current implementation by noting the aspects as follows. (1) Solving the Z-vector equation and two-electron integral transformation therein in MO basis become one bottleneck step for large molecules. (2) The evaluation of energy gradients through Eq. (LABEL:eq:ec1e2e) does not yet take advantage of OSV savings and therefore scales quickly with system sizes. (3) Embarrassing parallelization schemes seem obvious within the OSV-MP2 framework by distributing local orbitals over many processes. (4) Finally, the OSV-MP2 gradient computation is currently much slower than OSV-MP2 energy by 3-4 folds. An appropriate scheme for pruning out insignificant OSV relaxations and associated pairs shall further speed up gradient computation. The efforts along these directions are being made and will be reported in future.

{acknowledgement}

J.Y. acknowledges financial supports from the Hong Kong Research Grant Council (RGC) Early Career Scheme (ECS) through Grant No. ECS27307517, and the Hui’s fund provided by department of chemistry at the University of Hong Kong. R.Y.Z. thanks Prof. Roberto Car for hosting his summer research and valuable discussions. J.Y. thanks Dr. Qiming Sun for general assistance in PySCF package.

{suppinfo}

The file Supporting supporting.pdf contains further results of the computations and is available free of charge.

Bibliography88

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Zaleśny et al. 2011 Zaleśny, R.; Papadopoulos, M. G.; Mezey, P. G.; Leszczynski, J. Linear-Scaling Techniques in Computational Chemistry and Physics: Methods and Applications ; Springer Science & Business Media, 2011; Vol. 13
2Gordon 2017 Gordon, M. S. Fragmentation: Toward Accurate Calculations on Complex Molecular Systems ; John Wiley & Sons, 2017
3Martinez and Carter 1994 Martinez, T. J.; Carter, E. A. Pseudospectral Møller–Plesset perturbation theory through third order. J. Chem. Phys. 1994 , 100 , 3631–3638
4Ayala and Scuseria 1999 Ayala, P. Y.; Scuseria, G. E. Linear scaling second-order Møller–Plesset theory in the atomic orbital basis for large molecular systems. J. Chem. Phys. 1999 , 110 , 3660–3671
5Reynolds et al. 1996 Reynolds, G.; Martinez, T. J.; Carter, E. A. Local weak pairs spectral and pseudospectral singles and doubles configuration interaction. J. Chem. Phys. 1996 , 105 , 6455–6470
6Lee et al. 2000 Lee, M. S.; Maslen, P. E.; Head-Gordon, M. Closely approximating second-order Mo/ller–Plesset perturbation theory with a local triatomics in molecules model. J. Chem. Phys. 2000 , 112 , 3592–3601
7Pulay 1983 Pulay, P. Localizability of dynamic electron correlation. Chem. Phys. Lett. 1983 , 100 , 151–154
8Sæbø and Pulay 1985 Sæbø, S.; Pulay, P. Local configuration interaction: An efficient approach for larger molecules. Chem. Phys. Lett. 1985 , 113 , 13–18

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A complete OSV-MP2 analytical gradient theory for molecular structure and dynamics simulations

Abstract

1 INTRODUCTION

2 THEORY AND IMPLEMENTATION

2.1 OSV-MP2 wavefunction

2.2 Perturbed OSVs and relaxation

2.2.1 OSV orbital rotation

2.2.2 OSV relaxation as perturbed non-degenerate eigenvalue problem

2.3 OSV-MP2 analytical gradient theory

2.3.1 OSV-specific energy gradient Ec{λ}E^{\{\lambda\}}_{c}Ec{λ}​

2.3.2 MO-specific energy gradient Ec[λ]E^{[\lambda]}_{c}Ec[λ]​

2.3.3 AO-specific energy gradient Ec(λ)E^{(\lambda)}_{c}Ec(λ)​

2.4 Implementation scheme

3 APPLICATIONS TO MOLECULAR STRUCTURES

3.1 Accuracy of OSV-MP2 analytical gradients

3.2 Optimized molecular structures

Bond lengths

Bond and dihedral angles

Performance with pair screening

Timing comparison

4 OSV-MP2-DRIVEN AB-INITIO BOMD

4.1 Protonated Eigen and Zundel water cations

Energy drifts

Radial distribution function (RDF)

Vibrational density of states (VDOS)

4.2 Rotational free energy of ethanol

5 CONCLUSIONS

2.3.1 OSV-specific energy gradient $E^{\{\lambda\}}_{c}$

2.3.2 MO-specific energy gradient $E^{[\lambda]}_{c}$

2.3.3 AO-specific energy gradient $E^{(\lambda)}_{c}$