Non-uniqueness of quantum transition state theory and general dividing surfaces in the path integral space
Seogjoo Jang, Gregory A. Voth

TL;DR
This paper analyzes the non-uniqueness of quantum transition state theory (QTST) and shows that recent formulations like HA-QTST are limited to specific dividing surfaces, revealing ambiguities in their general applicability.
Contribution
The work provides a detailed theoretical analysis of HA-QTST, demonstrating its limitations and clarifying its relation to RPMD rate theory for various path integral dividing surfaces.
Findings
HA-QTST matches RPMD rate theory for quadratic low-frequency modes
Different results arise for higher frequency imaginary time paths
HA-QTST does not generally derive RPMD-TST, indicating ambiguity
Abstract
Despite the fact that quantum mechanical principles do not allow the establishment of an exact quantum analogue of the classical transition state theory (TST), the development of a quantum TST (QTST) with a proper dynamical justification, while recovering the TST in the classical limit, has been a long standing theoretical challenge in chemical physics. One of the most recent efforts of this kind was put forth by Hele and Althorpe (HA) [ J. Chem. Phys. 138 , 084108 (2013)], which can be specified for any cyclically invariant dividing surface defined in the space of the imaginary time path integral. The present work revisits the issue of the non-uniqueness of QTST and provides a detailed theoretical analysis of HA-QTST for a general class of such path integral dividing surfaces. While we confirm that HA-QTST reproduces the result based on the ring polymer molecular dynamics (RPMD) rate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Non-uniqueness of quantum transition state theory and general dividing surfaces in the path integral space
Seogjoo Jang111Email:[email protected]
Department of Chemistry and Biochemistry, Queens College, City University of New York, 65-30 Kissena Boulevard, Queens, New York 11367222mailing address & PhD programs in Chemistry and Physics, and Initiative for the Theoretical Sciences, Graduate Center, City University of New York, 365 Fifth Avenue, New York, NY 10016, USA
Gregory A. Voth333Email:[email protected]
Department of Chemistry, James Franck Institute, and Institute for Biophysical Dynamics, The University of Chicago, 5735 S. Ellis Avenue, Chicago, Illinois 60637, USA
(Accepted to the Journal of Chemical Physics on April 11, 2017)
Abstract
Despite the fact that quantum mechanical principles do not allow the establishment of an exact quantum analogue of the classical transition state theory (TST), the development of a quantum TST (QTST) with a proper dynamical justification, while recovering the TST in the classical limit, has been a long standing theoretical challenge in chemical physics. One of the most recent efforts of this kind was put forth by Hele and Althorpe (HA) [ J. Chem. Phys. 138, 084108 (2013)], which can be specified for any cyclically invariant dividing surface defined in the space of the imaginary time path integral. The present work revisits the issue of the non-uniqueness of QTST and provides a detailed theoretical analysis of HA-QTST for a general class of such path integral dividing surfaces. While we confirm that HA-QTST reproduces the result based on the ring polymer molecular dynamics (RPMD) rate theory for dividing surfaces containing only a quadratic form of low frequency Fourier modes, we find that it produces different results for those containing higher frequency imaginary time paths which accommodate greater quantum fluctuations. This result confirms the assessment made in our previous work [J. Chem. Phys. 144, 084110 (2016)] that HA-QTST does not provide a derivation of RPMD-TST in general, and points to a new ambiguity of HA-QTST with respect to its justification for general cyclically invariant dividing surfaces defined in the space of imaginary time path integrals. Our analysis also offers new insights into similar path integral based QTST approaches.
I Introduction
Certain concepts in classical mechanics cannot be carried over to the quantum regime no matter how useful they may be. Transition state theory (TST)eyring-jcp3 ; wigner-jcp5 in its rigorous definitionwigner-jcp5 ; pechukas-r1 ; pechukas-jcp58 ; chandler-jcp68 ; miller-acc26 is tied to such concepts of classical mechanics, namely, the definability of deterministic trajectories in the phase space. Therefore, no exact analogue of the classical TST can be found in the quantum regime.miller-acc26 ; mclafferty-cpl27 Nonetheless, attempts to develop a quantum TST (QTST) have remained an ongoing and challenging theoretical subject. This is because there is a significant benefit in calculating a rate constant without carrying out any actual real time quantum dynamics calculation. The insight available from a quantum correction factor for the TST also serves as an important motivation. However, a major conceptual issue to be settled in developing a QTST is that distinct starting points of the rate formulation,mclafferty-cpl27 ; yamamoto-jcp123 ; miller-jcp79 ; affleck-prl46 ; gillan-jp-c20 ; voth-jcp91 ; hansen-jcp ; stuchebrukhov-jcp95 ; cao-jcp105 ; pollak-jcp108 which all lead to the same TST rate expression in the classical limit, can produce different results in the genuine quantum regime. In fact, this non-uniqueness of QTST is not unexpected. It rather reflects the fundamental principles of quantum mechanics, and should be recognized as an intrinsic quantum feature. In this sense, it is imperative to understand clearly the assumptions and approximations involved in a given QTST because such an understanding offers a correct assessment of the implications and utility of that particular QTST.
In a recent work,jang-jcp144 we analyzed a new kind of QTST formulated by Hele and Althorpe (HA).hele-jcp138 Our analysis offered a context for this theory within the general consensus on the QTST, as described above, by clarifying key assumptions implicit in HA-QTST. First, we pointed out that the starting expression of HA-QTSThele-jcp138 ; althorpe-jcp139 was missing a clear physical origin tied to response or scattering theories, unlike those introduced by Yamamotoyamamoto-jcp123 and by Miller and coworkers.miller-jcp79 Second, we analyzedjang-jcp144 an apparent approximation HA employed for the evaluation of their flux-side correlation function,hele-jcp138 and have also provided an alternative and exact evaluation. The resulting form based on our evaluation suggested that HA-QTST and ring polymer molecular dynamics (RPMD) TSTcraig-jcp122 ; craig-jcp123 are different in general.
An important issue that was not discussed in detail in our workjang-jcp144 was the concept of a general dividing surface defined in the imaginary time path integral space, which was introduced as a new means to optimize RPMD-TST by Richardson and Althorperichardson-jcp131 and had also previously been considered by otherscao-jcp105 ; mills-cpl278 in the context of improving the path integral QTST (PI-QTST).voth-jcp91 It is important to first note that the introduction of such a dividing surface is a departure from the original formulation of the RPMD rate theory by Craig and Manolopouloscraig-jcp122 ; craig-jcp123 because not every dividing surface in the imaginary time path integral space can be expressed as a ring polymer average of classical ones, namely, an equal weight linear combination of classical dividing surfaces over all the beads of the ring polymer. For this reason, our previous analysisjang-jcp144 was limited to the centroid dividing surface, an obvious ring polymer averaged quantity. For this latter case, we have confirmed that HA-QTST reproduces the PI-QTST rate expression above the crossover temperature,voth-jcp91 to which the RPMD-TST expression proposed a number of years later was also shown to be equivalent.craig-jcp123 We note that this outcome is not an exact derivation of PI-QTST or RPMD-TST because neither of these formulations of QTST are exact except in certain limits. In a more recent article,hele-jcp144 HA provided further analysis of the general expression obtained in our work,jang-jcp144 and presented a result that seems to suggest again the equivalence between HA-QTST and RPMD-TSTrichardson-jcp131 for general dividing surfaces. However, as will be shown here, this is not always true.
Before presenting our new analysis addressing the issue of general path integral dividing surfaces, we want to stress that our assessmentjang-jcp144 of HA-QTST as being one of many non-unique versions of QTST remains intact. In particular, our assessment of the assumption implicit in employing the limit of a specially constructed flux-side time correlation function, namely, one that neglects a certain part of the Feynman diagram and thus in effect assumes that certain operators can commute, requires no further clarification. Without such an assumption, and without any coarse-graining in time, the exact limit of the flux-side correlation function will always vanish in the quantum limit.costley-cpl83 ; wolynes-prl47 With this issue clarified, we will here focus our attention mostly on the subject of general path integral dividing surfaces. These general dividing surfaces arise from the specially constructed time correlation function within the approximation of HA-QTST.
An important condition underlying the analysis by HA in their recent workhele-jcp144 is that the dividing surface is a smooth function of imaginary time, an important assumption that should not be overlooked. We will show here that this analysis does not hold true for more general and yet cyclically invariant dividing surfaces in the imaginary time path integral space. This result confirms our previous conclusion that HA-QTSThele-jcp138 is not an exact derivation of RPMD-TSTcraig-jcp122 ; craig-jcp123 ; richardson-jcp131 in general because an exact quantum mechanical identity should be independent of the nature of the dividing surface.
This paper is organized as follows. Section II provides a brief overview of HA-QTST and our recent analysis, and summarizes its relationship to RPMD-TST. Section III provides a detailed evaluation of our final expression for HA-QTST for two types of cyclically invariant paths, and demonstrates our central point that HA-QTST does not lead to RPMD-TST for general cyclic paths. Section IV provides concluding remarks.
II Summary of HA-QTST and its relationship with RPMD-TST
The formulation of HA-QTST,hele-jcp138 for the case where the population function is the step function and assuming one dimensional reaction coordinate, starts from the following generalized Kubo-transformed side-side correlation function (GKSCF):
[TABLE]
where represents the dividing surface11endnote: 1 Although is defined in the space of imaginary time path integral, this may not always be a true dividing surface of the multidimensional ring polymer potential energy profile. Rather, this can be viewed as a some kind of path integral generalization of the classical dividing surface in general. defined in the space of imaginary time path integral, and
[TABLE]
with , and
[TABLE]
As mentioned in the Introduction and in our previous work,jang-jcp144 the quantum dynamical meaning of Eq. (1) is not clear as it is not derived from, e.g., any response theory. The primary motivation provided by HAhele-jcp138 for using this time correlation function is that the limit of its time derivative is nonzero and positive, although the value of the exact time correlation function at time is zero.jang-jcp144 The purely mathematical basis for the classical-like singularity in the correlation function that imparts its nonzero value at is easy to understand, but its physical meaning is much less clear in the context of quantum mechanics. HA then utilized this singular behavior of the time derivative of GKSCF to define the following rate expression:hele-jcp138
[TABLE]
and evaluated this by employing a short time approximation for real time propagators, , within the definition of {\mathcal{G}}({\bf q},\mbox{\boldmath\eta},{\bf z};t), Eq. (3). In this procedure, a natural classical-like variable defined as
[TABLE]
emerges and plays an important role in the derivation of the final rate expression by HA.hele-jcp138 However, on the other hand, we have shownjang-jcp144 through partial integration that Eq. (4) can be expressed as
[TABLE]
where and are average imaginary time momenta defined as follows:
[TABLE]
No approximation has been made in deriving Eq. (6). When evaluated up to only the leading order of ,
[TABLE]
Equation (6) is in contrast with the expression derived by HA,hele-jcp138 which amounts to replacing the imaginary time momenta and with defined by Eq. (5). As noted in our earlier work,jang-jcp144 this is clearly an approximation.
In our earlier work,jang-jcp144 we then showed that Eq. (6) can be evaluated further employing a normal mode transformation used by HA.hele-jcp138 To this end, it is convenient to introducehele-jcp138 ; jang-jcp144 the following functions:
[TABLE]
Then, employing Eqs. (9)-(10), and performing integrations over all the modes of and perpendicular to the vector formed by ’s, we have obtained the following expression:jang-jcp144
[TABLE]
where (\mbox{\boldmath\eta}_{0})_{k}=\tilde{\eta}_{0}({\bf q})T_{k}({\bf q}) and \rho({\bf q},\mbox{\boldmath\eta}_{0}) can be approximated as
[TABLE]
In the above expression, is the conventional imaginary time path integral representation of the diagonal position element of the canonical density operator defined as
[TABLE]
with the cyclic boundary condition , and
[TABLE]
In their recent work,hele-jcp144 HA argued that Eq. (14) becomes equivalent, in the limit of , to the following version of the RPMD-TST expression:richardson-jcp131
[TABLE]
where is the normalization factor defined by Eq. (12) and, through Eq. (11), can also be expressed as
[TABLE]
In order for Eqs. (14) and (18) to be equivalent in the limit of , as claimed by HA,hele-jcp144 contributions of and in the former should become negligible compared to others in that limit. To be more precise, the two become equivalent if the following conditions are satisfied:
[TABLE]
For the type of smooth dividing surfaces assumed by HA,hele-jcp138 ; hele-jcp144 the above conditions are satisfied. However, in the following section, we show that the analysis does not hold true for more general cyclically invariant dividing surfaces in the imaginary time path integral space.
III Dependence of HA-QTST on Dividing Surface
Because of the cyclic invariance of the summation in Eq. (17), can also be expressed as follows.
[TABLE]
where Eq. (11) has been used in the second equality. The main argument made by HAhele-jcp144 is that and that , for which Eqs. (20) and (21) are satisfied and Eq. (14) reduces to Eq. (18) after Gaussian integration over . However, such an agreement is true only for a certain simple class of dividing surfaces in which the contribution of the quantum kinetic energy is vanishingly small, and hence so are the quantum path fluctuations. For general cyclically invariant dividing surfaces, this is not true. This can be shown by demonstrating results for two specific examples of the dividing surface as described below.
III.1 Dividing surface containing a quadratic form in the Fourier modes of the imaginary time path
Generalizing the dividing surfaces introduced by Althorpe and coworkers,richardson-jcp131 ; hele-jcp138 we here consider a class of cyclically invariant dividing surfaces defined as
[TABLE]
where is a real phase factor that can be determined or optimized separately, is a nonnegative integer less than or equal to , and
[TABLE]
The above function corresponds to the norm of the th Fourier mode of the cyclic imaginary time path. It is worthwhile to note here that there can be additional restriction for in order for to be a genuine dividing surface in the path integral space. An obvious condition is that cannot be zero or close to it because the information on the average position of ring polymers gets lost in such case. With this issue clarified and under the assumption that a physically appropriate choice of can be made, below we examine whether the conditions of Eqs. (20) and (21) are indeed satisfied for any choice of .
First, taking derivative of Eq. (23) with respect to , we find that
[TABLE]
Using the above expression in Eq. (12), we also fine that
[TABLE]
Therefore, based on the definitions of Eqs. (11) and (17), and after employing standard trigonometric manipulation, we obtain the following general expressions:
[TABLE]
Employing the fact that , from Eq. (27), we identify the following three types of scaling behavior depending on the value of :
[TABLE]
Similarly, from Eq. (28), we find the following three types of scaling behavior:
[TABLE]
The results shown in Eqs. (29) and (30) confirm that the conclusionhele-jcp144 by HA is true only for . In fact, this limitation was noted briefly in the Appendix A of Ref. hele-jcp144, , but has not been stated anywhere else. For the case of , the term involving does not vanish but remains finite. Therefore, the difference between HA-QTST and RPMD-TST can be accounted for by a finite factor. For , both the terms involving and cause HA-QTST to be different from RPMD-TST. In fact, in this case, it is not clear whether HA-QTST results in a finite value. Neglecting these cases amount to projecting the imaginary time path integral space onto the subspace where the quantum kinetic energy is not fully accounted for, which it seems cannot be overlooked in any QTST that aspires to more fully incorporate quantum effects.
As an example, let us consider the case where is even and . Equation (25) in this case becomes
[TABLE]
where and . Using the above expression in Eq. (12), we also find that
[TABLE]
Using the above expression in Eq. (11), we can calculate that
[TABLE]
In addition, from Eq. (22), we also find that
[TABLE]
The above two expressions, Eqs. (33) and (34), serve as explicit demonstration of the scaling behavior of Eqs. (29) and (30) for and thus prove that both conditions of Eqs. (20) and (21) are indeed violated.
As an alternative and more general example, let us consider a specific realization of the cyclic path along which remains constant as follows:
[TABLE]
where , , and are fixed parameters. For this choice, it can be shown that
[TABLE]
Inserting Eq. (37) into Eq. (28), we find that
[TABLE]
which clearly follows the scaling behavior of Eq. (30). Equation (27) for this cyclic path can also be calculated and simplifies to
[TABLE]
The above expression follows the scaling behavior shown in Eq. (29). Figure 1 shows this behavior more clearly for and for three specific choices of , , and assuming .
Ultimately, what contributes to the difference between HA-QTST and RPMD-TST is the difference of the summation in Eq. (14) and that in Eq. (18). This can also be calculated explicitly as follows:
[TABLE]
The above expression shows that the actual difference between the two sums decreases further for and due to some cancellation, but the scaling behavior predicted from Eq. (29) remains the same for .
III.2 Dividing surface containing a quadratic form defined in terms of the imaginary time path coordinates
We here consider another kind of cyclically invariant dividing surface defined as
[TABLE]
where
[TABLE]
Because of the cyclic boundary condition for , this function is invariant with respect to cyclic permutation. In Eq. (42), is an appropriate normalization factor that makes an order of unity. For example, for , . For , .
For the dividing surface defined above,
[TABLE]
and it can be shown that
[TABLE]
For the case where , and . Therefore, assuming that for , , and , we obtain the following estimates.
[TABLE]
This is similar to the case of for the dividing surface in the previous subsection, for which HA-QTST may not be well defined.
On the other hand, for , and . As a result,
[TABLE]
The behavior shown above is a mixture of those for and for the dividing surface in the previous subsection, and also renders HA-QTST ill-defined.
The analysis in this subsection thus demonstrates that, for the case where the dividing surface function mixes different Fourier modes of the imaginary time path in a nonlinear manner, the assumption of the analysis by HAhele-jcp144 does not hold true in general. As a result, our exact evaluationjang-jcp144 of HA-QTSThele-jcp138 expression becomes different from the RPMD-TST expression.richardson-jcp131
IV Concluding Remarks
In this work, we have presented a new and detailed analysis of the effect of general cyclic imaginary time path integral dividing surfaces on the corresponding result of HA-QTST.hele-jcp138 What is believed to make HA-QTST unique is the fact that it is defined for an arbitrary form of cyclically invariant dividing surface constructed in the space of the imaginary time path integral. However, HA-QTST in its present form does not offer any independent and self-consistent prescription of the best choice out of the infinite number of dividing surface possibilities available, and its implementation practice has been confined to the dividing surfaces of the type in Eq. (23) with followed by a limited variational optimization. This is different from the instanton theory,affleck-prl46 ; gillan-jp-c20 which provides a definition of the dividing surface as the saddle point within the semiclassical approximation for the imaginary time action and can also be connected to the PI-QTST.cao-jcp105 ; jang-jpca103-2 Although HA-QTST has been shown to work well for the kind of dividing surface given by Eq. (23) with , and it becomes equivalent to RPMD-TST for , the analysis presented in this work clarifies that different outcomes are expected for more general dividing surfaces. Along with our previous work,jang-jcp144 which was focused more on the underlying quantum mechanical expressions and approximations inherent in HA-QTST, the present work again emphasizes the major challenges in developing a unique and well defined QTST while utilizing the full quantum free energy expression defined in the imaginary time path integral space.
Regardless of our analyses presented here and in a recent work,jang-jcp144 one can justifiably argue that HA-QTSThele-jcp138 or RPMD-TSTcraig-jcp123 for a smooth dividing surface defined in the imaginary time path integral space is useful in practice. However, one should not misinterpret this degree of usefulness as a quantum mechanical validation of some key assumptions and approximations behind HA-QTST and RPMD-TST.
We conclude this work by offering a few additional insights into RPMD-TST and HA-QTST. First, it is relatively easy to understand RPMD-TST.craig-jcp123 As is clear from Eq. (18), RPMD-TST for a general dividing surface is the relative thermal weight for the dividing surface multiplied by the average “classical” flux across it. Therefore, this can be seen as an extension of the PI-QTST above the crossover temperaturevoth-jcp91 for a general dividing surface defined in the imaginary time path integral space. The fact that this limit is obtained from the RPMD rate theorycraig-jcp122 ; craig-jcp123 is the result of the principles governing classical mechanics and does not have a quantum mechanical origin.
The implications inherent in HA-QTSThele-jcp138 are more subtle as its starting point, GKSCF, already takes both classical and ensemble concepts and encodes them into the theory. Unlike ring polymer averages defined in the RPMD rate theory, which can be considered as ensemble averages of actual physical observables, the GKSCF defines a function of imaginary time paths (a dividing surface in the path integral space) as the argument of the population measurement. Can this kind of population be measured? Such a measurement requires measuring positions for the quantum canonical ensemble and then giving a weight only when a nonlinear (in general) function constructed from the measured positions meets the criterion of the population. This is not in general an ensemble average of a physical measurement but rather a subset of the ensemble of measured values that are selected and weighted in certain way. Even if such a measurement were possible, assuming a time correlation of such a quantity exists implies that it does not disturb the initial quantum state, which is why its time derivative has a finite limit. Thus, GKSCF is different from the original exact Kubo-transformed time correlation functionkubo-jpsj-12 ; kubo-jpsj-12-2 which represents a well-defined response of a quantum ensemble following a physical perturbation.
The fact that HA-QTST reduces to RPMD-TST for only smooth dividing surfaces, but may not be well defined for general cyclic dividing surfaces in the path integral space as demonstrated in this work, also reveals an additional feature of the theory. This result means that HA-QTST becomes equivalent to RPMD-TST only when the effect of operator ordering along the dividing surface of the imaginary time path integral becomes vanishingly small. In practice, confining the dividing surface to smooth functions ensures that it remains close to the instanton trajectory below the crossover temperature, which may be sufficient. Interestingly, HA-QTST, in its practice of using a smooth dividing surface, may be viewed as a heuristic approximation that interpolates the PI-QTST above the crossover temperaturevoth-jcp91 and the instanton theory below the crossover temperature,affleck-prl46 but without a change in the pre-exponential factor (a change that can be argued to be “needed”cao-jcp105 ). Alternatively, HA-QTST might be considered to be a semi-analytical approximation for a numerical approach developed some years ago that proposesmills-cpl278 to calculate the exact saddle point of the action in the full imaginary time path integral space.cao-jcp105
Acknowledgements.
SJ acknowledges the support for this research from the National Science Foundation (CHE-1362926) and the Office of Basic Energy Sciences, Department of Energy (DE-SC0001393). GAV acknowledges the support of the National Science Foundation (NSF) through grant CHE-1465248. We thank David Reichman for discussion in the early stage of this work and Eli Pollak for helpful comments and suggesting detailed consideration of the case .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) H. Eyring, J. Chem. Phys. 410 , 107 (2005).
- 2(2) E. P. Wigner, J. Chem. Phys. 5 , 720 (1937).
- 3(3) P. Pechukas, in Dynamics of Molecular Collisions, Part B , edited by W. H. Miller (Plenum, New York, 1976).
- 4(4) P. Pechukas and F. J. Mc Lafferty, J. Chem. Phys. 58 , 1622 (1973).
- 5(5) D. Chandler, J. Chem. Phys. 68 , 2959 (1978).
- 6(6) W. H. Miller, Acc. Chem. Res. 26 , 174 (1993).
- 7(7) F. J. Mc Lafferty and P. Pechukas, Chem. Phys. Lett. 27 , 511 (1974).
- 8(8) T. Yamamoto, J. Chem. Phys. 123 , 104101 (2005).
