Geminal-Based Configuration Interaction
Thomas M. Henderson, Gustavo E. Scuseria

TL;DR
This paper introduces a geminal-based configuration interaction method that enhances the accuracy of AGP wave functions, especially for systems with strong pairing correlations, by constructing an excitation manifold that generalizes standard CI.
Contribution
It develops a geminal CI approach that improves upon AGP and can incorporate more general geminal wave functions, advancing the modeling of strongly correlated systems.
Findings
Geminal CI outperforms AGP in the reduced BCS Hamiltonian.
The method reduces to standard CI when AGP simplifies to a single determinant.
It naturally extends to more general geminal wave functions.
Abstract
The antisymmetrized geminal power (AGP) wave function has a long history and considerable conceptual appeal, but in many situations its accuracy is wanting. Here, we consider a form of configuration interaction (CI) based upon the AGP wave function and taking advantage of its killing operators to construct an excitation manifold. Our geminal CI reduces to standard single-determinant--based CI in the limit in which AGP reduces to a single determinant. It substantially improves upon AGP in the reduced BCS Hamiltonian, which serves as a prototype for the kinds of strong pairing correlations relevant in Bardeen-Cooper-Schrieffer--style superconductivity. Moreover, our geminal CI naturally generalizes to add correlation to more general geminal-based wave functions than AGP.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Geminal-Based Configuration Interaction
Thomas M. Henderson
Department of Chemistry, Rice University, Houston, TX 77005-1892
Department of Physics and Astronomy, Rice University, Houston, TX 77005-1892
Gustavo E. Scuseria
Department of Chemistry, Rice University, Houston, TX 77005-1892
Department of Physics and Astronomy, Rice University, Houston, TX 77005-1892
Abstract
The antisymmetrized geminal power (AGP) wave function has a long history and considerable conceptual appeal, but in many situations its accuracy is wanting. Here, we consider a form of configuration interaction (CI) based upon the AGP wave function and taking advantage of its killing operators to construct an excitation manifold. Our geminal CI reduces to standard single-determinant–based CI in the limit in which AGP reduces to a single determinant. It substantially improves upon AGP in the reduced BCS Hamiltonian, which serves as a prototype for the kinds of strong pairing correlations relevant in Bardeen-Cooper-Schrieffer–style superconductivity. Moreover, our geminal CI naturally generalizes to add correlation to more general geminal-based wave functions than AGP.
I Introduction
Suppose one has an approximate wave function which one wishes to improve. A simple route toward doing so is to write a new wave function
[TABLE]
where the states are linearly-independent of and the coefficients are parameters to be optimized. By the variational principle, almost certainly has lower energy than does ; at the very least, it cannot be higher.
The trick, of course, is to make a clever choice of states . We would like to be linearly independent of and we would like Hamiltonian and matrix elements between and and between states and to be readily evaluated. If has a symmetry of the exact wave function, we would like to do likewise. And of course we would like to choose states which provide the most improvement to for the least amount of work.
One convenient way to choose the states is to look for operators whose action on is to annihilate it:
[TABLE]
which we shall refer to as killing operators of . Then the subset of all operators which are not themselves killing operators provides a convenient way to construct the states , which we can create as simply
[TABLE]
These states are automatically orthogonal to (and therefore linearly independent of it). Further, matrix elements can be readily evaluated because we can adopt a normal order in which all operators appear adjacent to the bra and all operators appear adjacent to the ket; by reordering the various operators in a given matrix element with the aid of commutation or anticommutation rules, we can extract the non-zero portion of the matrix element as that piece which contains neither nor , and we are left with expectation values only with respect to which we presume can be easily computed. Additionally, if we wish to enforce that the state , like the state , is an eigenstate of some symmetry operator with eigenvalue , then we merely need to further limit our choice of killing operators to those which commute with . The (hermitian adjoints of) the killing operators thus have many desirable properties which make them well-suited to add corrections to .
This observation is by no means new. Indeed, the standard particle-hole construction in configuration interaction (CI) or coupled cluster theory proceeds entirely along these lines, and similar concerns motivate development in multi-reference theories. In this manuscript we seek to follow the same path, using the antisymmetrized geminal power (AGP) wave functionColeman (1965) as the reference . To do so, we first need to construct killing operators of the AGP wave function; this, in turn, requires us to say a few words about AGP.
II AGP And Its Killing Operators
The AGP wave function can be written as
[TABLE]
where is the physical vacuum, is the number of electron pairs in the system, and creates a geminal:
[TABLE]
Here, creates an electron in level and another in a level paired with :
[TABLE]
The AGP wave function has a symmetry known as seniority in which levels and are either both occupied or both empty in every determinant in the expansion of the AGP wave function. General Hamiltonians, of course, do not have seniority as a symmetry, so typically we will need killing operators which do not necessarily conserve seniority. We shall have more to say on this point later (and indeed, such operators for AGP appeared in the literature many years agoWeiner and Goscinski (1983)), but for now we will limit our attention to seniority-conserving killing operators, which we may construct by making use of the operators as well as
[TABLE]
which also preserve seniority.
The only one-body operators we have at hand which preserve both seniority and particle number are the number operators ; as they are Hermitian, if a linear combination of the ’s is a killer, so is its adjoint (we assume that the parameters and the coefficients in any linear combination of operators intended for use as a killing operator are all real, which guarantees that the wave function also respects complex conjugation symmetry). Thus, the relevant seniority- and number-conserving killing operators of the AGP wave function are two-body or higher. We can write these killing operators as
[TABLE]
where we assume (for , the operator is Hermitian and thus of no use to us).
A simple way to guarantee that is a killing operator is to adjust the coefficients , , , , and such that commutes with ; if this is the case, then
[TABLE]
and as annihilates the physical vacuum, it would then also annihilate the AGP state.
Using the commutation relations
[TABLE]
one can verify that up to an overall multiplicative factor, the unique operator which has these properties is
[TABLE]
Note that . For , we have , so we can hope to write a sort of geminal configuration interaction (GCI) state as
[TABLE]
This is the wave function we intend to investigate in this work. We will limit our attention to AGP wave functions which respect spin symmetry, so that the orbitals and are the -spin and -spin spinorbitals for the same spatial orbital. Further, because we have chosen to consider only the seniority-conserving kiling operators, we should limit ourselves to Hamiltonians for which seniority is a symmetry; accordingly, we will specialize to the pairing or reduced BCS Hamiltonian, which can be written entirely in terms of the seniority zero generators , , and and for which the exact energy can be obtained via a form of Bethe ansatz which simply requires solving the nonlinear Richardson equations.Richardson (1963); Dukelsky and Pittel (2004) Note that even for general Hamiltonians, only the portion of the Hamiltonian which itself conserves seniority contributes to AGP expectation values involving , , and .
III The Pairing Hamiltonian
The pairing Hamiltonian can be written as
[TABLE]
While other choices are possible, we take and . Note that for positive the Hamiltonian has an attractive two-body interacton rather than the more familiar repulsive two-body interaction. The mean-field symmetry which spontaneously breaks is, accordingly, number symmetry rather than spin symmetry, giving rise to the well-known Bardeen-Cooper-Schrieffer (BCS) wave function.Bardeen, Cooper, and Scrieffer (1957) Note also that the interaction is infinite in range.
This Hamiltonian was introduced as a phenomenological model to describe pairing correlations in which electron pairs interact with the hole pairs they have left behind, but has also been used to model ultrasmall superconducting grains.Braun and von Delft (1998); Dukelsky and Sierra (1999); Sierra et al. (2000)
Remarkably, the Hamiltonian can be solved exactly. Its eigenstates can be written as
[TABLE]
where the energy is given by the sum of the rapidities , which satisfy a set of coupled nonlinear equations. Note that the exact wave function is simply an antisymmetric product of non-orthogonal geminals.
Although the exact solution is readily available, this Hamiltonian is very difficult for conventional electronic structure methods such as single-reference coupled cluster theoryDukelsky et al. (2003); Henderson et al. (2014) and even extended coupled cluster.Henderson, Bulik, and Scuseria (2015) This can readily be seen by Fig. 1, which shows the fraction
[TABLE]
of the exact correlation energy with respect to Hartree-Fock recovered as a function of . where refers to the value of at which the mean-field undergoes spontaneous symmetry breaking toward a number-broken mean-field solution. But while sophisticated coupled cluster approaches fail for this problem, AGP provides good albeit imperfect results for almost all and is an eigenstate in the limit.111The manner of plotting in Fig. 1 suggests errors in AGP for small , but in fact AGP is exact for ; Fig. 1 merely reflects the fact that AGP delivers the wrong contribution to the total energy Note that in the repulsive case the situation is very different, and traditional coupled cluster performs exceptionally well. The attractive pairing Hamiltonian thus forms an interesting test case for our geminal CI. Our hope is that if we can correctly describe this exactly solvable model, we can also correctly describe related but more physical problems for which no exact solution exists but which pose similar computational challenges.
IV Results
To test the accuracy of our geminal CI ansatz, we return to the half-filled 16-site pairing Hamiltonian, which is shown in Fig. 2, the left panel of which can be directly compared to Fig. 1. Although the geminal CI is not exact, it has successfully remedied almost all of the deficiencies of AGP. This is made much clearer by examination of the right panel of Fig. 2, which shows the fraction of correlation with respect to AGP recovered by the geminal CI wave function and shows that GCI removes, in this case, more than 99% of the energy error made by AGP.
The right panel also reveals a curious (and unpleasant) feature of the geminal CI. While for small the AGP reference becomes more like a single determinant (see below) for which we expect the number of double excitations to be where is the number of pairs and is the number of levels, the number of killing operators we construct is . Several must be linearly-dependent for small , and these linearly-dependent states must be removed from the diagonalization. This, we expected. What was unexpected is that we always encounter one linearly-dependent mode even for larger , and this appears to be true independent of the number of pairs and number of levels . Put differently, there always appears to be at least one linear combination of operators which is a killing operator of AGP and which must therefore be excluded. We do not at present have a better prescription for finding this mode than by diagonalizing the metric and eliminating the mode with zero eigenvalue. Note that this zero eigenvalue is not simply a small eigenvalue, but rather one which vanishes to the double precision in which we have implemented our geminal CI.
To get a sense of how our method performs as we adjust system size, Fig. 3 shows the fraction of correlation energy recovered in several different pairing Hamiltonians. Clearly our results deteriorate somewhat as system size increases, but the accuracy remains uniformly excellent. Results away from half-filling (not shown) are similar. Neither AGP nor truncated CI are fully extensive methods, and we ought not expect geminal CI to be extensive either. The pairing Hamiltonian, unfortunately, is not a good place to test the error in extensivity, because the exact energy is not linear in system size in the first place since the interaction has infinite range. Nonetheless, as the right panel of Fig. 3 makes clear, a large portion of the size-dependence in the geminal CI is due to the underlying AGP rather than to the geminal CI per se.
Although obtaining accurate energies is perhaps the chief priority, we would also like similar accuracy for the calculation of properties (or, in other words, we would like to get both the energy and the wave function right). To assess this, Fig. 4 shows the errors in the occupation numbers for the 16-site, half-filled pairing Hamiltonian at . This value is large enough that we expect to see occupations differ appreciably from 0 and 2 (the mean-field result) without being too large. Clearly the geminal CI significantly reduces the error in the occupation numbers, and thus in typical single-particle properties. We should note that the plot has a symmetry around which is just a consequence of a symmetry of the half-filled pairing Hamiltonian.Hirsch et al. (2002)
V Discussion
We have seen that for the pairing Hamiltonian, our geminal CI is remarkably accurate. Here, we wish to discuss the extension to more realistic Hamiltonians which do not have seniority as a symmetry.
For such Hamiltonians, the relevant killing operators are one-body in nature, and can be written as
[TABLE]
for in terms of operators
[TABLE]
which become the standard spin-integrated unitary group generators when levels and are the - and -spinorbitals corresponding to the same spatial orbital. Killers of AGP were first discussed in quantum chemistry,Weiner and Goscinski (1983) and recently applied by Dukelsky and collaborators to superfluid nuclei,Dukelsky et al. albeit not in the same form as above. We may think of the operators , , and as a basis of one-body operators and so in principle can use them to write a normal order of the form .
Our killing operators are essentially just ; while not strictly identical, and act identically on seniority-zero states. For problems such as the molecular Hamiltonian, we can generalize our geminal CI to the form
[TABLE]
The expectation value of the Hamiltonian with respect to this wave function can readily be taken and minimized with respect to the variational parameters, and we shall report on its applications in due time. The anticommutator in the quadratic term is present to ensure that it corresponds to a genuinely two-body operator (because the commutator of with is a non-zero one-body operator).
Let us say a few words about the single-determinant limit. If we have only non-zero parameters , then the AGP wave function reduces to a single determinant. Using the conventional notation in which and index unoccupied orbitals with and and index occupied orbitals with , then
[TABLE]
The adjoint operators annihilate the reference determinant because , while the adjoint operators are, up to an overall factor, the usual particle-hole excitation operators. Thus, in the single-determinant limit of AGP, our geminal CI reduces to standard single-reference configuration interaction when using the killers (and to the CI analog of pair coupled cluster doubles when using the killers).
Our geminal CI bears close resemblance to some methods which merge correlation and symmetry projection, essentially because AGP is a symmetry-projected mean-field method (AGP is equivalent to number-projected BCS). Although the two methods are distinct, geminal CI is particularly closely related to the projected configuration interaction of Ripoche et al.Ripoche et al. (2017) (and also to a spin-projected analog due to Tshuchimochi and Ten-noTsuchimochi and Ten-no (2016a, b)). A coupled-cluster generalization of our geminal CI would likewise approximate symmetry-projected coupled cluster theory.Duguet (2015); Duguet and Signoracci (2017); Tsuchimochi and Ten-no (2017); Qiu et al. (2017, 2019) These methods write a correlated wave function which uses the killing operators of BCS and then act a number projector on the correlated state. In contrast, our method works directly with those operators which annihilate the projected mean-field state. Figure 5 shows that for a small pairing model, geminal CI provides accuracy similar to symmetry-projected coupled cluster with singles and doubles without needing to worry about the highly technical details of how one might carry out the symmetry projection of a correlated state.
Finally, a word on the name “geminal CI”. We have chosen this name because while we have considered only the AGP case, the ideas discussed in this manuscript generalize to other geminal wave functions. Consider, for example, a geminal wave function
[TABLE]
where the geminals created by the different are strongly orthogonal (i.e. are expanded in mutually orthogonal single-particle orbital spaces):
[TABLE]
This geminal wave function is a kind of cluster mean-field state.Jiménez-Hoyos and Scuseria (2015) It includes AGP on the one hand (when there is a single geminal so that ) and the antisymmetrized product of strongly orthogonal geminalsSurján (1999); Rassolov (2002); Surján et al. (2012); Limacher et al. (2013); Pernal (2014) (APSG) on the other (when the various are all equal to 1). Such wave functions are annihilated by our killing operators and and thus form a perfectly valid reference state for our geminal CI. An APSG-based version of our geminal CI would be related to both Kállay and Surján’s APSG-based CI methodKállay and Surján (1999) and Goddard’s generalized valence bond CI.Carter and Goddard III (1988); Faglioni and Goddard III (1999) While we have limited our attention in this manuscript to the AGP case, the theory is considerably more general.
Acknowledgements.
This work was supported by the U.S. National Science Foundation under Grant CHE-1762320. G.E.S. is a Welch Foundation Chair (C-0036). We thank Jorge Dukelsky for providing his AGP code and Armin Khamoshi for providing code to evaluate AGP density matrices.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Coleman (1965) A. J. Coleman, J. Math. Phys. 6 , 1425 (1965).
- 2Weiner and Goscinski (1983) B. Weiner and O. Goscinski, Phys. Rev. A 27 , 57 (1983).
- 3Richardson (1963) R. W. Richardson, Phys. Lett. 3 , 277 (1963).
- 4Dukelsky and Pittel (2004) J. Dukelsky and S. Pittel, Rev. Mod. Phys. 76 , 643 (2004).
- 5Bardeen, Cooper, and Scrieffer (1957) J. Bardeen, L. N. Cooper, and J. R. Scrieffer, Phys. Rev. 108 , 1175 (1957).
- 6Braun and von Delft (1998) F. Braun and J. von Delft, Phys. Rev. Lett. 81 , 4712 (1998).
- 7Dukelsky and Sierra (1999) J. Dukelsky and G. Sierra, Phys. Rev. Lett. 83 , 172 (1999).
- 8Sierra et al. (2000) G. Sierra, J. Dukelsky, G. G. Dussel, J. von Delft, and F. Braun, Phys. Rev. B 61 , 11890 (2000).
