Perdew-Zunger self-interaction correction: How wrong for uniform densities and large-Z atoms?
Biswajit Santra, John P. Perdew

TL;DR
This paper analyzes the limitations of the Perdew-Zunger self-interaction correction, revealing significant errors for uniform densities and large-Z atoms, and suggests the need for a more accurate generalized SIC.
Contribution
The study provides analytical and extrapolated estimates of PZ SIC errors for large atoms, highlighting its inaccuracies for uniform densities and proposing directions for improved SIC methods.
Findings
PZ SIC introduces large errors in uniform gas correlation energies.
Estimated errors for large atoms are +5.5% (LSDA-SIC) and -3.5% (PBE/SCAN-SIC).
Errors may explain PZ SIC's poor performance in equilibrium properties.
Abstract
Semi-local density functionals for the exchange-correlation energy of a many-electron system cannot be exact for all one-electron densities. In 1981, Perdew and Zunger (PZ) subtracted the fully-nonlocal self-interaction error orbital-by-orbital, making the corrected functional exact for all collections of separated one-electron densities, and making no correction to the exact functional. Although the PZ self-interaction correction (SIC) eliminates many errors of semi-local functionals, it is often worse for equilibrium properties of sp-bonded molecules and solids. Non-empirical semi-local functionals are usually designed to be exact for electron gases of uniform density, and thus also make 0% error for neutral atoms in the limit of large atomic number Z, but PZ SIC is not so designed. For localized SIC orbitals, we show analytically that the LSDA-SIC correlation energy per electron of…
| Functional | ||||
|---|---|---|---|---|
| Exact | 0.2209Cancio et al. (2018) | -0.224Burke et al. (2016) | 0.02073Cancio et al. (2018) | 0.0372Burke et al. (2016) |
| LSDA | 0.2209Cancio et al. (2018) | -0.0Burke et al. (2016) | 0.02073Cancio et al. (2018) | -0.00451Cancio et al. (2018) |
| PBE | 0.2209Cancio et al. (2018) | -0.215Elliott and Burke (2009) | 0.02073Cancio et al. (2018) | 0.03936Burke et al. (2016) |
| 10 | -12.108 | -10.9668 | -12.0269 | -12.1413 | -12.4636 | -12.0385 | -11.9934 |
|---|---|---|---|---|---|---|---|
| 18 | -30.188 | -27.8122 | -29.9697 | -30.2563 | -31.1554 | -29.8464 | -29.7817 |
| 36 | -93.890 | -88.5356 | -93.3685 | -94.0488 | -97.6783 | -92.3723 | -92.1973 |
| 54 | -179.200 | -170.5124 | -178.2248 | -179.2896 | -186.5765 | -175.5691 | -175.2490 |
| 10 | -0.3910 | -0.7398 | -0.3469 | -0.3444 | -0.4108 | -0.2834 | -0.3446 |
|---|---|---|---|---|---|---|---|
| 18 | -0.7254 | -1.4232 | -0.7042 | -0.6916 | -0.7952 | -0.5718 | -0.6911 |
| 36 | -1.8504 | -3.2683 | -1.7632 | -1.7591 | -1.8579 | -1.4797 | -1.7584 |
| 54 | -3.0002 | -5.1782 | -2.9159 | -2.9059 | -2.9586 | -2.4692 | -2.9047 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Chemical Physics Studies · Inorganic Fluorides and Related Compounds · Magnetic and transport properties of perovskites and related materials
Perdew-Zunger self-interaction correction: How wrong for uniform densities and large- atoms?
Biswajit Santra
Department of Physics, Temple University, Philadelphia, PA 19122
John P. Perdew
Department of Physics, Temple University, Philadelphia, PA 19122
Department of Chemistry, Temple University, Philadelphia, PA 19122
Abstract
Semi-local density functionals for the exchange-correlation energy of a many-electron system cannot be exact for all one-electron densities. In 1981, Perdew and Zunger (PZ) subtracted the fully-nonlocal self-interaction error orbital-by-orbital, making the corrected functional exact for all collections of separated one-electron densities, and making no correction to the exact functional. Although the PZ self-interaction correction (SIC) eliminates many errors of semi-local functionals, it is often worse for equilibrium properties of -bonded molecules and solids. Non-empirical semi-local functionals are usually designed to be exact for electron gases of uniform density, and thus also make 0% error for neutral atoms in the limit of large atomic number , but PZ SIC is not so designed. For localized SIC orbitals, we show analytically that the LSDA-SIC correlation energy per electron of the uniform gas in the high-density limit makes an error of -50 in the spin-unpolarized case, and -100% in the fully-spin-polarized case. Then we extrapolate from the Ne, Ar, Kr, and Xe atoms to estimate the relative errors of the PZ SIC exchange-correlation energies (with localized SIC orbitals) in the limit of large atomic number: about +5.5% for the local spin density approximation (LSDA-SIC), and about -3.5% for nonempirical generalized gradient (PBE-SIC) and meta-generalized gradient (SCAN-SIC) approximations. The SIC errors are considerably larger than those that have been estimated for LSDA-SIC by approximating the localized SIC orbitals for the uniform gas, and may explain the errors of PZ SIC for equilibrium properties, opening the door to a generalized SIC that is more widely accurate.
I Introduction
A realistic description of bonding between atoms in molecules and solids within Kohn-Sham (KS) ground-state density functional theory Kohn and Sham (1965); Jones (2015) requires a realistic approximation to the exact spin-density functional for the exchange-correlation energy Jones (2015). Local or semi-local functionals such as the local spin density approximation (LSDA), the generalized gradient approximation (GGA), and the meta-GGA are widely employed. These functionals are single integrals over position space of energy densities that are functions of ingredients readily available at each position from self-consistent solution of the KS or generalized KS one-electron Schrödinger equation. Standard ingredients include the spin-resolved electron densities (LSDA), and sometimes also their gradients (GGA) and even orbital kinetic energy densities (meta-GGA). Semi-local functionals are computationally efficient and often usefully accurate, and lend themselves to non-empirical construction by satisfaction of many exact constraints and appropriate norms Sun et al. (2015). The earliest appropriate norms were the uniform electron gases Kohn and Sham (1965); Perdew and Wang (1992), for which nearly all practical functionals are exact by construction.
Perhaps the simplest density is that for one isolated electron (e.g., ), for which the exact exchange-correlation energy must exactly cancel the spurious Hartree electrostatic self-interaction. But, since the Hartree energy
[TABLE]
is a fully-nonlocal functional of the density, no local or semi-local approximation can be exact for all possible one-electron densities. In 1981, Perdew and Zunger (PZ) Perdew and Zunger (1981) proposed to subtract the spurious self-interaction from each occupied orbital,
[TABLE]
where is the -th occupied () orbital density of spin , and thus a one-electron density. Eq. (I) is exact for any collection of ground- or excited-state one-electron densities. PZ self-interaction correction (SIC), applied to the exact functional, makes no change in it.
PZ also realized that the right orbitals for SIC cannot be the often-delocalized Kohn-Sham orbitals, but must be localized to make a properly size-extensive theory in which the energy of a periodic array of separated atoms always becomes times the energy of one atom in the limit of large separation. Lin and collaborators Pederson et al. (1984) found unitary transformations in the occupied-orbital space that minimize the PZ SIC energy. More recently, Pederson et al. Pederson et al. (2014) have found a way to limit this transformation to Fermi-Löwdin orbitals (FLO’s) that are guaranteed to be localized, because each has a Fermi orbital descriptor or position where
[TABLE]
PZ SIC leads to an almost linear (and thus almost exact Perdew et al. (1982)) variation of the energy of an open system with non-integer average electron number, between adjacent integer electron numbers, and thus often avoids spurious fractional charges on dissociated atoms Ruzsinszky et al. (2006). And it corrects many other qualitative failures of the semi-local functionals, yielding a correct exchange-correlation potential at the edges of a system, bound negative atomic ions, raised and improved energy barriers to chemical reactions Patchkovskii and Ziegler (2002), etc.
For equilibrium properties like atomization energies and equilibrium bond lengths, the non-empirical semi-local functionals have improved in accuracy from LSDA Kohn and Sham (1965); Perdew and Wang (1992) to GGA (e.g., Ref. Perdew et al., 1996) and meta-GGA (e.g., Ref. Sun et al., 2015). But PZ SIC applied to GGA Vydrov and Scuseria (2004); Vydrov et al. (2006) and meta-GGA Shahi et al. is actually less accurate than GGA or meta-GGA, respectively, for these properties. Part of the explanation for this lies in the noded character of overlapped orthogonal localized real orbitals. Using complex SIC orbitals leads to a lower total energy, and reduces but does not eliminate the errors in the equilibrium properties Klüpfel et al. (2011). We have recently argued Shahi et al. that lobed one-electron densities (including the familiar stretched one-electron densities and the less familiar noded ones) are problematic for semi-local approximation. And some of the troublesome lobed character survives after the nodes are removed.
Another part of the explanation could be that PZ SIC loses the exact uniform-density limit that the non-empirical semi-local functionals all respect. We now know that, in the limit of large atomic number , most of the electron density in a neutral atom is slowly-varying on the scale of the local Fermi wavelength Perdew et al. (2006), and that the local density approximation, based only upon the uniform electron gas, becomes relatively exact for the exchange energy and even for the correlation energy (according to numerical evidence) Burke et al. (2016). The exact large- asymptotic expansions for the exchange Schwinger (1981); Elliott and Burke (2009) and correlation Burke et al. (2016); Cancio et al. (2018) energies of neutral atoms are:
[TABLE]
[TABLE]
Clearly the ratio tends to zero as tends to infinity. LSDA Kohn and Sham (1965); Perdew and Wang (1992), the PBE GGA Perdew et al. (1996), and the SCAN (strongly-constrained and appropriately-normed) meta-GGA Sun et al. (2015) have similar expansions, and all of those semi-local approximations are exact to leading order. PBE and SCAN, but not LSDA, are also accurate for the next-order terms in in Eqs. (4) and (5), as shown in Table 1. (While SCAN was fitted to the exact large- limits, the values of its coefficients for Eqs. (4) and (5) are unpublished.)
Before the relevance of the uniform gas to real systems was fully understood, it was hard to determine the errors of PZ SIC for the uniform electron gas, because there was no reliable construction of the localized SIC orbitals. Relative errors in the PZ LSDA-SIC negative exchange energy of about -2.5% Norman (1983) or about +0.8% or more Pederson et al. (1989) were estimated. But we can readily construct the energy-minimizing real Fermi-Löwdin orbitals for the atoms Ne, Ar, Kr, and Xe, then extrapolate the exchange energy to large using the known asymptotic expansions of Eqs. (4) and (5). The resulting relative errors in the negative exchange energy turn out to about +5.5% for PZ LSDA-SIC and about -3.5% for PZ PBE-SIC and PZ SCAN-SIC, and thus not negligible. Since and are negative, a positive percentage error means that an approximation is too negative, and a negative percentage error means that the approximation is not negative enough.
In the high-density limit, exchange completely dominates over correlation. Since the PZ LSDA-SIC exchange energy per electron of a uniform electron gas is too low with real SIC orbitals, minimizing the total SIC energy over complex orbitals can only make it lower and less accurate. A consequence of our work is that, in a high-density uniform electron gas, a real unitary transformation that minimizes the total energy will yield localized SIC orbitals in PZ LSDA-SIC, but (unless constrained as in the Fermi-Löwdin-orbital approachPederson et al. (2014)) delocalized SIC orbitals in PZ PBE-SIC and PZ SCAN-SIC. Although delocalized or Kohn-Sham SIC orbitals would make the PZ PBE-SIC and PZ SCAN-SIC exact for uniform electron gases, they are ruled out by our requirement that SIC be size extensive.
While the exact exchange energy of Kohn-Sham theory requires a multiplicative exchange potential , it is for practical purposes equivalent to the Hartree-Fock exchange energy for the same electron spin densities. The difference is only about 0.03% for the Ar atomGörling and Ernzerhof (1995), and tends to 0% as . The exact exchange energy and all reasonable approximations to it (including Eq. (I) with replaced everywhere by ) are homogeneous functionals that obey a uniform-density-scaling equality Levy and Perdew (1985). Thus, while the large- limit for neutral atoms corresponds to high electron densities, the relative (%) error of a reasonable approximation to the exchange energy functional is the same for all uniform electron densities.
Since SCAN is self-correlation-free, it requires and gets no SIC for correlation, but the LSDA and PBE correlation functionals do. The leading term for the correlation energy in the high-density limit, with or without SIC, can be found exactly by analytic derivation, and will thus be presented before and used in the extrapolation for the exchange-correlation energy. The PZ SIC relative errors for the correlation energy in the high-density limit are -50% to -100%, for the spin-unpolarized and fully-polarized cases respectively. The special property of the logarithm, , makes it possible to find these relative errors without constructing the localized SIC orbitals for the uniform electron gas.
Understanding the errors that PZ SIC makes for uniform densities and for large- atoms is a necessary first step toward the development of a generalized SIC Per that would be more widely accurate than PZ SIC. Such a correction might modify the semi-local part of the PZ SIC of Eq. (I) in regions where the localized SIC orbitals overlap, as in Ref. Vydrov et al., 2006 but without modifying the fully non-local part.
Our calculations for atoms are made with the FLOSIC flo molecular code, which in turn is based upon and uses the default all-electron Gaussian basis set of the NRLMOL code Porezag and Pederson (1999). The basis sets allow us to treat Ne, Ar, Kr, and Xe, but not Rn (which has electrons). The integration mesh for SCAN is about three times bigger than the default mesh. The integration mesh in the FLOSIC code is variational Pederson and Jackson (1990) and for all calculations the maximal error margin for the integration mesh is set to . The descriptors or positions of the FLOs are optimized using LSDA-SIC until the maximum component of the forcesPederson (2015) are lower than Hartree/Bohr. The LSDA-SIC optimized descriptors are used to self-consistently minimize the FLOs and the corresponding PZ SIC total energy Yang et al. (2017) with a threshold of Hartree for all PZ SIC functionals employed here. The exchange and correlation energies obtained from these calculations are given in Tables 2 and 3.
II Correlation energy at high density: Analytic derivation
The exact correlation energy per electron for a uniform electron gas in the high-density limit is given by the random phase approximationGell-Mann and Brueckner (1957); Misawa (1965):
[TABLE]
where is the total density and is the relative spin polarization (0 for the unpolarized case, and for the fully spin-polarized case). Note thatMisawa (1965) . We are interested in the unpolarized case, but will need the fully-polarized case to make the SIC.
In PZ SIC the localized SIC orbital density is
[TABLE]
Eq. (7) has the proper uniform-density scalingLevy and Perdew (1985), can integrate to 1 for all , and can satisfy Eq. (3) for all . The shape of this orbital density is unknown, but will not contribute to the correlation energy per electron of order . Thus the relative error in the high-density limit will be the same whether the SIC orbitals are real or complex. For LSDA, which are exact for all uniform densities, the PZ SIC correlation energy per electron of the spin-unpolarized uniform electron gas in the high-density limit is
[TABLE]
Using Eq. (7) and applying the special property of the logarithm to Eq. (II) we obtain:
[TABLE]
Neglecting the lower-order term (a constant independent of ) and using , we find
[TABLE]
for a -50% error. Following similar reasoning, the PZ LSDA-SIC correlation energy per electron for fully spin-polarized uniform electron gas in the high-density limit is 0 times , for a -100% error.
In the large- limit the exact correlation energy of the (spin-unpolarized) atom is , where Cancio et al. (2018). Thus in the large- limit the LSDA-SIC correlation energy is
[TABLE]
for a -50% error.
For the PBE-SIC correlation energy, however, the relative error in the large- limit is 0%. This follows from Eq. (9) of Ref. Perdew et al., 1996, and from the fact that the reduced density gradient for the SIC orbital density does not approach 0 in the large- limit. As a result, the self-interaction correction to the PBE correlation energy per electron scales as , and not as , when .
Using the self-consistent density for each functional and nearly-exact correlation energiesBurke et al. (2016) for the rare-gas atoms Ne, Ar, Kr, and Xe, we have computed the relative errors of the various correlation energy functionals in Fig. 1. We have then fitted them to the formula
[TABLE]
where . This formula was constructed by keeping just the first two terms in the expansion of Eq. (5), for both the exact and the approximate functionals, with the coefficients of the leading terms as derived in this section, making for LSDA, PBE, SCAN, and PBE-SIC, but for LSDA-SIC. SCAN, which is already self-correlation-free, needs no SIC for correlation. From this, we constructed the large- expansion of Eq. (12) for the relative error of each approximate functional. Enough terms in that expansion were kept to provide three parameters , , and to be least-squares fitted to the four considered atoms. (Four parameters would overfit the data.)
It is a long extrapolation from (Xe) to . The data of Fig. 1 for each functional prefigure but do not pinpoint that functional’s analytic large- limit.
III Exchange and Exchange-Correlation energy: Extrapolation of the relative errors for Ne, Ar, Kr, and Xe
Using the self-consistent density for each functional and Hartree-Fock exchange energiesBecke (1988) for the rare-gas atoms Ne, Ar, Kr, and Xe, we have plotted the relative errors of the various approximate exchange functionals in Fig. 2. We have then fitted them to the formula
[TABLE]
where . This formula was constructed by keeping just the first three terms of Eq. (4) for both the exact and the approximate exchange energies (Appendix A). From this, we constructed the large- expansion of Eq. (13), keeping just enough terms to provide three parameters , , and to be least-squares fitted to the four considered atoms.
Although we know that the parameter should be 0% for LSDA, PBE, and SCAN, we have not constrained to this exact value. Values of less than 0.5% emerge for all three approximations (Appendix B), suggesting that the considerably larger +5.8% for LSDA-SIC, -3.3% for PBE-SIC, and -3.6% for SCAN-SIC can be trusted within . These large- limits are clearly pre-figured in the data for Ne, Ar, Kr, and Xe.
The exchange-correlation energy is just the sum of the exchange and correlation energies, and is presented for completeness in Fig. 3. The fitting formula is
[TABLE]
where . and are exact values from Table 1. is 0% for LSDA, PBE, SCAN, PBE-SIC, and SCAN-SIC, but -50% for LSDA-SIC. Eq. (III) was constructed (Appendix A) almost as Eq. (13) was, but includes the exact terms derived in section II. The or limit is dominated by exchange, and yields conclusions similar to those of Fig. 2: LSDA-SIC makes a +5.3% error in this limit, while PBE-SIC and SCAN-SIC respectively make -3.6% and -3.9% errors.
IV Conclusion
The Perdew-Zunger self-interaction correction (PZ SIC) to a semi-local density-functional approximation for the exchange-correlation energy imposes some additional exact constraints, such as exactness for all collections of separated one-electron densities and (nearly) linear variation of the total energy between adjacent integer electron numbers. But it loses other exact constraints or appropriate norms, such as exactness for all uniform densities and 0% error for neutral atoms in the limit of large atomic number. This could explain PZ SIC’s curious performance history of striking successes and equally striking failures, which are in a way the complements to those of the semi-local approximations.
The difficulty in constructing energy-minimizing but localized SIC orbitals for the uniform electron gas has led until recently to large uncertainties in the relative errors of PZ SIC for this important appropriate norm. We have found analytically that PZ LSDA-SIC correlation makes an error of -50% for spin-unpolarized uniform densities in the high-density limit or for large- atoms, and -100% for fully-spin-polarized uniform densities in the high-density limit, independent of the details of the localized SIC orbitals. By extrapolating the relative error of the exchange energy from Ne, Ar, Kr and Xe, guided by the known asymptotics of the large- limit, and employing real localized orbitals, we have estimated that for the uniform electron gas the LSDA-SIC exchange energy is about 5.5% too negative, while the PBE-SIC and SCAN-SIC exchange energies are not negative enough by about 3.5%. The estimated uncertainties due to extrapolation, based upon our results for the semi-local functionals without SIC, are less than 0.5%.
The complementary strengths and weaknesses of PZ SIC and a semi-local functional like SCAN Sun et al. (2015) to which it is applied suggest that a widely-useful approximation might be found from a generalized self-interaction correction to SCAN (actually just to SCAN exchange, since SCAN is self-correlation-free) which preserves the best features of both, being accurate for real atoms, large- atoms, and uniform densities. This possibility is being exploredPer .
We have seen that a knowledge of the asymptotic forms (4) and (5), and data from the sequence of real atoms Ne, Ar, Kr, and Xe, are sufficient to determine the large- limits for the LSDA, PBE, and SCAN exchange energy functionals within . There is of course no guarantee that the SIC exchange functionals approach their large- limits as rapidly as these semi-local functionals do. But we know that real atoms are also appropriate normsJones (2015); Cancio et al. (2018) for functionals beyond LSDA. Just correcting the large relative errors of SIC exchange for the real atoms Ne, Ar, Kr, and Xe (Fig. 2) might improve the performance of SIC for the equilibrium properties of molecules and solids.
Acknowledgement
BS, who co-designed the project and performed the calculations, was supported by the US Department of Energy, Office of Science, Basic Energy Sciences, under Award No. DE-SC0018331 as part of the Computational Chemical Sciences Program. JPP, who co-designed the project and wrote the first draft of the manuscript, was supported by the US National Science Foundation under Grant No. DMR-1607868.
Appendix A Derivation of the fitting formulas, Eqs. (III) and (13)
We define the exact and the approximate exchange-correlation energy by considering the sum of the first three terms of the exact large- asymptotic expansions for the exchange energy (Eq. (4)) and the first two terms of the exact large- asymptotic expansions for the correlation energy (Eq. (5)).
[TABLE]
[TABLE]
The relative error is defined as
[TABLE]
where , , , and .
Now dividing numerator and denominator by the leading term we get
[TABLE]
Neglecting the lower-order terms and using two free parameters, to replace and to replace , we get:
[TABLE]
Now we consider another free parameter to replace and define . Then we use and to obtain
[TABLE]
Here, and are exact values given in Table 1 of the main article. is 0 for LSDA, PBE, SCAN, PBE-SIC, and SCAN-SIC. is -50% for LSDA-SIC since .
The above Eq. (20) is the same as Eq. (III). When exchange-only terms are considered, the second term vanishes and the equation reduces to Eq. (13).
Appendix B Parameters of the fits to Eqs. (13) and (III)
The parameters () obtained from least-square fits to the asymptotic Eq. (13) are: LSDA (); PBE (); SCAN (); LSDA-SIC (); PBE-SIC (); SCAN-SIC (, , ).
The parameters () obtained from least-square fits to the asymptotic Eq. (III) are: LSDA (); PBE (); SCAN (); LSDA-SIC (); PBE-SIC (); SCAN-SIC (, , ).
We have also examined the effect of the choice of electron density on the fitted parameter which indicates the relative (%) error in DFT exchange energy in the large- limit (using Eq. (13)). The parameter computed at self consistent density () for LSDA, PBE, and SCAN functionals are , , and , respectively. In comparison, the parameter computed at the exact Hartree-Fock densityRoetti and Clementi (1974) for LSDA, PBE, and SCAN functionals are , , and , respectively. The influence of the electron density is negligible in determining the parameter in Eq. (13).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Kohn and Sham (1965) W. Kohn and L. J. Sham, Phys. Rev. 140 , A 1133 (1965) . · doi ↗
- 2Jones (2015) R. O. Jones, Rev. Mod. Phys. 87 , 897 (2015) . · doi ↗
- 3Sun et al. (2015) J. Sun, A. Ruzsinszky, and J. P. J. Perdew, Phys. Rev. Lett. 115 , 036402 (2015) . · doi ↗
- 4Perdew and Wang (1992) J. P. Perdew and Y. Wang, Phys. Rev. B 45 , 13244 (1992) . · doi ↗
- 5Perdew and Zunger (1981) J. P. Perdew and A. Zunger, Phys. Rev. B 23 , 5048 (1981) . · doi ↗
- 6Pederson et al. (1984) M. R. Pederson, R. A. Heaton, and C. C. Lin, J. Chem. Phys. 80 , 1972 (1984) . · doi ↗
- 7Pederson et al. (2014) M. R. Pederson, A. Ruzsinszky, and J. P. Perdew, J. Chem. Phys. 140 , 121103 (2014) . · doi ↗
- 8Perdew et al. (1982) J. P. Perdew, R. G. Parr, M. Levy, and J. L. Balduz, Phys. Rev. Lett. 49 , 1691 (1982) . · doi ↗
