Existence and nonexistence of HOMO-LUMO excitations in Kohn-Sham density functional theory
Gero Friesecke, Benedikt Graswald

TL;DR
This paper investigates the existence of HOMO-LUMO excitations in Kohn-Sham density functional theory, showing they exist for systems with nuclear charge greater than the number of electrons, but not for neutral atoms under certain conditions.
Contribution
The paper provides a rigorous mathematical analysis of HOMO-LUMO excitation existence in KS-DFT, including conditions under which they do or do not exist, with implications for electronic structure calculations.
Findings
HOMO-LUMO excitations exist for Z>N systems.
Excited states do not exist for neutral atoms Z=N under certain approximations.
Implications for ionization potential estimates from HOMO eigenvalues.
Abstract
In numerical computations of response properties of electronic systems, the standard model is Kohn-Sham density functional theory (KS-DFT). Here we investigate the mathematical status of the simplest class of excitations in KS-DFT, HOMO-LUMO excitations. We show using concentration-compactness arguments that such excitations, i.e. excited states of the Kohn-Sham Hamiltonian, exist for , where is the total nuclear charge and is the number of electrons. The result applies under realistic assumptions on the exchange-correlation functional, which we verify explicitly for the widely used PZ81 and PW92 functionals. By contrast, and somewhat surprisingly, we find using a method of Glaser, Martin, Grosse, and Thirring \cite{glaser1976} that in case of the hydrogen and helium atoms, excited states do not exist in the neutral case when the self-consistent KS ground state…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Existence and nonexistence of HOMO-LUMO excitations
in Kohn-Sham density functional theory
Gero Friesecke and Benedikt Graswald
Department of Mathematics, Technische Universität München
[email protected], [email protected]
Abstract
In numerical computations of response properties of electronic systems, the standard model is Kohn-Sham density functional theory (KS-DFT). Here we investigate the mathematical status of the simplest class of excitations in KS-DFT, HOMO-LUMO excitations. We show using concentration-compactness arguments that such excitations, i.e. excited states of the Kohn-Sham Hamiltonian, exist for , where is the total nuclear charge and is the number of electrons. The result applies under realistic assumptions on the exchange-correlation functional, which we verify explicitly for the widely used PZ81 and PW92 functionals. By contrast, and somewhat surprisingly, we find using a method of Glaser, Martin, Grosse, and Thirring [11] that in case of the hydrogen and helium atoms, excited states do not exist in the neutral case when the self-consistent KS ground state density is replaced by a realistic but easier to analyze approximation (in case of hydrogen, the true Schrödinger ground state density). Implications for interpreting minus the HOMO eigenvalue as an approximation to the ionization potential are indicated.
Contents
1 Introduction
Electronic excitations play an important role in the description of molecular properties such as absorption spectra, photoexcitation, state-to-state transition probabilities, reactivity, charge transfer processes, and reaction kinetics [5, 6, 12]. In numerical computations of these response properties, the standard model is Kohn-Sham density functional theory (KS-DFT), because of its good compromise between accuracy and feasibility for large systems (see [19] for a textbook account and [3] for a recent review). It is then of interest to investigate the mathematical status of excitations in KS-DFT.
In this paper we analyze the simplest such excitations, HOMO-LUMO transitions, in the setting of the local density approximation (LDA). For a systematic comparison of HOMO-LUMO excitations with experimental data see e.g. [2, 23]. Even in this case we are not aware of previous rigorous results. Our findings are the following (see Figure 1).
For positively charged systems (i.e., total nuclear charge greater than the number of electrons) such excitations – mathematically, excited states of the KS Hamiltonian – are rigorously proven to exist, under realistic assumptions on the exchange-correlation functional which we verify explicitly for the widely used PZ81 and PW92 functionals. See Theorem 1 in Section 3. As a corollary we also establish existence of optimal excitations with respect to suitable control goals recently introduced in [10], without requiring the simplifying assumption in [10] of bounded domains. See Section 4.
By contrast, the neutral case holds a surprise. In the case of the hydrogen and helium atoms, we prove that excited states do not exist when the self-consistent KS ground state density is replaced by a realistic but easier to analyze closed-form approximation (in case of hydrogen, the true Schrödinger ground state density). See Theorem 4 in Section 5.
Mathematically, the existence result relies on concentration-compactness arguments, and should not come as a surprise to experts. The nonexistence result uses a not widely known method by Glaser, Martin, Grosse, and Thirring (GMGT) [11]. The latter method could, in principle, also be applied to numerical KS ground state densities; we expect that for some atoms and molecules, including hydrogen and helium, the GMGT nonexistence criterion (that a certain integral associated with the effective KS potential lies below a threshold value) would be satisfied.
Physically, these results indicate a significant artefact of KS-DFT. In the full -electron Schrödinger equation, neutral systems (and even systems with ) are known to possess infinitely many excited states below the bottom of the continuous spectrum. This is a celebrated result by Zhislin [24]; for a modern variational proof see [9]. The analogous result also holds in Hartree-Fock theory: for the Fock operator associated with the Hartree-Fock ground state density possess infinitely many bound states below the continuous spectrum [18, Lemma II.3], the latter being the interval . (It is also known [15] that the Hartree-Fock enegery functional possess infinitely many critical points below 0.) Our results suggest that in KS-DFT, the threshold for existence of infinitely many excited states is shifted from to . This is a previously unnoticed but important qualitative consequence of the (well known) incomplete cancellation of the self-interaction energy in KS-DFT.
It is interesting to interpret the nonexistence of excitations from the point of view of numerical computations in finite basis sets, or mathematical analysis (as in [10]) in bounded domains. Consider a neutral system for which (exact) excitations do not exist. In a finite basis set, or a bounded domain, the spectrum of the KS Hamiltonian is purely discrete and therefore excited states exist. In the limit as the basis set approaches completeness, or the domain approaches the whole of ,
(i) the LUMO energy (i.e., the lowest unoccupied eigenvalue of the KS Hamiltonian) will remain well-defined, and approaches the bottom of the continuous spectrum (which equals [math], see Theorem 2 and Theorem 4)
(ii) the LUMO (i.e., the lowest unoccupied eigenstate) will become more and more delocalized, failing to converge to a bound state.
Thus in contrast to common (explicit or implicit) belief, restriction to finite basis sets or bounded domains may be not just a negligible technicality, but significantly alters the physical nature of LUMO excitations, from stable bound state (i.e., invariant under the dynamics of the KS ground state Hamiltonian) to a delocalized, dispersing state associated with the continuous spectrum.
(ii) makes it very tempting to physically interpret the HOMO-LUMO excitation in the nonexistence case as an (approximation to an) ionization process. This interpretation together with (i) yields ionization potential (where is the HOMO energy, i.e. the highest occupied eigenvalue of the KS Hamiltonian), lending new theoretical support to the famous semi-empirical formula
[TABLE]
which often agrees quite well with experimental data [2, 23].
2 Mathematical setting
We start by recalling well-known mathematical facts about Kohn-Sham density functional theory (KS-DFT) [14, 13, 19]. Readers familiar with these facts might want to skip this part. After that we give a variational definition of HOMO-LUMO excitations as introduced recently in [10], which works irrespective of degeneracies and is convenient for the mathematical analysis of excitations.
2.1 Kohn-Sham equations
We consider a system of non-relativistic electrons in in the electrostatic potential generated by nuclei of charges located at positions ,
[TABLE]
In fact, for our analysis it is not essential that the nuclei are point particles. It suffices to assume the nuclear charge distribution is given by a nonnegative Radon measure with total mass supported on a compact set , i.e. we consider any belonging to
[TABLE]
where denotes the space of signed Radon measures on , and
[TABLE]
For simplicity we look at a spin-unpolarized system, so the number of electrons is even, i.e. for some . In this case Kohn-Sham DFT describes the electrons by orbitals , each occupied by two electrons of opposite spin. They are -orthonormal, i.e.
[TABLE]
and we denote . Note that in the following will always denote the inner product. Then the corresponding Kohn-Sham energy functional is given by
[TABLE]
where gives the exchange-correlation energy per unit volume and is the total electron density, that is,
[TABLE]
The KS energy hence consists of the following terms: is the kinetic energy of the electrons, is the potential energy from the electron-nuclei interaction with being the electrostatic potential of the nuclei (2.3); (the Hartree energy) describes the energy corresponding to the interelectron repulsion if the electrons were mutually independent; is the exchange-correlation energy which accounts for correlation effects correcting the simple independent ansatz of .
Note here that the Coulomb potential over the whole is not in any -space, but it is in and we will be using the splitting , where lie in , respectively.
Precise assumptions on which are sufficient for our mathematical results and cover standard local density approximation (LDA) exchange-correlation functionals used in practice are given in Section 3.1. A basic example derived from the homogeneous electron gas is the Dirac exchange energy
[TABLE]
The ground state of the system is given by
[TABLE]
Any ground state satisfies the Euler-Lagrange equations of the system, the Kohn-Sham equations
[TABLE]
where the Lagrange multipliers arise due to the orthonormality condition (2.4). The Hartree and exchange-correlation potentials are given by
[TABLE]
Since the effective one-body operator in (2.9) (the Kohn-Sham Hamiltonian) is invariant under unitary transformations, the KS equations can be brought into their canonical form
[TABLE]
2.2 Excitations
Following [10] we confine ourselves here to the simplest model for electronic excitations, the HOMO-LUMO transition. In this transition an electron pair migrates from the highest occupied molecular orbital (HOMO) to the lowest unoccupied molecular orbital (LUMO). For the KS-orbitals ordered by the size of their eigenvalue in (2.11) this means
[TABLE]
where is the HOMO and - the eigenstate corresponding to the next higher eigenvalue of - is the LUMO.
To define HOMO and LUMO in a variational way, we consider the excitation energy functional [10] given by the quadratic form associated with KS Hamiltonian (2.11),
[TABLE]
Now define a HOMO by
[TABLE]
and a LUMO by
[TABLE]
If they exist, HOMO and LUMO clearly satisfy the KS equations
[TABLE]
for some eigenvalue (the HOMO energy) and (the LUMO energy).
3 Existence of HOMO-LUMO excitations
In this section we show that for positively charged systems there always exist HOMO-LUMO excitations. This generalizes a corresponding result in [10] to unbounded domains, except that in bounded domains no restriction on are needed. In the latter case such a result is not straightforward due to the possibility of “mass escaping to infinity”, and requires concentration-compactness arguments [16, 17]. The reader may wonder whether the assumption , which is essential in our proof, is really necessary. The authors of course asked themselves the same question. For counterexamples to existence in the case see Section 5.
3.1 Assumptions
Assumptions on the exchange-correlation energy
We assume that with
[TABLE]
for some exponent with and constant .
Furthermore we need as in [1]:
[TABLE]
Remark**.**
These assumptions are trivially satisfied by the Dirac exchange (2.7) with . In our appendix we check explicitly that these assumptions are also satisfied for the two most popular LDA exchange-correlation functionals: Perdew-Zunger (PZ81) [21] and Perdew-Wang (PW92) [20]
Admissible sets of orbitals In order to write these definitions in a more compact way we introduce the following sets: The KS admissible set
[TABLE]
the HOMO admissible set
[TABLE]
and the LUMO admissible set
[TABLE]
The governing variational principles for the occupied KS orbitals, HOMO and LUMO can now be summarized as
[TABLE]
We start with some estimates for the KS energy functional (2.5).
Lemma 1** (Lower bounds on KS energy functional).**
The terms in the KS energy functionals have the following properties
* and*
[TABLE] 2. 2.
V[\rho]\geq-||\mu||_{\mathcal{M}}\big{(}||v_{\infty}||_{\infty}||\rho||_{1}+||v_{2}||_{2}||\rho||^{\nicefrac{{1}}{{4}}}_{1}||\rho||_{3}^{\nicefrac{{3}}{{4}}}\big{)}* and*
[TABLE] 3. 3.
* and*
[TABLE] 4. 4.
E_{xc}[\rho]\geq-c_{xc}\big{(}||\rho||_{1}+\frac{1}{p-1}||\rho||_{1}^{\nicefrac{{(3-p)}}{{2}}}||\rho||_{3}^{\nicefrac{{3(p-1)}}{{2}}}\big{)},* where is the exponent from assumption (3.1), and*
[TABLE]
Proof.
The first inequality is a standard result in DFT, but we include it for the sake of completeness. By a well-known result, see e.g. [4], we have , so the inequality follows by applying the Sobolev embedding to the function .
Estimate 2 follows from the duality between and and then Cauchy-Schwarz
[TABLE]
and finally bounding the -norm of by the Hölder interpolation inequality,
[TABLE]
with .
The positivity of is trivial.
Ad 4: By our assumtpion on we have
[TABLE]
The estimate again follows from Hölder interpolation (3.4) with . In all four cases the continuity results follow by pointwise continuity of the integrand and the proven bounds. ∎
The second Lemma considers the excitation functional (2.13). In the following we denote .
Lemma 2** (Lower bounds on excitation functional).**
The terms in the excitation functional have the following properties
* and*
[TABLE] 2. 2.
\int_{\mathbb{R}^{3}}v_{ext}\rho_{\psi}\geq-||\mu||_{\mathcal{M}}\big{(}||v_{\infty}||_{\infty}||\rho_{\psi}||_{1}+||v_{2}||_{2}||\rho_{\psi}||^{\nicefrac{{1}}{{4}}}_{1}||\rho_{\psi}||_{3}^{\nicefrac{{3}}{{4}}}\big{)}* and*
[TABLE] 3. 3.
* and*
[TABLE] 4. 4.
\int_{\mathbb{R}^{3}}v_{xc}(\rho)\rho_{\psi}\geq-c_{xc}\big{(}||\rho_{\psi}||_{1}+||\rho||_{p}^{p-1}||\rho_{\psi}||_{1}^{\nicefrac{{(3-p)}}{{(2p)}}}||\rho_{\psi}||_{3}^{\nicefrac{{3(p-1)}}{{(2p)}}}\big{)}* where is again the exponent from our assumptions on and*
[TABLE]
In particular, the map is weak strong weak∗ continuous and weak weak weak∗ lower semicontinuous on .
Proof.
Statements 1-3 follow by the same line of reasoning as in Lemma 2 above. The fourth assertion follows by the same argument given in [10], but we include it for the sake of completeness. By our assumption on the exchange-correlation potential we have
[TABLE]
The asserted bound now follows from the Hölder interpolation inequality (3.4) with . The continuity follows from the pointwise continuity of together with the bounds (3.1). ∎
Remark**.**
Our existence results do not require to be the KS ground state, but only to satisfy a fast enough decay property, like
[TABLE]
By the results of [1], whenever and satisfies our assumptions (3.1), then there exists a KS ground state and any such ground state fulfills the additional property (3.5).
Theorem 1** (Existence of HOMO-LUMO excitations).**
For any admissible nuclear charge distribution and any set of orbitals the excitation functional possesses a maximizer on (i.e., a HOMO).
If additionally we have , i.e. a positively charged system, and satisfies the decay property (3.5), then there exists also a minimizer on (i.e., a LUMO).
Proof.
Existence of a HOMO is elementary since is continuous on (see Lemma 2) and the admissible set is compact since it is a closed and bounded subset of a finite dimensional space. The case of a LUMO is more difficult and will be treated in the next section. ∎
3.2 Existence proof
For a given , define , the projection onto the span of the . Then we define the translation invariant excitation functional by
[TABLE]
Note depends on but to keep notation simple and distinguish from we do not explicitly stress this via the notation.
Now is again defined over the whole and if we can prove the existence of a minimizer with
[TABLE]
then we have the existence of a LUMO given by .
As usual in a concentration compactness argument we embed our problem in a one-parameter family of minimization problems
[TABLE]
parametrized by .
We can now state the first central Lemma of this section:
Lemma 3** (Fundamental properties of the infimum).**
The map given by has the following properties
- (i)
* and for every ,* 2. (ii)
* is continuous and strictly decreasing, i.e. for the inequality holds*
Proof.
The fact that is clear and since is a bounded operator from into itself follows by the bounds in Lemma 2.
Usually the strict subadditivity condition is the hardest part in a concentration compactness proof, but fortunately here it is quite easy.
The only thing we have to prove is actually for . Since by the structure of our problem we have for all . Hence, for we choose such that and obtain
[TABLE]
Also continuity is clear since is just a linear function. The inequality is also important because otherwise we would be considering the constant zero-function.
Take a radially symmetric testfunction with supp for some radius , . Here denotes the complement of the ball around the origin of radius . Then define
[TABLE]
Note that
[TABLE]
where we used in the second to last equality that supp and in the last step the exponential decay of the KS-orbitals together with the estimate
[TABLE]
Hence decays exponentially for , so it can be neglected up to higher orders, i.e.
[TABLE]
With this at hand we can estimate the energy of
[TABLE]
Since we want to estimate the energy from above we do not need to consider the exchange-correlation term since it only gives a negative contribution. The kinetic energy is easily estimated
[TABLE]
Our next step will be to estimate the Hartree and the nuclei term and bring them in a similar form
[TABLE]
where we used the radial symmetry of in the last step.
Exactly the same steps transform the external potential term into
[TABLE]
Since the support of is compact and again due to supp, we have for small enough in the last integral .
Putting everything together we obtain for small enough
[TABLE]
In this last inequality the assumption of a positively charged system is crucial.
∎
Our last Lemma is just a standard lower semicontinuity result.
Lemma 4**.**
Let be bounded in converging weakly to . If then strongly in for and furthermore
[TABLE]
Proof.
The weak convergence in together with the convergence of the -norms gives strong convergence in which in turn gives strong convergence in , due to Hölder interpolation inequality and the fact that is bounded in for by Sobolev embedding.
The inequality follows by weak lower semicontinuity of established in Lemma 2 and the fact that is a continuous operator from into itself. ∎
With all these Lemmata at our disposal, we are now able to prove the existence of a minimizer to (3.7) and hence the existence of a LUMO.
Take a minimizing sequence for . Then w.l.o.g. , i.e. lies in the orthorgonal complement of . Therefore the sequence is bounded in by Lemma 2 and hence (up to taking a subsequence) weakly in for some .
If , then by Lemma 4 we get
[TABLE]
and since also lies in the (convex and closed, hence weakly closed) orthorgonal complement of we have so is a minimizer.
The rest of the proof consists in ruling out .
For this purpose, assume first . Then and in for . In particular,
[TABLE]
for any fixed .
The first term converges to 0 and the second one can be made arbitrarily small because vanishes at infinity and also vanishes since decays exponentially.
So in conclusion we would get , a contradiction to Lemma 3.
So we only have to rule out . Here we use a standard argument. Following e.g. [8], define a quadratic partition of unity with being smooth, radially non-increasing and with
[TABLE]
Define . Then for all the map is continuous, non-decreaing, vanishes for , and converges to for .
Hence we can choose a sequence such that . Furthermore we claim that this sequence goes to infinity as .
Assume for contradiction that this were not the case. Then up to a subsequence we have for some . But this would give us
[TABLE]
which is not possible since for . Hence .
Next we define and such that and (by definition of the ) .
Note that those two sequences are also bounded in , since is bounded in and belongs to .
Furthermore the cutoff gives us that converges weakly to and hence as a by-product converges to 0 in . In order to see this, we take a and consider
[TABLE]
where we used that converges weakly to and the fact that converges pointwise to 1 and stays uniformly bounded. So we proved weak convergence on a dense subset and by boundedness we conclude.
Now we can apply Lemma 4 to the sequence since by construction , and obtain and in for .
Our next step is proving
[TABLE]
where is the usual Landau notation meaning terms which vanish for .
First note that and also , hence
[TABLE]
Even more, remembering for , we have . Therefore we obtain
[TABLE]
This already gives us .
For the potential energy we start by proving .
Due to the fact that is bounded and converges pointwise to zero we have in and hence , which gives us . Consequently
[TABLE]
So up until now we have
[TABLE]
The only fact that remains to be proven is . This follows from the standard argument that the only negative contributions come from and , i.e.
[TABLE]
where the first term vanishes because in , and the second one also tends to [math] because was arbitrary and and decay at infinity.
In conclusion we proved which gives us after passing to the limit
[TABLE]
where we used in the last inequality that lies in the orthogonal complement of the occupied orbitals and hence so .
But since we assumed , this contradicts statement in Lemma 3. This completes the proof of Theorem 1.
3.3 Higher excitations
We now complete the rigorous justification of the picture on the left-hand side of Figure 1, i.e. we fully characterize the spectrum of the KS Hamiltonian in the positively charged case.
Theorem 2** (Spectrum of KS Hamiltonian in the case ).**
Consider , i.e. a positively charged system. Then the KS Hamiltonian given in (2.11) has infinitely many negative eigenvalues of finite multiplicity below the bottom of its essential spectrum,
[TABLE]
Proof.
Due to assumption (3.1) the potential belongs to the space , consisting of potentials which for any given , can be decomposed as with , . Therefore is relatively compact w.r.t. the Laplacian. Hence by Weyl’s Theorem (see e.g. Chapter XIII.4 of [22]) is self-adjoint and we have \sigma_{ess}\big{(}h_{\mu,\rho}\big{)}=\sigma_{ess}\big{(}-\Delta\big{)}=[0,\infty). Furthermore, is bounded from below. To see this, write with and . Then for every with we have
[TABLE]
Now the Sobolev embedding and Young’s inequality give the asserted lower bound. Now note that
[TABLE]
and since by assumption we can apply Lemma II.1 of [18] which gives us that the right-hand side operator of (3.10) is negative on an infinite-dimensional subspace. Hence so is . The min-max principle now gives that it has infinitely many negative eigenvalues. This completes the proof. ∎
4 Optimal HOMO-LUMO excitations
In [10], motivated by the design of photovoltaic materials, results are given for optimal HOMO-LUMO excitations with respect to some control goals. The control is the nuclear charge distribution (e.g., the doping profile or the heteroatom substitutions); typical control goals are the spatial electron-hole charge separation or the size of the HOMO-LUMO gap. The analysis in [10] relies on the simplyfing assumption of bounded domains. Here we generalize this analysis to unbounded domains. The main difficulty again consists in handling loss of mass at infinity.
Lemma 5** (Analytic properties of the set of HOMO-LUMO excitations).**
Consider a positively charged system, i.e. , then the joint solution set to the governing variational pronciples for occupied KS orbitals, HOMO and LUMO parametrized by the set of nuclear charge distributions ,
[TABLE]
has the following properties:
- (a)
* is weak weak weak weak*∗-closed in 2. (b)
* is strong strong strong weak*∗-compact in
Proof.
Part (a) Let in and in , then we need to prove
[TABLE]
Ad (i): Since all measures are supported on the compact set , the constant functions are in the predual of , that is the space of the continuous functions on , and hence , so
**Ad (ii): ** For any admissible we have by the variational definition of and the weak∗ continuity of
[TABLE]
which implies
[TABLE]
since was arbitrary.
Using the weak weak∗ lower semicontinuity of on gives
[TABLE]
but unlike in the bounded domain case this does not directly give us the result since is not weakly closed.
Fortunately, the are not arbitrary elements of . Since the energy functional is invariant under unitary transformations, we can always assume the orbitals to be orthogonal. So the only thing that could go wrong is loss of mass due to weak convergence.
Assume for contradiction so we lose mass in at least one orbital . In order to see that this is not possible, we will place some small mass at a large but finite distance, and obtain a state with lower energy.
We take with and consider the test function .
Then
[TABLE]
Using the assumption (3.2) and the fact that , we obtain that for small enough there exists a constant such that
[TABLE]
Since , the negative term dominates in the limit (if we let both go to [math] at the same speed). Hence by choosing the parameters and small enough we ensure .
Now since we assume loss of mass in the k-th orbital , let us consider , where is some unit vector in . Denoting the orbitals with replaced by by , we get
[TABLE]
So for large enough we obtain and .
We can now repeat these steps until we have constructed a with (after a suitable unitary transformation), and arrive at the contradiction
[TABLE]
Hence there is no loss of mass, , and (ii) holds.
Before we move on to (iii) and (iv) we mention that due to the upper and lower bound above, . But this gives us since the other terms in the energy functional, namely , are continuous on , and similarly to the proof of Lemma 4 we infer strong convergence in for , since there is no loss of mass.
Hence the kinetic energy converges as well, which means , giving us in , so in .
Ad (iii): As in [10] the statements (iii) and (iv) are more difficult since the HOMO and LUMO orbitals and are not defined via universal but -dependent sets and hence an admissible trial function for the limiting HOMO and LUMO orbitals and may not be admissible for the variational principle for the approximating orbitals. In short, our argument in (4.1) is not valid anymore.
Hence we need to look at the -projector of the sequence , i.e. . For any given the mapping is strongly continuous from to , i.e.
[TABLE]
Furthermore by definition we have and for any . So by (4.4) we have for all large enough and therefore by the variational principle for the HOMO (3.3)
[TABLE]
Here we used the strong convergence of (4.4) and the fact that the map is weak strong weak∗ continuous on due to Lemma 2.
Since (4.5) holds for any , we have
[TABLE]
Next we prove that . Since by definition the lie in the span of , we obtain by the weak convergence of in and the strong convergence of in that
[TABLE]
So taking the limit yields and hence strongly, so .
Furthermore by the continuity properties proven in Lemma 2 we obtain
[TABLE]
Combining (4.6) and (4.7) yields (iii).
Ad (iv):
The corresponding proof for the LUMO starts similarly. Take any , i.e. and . Then again by (4.4) and the variational principle for the LUMO (3.3) we obtain
[TABLE]
Minimizing over gives
[TABLE]
For the lower bound we use the weak weak weak∗ lower semicontinuity of the map by Lemma 2
[TABLE]
Unfortunately we are not done since the are not known to converge strongly but just weakly, so we still need to prove that is admissible. The limit lies in the orthogonal complement of the , since converges strongly and therefore
[TABLE]
So we only need to prove . Assume . Then by (4.9) and (4.10)
[TABLE]
But we proved in Lemma 3 that the mass–to–LUMO–energy map is strictly decreasing. Hence , so is admissible and together with (4.9) and (4.10) this establishes (iv).
Before we prove part b) let us make some remarks. Since there is no loss of mass, converges strongly in and hence strongly in for . But since and all terms except the kinetic energy converge due to the continuity results in Lemma 2, we must also have , i.e. converges strongly in .
In conclusion, we know that \big{(}\Phi^{(\nu)},\varphi_{H}^{(\nu)},\varphi_{L}^{(\nu)}\big{)} converges strongly in .
Part (b)
To prove sequential compactness of the set is now quite easy since by part a) and Banach-Alaoglu we just need to prove that any sequence \big{(}\Phi^{(\nu)},\varphi_{H}^{(\nu)},\varphi_{L}^{(\nu)},\mu^{(\nu)}\big{)}\in\mathcal{B} is bounded in . For and this follows from the bounds in Lemma 1 and 2, respectively, noting that the exponent in the assumption on has the property that the exponents and of are strictly less than 1. Since the stay bounded, so do the . Lastly, since we have , which concludes the proof. ∎
As an example of a control goal we consider bandgap tuning as introduced in [10]. Here the quantity which one wants to influence by a suitable choice of the nuclear charge distribution is the HOMO-LUMO bandgap , where and stand for the HOMO and LUMO eigenvalues of the KS Hamiltonian (2.11). Any bandgap tuning functional promoting a desired target value has to reach its minimum when . A simple choice suggested in [10] is
[TABLE]
Theorem 3**.**
For any and for , there exists a nuclear charge distribution which minimizes the bandgap tuning functional (4.12) over subject to the constraints (3.3).
Proof.
The bandgap functional in (4.12) is, due to Lemma 2, weak strong strong weak*∗* continuous on . Hence by the compactness of the set proven in Lemma 5 it attains its minimum over this set. ∎
5 Nonexistence of HOMO-LUMO excitations in the neutral case
We now introduce carefully chosen and realistic model densities and prove that the excitation functional admits no excited states, i.e. no bound states other than the ground state. See the picture on the right in Figure 1.
This finding suggests that also for the true KS ground state density , it may happen that there are no exact HOMO-LUMO excitations. Of course, the Hamiltonian possesses continuous spectrum above the ground state energy and therefore “metastable” excitations (suitable square-integrable superpositions of continuous eigenstates) still exist.
From a mathematical point of view, the results in this section show that the assumption in our existence result (Theorem 1) was in fact sharp and cannot be weakend to . Note that the model densities considered here satisfy the assumptions of Theorem 1, in particular (3.5).
H-Atom ground state density
Let and let be the hydrogen atom ground state for the Schrödinger equation, i.e. the lowest eigenfunction of . Its density is
[TABLE]
We expect this to be a good approximation for the KS density, hence the KS operator should be a good approximation to the self-consistent hydrogen KS operator.
In this case the Hartree-potential can be explicitly computed. The well known result is
[TABLE]
The first term cancels the external potential , hence the excitation functional becomes
[TABLE]
with corresponding Hamiltonian
[TABLE]
Model ground state density for the He-Atom
In order to construct a model density for Helium, we make the following ansatz.
We take a dilated version of the hydrogen orbital, i.e.
[TABLE]
and – following Hans Bethe – determine the parameter by
[TABLE]
The last term describing the electron-electron interaction comes from
[TABLE]
with the spinors and
The energy is easily computed as
[TABLE]
which implies
[TABLE]
In Figure 2 one sees that this value corresponds to the fact that the second electron does not see the full Coulomb potential of the nucleus but a screened one.
With this density the Hartree-potential can again be computed explicitly and plugging this density into our excitation functional gives us the Hamiltonian
[TABLE]
With these two densities at hand we can state the main result of this section and complete the picture given in Figure 1.
Theorem 4** (Spectrum of the KS Hamiltonian in the case ).**
Consider either the hydrogen atom ( ) with the density given by (5.1) or the helium atom () with the density given by (5.2). Furthermore let the exchange-correlation energy be given by either Dirac exchange, PW81 or PZ92. Then the spectrum of the KS Hamiltonian has the form
[TABLE]
In particular, the Hamiltonian possesses exactly one bound state (up to spin in the hydrogen case) and no excited states, i.e. no bound states above the ground state.
The result of Theorem 4 is quite significant from a computational point of view. In numerical methods one, of course, obtains excited states, but in the limit of complete basis sets in infinite volume these might dissolve into metastable states associated with the continuous spectrum.
Proof.
As in the proof of Theorem 2 the potential of the KS Hamiltonian is in , hence in both cases \sigma_{ess}\big{(}h_{\mu,\rho}\big{)}=[0,\infty).
Next, we prove that there is at least one bound state with eigenvalue . By the Rayleigh–Ritz method it suffices to find a with and . Since for any LDA functional we have , it suffices to prove the inequality for Dirac exchange. As a test function we choose the corresponding orbitals we used in the construction of our densities. These terms are easily computed to give for the hydrogen case
[TABLE]
So for both atoms we have at least one bound state.
Now we use the upper bound on the number of bound states given in [11] which says that the number of bound states with angular momentum satisfies
[TABLE]
where the functional is given by
[TABLE]
For a radially symmetric potential this reduces to
[TABLE]
We call the Glaser-Martin-Grosse-Thirring (GMGT) functional. Here denotes the negative part of the potential, and the parameter for the radially symmetric case while for the general case we have the restriction .
If one calculates this integral numerically for our hydrogen density (5.1) with Dirac exchange, one obtains that the minimum value is attained at with . This gives us the upper bound
[TABLE]
which means and all other for . So up to the degeneracy with respect to spin there is only the ground state and there are no excited states.
For the helium density (5.2) we computed the GMGT functional for the Dirac [7], the Perdew-Zunger [21] and the Perdew-Wang [20] exchange-correlation functional and obtained
[TABLE]
Here we denote by the potential V_{-}(x)=2e^{-2\alpha|x|}\big{(}\frac{1}{|x|}+\frac{1}{\alpha}\big{)}+v_{xc}(\rho_{\alpha}(x)), so in e.g. the Dirac case we have
[TABLE]
As before, it now follows from (5.6) and (5.8) that no bound state other than the ground state orbital exists.
Hence for both hydrogen and helium we have exactly one bound state – the ground state itself – which corresponds to an eigenfunction with eigenvalue . ∎
In Figure 3 the values of as a function of are given in the case of helium for the three LDAs mentioned above. The figure shows that the minimum value of the GMGT-functional is always attained at some , i.e. in the interval , where the upper bound is also valid for non-symmetric potentials. Hence, as long as the real KS-density is only a small perturbation of our model density , even if it were not symmetric, our results would hold.
The overall conclusion of this section is that there are no LUMO excitation for the model densities (5.1) and (5.2).
Our findings raise the following interesting questions which lie beyond the scope of the present paper. First, is the GMGT nonexistence criterion satisfied for numerically obtained ground state densities of hydrogen and helium, or more complex atoms and molecules? A second question is whether the absence of exact excitations persists for more advanced excitation models like the Casida ansatz.
Acknowledgements. This project was supported by the DFG through the IGDK 1754 Optimization and Numerical Analysis for Partial Differential Equations with Nonsmooth Structures.
6 Appendix: Analytical properties of PZ81 and PW92
In the following we verify explicitly that the correlation functionals PZ81 and PW92 satisfy the assumptions (3.1) and (3.2) of this paper. In the physics literature the exchange-correlation functional is usually specified by the energy per particle at the density , denoted . We work with the mathematically convenient energy per unit volume . The exchange part was already discussed in the Remark Remark, so we only need to deal with the correlation part.
Furthermore, recall the Wigner-Seitz radius
[TABLE]
which is a standard parameter in physics to describe the local electron density of a system. Lastly, we remark that in the following will describe a generic constant, which may have different values at each appearance, but is independent of and .
Perdew-Wang (PW92)
In this paper we consider only spin-unpolarized systems, so we have . So (in the notation of the original paper) we only need to check the assumptions (3.1) and (3.2) for . The PW92 correlation functional is given by
[TABLE]
In order to improve readability of the arguments below, we define the following functions
[TABLE]
So returning to the notation of our paper we need to check (3.1) and (3.2) for the function
[TABLE]
This function is clearly continuously differentiable for , so we only check the limit . Since and for , repectively , we get
[TABLE]
hence with , is continuous.
Next we calculate the derivative
[TABLE]
Applying the inequality for , using and taking the limit gives now
[TABLE]
So also is continuous and hence it suffices to prove for , i.e. . Using , and we obtain
[TABLE]
so (3.1)holds with in.
For (3.2) we can choose (any value between and will do). Writing the condition in terms of then gives us
[TABLE]
Realizing that for and that implies for the -term , transforms this into
[TABLE]
which is true if .
Perdew-Zunger (PZ81)
Here we again consider spin-unpolarized systems, i.e. , as in the case of PW92. The precise value of the PZ81 constants is important for continuity and continuous differentiability; they were chosen in such a way that is continuously differentiable.
The PZ81 correlation functional is given by
[TABLE]
Here we used the notation of the original paper and hence are negative. For our analysis we now need to consider . Hence, we calculate to be
[TABLE]
The continuity for is now checked easily:
[TABLE]
and
[TABLE]
The continuity of and at the value follows from the choice of constants in the original paper [21], since
[TABLE]
and
[TABLE]
Since is continuous, we only need to check the bound (3.1) for , i.e. . But here the only term which is not bounded is for which we again use a standard -bound, for , so |v_{c}(r_{s})|\leq C\big{(}1+\tfrac{1}{r_{s}}\big{)} for small enough. In terms of this means |v_{c}(\rho)|=C\big{(}1+\rho^{\nicefrac{{1}}{{3}}}\big{)}, so (3.1) holds with .
Also, (3.2) holds with , since then plugging in the relation (6.1) between and (6.1) yields
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Anantharaman and E. Cancès. Existence of minimizers for Kohn-Sham models in quantum chemistry. Ann. Inst. H. Poincaré Anal. Non Linéaire , 26(6):2425–2455, 2009.
- 2[2] E.J. Baerends, O.V. Gritsenko, and R. Van Meer. The kohn-sham gap, the fundamental gap and the optical gap: The physical meaning of occupied and virtual kohn-sham orbital energies. Physical Chemistry Chemical Physics , 15(39):16408–16425, 2013.
- 3[3] A.D. Becke. Perspective: Fifty years of density-functional theory in chemical physics. Journal of Chemical Physics , 140(18), 2014.
- 4[4] I. Catto, C. Le Bris, and P.-L. Lions. On the thermodynamic limit for Hartree-Fock type models. Ann. Inst. H. Poincaré Anal. Non Linéaire , 18(6):687–760, 2001.
- 5[5] C.J. Cramer. Essentials of Computational Chemistry: Theories and Models . Wiley, 2002.
- 6[6] H.-L. Dai and W. Ho. Laser Spectroscopy and Photochemistry on Metal Surfaces . World Scientific Publishing Company, 1995.
- 7[7] P. A. M. Dirac. Note on exchange phenomena in the thomas atom. Mathematical Proceedings of the Cambridge Philosophical Society , 26(3):376–385, 1930.
- 8[8] R. L. Frank, E. H. Lieb, R. Seiringer, and H. Siedentop. Müller’s exchange-correlation energy in density-matrix-functional theory. Physical Review A , 76(5):052517, 2007.
