Theoretical X-Ray Spectroscopy of Transition Metal Compounds
Sergey I. Bokarev, Oliver K\"uhn

TL;DR
This paper reviews the theoretical approaches, especially MCSCF, used to interpret complex X-ray spectra of transition metal compounds, emphasizing the challenges and details involved in understanding experimental data.
Contribution
It provides a comprehensive overview of first principles methods, focusing on the application and limitations of MCSCF theory for transition metal L-edge X-ray spectroscopy.
Findings
MCSCF can elucidate detailed spectral features of transition metals.
Interpretation of spectra requires advanced theoretical understanding.
Challenges include near-degeneracy and spin-orbit coupling effects.
Abstract
X-ray spectroscopy is one of the most powerful tools to access structure and properties of matter in different states of aggregation as it allows to trace atomic and molecular energy levels in course of various physical and chemical processes. X-ray spectroscopic techniques probe the local electronic structure of a particular atom in its environment, in contrast to UV/Vis spectroscopy, where transitions generally occur between delocalized molecular orbitals. Complementary information is provided by using a combination of different absorption, emission, scattering as well as photo- and autoionization X-ray methods. However, interpretation of the complex experimental spectra and verification of experimental hypotheses is a non-trivial task and powerful first principles theoretical approaches that allow for a systematic investigation of a broad class of systems are needed. Focussing on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Theoretical X-Ray Spectroscopy of Transition Metal Compounds
Sergey I. Bokarev, Oliver Kühn Institut für Physik, Universität Rostock, Albert-Einstein-Str. 23-24, 18059 Rostock, GermanyInstitut für Physik, Universität Rostock, Albert-Einstein-Str. 23-24, 18059 Rostock, Germany
ADC Algebraic Diagrammatic Construction AES Auger Electron Spectrum AMFI Atomic Mean-Field Integrals AO Atomic Orbital AS Active Space BO Born-Oppenheimer CAS Complete Active Space CASPT2 Complete Active Space second order Perturbation Theory (PT) CASSCF Complete Active Space Self-Consistent Field (SCF) CC Coupled Cluster CI Configuration Interaction CIS Configuration Interaction (CI) Singles CISD
CI Singles Doubles CK Coster-Kronig CSF Configuration State Function CT Charge Transfer CVS Core-Valence Separation DCL Dynamical Classical Limit DFT Density Functional Theory DFT-CI Configuration Interaction on Kohn-Sham Density Functional Theory (KS-DFT) orbitals DMRG Density Matrix Renormalization Group DO Dyson Orbital DOF Degree Of Freedom EOM Equation Of Motion EOM-CC Equation-Of-Motion Coupled Clusters FEL Free Electron Laser FOA Frozen Orbital Approximation FCI Full CI
HERFD-XAS High Energy Resolution Fluorescence Detection X-ray Absorption Spectrum GASSCF Generalized Active Space SCF
HF Hartree-Fock HHG High Harmonic Generation IOTC Infinite-Order Two-Component IP Ionization Potential ICD Interatomic Coulombic Decay KS-DFT Kohn-Sham Density Functional Theory LFM Ligand Field Multiplet LR Linear Response LR-CC Linear-Response Coupled Cluster (CC) LR-TDDFT Linear-Response TDDFT MCSCF Multi-Configurational Self-Consistent Field MCTDHF Multi-Configurational Time-Dependent Hartree-Fock (HF) MD Molecular Dynamics MO Molecular Orbital LCAO Linear Combinations of Atomic Orbitals MR Multi-Reference NEVPT2
-electron Valence second order PT
NXES Non-resonant X-ray Emission Spectrum OEA One-Electron Approximation PES Photoelectron Spectrum PFY Partial Fluorescence Yield PP Polarization Propagator PT Perturbation Theory QC Quantum Chemistry RASPT2 Restricted Active Space Perturbation Theory (second order) RASSCF Restricted Active Space Self-Consistent Field RASSI Restricted Active Space State Interaction REW Restricted Excitation Window REW-TDDFT Restricted Excitation Window Time-Dependent Density Functional Theory RIXS Resonant Inelastic X-ray Scattering ROHF Restricted Orbital HF ROCIS Restricted Open-shell Configuration Interaction Singles RPES Resonant Photoelectron Spectrum RPE Resonant Photoelectron RT Real-Time RXES Resonant X-ray Emission Spectroscopy SA Sudden Approximation SCF Self-Consistent Field SCL Static Classical Limit SOC Spin-Orbit Coupling SD Slater Determinant TCF Time-Correlation Function TD-CI Time-Dependent CI
TDDFT Time-Dependent Density Functional Theory TD-MCSCF Time-Dependent Multi-Configurational Self-Consistent Field (MCSCF) TISE Time-Independent Schödinger Equation TD-RASCI Time-Dependent Restricted Active Space TDSE Time-Dependent Schödinger Equation TFY Total Fluorescence Yield TM Transition Metal TP-DFT Transition Potential DFT UPS Ultraviolet Photoelectron Spectrum UV/Vis UltraViolet/Visible photon energy range XAS X-ray Absorption Spectrum XC eXchange-Correlation XES X-ray Emission Spectrum XFEL X-ray Free Electron Laser XPS X-ray Photoelectron Spectrum XUV eXtreme UltraViolet ZORA Zero-Order Regular Approximation
Article Type:
Advanced Review
Abstract
X-ray spectroscopy is one of the most powerful tools to access structure and properties of matter in different states of aggregation as it allows to trace atomic and molecular energy levels in course of various physical and chemical processes. X-ray spectroscopic techniques probe the local electronic structure of a particular atom in its environment, in contrast to UV/Vis spectroscopy, where transitions generally occur between delocalized molecular orbitals. Complementary information is provided by using a combination of different absorption, emission, scattering as well as photo- and autoionization X-ray methods. However, interpretation of the complex experimental spectra and verification of experimental hypotheses is a non-trivial task and powerful first principles theoretical approaches that allow for a systematic investigation of a broad class of systems are needed. Focussing on transition metal compounds, -edge spectra are of particular relevance as they probe the frontier -orbitals involved in metal-ligand bonding. Here, near-degeneracy effects in combination with spin-orbit coupling lead to a complicated multiplet energy level structure, which poses a serious challenge to quantum chemical methods. MCSCF theory has been shown to be capable of providing a rather detailed understanding of experimental X-ray spectroscopy. However, it cannot be considered as a ’blackbox’ tool and its application requires not only a command of formal theoretical aspects, but also a broad knowledge of already existing applications. Both aspects are covered in this overview.
1 INTRODUCTION
Spectroscopy is one of the most powerful tools in physics and chemistry and, in particular, X-ray spectroscopy attracts special attention not at least due to the ongoing developments at X-ray Free Electron Laser (XFEL) facilities and High Harmonic Generation (HHG) sources [1, 2]. As a consequence of the localized nature of the core orbitals, the transition operator acts locally and that is why X-ray excitation probes the local electronic structure of a particular atom embedded in its chemical environment. This is in contrast to, e.g., UltraViolet/Visible photon energy range (UV/Vis) spectroscopy, where transitions usually occur between delocalized valence Molecular Orbitals. The combination of different absorption, emission, scattering, photoionization, and diffraction X-ray techniques allows addressing various aspects of static properties as well as photoinduced and chemical dynamics in steady-state and time-resolved X-ray spectroscopy [3, 4, 5, 1, 6]. Sharpening the experimental probe does not guarantee, however, an increase of the acquired knowledge as the complexity of the detected signal increases as well when addressing more and more intricate effects. Thus, for interpretation of the experimental data, theoretical modeling is mandatory.
This review presents an overview on theoretical approaches with a focus on first principles electronic structure methods and in particular on multi-reference wave function techniques. Applications to the interpretation of experimental data will be shown to provide a means for dissecting different structural and dynamical problems. We will restrict ourselves to isolated molecules and complexes as they appear in the gas or solution phases. Extended periodic systems like crystals are explicitly excluded from consideration as respective theoretical models represent a huge self-standing body of methods which are reviewed elsewhere. [7, 5, 8] Here, we address only discrete the pre-edge structure leaving XANES and EXAFS parts of the absorption edge aside since especially the latter provides more information on the geometric rather than electronic structure. [9]
Although computational X-ray methods have been developing apace with quantum chemistry since its infancy, the 21st century heralds an explosive growth of theoretical studies of X-ray spectra. These studies are mainly devoted to the -edge () spectra of second period and heavier elements using single reference methods. It is barely possible and not aimed in the current review to cover them all, including the numerous applications (for reviews see References 10, 11, 5, 12). Therefore, mostly the soft X-ray photon energy range (0.1-1.0 keV) and 4th period Transition Metal (TM) complexes will be considered. Thus, mainly metal -edges will be discussed corresponding to excitation or ionization from core orbitals.
2 X-RAY SPECTROSCOPY
2.1 What makes it special?
Spectroscopy in the UV/Vis spectral range is a standard tool for probing electronic transitions and related processes, taking place within the valence state manifold. Here, participating electronic states often exhibit rather delocalized electron densities. This makes an interpretation in terms of local changes difficult, e.g., with respect to specific bonds. In contrast, X-ray transitions involve at least one orbital, which is localized on an atom, thus providing a local probe of the electronic structure. Moreover, core level energies vary vastly between different atoms what provides element specificity. Further, core orbitals with non-zero angular momentum lead to pronounced relativistic effects, e.g., caused by Spin-Orbit Coupling (SOC). Conventional X-ray spectroscopy is plagued by the rather short lifetime of the core-hole (4-10 fs), which limits the resolution (a possible way to overcome this problem is High Energy Resolution Fluorescence Detection X-ray Absorption Spectrum (HERFD-XAS) [13]). However, this can be turned into an advantage by using the core-hole lifetime to clock ultrafast dynamics (“core-hole clock”) [14]. Finally, X-ray wavelengths are in the range of relevant molecular length scales, enabling better spatial and time resolution.
The list of problems, which X-ray spectroscopy typically addresses includes probing the chemical environment and bond distances [15, 9, 16, 17], the nature of chemical bonds [18, 19, 20] as well as oxidation, spin, and solvation states. [21, 22, 23, 24] Further, it is indispensable for the investigation of solids and surfaces. [3, 4, 25, 26] While steady-state X-ray spectroscopy is well-established, emerging time-resolved experiments are pushing the limits to enter the regime of even the fastest nuclear and electron dynamics. This becomes possible due to the fact that the optical cycle of X-rays is in the few attosecond range. Among others [1, 6], recent examples include, e.g., the study of the (photo)reaction dynamics of acetylene isomerization upon core-hole formation [27], photodissociation of simple flourides [28], ligand exchange [22] and transient photodynamics [29, 30, 31] in metal complexes, photocatalytic processes on metal surfaces [32, 33], nanosecond dynamics of dissociation and reassociation of insulin [34], and ultrafast electron and nuclear wave packet dynamics using time-resolved and transient photoelectron spectroscopy [35, 36].
The advantages of X-ray as compared to UV/Vis spectroscopy come at the price of requiring large scale facilities like synchrotrons and XFELs. However, recent progress in HHG sources scales down the size of the setups to the table-top. Moreover, prominent recent developments include low-power X-ray (bremsstrahlung) tubes [37] and plasma-based X-ray pulse generation [38]. It is anticipated that within a few years a table-top intense and stable source of isolated ultrafast XUV and X-ray pulses will appear what will herald the “golden age” of frequency- and time-resolved X-ray spectroscopy giving rise to a plethora of new techniques and perspective directions [2].
2.2 Types of X-ray probes
For TM compounds, -edge spectra enjoy great popularity [4, 5]. They are due to excitation or ionization from the and orbitals (-shell) of the target atom to bound or continuum states, respectively. Being interested in frontier -orbitals containing information on the nature of metal-ligand or metal ion-solvent interaction, dipole-allowed excitation from orbitals is addressed more frequently. Traditionally, solid state samples have been investigated. However, recent developments in the vacuum liquid microjet technique have broadened the applicability of soft X-ray spectroscopy to highly volatile liquid solutions. [39, 40]
Excitation of a system from its electronic ground state to a high-energy and highly non-equilibrium core-excited state triggers a number of relaxation processes which either represent losses or can be used as a spectroscopic probe of material properties. Let us consider them on the example of -edge spectroscopy as illustrated in Figure 1. In general there are two pictures with respect to the electronic states that can be used for the discussion. First, the intuitive and therefore widely used “single electron” MO picture (panel (a)). Second, the more appropriate “state picture”, which is based in the many-particle eigenstates of the electronic Hamiltonian (panel (b)).
Upon absorption of a soft X-ray photon (0.1-1.0 keV), a core hole is created at the metal center accompanied by -edge ionization of the ligands. This process is depicted on the very left of Figure 1 and is subject of X-ray absorption spectroscopy [9]. In this case, an electron is excited from a localized core MO to delocalized bound one. Within the state picture, the X-ray Absorption Spectrum (XAS) is due to transitions within the -electron manifold, i.e. between the ground and core-excited states of the (“neutral”) system.
The core-electron can also be excited into the continuum and the photocurrent of the outgoing electrons can be measured yielding a PES. In the state picture this corresponds to a transition between the initial -electron (“neutral”) system and the -electron ionized system as illustrated in Figure 1b. The proper total final state is formed by the antisymmetrized product of the state of the electron manifold and the outgoing free electron state.
The created core-hole can decay either radiatively or non-radiatively. Radiative decay channels comprise and pathways, which represent about 1% of the total decay probability for the first-row TMs [41]. Depending on whether the initial excitation has been into a bound or continuum state one speaks about resonant (RXES) or RIXS [42] or the NXES, respectively. The major decay channel is due to Auger, CK decay, and Interatomic Coulombic Decay (ICD) autoionization processes [43] (Figure 1a). In the state picture these processes involve a transition between the and -electron manifolds and are reflected in the AES.
In general XAS and RIXS are called photon-in/photon-out spectroscopies, whereas PES and AES are photon-in/electron-out techniques. The combination of X-ray absorption and emission together with photoelectron studies provides a very powerful suite of tools to address electronic structure and specific solute-solvent interactions [5, 44].
2.2.1 Absorption spectroscopy
Absorption spectroscopy provides the most simple type of spectra considered in the context of X-ray excitation as long as the discrete pre-edge structure is considered only. Viewed in the MO picture, XAS probes the unoccupied valence orbitals. Conventional X-ray experiments operate in the weak-field regime [9] and the respective absorption amplitude, proportional to the absorption cross section at frequency , can be expressed in terms of first-order time-dependent perturbation theory (Fermi’s Golden Rule) [45]:
[TABLE]
Here, the and states are ground and core-excited final states with respective energies and . Due to finite temperature , there can be several initial states populated according to the Boltzmann distribution . This especially applies to TM complexes, where the initial high degeneracy of the electronic states can be lifted due to Jahn-Teller effect and SOC, leading to several levels accessible by thermal excitation.
Further, and in Eq. 1 are electric dipole operator and polarization vector of the incoming light. The lineshape function, , characterizes the density of the final states and, thus, the form of the absorption bands. Effects to be considered here include the width of the incoming excitation pulse, homogeneous (lifetime) broadening due to Auger and radiative decay, and inhomogeneous broadening due to influence of environment, e.g., solvent or phonon broadening in solid state.[46] These broadening parameters can be often fitted to best reproduce the experiment.
To address XAS theoretically in dipole approximation, one thus needs to calculate energies of the core-excited electronic states with respect to the ground one and the ground-to-core-excited transition dipole moments. Note that for hard X-ray radiation the dipole approximation breaks down and higher order multipole moments have to be considered.[47]
For -edge spectra, additional complexity enters due to strong SOC, triggered upon core-hole formation and within the states themselves [4]. This leads to a characteristic shape of the spectrum containing two groups of bands, so-called and edges (see Figure 4 below).
2.2.2 Resonant emission spectroscopy (RIXS)
Staying in the weak-field perturbational regime, the radiative decay of the core-excited state prepared upon absorption of X-ray light can be described within second-order perturbation theory. The respective Kramers-Heisenberg expression for the emission amplitude reads [45]
[TABLE]
It contains the same ingredients as the XAS expression, Equation (1), where in addition and denote the energy and polarization of the emitted photon, is the inverse lifetime of core-excited state and state represents the final valence state. Notice that due to the fact that the sum over appears under the square, the radiative channels ending up in the same final state interfere with each other. This photon-in/photon-out process represents an electronic analogue of the well-known vibrational Raman spectroscopy. The only difference is that one addresses transitions not between vibrational levels but between different electronic states. Formally, RIXS corresponds to a non-linear process and therefore is analogous to four-wave mixing but with the last two interactions being of spontaneous rather than stimulated nature [48]. Thus, it is only linearly proportional to the intensity of the incoming light. Drawing on this analogy the overall spontaneous light emission signal can be separated into two contributions: sequential incoherent fluorescence where a population in the excited state is created followed by radiative decay and a direct “coherent” Raman process. While in the former case resonant excitation is required, RIXS also works for off-resonant excitation, i.e. involving only coherences between the participating states.
In the following we will focus on RIXS only. The RIXS intensity can be recorded as a function of incoming, , and outgoing, , photon energy. Thus, it provides a two-dimensional spectrum, resolving both absorption and emission processes. Often one discusses one dimensional cuts at fixed excitation energy of this spectrum only. Alternatively, the so-called Partial Fluorescence Yield (PFY) spectrum at a given excitation energy can be obtained by integrating with respect to the emission energies in some energy window
[TABLE]
providing a spectrum analogous to XAS. Viewed in the orbital picture (Figure 1a), RIXS gives access to the unoccupied MOs as well as to the occupied valence MOs. In state picture (Figure 1b), absorption and emission processes become coupled because of orbital relaxation due to core-hole and valence excitation relaxation. Thus, RIXS approximately maps the valence excited states. Calculating RIXS is a more elaborate task as compared to XAS since these core-hole and valence excitation states may have different requirements to the computational protocol.
2.2.3 Photon-in/electron-out spectroscopy
Photon-in/electron-out spectroscopy represents a powerful suit of methods [25, 49, 50]. Depending on whether the X-ray photon energy exceeds the binding energy of the core-electron or is below the core ionization threshold one distinguishes between non-resonant PES and Resonant Photoelectron Spectrum (RPES), cf. Figure 1a. PES provides direct information on the energies of core orbitals, e.g., the value of their SOC splitting, if viewed from the orbital picture perspective. In the state picture, it corresponds to a transition from e.g. ground state of non-ionized system to the core-excited states of ionized one. Moreover, the shake-up excitations, where ionization is accompanied by valence excitation, can provide valuable insights into the electronic and geometric structure. [51]
The resonant counterpart, RPES, contains two processes – direct ionization from the valence level and autoionization of the core-excited state. The former one represents a usual PES where electrons from valence MOs are ejected having kinetic energies comparable to that of the absorbed X-ray photon. The latter one is an Auger process, i.e. a transition between core-excited state of the non-ionized manifold and the valence state of the ionized manifold accompanied by emission of an electron as depicted in Figure 1b. In terms of orbital picture, it corresponds to core-hole refill by a valence electron and simultaneous emission of another valence electron with high kinetic energy.
The total RPES cross section assuming integration over all directions of the outgoing photoelectron reads
[TABLE]
The structure of this expression is a hybrid of first-order Fermi’s Golden rule for direct photoionization (first term under square) and second order Kramers-Heisenberg-like term for autoionization. The system starts from thermally populated initial states . The incoming X-ray light either ionizes the system by transition to a final state , which is the properly antisymmetrized product of the bound -electron ionic remainder state and the state of the free electron , having kinetic energy . Alternatively, the same state can be obtained by excitation to the intermediate bound core-excited state , which then decays non-radiatively, mediated by the Hamiltonian of the system. This decay is essentially due to the Coulomb coupling which is dominating in the respective matrix element.
There are some more comments needed at this point. First of all, due to the coinciding final state of both processes there is an interference between the direct and Auger parts of the signal. This interference is present also for decay channels from different intermediate states in the Auger part of the signal. Second, Equation (4) sheds light on the nature of the parameters entering also the RIXS expression, Equation (2). In fact, , representing the total Auger decay rate of core-excited state . In principle, radiative decay would add to the total rate, but for TMs autoionization is the dominating process. Next, we note that here we assumed the simple product form of the final state and thus correlations between the ionic remainder and the outgoing electron are neglected. A more elaborate scheme taking interaction of ionization channels requires a scattering theory formulation [52]. Finally, RPES can be also viewed as a 2D type of spectrum [44] as it depends on the incoming photon energy and the photoelectron kinetic energy .
As compared to XAS and RIXS, PES is analogous to the former and AES to the latter. The difference is that PES contains information on the core-excited states of the ionic remainder, whereas AES contains that of the core states of the non-ionized system and valence manifold of ionized system (see Figure 1b). RPES calculations are more elaborate due to the need to compute integrals between bound and continuum states to obtain matrix elements of the of the dipole moment and molecular Hamiltonian .
In the weak field regime, PES cross sections can be calculated in the framework of the Dyson Orbital (DO) formalism [53, 54, 55]. For illustration, we leave the second term in Equation (4) aside and consider only the direct photoionization represented by the first term. Assuming the strong orthogonality between the state and MOs of the initial system, the expression can be rewritten in terms of quasi-one-electron states which are called DOs. It represents an -electron integral over initial and final ionic states . Using the DO, the transition matrix element (neglecting polarization dependence) can be expressed as
[TABLE]
Here, the subscripts and 1 denote -electron and one-electron integration, respectively. This formal reorganization allows to separate system and electronic structure method dependence of the intensity from the actual representation of the photoelectron. Once the DO is computed for a particular pair of bound states, and , it can be further used to estimate transition dipoles for different kinetic energies and even different forms of . Commonly used free electron functions include the plane wave expansion in terms of partial waves or Coulomb waves [54]. The former is appropriate when photodetachment from a negatively charged species is considered, whereas the latter accounts for spherical Coulomb potential of the ionic remainder. In case of TM complexes, one needs many terms in the expansion to realistically approximate the free electron function [53]. This approach has facilitated interpretation of experimental XUV and X-ray PESs of transition metal complexes in solution. [44, 56, 57, 58]
Finally, we would like to mention the so-called Sudden Approximation (SA) [59]. It assumes that the cross section for a pair of states, and , is proportional to . Thereby the kinetic energy dependence of the cross section is neglected such that no bound-to-continuum integrals in Equation (5) need to be calculated. This approximation is usually considered to be justified if the nature of the transitions and thus of the DOs is similar and kinetic energies are large.
3 THEORETICAL METHODS
3.1 General remarks
Theoretical studies of X-ray spectra can be divided into two classes: 1) investigations of closed-shell molecules containing mainly the second-period atoms addressed with single-reference methods and 2) works reporting on open-shell systems, primarily TM complexes. In general, first principles quantum chemical predictions for excited electronic states of TM compounds are significantly more difficult than for compounds containing main group elements. [60] TM chemistry is closely connected to near degeneracy effects, also called static correlation, making methods based on a single SCF-like reference configurations unreliable. Thus, computationally more demanding multi-reference approaches are often required to obtain quantitative accuracy. In addition, the treatment of -electrons requires the inclusion of dynamic electron correlation. Finally, especially for core-excited states, relativistic effects, e.g., SOC, have to be taken into account.
An overview of available methods is provided in Figure 2. In general, corresponding methods can be formulated in time (E,F,G,H) and energy (A,B,C,D) domain, which amounts to solving either the time-dependent or stationary Schrödinger equation. The spectroscopic observable (Equations (1), (2), and (4)) are formulated in terms of the eigenvalues and eigenstates of the stationary Schrödinger equation, . However, the spectra could be equivalently expressed in as Fourier transforms of multi-time dipole correlation functions.[45] For instance for XAS, these are two-point and for RIXS four-point dipole correlation functions [61, 62, 63, 64]. Beyond this perturbational regime, i.e. for strong field excitation, or in cases of (shaped) ultrashort laser pulses one has to resort to explicit solution of the time-dependent Schrödinger equation including the field-matter interaction. Compared to the Quantum Chemistry (QC) methods, Real-Time (RT) propagation approaches have received less attention so far.
Another, “dimension” of the classification in Figure 2 is with respect to the type of MOs that are used to express the many-electron states. In fact, every calculation starts from an MO optimization; the question is whether it is done within the actual excited state method as well (B,D,F,H) or the MOs are fixed during excited state calculations (A,C,E). Finally, QC methods can be distinguished according to the used reference space into single (A,B,E,F) and multi-configuration (C,D,H) approaches. Note that we are not aware of the methods which fall into category G.
In the following, we will review ab initio QC methods, focussing on MCSCF-based methods in frequency domain (Figure 2D and C) as applied to X-ray spectroscopy of TM complexes. However, other methods mentioned in Figure 2 will be briefly discussed as well.
3.2 Basic concepts: SCF
3.2.1 HF theory
The HF method is the most basic MO approach discussed here. Although it is not directly applied to core-level spectroscopies it is central for understanding other methods. It can be characterized by the following points: (i) The -electron wave function , describing the ground state of the system, is represented as a single Slater determinant of one-electron MOs (for inclusion of spin, see Section 3.6). (ii) The optimal set of is searched variationally to obtain giving the best possible estimate of the exact ground state energy under the restriction of (i), see Eq. (6). (iii) The electronic Hamiltonian contains one- and two-electron terms . HF theory is a mean-field approach where the latter are approximated by a one-electron potential . Matrix elements of this one-electron operator are expressed via Coulomb, , and exchange, , two-electron integrals as . (iv) The problem is reformulated in terms of a one-electron eigenvalue equation , yielding orbitals and their energies . Since the Fock operator, , depends on the orbitals the equation is solved iteratively (SCF scheme). Essentially, this is done via a search for unitary transformations of orbitals which give a minimal ground state energy
[TABLE]
where is an anti-hermitian one-electron operator. Thus, the orbital variations are parametrized in terms of the elements of the skew-hermitan matrix , which represents in the basis of MOs; one may think of the elements of as corresponding to pairwise orbital rotations.
In practice, one uses the Linear Combinations of Atomic Orbitals (LCAO) form, where the MOs are represented in an atomic basis, ,
[TABLE]
Thus the coefficients are variationally optimized.
According to its mean-field nature, HF theory does not take into account correlations due to the electron-electron interaction. Electron correlation is at least a two-body effect and should be distinguished from the one-electron orbital relaxation accounted for via the SCF procedure. Different types of correlations can be identified if we adopt for the moment a classical point of view. Two electrons in an atom will try to “occupy” orbits with different radii (radial correlation) or reside on different sides of the nucleus (angular correlation). This type of correlation is called dynamic (or weak) and can be accounted for by allowing to have in addition to the leading HF term a small admixture of other determinants built on the same set but with different integer occupation numbers (excited determinants). This approach will be considered later in the context of the CI method in Section 3.3.1. There are cases, when a single leading term for cannot be found since several of determinants have comparable weights. This more extreme case leads to complete break-down of HF approximation and is referred to as static (or strong) correlation. It will be considered in the framework of MCSCF theory in Section 3.3.2. Finally, one may distinguish differential correlation, when the correlation energy is different in different electronic states.
3.2.2 KS-DFT
The KS-DFT method belongs to the same class as HF, see Figure 2B. The approach is rather different though. Backed by the Hohenberg-Kohn theorems it formulates the energy as a functional of electron density subject to the variational principle. Since is a three-dimensional quantity and thus simpler than the -dimensional wave function, it offers distinct benefits from the viewpoint of computational efficiency. In practice, one considers the Kohn-Sham scheme, where a single determinant is postulated for the ground state. The density is determined as , with being the occupied Kohn-Sham MOs. This approach aims at the exact description of a system, adopting a formally non-interacting electron ansatz for the wave function. The “non-classical” terms which constitute the difference between this HF-like solution and the exact one are thus transferred to the one-electron potential . The energy functional of the electron density which is variationally minimized reads , containing kinetic energy of non-interacting system, “classical” Coulomb repulsion energy, electron-nuclear attraction energy, and the eXchange-Correlation (XC) energy comprising the difference between exact and approximate () kinetic energies, the effect of electron correlation, and exchange interaction. Various approximate exist and numerous techniques for the construction of this functional have been devised.
All shortcomings of Density Functional Theory (DFT) result from the approximate nature of . Most notable is the so-called self-interaction error. In HF, the interaction of an electron with itself is canceled by the exact relation . This is, however, not true anymore if approximate XC functionals are considered. This appears to be critical in many respects [65] and especially for the core-levels of atoms (see Section 3.5 below).
3.2.3 (Quasi)-one-electron methods for X-ray spectroscopy
The KS-DFT method has replaced HF in modern applications. Although being also of one-electron nature, it describes independent quasi-particles, implicitly accounting for (dynamic) electron correlation. Both methods can be used to generate a suitable basis of orbitals for subsequent electron correlation treatment by single-reference methods (Figure 2A), such as CI, PT, or CC approaches as discussed below. The choice of a proper orbital basis is more important for core-excited states than for valence-excited ones. The reason is that removing an electron from the core region where the density of electrons is very high significantly influences electrons occupying other shells due substantial changes in the screening of the nuclear charge, which can be associated with an additional polarization potential [66]. Thus, most of the orbitals will strongly relax to become more tightly bound in energy and more confined in space. Naturally, this type of orbital relaxation is very important for core ionization or the description of the subsequent NXES; although there might be exception, see Reference 67. This may require to perform an SCF orbital optimization to account for the core-hole excited state to accurately predict X-ray spectra. The simplest way to do so is to obtain relaxed orbitals where the core-excited atom is replaced by the next element from the periodic table. Within the so-called or equivalent core approach [68] these orbitals are used for calculations of the parent molecule. Important applications include the interpretation of PESs and thermodynamic data [69].
Although both HF and DFT techniques were described in the previous section as purely ground state methods, in principle, they can be applied to excited states as well. In this case, the excited state is also represented as a single Slater/Kohn-Sham determinant analogously to the ground state; this is called SCF method since the transition energy is be obtained as the difference between respective SCF energies. Its application to core-excited states has been first suggested in Reference 70. Because some of the features of the SCF method will be important for the later discussion of MCSCF calculations of core-excited states, more details will be provided in the following.
A major issue for all methods which involve orbital optimization (cf. Figure 2B,D) is the so-called variational collapse. Let us assume that we have setup a wave function with a hole in some core orbital and an additional electron in a virtual orbital and naively perform a variational SCF optimization (note that no spatial symmetry is assumed and the spin coincides with that of the ground state). For a randomly chosen initial orbital guess, the virtual orbital hosting an excited electron will be transformed to the core one in a series of unitary transformations at every SCF iteration. In other words, without additional tricks the wave function is likely to collapse to the ground state and core-excited states cannot be accessed. The only reason why such calculations of deep core-hole excitations might converge to the desired local minimum solution is that usually these deep-lying core orbitals have a negligible overlap and couplings with other MOs. For larger molecules and shallow holes this, however, might not work leading to oscillations, divergence, or to collapse to the ground state.
To circumvent this problem, several schemes have been suggested (Figure 2B). In approach I, one searches consecutively for higher-lying variational solutions, e.g., under the constraint that each next solution should be orthogonal to all previously found, although this additional condition is not always rigorously applied, see below. [71] This might be a tedious procedure if more than a few core-excited states are needed. Alternatively, in approach II, a one-electron orbital basis is constructed in such a way, that the differences of orbital energies give good estimates of the X-ray transition energies ( approach).
Several different schemes along the lines of approach I have been suggested to perform a constrained search. They differ in the construction of occupied and virtual MOs involved in the excitation and in setting their occupation numbers, see Reference 71 and references therein. Exemplarily, one should mention the application of a multiple hole/particle algorithm within orthogonality constrained DFT to the calculation of -edge spectra [72]. Moreover, state-following algorithms can be applied to avoid the variational collapse without explicit setting the orthogonality constraint. Here, one specifies the desired occupation number pattern for the initial guess and attempts to maintain it during SCF iterations. This can be accomplished by applying different kinds of projectors to separate core orbitals from others at every iteration, see Reference 73 and references therein. Alternatively, one can utilize a maximum overlap criterion [74, 75] to identify the core-hole orbital at each SCF step. In principle, one can also apply conventional SCF subject to a special construction of the initial guess as discussed in References 76, 77. Here, in the first step the singly-occupied core orbital is frozen and thus it cannot mix with other MOs via unitary transformations , i.e. one gets a constrained SCF minimum. Next, one uses this partially relaxed wave function to converge to the local energy minimum, corresponding to a fully relaxed core, making use of quadratically converging algorithms.
In approach II, one aims at obtaining orbitals partially relaxed due to presence of the core hole such that differences of orbital energies, , are a good approximation of transition energies . Several versions have been suggested, differing in the included fraction of the core hole and excited electron causing orbital relaxation, e.g., full or half of the core hole and full neglect or inclusion of the excited electron [78, 79]. One of the prominent representatives of approach II is the * Transition Potential DFT (TP-DFT)* method [80] reviewed in Reference 81. Here one chooses a half-occupied core orbital and neglects the effect of the excited electron. Such a choice of a partial occupation can be viewed from a more general viewpoint of ensemble DFT [82, 83], where the number of electrons in the system varies from to . For the exact XC potential, the energy between points with different integer particle numbers and changes linearly with fractional occupation and the derivative of the energy exactly amounts to the orbital energy at any value of [84]. This is however not the case for approximate XC functionals as the energy behaves nonlinearly. [85] According to Lagrange’s mean value theorem the exact relation between the energy derivative and is restored at some point between and electron numbers. In a sense, TP-DFT is pretending to search for this optimal point approximating it by . More generally, fractional occupation can be considered as a “fitting” parameter to generate optimal orbitals for X-ray calculations.[86, 87]
Note that in a number of works an even more simplified protocol belonging to approach II, the so-called * Frozen Orbital Approximation (FOA)*, has been used. Here, only ground state SCF is carried out and intensities and energies are fully based on transition matrix elements and energy differences between ground state Kohn-Sham orbitals. FOA is thus completely neglecting excited state orbital relaxation and in this respect it is similar to Koopmans’ theorem approach for HF [88] or its analogues for DFT [84, 89, 90] as applied to X-ray PES [66]. However, it gives reasonably good results for -edge XAS and NXES of TM compounds [91, 92].
For approach I, the calculation of transition matrix elements is quite complicated in case of non-orthogonal wave functions and can lead to spurious results. Within approach II they are usually estimated on a quite approximate basis as . Here, and are SCF solutions for ground and th core-excited states and and are orbitals corresponding to the respective one-electron excitation. Thus, the transition matrix element between -electron states is replaced by a one-electron integral. The factor is the -electron overlap which accounts for the relaxation of other orbitals not involved in the excitation. It is usually assumed to be constant, but can be calculated in a more elaborate way to improve computed intensities [93].
The main advantage of the SCF method is that one can use the well-developed machinery of ground state SCF theory for obtaining excited state properties without much of new development. One should keep in mind, however, that in case of DFT-based techniques the accuracy of the underlying density functional is of vital importance. SCF-like methods have been widely applied to the -edges of the second period elements from small molecules [94, 95, 75] to macromolecules [96, 97] and condensed phases [98, 79, 81, 93]. It has been even attempted to calculate energies of -edge transitions of Si, P, S, Cl, and Cu containing systems [75], although without proper treatment of relativistic effects like SOC. Finally, SCF can also be used to optimize electronic states with multiple holes residing on the core or valence MOs as relevant for Auger spectroscopy and shake-up XAS transitions. [99, 100]
3.3 Methods including correlation
3.3.1 Configuration interaction
Apart from SCF-like approaches adapting the HF or KS-DFT ground state to compute higher-lying states, there is a plethora of single-reference energy-domain excited state methods; cf. Figure 2A,B. The respective excited states can be represented in the general form
[TABLE]
where is the ground state obtained with any suitable method and is an excitation operator. In single-reference methods, is usually taken to be a single ground state Slater (or Kohn-Sham) determinant, , and orbitals are not further optimized. We start the discussion of this class of methods with the CI technique, being conceptually the simplest although not computationally most efficient representative. The CI framework facilitates understanding of other (even conceptually different) methods as it provides a very pictorial tool of wave function composition.
CI is one of the most logical ways to improve on the SCF description [101, 102]. First it provides a systematic way for inclusion of electron correlation and second it can be naturally extended to the calculation of excited states. Since ground state and excited Slater (or Kohn-Sham) determinants form a complete basis, a CI state can be written as
[TABLE]
Here, the determinants are built from the fixed SCF orbitals , obtained prior to the CI procedure, e.g., by ground state HF or KS-DFT. In practice, the coefficient matrix is obtained by writing the electronic Hamiltonian in the basis of determinants which is iteratively diagonalized using the Davidson algorithm [103].
Using all possible determinants in the expansion Equation (9) corresponds to the exact Full (FCI) solution for a given basis set. Its computational complexity grows exponentially with the number of basis functions and truncation schemes are usually applied. Starting point is the classification of the terms in Equation (9) as being unexcited reference, singly, doubly, triply, and so on excited electronic configurations:
[TABLE]
In this notation, indices and denote occupied orbitals from where an electron is excited and virtual orbitals to which electrons are excited in the determinant , respectively. To make the problem computationally tractable, one truncates the expansion Equation (10) including, e.g., all determinants up to a given excitation class. This gives rise to Singles (CIS), Singles Doubles (CISD) methods, and so on. Since electron correlation is at least a two-body effect, doubles are necessary to correct for it on top of a HF reference. Singles are important for description of charge density and thus for properties like transition dipole moments. If only singles are included, it might provide a good tool for spectroscopy with near-HF quality for the excited states.
From the viewpoint of quantum chemistry the core-excited electronic states are not any different from the valence-excited ones. But, the problem is that one would need to calculate thousands or even millions of states to reach the highly lying levels relevant for X-ray spectroscopy. This problem stems from the fact that, if one is interested in the th eigenvalue, the efficient Davidson algorithm [103] standardly used for this purpose, diagonalizes at least an matrix and thus obtains non-interesting eigenvalues below . Since matrix diagonalization is the main computational bottleneck such a brute force approach wastes computational resources if this is possible at all. Let us consider, for instance, two systems H2O (-edge) and [Fe(H2O)6]2+ (-edge). If one takes a quite moderate triple-exponent TZP basis and accounts for singly and doubly excited configurations which is the minimal reasonable choice to include electron correlation within the CI approach, the total amount of valence states is 46 872 and 136 370 360, respectively. This means that to reach the first - or -core-excited state one would need to explicitly calculate a huge number of states, where in case of XAS often only few low-lying ones are of relevance for the spectrum (due to the factor in Equation (1)).
There are several ways to circumvent this difficulty. First of all, one can stay within the conventional Davidson scheme and project onto a smaller subspace of trial functions excluding most (or even all apart from the ground state) valence states (Figure 3). In practice one makes use of the concept of an Active Space (AS) of MOs, where certain orbitals (i.e. the core orbitals for XAS calculations) are allowed to change their occupations, while for the others electronic excitations are forbidden (also called Core-Valence Separation (CVS) [104, 105, 106]). Equivalently, the subspace of electronic configurations can be limited according to simple Koopmans’ estimates of the state energy based on the orbital energies as realized within Restricted Excitation Window Time-Dependent Density Functional Theory (REW-TDDFT) [107, 108].
This “reduced subspace” concept is illustrated in Figure 3a. It shows a pictorial view of the CI-like Hamiltonian matrix expressed in the basis of electronic configurations resulting from valence, single core-hole (1h-core), and double core-hole (2h-core) excitations. The overall dimensions of the Val–Val block are those thousands or millions mentioned before. In fact, in XAS one is usually interested in relatively small sub-block (marked with blue color) of the 1h-core–1h-core block including only few tens, hundreds or even thousands of states if only the discrete pre-edge region is of interest. In case of RIXS, one needs in addition a number of low-lying valence-excited states also denoted as a blue sub-block of the Val–Val block. The 2h-core–2h-core block is usually completely neglected. The same holds for the coupling Val–1h-core and 1h-core–2h-core blocks. The motivation for neglecting off-diagonal couplings between two configurations from (valence) and (core) is provided by PT. Here, the error is estimated to be of the order of in the wave function and in energy, where is the off-diagonal Hamiltonian matrix element and is the usually large difference between energies of these configurations standing on the matrix diagonal. In fact, the restricted subspace approach is equivalent to neglecting specific type of two-electron integrals of the form , where three of the orbitals are valence and one is core orbital or vice versa and where orbitals in the bra are both core and in the ket both valence [104, 105, 106]. Hence the diagonalization of the enormously large CI matrix in Figure 3a can be replaced by a Davidson diagonalization of the small (blue) sub-blocks. The uniform error introduced by CVS was estimated to be of the order of 0.5-1.0 eV for -edges of the second period elements [109, 110].
Another way to access core-states is to modify the Davidson algorithm itself. For instance, in Reference 111, a modification of the standard algorithm [103] employing the so-called “root-homing” technique has been suggested. It utilizes the maximum overlap criterion (similar to SCF) to identify the desired state during each Davidson iteration step. A more recent modification [112] builds on the idea of Reference 113 and introduces an intermediate projection of the matrix to be diagonalized onto the subspace of the trial vectors having energies in a predefined window. After diagonalization the eigenvectors are projected back onto the full space. The approximate energy of the trial vector is estimated as the orbital energy difference. This approach is superior to the simple index fixing used in REW-TDDFT [107] as it systematically includes contributions out of the initially predefined reduced space in a consistent way. This gives a good approximation for the high-lying states in the full space of trial vectors without explicit calculation of the lower-lying ones. For the high density of states typical for X-ray spectra, a more advanced adaptive hybrid algorithm can be applied to improve the convergence [114]. Apart from CI [111, 115], so far such approaches were realized for Linear-Response TDDFT (LR-TDDFT) [112, 116, 114] and Equation-Of-Motion Coupled Clusters (EOM-CC) [117] methods discussed below.
In the truncated CI ansatz, Equations (9) and (10), the underlying one-electron orbitals determine accuracy and convergence of the CI expansion. That is why the choice of the orbital basis is of considerable importance for the overall efficiency. It is known that virtual MOs from ground state HF calculations are too diffuse especially for large basis sets and thus represent a poor choice. The suggested remedies are to use improved virtual orbitals [118], natural orbitals [119], or even KS-DFT orbitals [120]. In Reference 87, different types of the relaxed initial orbitals (restricted, unrestricted, and restricted open-shell HF and KS-DFT) have been considered and their performance for core-excitations within CIS is deduced by comparison to experiment. It was found that a HF reference where orbitals are relaxed to the core-hole is most suitable. Ground state frozen orbitals are not that good and produce errors of order of 10 eV in core excitation energies.
CI has been traditionally used for predicting X-ray spectra due to -edge transitions in small molecules. [111, 115, 121, 122] For -edge spectra of TM complexes, the * Configuration Interaction on orbitals (DFT-CI)* approach enjoys great popularity. In this method, the standard CI operator is applied to the ground state KS-DFT determinant (). [123] The idea is to take the dynamical correlation effects at the level of DFT into account and to incorporate stronger (static) correlation effects by CI. As a result, the number of important determinants and thus the size of the Hamiltonian matrix can be substantially reduced. This method largely relies on the better suitability of Kohn-Sham orbital energies as estimates of the electronic transition energies. Since these orbital energy differences are dominating the diagonal of the CI Hamiltonian matrix in the basis of determinants, it makes the major contribution to the final CI energies [124]. However, since the CI is built on top of quasi-particle orbitals which already implicitly include dynamic electron correlation a number of points have to be considered. First, double counting of correlation effects should be excluded which can be done by energy-dependent exponential scaling of the CI couplings between determinants [125]. Moreover, the two-electron integrals calculated on KS-DFT MOs are overestimated and need to be scaled down even in case of CIS, where no double counting occurs due to its “uncorrelated” character. [124, 126] The need to parametrize this scaling makes the method semi-empirical in nature although the number of parameters is kept as small and universal/transferable as possible. The parameters determined, e.g., by least-squares fitting to experiment, depend on many aspects: XC functional, multiplicity and character of the target excited states, and so on. Thus, the variants usually applied to valence TM spectroscopy [127, 123] need to be adjusted to address core spectra, e.g., by changes of the virtual orbital energies. [124]
The hybrid nature of DFT-CI provides another distinct advantage making use of the correlated origin of the Kohn-Sham MOs. For TM complexes with covalently bound ligands it reproduces the effect of bonding better than HF which predicts bonds to be in general too ionic [60]. There are two variants of DFT-CI which were applied to X-ray spectra of TMs. In their approach, Ikeno et al. study crystalline metal oxide phases within a cluster approach.[128, 129, 130, 131, 132, 133] No corrections for double-counting or scaling of two-electron integrals is performed. As an XC functional the simplest local density approximation is applied, thus containing no exact exchange. Note that the CI problem is solved only in small subspace of MOs containing few electrons (essentially , where , and also ligand orbitals to account for charge-transfer effects) and the other electrons and their interaction with active ones are taken in an averaged approximate way [134]. This makes this approach being strongly related to the Ligand Field Multiplet (LFM) semi-empirical method described below, although no fitting of parameters beyond usual DFT level is performed. Despite of the absence of fitting parameters and the approximations introduced, this method demonstrates a remarkable level of agreement with experimental XAS and other types of spectra. A similar CI approach based on state-averaged relativistic Dirac-Hartree-Fock orbitals (no DFT) has been applied to X-ray and PESs of UO2 [135] and MnO [136], respectively.
The second DFT-CI method specifically designed to calculate TM -edge spectra is DFT-based Restricted Open-shell Configuration Interaction Singles (ROCIS) [126]. The restricted open-shell SCF calculation ensures the proper reference ground state configuration with unpaired electrons which is typical for TM complexes. Being a DFT-CI technique, it needs rescaling of two-electron integrals to additionally reduce electron-electron interaction similar to LFM. In practice, three parameters which are universal across the periodic table are introduced: two scaling Coulomb and exchange terms on the diagonal of the CI matrix and one common scaling parameter for all off-diagonal elements. With this it puts emphasis on the diagonal of the CI matrix containing basically the excitation energy for a given orbital pair. These scaling coefficients are optimized only for two density functionals: B3LYP and BHLYP in a specific def2-TZVP(-f) atomic basis. Thus, this method is semi-empirical making use of a training set containing experimental spectra. From the viewpoint of CI it accounts only for single electron excitations thus being intrinsically of SCF quality for the excited electronic states and not including any additional electron correlation on top of KS-DFT MOs. Although being a relatively crude approximation, it allows to keep the configuration space very compact at the price of neglecting shake up excitations. Therefore, it can significantly underestimate the density of states in energy regions where such satellite bands are occurring. On the other hand, this approach is computationally very efficient, especially in connection with a local correlation treatment [137]. It has been applied to models including up to several tens of TM atoms [138]. This method is one-component (see Section 3.6) but scalar relativistic effects and SOC are included what is prerequisite to reproduce -edge X-ray spectra. Overall, this approach provides fairly good agreement with XAS experiments at least for highly-symmetric systems [126].
Finally, we briefly mention LFM* theory* as another semi-empirical CI-like method. It has played an important role in understanding the complex structure of X-ray spectra during the last three decades. Within the MO paradigm, the covalent bonding situation is usually described as a one-electron effect resulting in mixing of orbitals of bound atoms. LFM follows more the valence bond theory route, where these effects are taken on the level of many-electron CI between configurations with localized electrons.
LFM theory has its origin in ligand field theory developed back in 1950s [139], but there are several modifications to accommodate the finer spectroscopic features of TM complexes. It is an atomic theory focussing on an isolated metal ion with, e.g., a configuration (corresponding to an electronic state ). The in general complicated chemical environment is taken into account as a perturbation via an effective electric field that disturbs the spherical symmetry of an atom [140, 141]. Since the effect of ligands goes beyond effective electrostatic interaction especially in case of highly covalent metal-ligand chemical bond, this model needs to be significantly extended. The major extension is to incorporate sophisticated bonding contributions in terms of charge-transfer between metal and ligand [142]. For this purpose in the framework of X-ray spectroscopy, one needs to consider not only and electronic configurations with “neutral” ligand () and either fully filled -shell () or having one core hole (), but also states where an electron is transferred, e.g., from the ligand to the -shell – and .
If one extends the number of basis functions in which the LFM Hamiltonian is written to the size of a few or even tens of thousands electronic configurations, upon diagonalization one obtains a complex structure of eigenvalues called multiplets. These multiplets originate from the manifold of atomic levels, which are additionally split in energy due to the ligand field and charge-transfer effect.
Often, LFM theory serves as a model for semi-empirical fitting of a number of parameters to reproduce experiments (for an overview see Reference 4). The Coulomb and exchange interactions within core and valence manifold as well as between them are parametrized via Slater-Condon integrals precalculated at the HF level and tabulated for each element. Additionally, their numerical values are often reduced to 80% or even smaller, to roughly account for electron correlation. The valence and the core SOC is also introduced semi-empirically as a parameter within or coupling limits (see below) depending on the TM ion considered. Another parametrized input for the Hamiltonian is the ligand field. In case of high symmetry TM complexes, e.g., octahedral or tetrahedral, only one effective parameter is used. However, the number of ligand-field parameters grows when symmetry is lowered. Finally, the charge-transfer parameters are adjusted to fit experimental results. The overall number of adjustable parameters can become quite large, thus complicating the fitting procedure. Due to its semi-empirical nature this theory represents a valuable tool to postanalysis of experiments but has only limited predictive power. The transition energies are usually calculated with an accuracy not better than 1.0 eV but in some cases an impressive agreement with experiment has been achieved.[4]
To parametrize more complicated ligand fields the necessary information can be taken from ab initio calculations. The tandem of the simplicity of LFM theory with the versatility of first principles approaches gave a rebirth to the well-established concept. [132] Moreover, band structure codes can be utilized for such a parametrization in case of solids [143, 144].
3.3.2 Multi-Configurational Self-Consistent Field method
Multi-configurational methods have been introduced to describe bond dissociation, distorted geometries, near-degenerate ground states, and electronically excited states. During decades of application, these methods have acquired a reputation of being the “golden standard” for this type of problems. [145, 146, 147, 148, 149, 150, 151, 152, 153] From the 1980’s on, when the first efficient and rapidly converging algorithms for MCSCF have been developed, also -edge core-level spectra for diatomics and small polyatomic molecules have been calculated. [76, 154, 155] During the last years investigations of -edge X-ray spectra of TM compounds emerged, where it has been shown to be able to cope with highly excited states in multiconfigurational situations [156, 157, 158].
The MCSCF method is a hybrid of the HF and CI approaches. It employs the CI multi-determinantal ansatz, Equation (10), for the many-body state,
[TABLE]
with the important difference that not only the CI coefficients, , are variationally minimized, but also the set of coefficients , within the LCAO ansatz
[TABLE]
The MCSCF wave function can be parametrized as
[TABLE]
where operator describes an orthogonal transformation of a set of MOs which are specific to state [150]. The energy functional can be expressed as
[TABLE]
Upon joint optimization of CI and MO coefficients, the one-electron MO basis adapts to the description of electron correlation effects. This allows to quite reliably account for static correlation (multiconfigurational character) and (near-)degeneracies of the electronic states, which are typical for TM compounds.
However, this simultaneous treatment of orbital relaxation and electron correlation leads to a considerable increase in complexity of the actual simulations. The number of CI configurations and MO coefficients quickly increases with the number of atoms and the number of basis functions per atom. The straightforward application of Equation (11) considering all orbitals on equal footing is rather demanding if not impossible already for medium-sized systems. There have been attempts to individually select the most important configurations, e.g., via perturbation theory. [159, 160] A more robust and popular solution is provided by the concept of the AS. At the most general level it corresponds to the disjoint subsets of MOs for which different levels of CI expansions are performed. Historically, the most popular variant is the Complete Active Space (CASSCF) approach [161], cf. Figure 3b. Here, the orbitals are classified in three groups: (i) those which stay doubly occupied are called inactive, (ii) those which are unoccupied, virtual or secondary, and (iii) active ones, forming a subspace where all possible distributions of some number of active electrons are allowed, thus corresponding to a subspace FCI calculation. It is important to note that inactive orbitals are treated at the HF level and that is why the scaling with the physical size of the system is not prohibitive. The exponential growth of the computational demands can be thus mitigated or even avoided by treating only the local correlated subsystem within the AS. For fine tuning with respect to important configurations, one can introduce additional subspaces restricting the number of electrons which can be excited from or to them. This leads to the RASSCF method [162, 163] if the number of active subspaces is equal to three or, in general, to the Generalized Active Space (GASSCF) method [164, 165] or similar approaches [166].
To sum up, MCSCF unites the benefits of the SCF and CI allowing not only for orbital relaxation upon core-hole formation but also for strong correlation effects. Therefore, it optimally suits the treatment of TM systems which are well-known for static correlations. Moreover, as we have seen for the case of CI for core-level calculations, the most pragmatic and robust approach to core-excited states employs restricted subspaces of orbitals and excitations between this subspaces. This naturally leads to the idea of using CASSCF or RASSCF for X-ray spectroscopy. It should also suit to probe with X-rays the excited state dynamics involving bond cleavage or conical intersections, being a native habitat of MCSCF-based methods.
The choice of included orbitals in the AS fixes electron configurations considered within CI treatment. This has to be done in a balanced way as in general correlation effects do not naturally split into strong and weak ones but continuously fill the range of importance. Thus, the selection of the AS has been for a good while considered to be an art. As of now there is a consensus on how to select a more or less optimal AS in general non-pathological cases [150, 167, 168, 169, 170]. The AS should comprise: (i) orbitals which are involved in the excitation to a particular state , (ii) all orbitals hosting unpaired electrons, (iii) strongly correlating orbitals – usually respective bonding and antibonding MO pairs, (iv) MOs which are responsible for near-degeneracy, e.g., and if the chemical bond is dissociating or degenerate -orbitals for TMs or -obritals for lanthanides and actinides, (v) more specifically in case of -TMs, and orbitals might be included for the elements in the beginning of a row to include semicore correlation. For the later elements of a row, additional set of orbitals is required to recover radial dynamic correlation of the electrons in the non-bonding -orbitals already in the AS (double-shell effect). Note that some of the rules might be redundant since, for instance, electrons are strongly correlating and at the same time respective orbitals can be singly occupied and degenerate. Naturally, to perform a computationally feasible calculation, the number of active MOs as well as active electrons needs to be kept as small as possible.
On the one hand, core-excited states can be considered as quite difficult from the viewpoint of selection of AS since the number of -electrons changes in course of excitation/deexcitation or (auto)ionization. On the other hand, in practice this does not represent a problem. Exemplarily, let us consider, how to choose the AS for a core state RASSCF calculation of the Fe(CO)5 model system [18], see Figure 3b. Although Fe(CO)5 has a high symmetry, it is not obeying atomic selection rules. However, X-ray excitation involves a very localized part of the electronic system, having predominantly atomic nature. Thus, we can still use approximate angular momentum () selection rules. Therefore, to study dipole allowed -edge transitions one needs to include at least and orbitals into the AS as illustrated in Figure 3b. The AS is further subdivided into RAS1, RAS2, and RAS3 subspaces. Usually, one is interested in singly core-excited states whereas doubly core-hole states are not relevant (Figure 3a). To account for this and simplify the calculation, orbitals are put in a separate subspace RAS1. Further, only one electron is allowed to be excited from RAS1 to other active orbitals, meaning that RAS1 contains 6 (for valence states) or 5 (for core states) electrons distributed in the three orbitals. The electrons are usually highly correlated what especially applies to Fe(CO)5 being a covalent complex [169]. Therefore, one would prefer to include them in a separate subspace, RAS2, and construct FCI-type configurations. Apart from the doubly-occupied non-bonding orbitals which reside almost exclusively on the Fe atom, there is a pair of bonding/antibonding and orbitals which are essential for correlation treatment. Moreover, one of the two main transitions in XAS is due to the excitation and in RIXS the relaxation plays an important role. [18] Additionally, one can include ligand orbitals, e.g., in the RAS3 subspace and limit the number of electrons which are allowed to be excited to them from RAS1 and RAS2, e.g., to one. It is reasonable to include orbitals which have a mixed and non-bonding character. These orbitals are involved in the transitions giving rise to an intense absorption feature [18]. These empty ligand-dominated orbitals can be included to describe effects of backdonation and explicit charge-transfer excitations as analyzed for the Fe(CO) [18, 22] and [Fe(CN)6]. [19, 158] Remarkably, the radial nodal structure of these MOs resembles that of the “double-shell” orbitals. Thus, being correlated, or in other words allowing for higher than single excitations to RAS3, they might help to describe radial correlation. However, if one focuses on reproducing transition strengths somewhat sacrificing accuracy in energy, single excitations should suffice [18]. If one is interested in other charge-transfer effects, one might add additional orbitals to RAS1 or RAS3 spaces. All other occupied orbitals are set inactive and stay always doubly occupied.
The complexity of the selection of the AS for TM compounds strongly varies with the covalence of the metal–ligand bond. One can state that covalent complexes correspond to particularly strong static correlation, whereas ionic complexes can be essentially singly-configurational [168, 169]. In case of ionic complexes [M(H2O)6]n+, the AS can be kept even more compact including just three and five orbitals ( non-bonding and two anti-bonding orbitals in octahedral symmetry) [157, 171, 53, 21, 44]. Other filled orbitals with character, for instance, the bonding counterpart, can be added to describe further shake-up and charge-transfer transitions as exemplified for, e.g. [FeCl6] [158] and [Mn(H2O)6]2+/3+ complexes [21]. Inclusion of orbitals into the AS accounts for allowed excitation, having, however, lower intensity without notable influence on -edge spectra [157]. It might be still important for accurate reproduction of experimental spectra [172].
As a byproduct of having both filled and unfilled (or half-filled) valence MOs in the AS one obtains a fraction of valence electronic states. In fact, states with valence character can be used to calculate RIXS and RPES. That is why, one should always take care that not only core but also bright valence states (in a sense of corevalence radiative relaxation) are represented appropriately. This condition is automatically fulfilled for the AS shown in Figure 3b. Finally, by choosing the AS one can switch on and off various effects, e.g., in addition to back relaxation one can account for dipole-allowed decay by simply adding orbitals to the AS [44].
As a note in caution: Imposing constraints on orbital subspaces does not in general case solve the problem of variational collapse. The matrix in Equations (13) and (14) contains terms which are interchanging active MOs with inactive and virtual ones. Effectively, those orbitals, which correspond to the largest coupling elements in the Hamiltonian, will stay in or enter the AS. Thus, there is always a probability that the calculation will not converge to the desired local minimum of the energy (with core orbitals in the AS) but to the global one, where core MOs are rotated out of AS. As mentioned already for the case of single-reference SCF, if the initial orbital guess is reasonable then, due to small overlap of orbitals with the valence ones and using at least a second order algorithm, the initial orbitals will tend to stay active and the system will fall into the closest local minimum of energy. In our experience, this procedure works quite well for high-spin states if the orbitals optimized for the lowest valence excited states are used as a starting guess. Nevertheless, a systematic convergence to the desired local minimum cannot be ensured and often fails for medium- and low-spin electronic states of TM complexes. A possible solution might be to use a core-hole relaxed initial guess as suggested by Ågren et al. [76, 77].
Non-desired rotations can also be suppressed by restricting the elements of mixing orbitals with others to zero, i.e. to perform a constrained SCF search. This approach is justified by the findings in References 66, 173, i.e. that a hole in some deep-core shell does not influence much other orbitals in the same shell and thus relaxation of orbitals can be neglected (see also References 76, 77). In addition, one can utilize symmetry arguments in case of centrally symmetric systems. [174, 175].
Another “brute force” solution is to freeze all occupied orbitals in the inactive subspace, while still allowing for rotations between RAS subspaces and virtual orbitals.[157] This approach still includes some orbital relaxation due to possibility to “borrow” some orbital contributions from the virtual space and redistribute them among active MOs. Although it sounds quite restrictive, in fact, it produces results which agree with experiments fairly well as illustrated in Section 4. This approach allows to efficiently compute a few thousands of the electronic states which are needed to cover all relevant valence and core levels of the neutral and ionized systems as depicted in Figure 1b.
In fact, doing an MCSCF calculation produces non-orthogonal electronic states since MOs for different states are not orthogonal with all the negative implications for the calculation of the transition properties. It is connected to notable numerical effort as the () set needs to be determined for each electronic state separately, whose number for core-excited states can easily reach few thousands. To circumvent this and improve convergence properties, the state-averaging procedure is routinely employed [177]. Within this approach the averaged ensemble density and energy over several electronic states are obtained via variational minimization of the functional
[TABLE]
with respect to a common set of orbital rotations. According to the most general formulation of the respective variational theorem by Gross-Oliveira-Kohn [178], the weighting coefficients should not increase with the state number. In practice, the weights are usually taken to be equal. Thus, the index for the set of orbitals in Equation (13) can be omitted as they are the same for all states in the ensemble. Having a common set of MO coefficients, a more balanced description of several states is achieved and the difference in energies is resembled by the CI. Such a procedure allows to obtain a set of orthogonal non-interacting electronic states. However, via averaging, the orbitals loose their optimality for the CI-expansion. Nevertheless, averaging allows to attain quite good agreement with experiment. The state averaging for core states has been suggested by McWeeny [179, 173] and used to interpret results of X-ray Photoelectron Spectrum (XPS) and Auger spectroscopies.
On the one hand, due to the large number of core-excited states which usually need to be accounted for, state averaging is a highly advantageous procedure as it reduces the computational effort. On the other hand, if the optimal orbitals for different states are substantially different, this state-averaging procedure becomes not eligible. One might expect such a situation if a number of valence and core states are optimized simultaneously since the former require no relaxation upon the influence of the core hole and the latter do. (A separate averaging of valence and core states has been also suggested [180].) The unexpected practical solution to this problem is that even if one does democratic averaging over all the states obtained within a given AS, the results agree well with experiments. The democratic average can be viewed as accounting for non-integer core-hole occupation somewhat similar to the TP-DFT approach. Thus, the orbitals represent some compromise between needs of valence and core states [19]. In passing we note that this reminds on the procedure where all the irrelevant lower-lying states are averaged with the interesting higher-lying ones to improve convergence and mitigate variational collapse [177].
3.3.3 Restricted Active Space Perturbation Theory (second order) (RASPT2)
The description of electron correlation within MCSCF can be considered as unbalanced since it accounts for near-degeneracy effects (static correlation) but in addition also partially for the dynamic correlation within the AS. The MOs outside the AS are treated at the HF level with the respective moderate costs, see Figure 3b. A more balanced description requires a more complete account for the dynamic correlation contribution and can be achieved via PT applied on top of MCSCF. Here, one considers the correlation in an approximate way due to, for instance, single and double excitations between inactive, active, and virtual orbital subspaces in case of PT up to second order. The second order energy correction reads
[TABLE]
where is the MCSCF state, Equation (13), to be corrected by PT, and configurations with respective energies correspond to the single and double excitations mentioned above, is the difference between the true Hamiltonian and its zero-order approximation, and is the level shift. Among different flavors of Multi-Reference (MR)-PT [181, 182, 183, 184, 185] differing in the definition of the zero-order approximation, Complete Active Space second order (CASPT2) [186, 187] and its restricted variant RASPT2 [163] have enjoyed popularity for excited states of TM complexes in general [188, 167, 169] as well as for TM -edge X-ray spectra. A concise introduction to this method listing main peculiarities and inherent problems can be found in Reference 189; for a more detailed discussion see also References 167, 188.
At this point we would like to highlight two issues: (i) CASPT2 or RASPT2 cannot correct for the effect of a too small AS. In fact, usually a larger AS than the minimal one is required to get accurate PT results. This is reflected in the AS selection rules listed above, for instance, the necessity to include “double shell” -orbitals. In principle, this effect can be mitigated by using the multi-state version of RASPT2 [190]. Here, the effective Hamiltonian is constructed in the basis of single-state RASPT2 wave functions and subsequently diagonalized giving a more precise description. However, it might be impractical for core states due to their large number.
(ii) The energy expression in Equation (16) can be subject to singularities due to the vanishing denominators, giving rise to the so-called intruder state problem. This is especially severe for core-excited states which can easily get degenerate with some configuration. The natural solution is to increase the AS if possible. Another solution is to apply a real-valued level shift (Equation (16)) [191, 192], which artificially lifts the degeneracy with the unwanted weakly-interacting intruder states. An even more elegant solution is the introduction of purely imaginary level shifts [193], which is in fact utilized for the core-level computations. Nevertheless, choosing a universal level shift which is good for a large number of states at the same time might be problematic and one has to stay at the RASSCF level. Perhaps the best solution is known as -electron Valence second order (NEVPT2)[183, 184]. It explicitly includes two-electron terms in the zero-order Hamiltonian and is manifested to be intruder state-free. However, it has not yet been widely applied to X-ray spectra calculations, except the work in Reference 172.
Finally, we note that the effect of RASPT2 depends on the covalence of the complex [169], with relatively small effect for aqueous metal ions [156, 21] but large impact for systems, like metal hexacyanides, where the affordable AS is not big enough to include all important ligand-dominated MOs [174].
Recently, the RASSCF/RASPT2-based multi-reference approach to X-ray spectroscopy of metal complexes was shown to be quite efficient in unraveling the nature of different transitions in static XAS [157, 158, 44, 174, 180, 194, 24, 172], RIXS [156, 18, 171, 19, 21, 196, 197], and core PES [198, 53, 199, 44, 51] as well as dynamics studies [22, 56, 200, 31]. Selected examples will be presented in the Section 4.
3.4 Time-domain approaches
A conceptually different approach is the direct solution of the Time-Dependent Schödinger Equation (TDSE) (for a review, see Reference 201) employing a time-dependent Hamiltonian, , with the field-free molecular Hamiltonian and the molecule-external field interaction . The state can be written in some time-independent basis, e.g., , with the temporal evolution recast in the form of time-dependent coefficients. This expansion can be done on the level of orbitals, leading to the time-dependence in MO coefficients. It results in the TD-SCF method, e.g., Real-Time Time-Dependent Density Functional Theory RT-TDDFT in case of the KS-DFT ansatz for the wave function. If one assumes time-independent electronic configurations (e.g., Slater determinants) built on time-independent MOs allowing CI coefficients evolving in time one gets the Time-Dependent (TD-CI) method. Uniting both approaches results in the Time-Dependent (TD-MCSCF) technique [202], which in the most general form can be also viewed as an Multi-Configurational Time-Dependent (MCTDHF) method [203].
The advantage of solving the TDSE is that core-excited states can be accessed right away by choosing the proper frequency of the oscillating external field. Further, for pulsed excitation one can tune the width of the frequency window to be investigated. [204] Besides focusing on the absorption of light, shaping the envelope of the pulse opens a way to discuss the excitation of intricate superpositions of eigenstates and quantum control of the system’s dynamics in real time.
Various real time methods have been applied for simulations of the core spectra and dynamics in the core-excited states such as RT-TDDFT [204, 205, 206], TD-CI [207, 208, 209], non-Dyson Algebraic Diagrammatic Construction (ADC) [210], real-time time-dependent Equation Of Motion (EOM)-CC [211], and -TD-RASCI.[200, 212, 213]
3.5 Response function methods
While real time-domain propagation methods are emerging, perturbative response function approaches enjoyed considerable popularity over the last decades. Since it falls outside the main scope of the review, we will briefly discuss non-SCF single-reference methods only (for an extensive review, see Reference 12). Response approaches are based on a perturbation expansion of the time-dependent observables in frequency domain [214], , where an external monochromatic field is assumed. In lowest order only the term linear in the field is retained and the limit provides the basis for Linear Response (LR) methods. For instance, the absorption spectrum is obtained from the Fourier-transformed first-order response or correlation function where the operator is the dipole moment operator. Moreover, by using correlation functions involving different operators one can generate the full realm of various propagator methods [215]. The main advantage of the so-called polarization propagator methods and closely related Green’s function approaches [216] is that the spectrum is determined directly from poles and residues of the propagator. This allows to avoid explicit calculation of the initial and final states and thus to minimize the error in differential correlation.
Among the propagator methods the ADC scheme [217] and in particular its second-order variant ADC(2) has been actively developed recently. As far as core level calculations are concerned, including CVS, it goes back to the 1980’s [104, 105] with early applications to X-ray spectra of small molecules reported in References 109, 218, 110. More complex recent applications have been reported for XAS [219, 220], RIXS [221], PES [222, 223, 224] and in combination with the Stieltjes method or a grid-based representation of the ionized electron to AES and ICD [225, 226].
The most widely used response function approach is LR-TDDFT whose versatility in predicting -edge spectra of molecules containing main group elements is well documented, for a review see Reference 10. In the -edge case, the relevant electronic states are essentially single-configurational and thus can be safely treated by a single-reference method. Being related to the CIS method [227], LR-TDDFT needs to incorporate the same tricks as discussed for the CI method. These include configuration space or energy window restriction [107, 108] or modification of the diagonalisation procedure itself leading to an energy specific solver.[112, 116, 114]
LR-TDDFT is prone to problems due to the so-called self-interaction error. It correlates with the overlap between the occupied and unoccupied orbitals, corresponding to an electronic transition [228]. This overlap is very small for core-to-valence and especially for core-to-Rydberg transitions [229] due to different radial sizes of very confined core and notably more delocalized virtual valence and Rydberg orbitals. Consequently, the energies of core excitations are usually substantially underestimated by LR-TDDFT with standard XC-functionals [230]. Similar to charge-transfer valence-valence transitions [231], energies of core transitions were found to be sensitive to the amount of exact exchange in the XC-functional [230, 116]. Here, exact exchange refers to orbital-dependent exchange integrals, analogous to the HF term, but calculated using Kohn-Sham orbitals. [65, 232] As a remedy, functionals with an adjusted amount of exact exchange have been suggested, such as BH0.57LYP for carbon -edge transitions [230] and the CV-B3LYP for -edges of second period elements, where core regions are described by BHHLYP (50% HF exchange) and valence with B3LYP (20% HF exchange). [233] Another, more gentle way is to split the Coulomb operator into short-range and long-range parts depending on the interelectron distance and switch exact exchange between different values with a smooth function. Most important for core spectroscopy is the short-range correction as introduced in References 234, 229, leading to a remarkable improvement of -edge transition energies. For other correction methods designed for prediction of X-ray spectra, see also References 235, 236, 237.
LR-TDDFT has been successfully applied to main group element -edges [10, 112, 116, 238] demonstrating good accuracy and predictive power. Attempts have also been made to calculate -edges of TMs. [107, 239, 108, 204] In particular, the REW-TDDFT method has been used to demonstrate different kinds of non-linear X-ray spectroscopies such as stimulated RIXS to probe the dynamics of electronic and nuclear wave packets in the excited states. [240, 241]
Another way to treat correlation on top of SCF is provided by the Coupled Cluster (CC) technique. In contrast to the linear form of the CI expansion, it employs an exponential ansatz for the excitation operator such that the ground state wave function can be obtained, e.g., from the HF solution as . The operator generates singly, doubly, and so on excited configurations similar to the CI method. One also does a hierarchical truncation according to excitation level as in the truncated CI discussed above. Its immense success for ground state studies [242] suggests an extension to the excited electronic states.
The CC approach can be generalized to treat excited states within the EOM [243] or LR [244] formalisms, giving the same energies but different transition moments [245]. The advantage over conventional CI is that the excited state wave function is not a linear combination of the determinants but has an exponential counterpart inherited from the reference state. Essentially, it can be obtained from Equation (8), where and has the same structure as in the CI case. The operator effectively accounts for higher-order excitations due to its exponential form and treats differential correlation with respect to the ground state.
Both EOM and LR-CC have been applied to -edge core spectra recently. For example, EOM-CC with singles and doubles excitations has been applied on top of the HF reference with a core-hole obtained with maximum overlap method for emission spectroscopy of water [246]. Alternatively, the energy specific algorithm, as discussed in Section 3.3.1, was used together with EOM-CC for a set of molecules containing main group atoms [117]. Different approaches have also been used to get core-ionized potential energy surfaces in case of the ClF molecule [247]. In Reference 248, a two-step approach is utilized, describing separately the effect of core-hole relaxation and an excited bound electron by means of electron-attachment variant of EOM-CC. A simpler reduction of the excitation space is introduced for the LR-CC method.[249, 250]
The EOM-CC and LR-CC methods fail if the ground state cannot be described by a single-reference CC wave function. In contrast to CI, the extension of the CC idea to the multi-reference case is non-trivial due to the problem with definition of a proper vacuum state leading to a number of technical complications, for review see Reference 251. However, various flavors of multi-reference CC have been applied to core-spectra of small main group molecules recently. [252, 253, 254]
3.6 Relativistic effects
Relativistic effects are traditionally considered for the valence chemistry of heavy elements. [255, 256] However, they also play an important role for core electrons as they experience the (almost) unscreened Coulomb potential of the nucleus. For instance, these effects make and core electrons even more tightly bound to nuclei resulting in an additional stabilization of the and levels and destabilization of the , , and ones. [257] This adds different energetic shifts, e.g., to -edge () and -edge () transitions. Further, SOC is determining the characteristic shape of -edge spectra.
A rigorous treatment of relativistic effects is provided by the Dirac equation, for reviews see References 258, 259, 260, 261, 262, 263. Here, the wave functions are four-component vectors, corresponding to positive (electronic) and negative (positronic) energy states. Four-component theory, equipped with a proper electron correlation treatment, is without doubt the “golden standard” of accuracy in relativistic calculations. But, the considerable computational costs limit their applicability to atoms or small (highly-symmetric) molecules. Examples of four-component core calculations include (i) DFT-CI based calculations of MO (M=Ti, Fe, Ni, etc) [128, 129, 130, 131, 132] and UO[135] clusters, (ii) LR-TDDFT calculations of - and -edges of H2S [264] and Si, Ge and S halides [265], and (iii) ADC-based investigations of XeFn -edge spectra [266] as well as of photoionization of (HI)2 and (LiI)2 [267] and of MnO. [136]
Accounting for negative energy states contributes little to the physical and chemical processes most relevant for molecules containing no very heavy elements. This situation is reflected by the fact that in case of low and medium nuclear charge there are solutions of the Dirac equation having dominant contributions from positive energy states. Various techniques have been designed to decouple both components such that the small component is implicitly included in calculation of the large component. This can be done in principle within any predetermined level of accuracy yielding so-called exact two-component methods.[268] Examples include the Zero-Order Regular Approximation (ZORA) [269] and Infinite-Order Two-Component (IOTC) methods [270], which are often used for core-level calculations. The transition from four- to two-component theory can also be done via the Douglas-Kroll-Hess protocol [271, 272] which applies a sequence of unitary transformations eliminating the coupling order-by-order and being exact in the limit of an infinite number of transformations. This yields a block-diagonal Hamiltonian represented as a perturbational sum, where in addition to block diagonalization the spin-dependent and spin-free parts can be separated, thus building a basis for distinguishing scalar and, e.g., SOC relativistic effects. In practice, however, one transforms only one-electron terms as the transformation of two-electron integrals is very involved and is often sacrificed for the sake of efficiency. This might lead to inaccuracies in splittings as discussed in Reference 206.
Such a block-diagonalization of the Dirac Hamiltonian with subsequent neglect of the negative energy block leads to two-component schemes. Here, SOC is usually taken into account at the stage of the variational SCF orbital optimization leading to the so-called -coupling limit. The justification behind is that for high- elements SOC becomes comparable to the largest two-electron integrals and hence significantly influences the one-particle MO basis. Examples of 2-component calculations include (i) ZORA DFT-CI based calculations of oxides [133], (ii) ZORA LR-TDDFT calculations of - and -edges of Ti complexes [273, 274], of chemical shifts in emission lines of YbFn [275] and Nb oxide [276], XAS of gas phase TM oxochlorides [277], short-chain oligothiophenes XAS and XPS [238], (iii) real-time DFT calculations of XAS for TM chlorides and oxochlorides [206], (iv) RASSCF IOTC calculations of XPS of noble gases. [278]
For compounds containing low and medium elements the most pragmatic approach is to neglect the spin-dependent part of the Hamiltonian in the two-component scheme, resulting in a one-component approach. It corresponds to the usual solution of the Schrödinger equation with the difference that scalar relativistic effects are taken into account. The latter have been shown to be crucial for predicting the correct energetics of X-ray spectra. [264, 126, 279] Typically, scalar relativistic effects are treated at the second-order Douglas-Kroll-Hess level.
For -edge spectra the magnitude of SOC is larger than typical energy level spacings and thus it must be taken into account. Within the one-component protocol SOC is included as a perturbation of the spin-free problem in the framework of the so-called quasi-degenerate perturbation theory.[280] Here, the SOC Hamiltonian, , is represented in the basis of spin-free spatial wave functions multiplied by an appropriate spin part. The SOC-coupled eigenenergies and eigenstates are obtained via matrix diagonalization and have the form of linear combination of spin-free states, , with complex coefficients
[TABLE]
In practice for core-level calculations, it is advisable to use as many states for the zero-order state basis set as are available for a given AS.[174] Note that since in this approach not the total momenta of the individual electrons are coupled as in the -limit, but rather the total angular momenta and spins of many-body electronic states, it is referred to as -coupling limit.
The expectation values and transition matrix elements are then calculated for the SOC-coupled eigenstates . Therefore, formally spin-forbidden transitions may gain notable intensity in X-ray spectra which is borrowed from the allowed ones via SOC-mixing [157, 21, 44]. In passing we note that this mixed nature of SOC eigenstates is also reflected in the structure of the DOs, which become complex orbitals having spin-up and spin-down components due to mixing of different multiplicities. [53]
For such an a posteriori treatment of SOC the operator in the Breit-Pauli form [281] is commonly used. It can be represented in terms of one-, , and two-electron, , operators As usual, multi-center two-electron terms require the most effort in the calculation of matrix elements in the basis of spin-free electronic states. However, they cannot be neglected (especially for light elements) without notable decrease in accuracy.
Let us consider a matrix element of between two Slater determinants differing in the occupation of the th and th orbitals, whereas occupations of other orbitals, , are the same, i.e.
[TABLE]
An efficient mean-field type approximation is to fix the occupation numbers to, e.g., ground state HF values [282]. Further, a speedup is achieved when the molecular mean-field is approximated by the superposition of the mean-fields of the constituent atoms. Accounting for the locality of SOC, decaying much faster with distance than the Coulomb interaction, and the fact that the largest couplings are expected for core orbitals which are close to the respective nucleus, one can stay with one-center two-electron terms only. This completely separates molecular SOC into additive atomic contributions. The are standardly calculated by the HF method for neutral atoms. This approximation is called Atomic Mean-Field Integrals (AMFI) [283] and appeared to be quite accurate [280]. Numerous applications in the field of X-ray spectroscopy confirmed that this technique provides remarkable accuracy not only for valence but also for core-electrons as is evidenced by the prediction of energy separations and general shape of XAS, XPS, and RIXS. It is the basis of the SOC treatment in the ROCIS-DFT method [126] and of all multi-reference calculations discussed in Section 4.
Finally, we comment on the atomic basis sets which can be employed for core-level calculations. The usual way of implicit inclusion of relativistic effects via core-potentials [284] is naturally not suitable for this purpose. In this approach the core electrons are removed from the respective atoms, essentially simplifying the nodal structure of the MOs, and their effect is accounted by an effective pseudopotential and only transitions between valence levels can be rigorously calculated. However, there have been reports of two-component calculations using pseudopotentials with an a posteriori restoration of the core nodal structure. [275, 276] However, in general utilization of all-electron basis sets is prerequisite for explicit calculations of X-ray spectra. Moreover, basis sets need to contain very tight functions to describe the core region; to increase the accuracy one might even de-contract otherwise contracted core functions. [156] For this purpose, basis sets specifically designed for correlation and at the same time relativistic treatment from the families cc-pwCVXZ-DK [285] and ANO-RCC [286] are usually utilized. For calculations of core spectra of compounds containing atoms from the first three periods the IGLO bases have also been developed. [287]
4 APPLICATIONS
4.1 Unraveling the electronic structure at the metal-solvent interface
Solvated transition metal ions have been shown to provide a model system for metal-ligand interactions and in particular for the electronic structure at the metal-solvent interface [157, 171, 44]. In Figure 4 XAS and RIXS spectra for the prototypical ferrous [Fe(H2O)6]2+ complex are shown. This weak-field high-spin (quintet) complex has configuration and a Jahn-Teller distorted tetragonal symmetry. The and orbitals of the iron ion form bonding and antibonding orbitals of symmetry with the orbitals of the water molecules. The other orbitals of symmetry form -type MOs with the water and orbitals, where the mixing is notably weaker as compared to the orbitals. That is why they can be denoted as non-bonding MOs. Note that symmetry labels assume octahedral symmetry for simplicity.
Calculated and measured absorption spectra are in rather good agreement as shown in Figure 4a [171, 44]. Focusing on the edge, analysis of the core-excitations reveals that almost all intense transitions are of type, but with substantial admixture of character. Notice that this mixing of and orbitals strongly depends on the considered ion. For Fe3+, for instance, it has been found that the low-energy part of the edge is dominated by -type transitions [157]. The multiplicity of the core-excited states strongly depends on energy [200, 212, 213]. For energies up to about the maximum spin-conserving quintet-quintet transition dominate. The region up to 715 eV is characterized by strongly mixed quintet-triplet transitions. The -edge is comprised essentially of spin-forbidden quintet-triplet transitions, which borrow intensity by mixing with spin-allowed ones.
RIXS spectra of [Fe(H2O)6]2+ shown in Figure 4b can be used to characterize the electronic structure in more detail. [171] For excitation across the -edge up to its maximum, most prominent is the elastic Rayleigh feature at the energy loss of 0 eV. Inelastic peaks at about -1.2 eV energy loss are due to emission from the doubly occupied orbital and the respective shake-off satellites. With increasing excitation energy the intensity of inelastic peaks also increases as compared to the elastic ones. This evidences the fact that below -1.2 eV valence excited states are of triplet character. Essentially, the excitation occurs from the ground quintet state with spin to a spin-mixed core-excited state (see also Section 4.3 and Figure 6b). The features with energy losses from 0 to -1.2 eV correspond to transitions. Due to the mixing in the intermediate state, the new formally spin-forbidden radiative channels are opened. Since with increasing energy core-excitations are characterized by an increasing quintet-triplet mixing, the elastic peak decreases and spin-forbidden features grow in intensity. Different spin states of iron complexes have been also studied in Reference 288.
The spectra shown in Figure 4b are essentially due to local metal-centered transitions. RIXS features due to metal to ligand charge transfer are showing up around -15 eV energy loss, but have small intensity. Hence, the mixing between the iron and water orbitals at the -edge is of minor importance, evidencing the predominant ionic character of the metal-ligand bond. Respective results obtained for probing the oxygen -edge, yield rather similar spectra for [Fe(H2O)6]2+ and for pure water, thus supporting the weak mixing (ionic) picture. [171]
Recording a true XAS in transmission mode is a difficult task for optically-thick condensed phase samples, such as solutions, even if microjets are utilized. Therefore, experimentally one usually obtains it in the PFY detection mode, Equation (3) [26]. The natural question arising in this respect is how large are the distortions of the PFY relative to the true XAS and how they can be reduced. [289, 290] Figure 4a shows the two experimental PFY spectra resulting from (PVFY) and (PCFY) dipole-allowed radiative relaxation of the initially created core hole, [44] see panel (c). The comparison of the calculated XAS and PFY spectra stemming from these two channels evidences that the core variant PCFY should be a more reliable probe of XAS in contrast to PVFY. The reason for the distortions of the latter has been identified as being the opening of new radiative relaxation channels via strong SOC in the core-excited state [44], cf. discussion in previous paragraph for RIXS.
The RIXS spectra record photon-in/photon-out events and thus are determined by dipole selection rules as is evident from the Kramers-Heisenberg Equation (2). In Reference 44, it has been demonstrated for [Fe(H2O)6]2+ that a back-to-back analysis of RIXS and photon-in/electron-out RPES, which is free from such selection rules, provides complementary information on the electronic states. In particular, comparing intensities, one can obtain information on the competition between radiative and non-radiative decay channels of the core-excitation.
4.2 A new look at metal-ligand bonding
XAS and RIXS spectroscopy applied to metal-ligand complexes can provide an atom-specific, chemical state selective, crystal field symmetry and orbital symmetry resolved description of the electronic structure [18, 19, 20, 175, 31]. In References 18, 19, 176, it has been shown that this allows for scrutinizing traditional chemical concepts of metal ligand bonding. Within the valence bond structure theory, metal-ligand bonding is described by the so-called -donor/-acceptor mechanism. For example, the formation of the covalent Fe–CO bonds in Fe(CO)5 is accompanied by charge donation from the orbital of CO to the orbital of the Fe and back donation to the orbital of CO. In terms of the MOs of the metal-ligand complex, the donation/back donation is expressed by the mixing of the respective Fe and CO orbitals. The extent of this mixing can be addressed by identification of charge-transfer transitions in RIXS spectra.
For illustration, let us consider the RIXS amplitude for the Fe -edge in the simplified picture considering only MOs. The dipole operator can be written as Suppose that the character of the absorbing state is given by the MO coefficient of a virtual orbital, to which the core electron is excited, i.e. . Thus, core excitation projects out all MO contributions apart from the due to locality and relatively strict dipole selection rules. The character of the respective state for emission shall be given by the coefficient of the occupied orbital from which the core hole is refilled, i.e. . Hence, the RIXS amplitude in Equation (2) becomes . This suggests that comparing the intensities of different RIXS channels provides access to the character of the involved MOs. This has been demonstrated in References 18 and 19 for Fe(CO)5 and [Fe(CN), respectively. Needless to say that this reasoning applies to other absorption edges as well.
In Figure 5a, exemplary RIXS spectra for the Fe -edge and the O -edge of Fe(CO)5 are shown. In case of the Fe -edge, there are, on the one hand side, local type radiative relaxation transitions of ) to character (see density difference plots in panel (b)). On the other hand side, inelastic peaks of ) to character are clearly discernible for different excitation energies. From the density difference plots (note especially the negative part shown in blue) in Figure 5b the ligand to metal charge-transfer character of these core-hole refill transitions becomes apparent. Based on the simple model outlined before the intensity ratio of these two types of transitions can be used to characterize the degree of orbital mixing. The same argument applied to the O -edge leads to the identification of the ) to 1s transition as marker bands for the metal-to-ligand charge-transfer character. Taking both results together gives a means for quantifying the MO composition in terms of atomic orbitals or equivalently -donor/-acceptor character of the metal-ligand bonding. In order to establish the general nature of this argument, the [Fe(CN) complex has been investigated using RIXS spectroscopy [19]. Comparing band intensities between [Fe(CN) and Fe(CO)5 it was verified that CN- is a stronger -donor but weaker acceptor than CO, which is in accord with chemical intuition.
4.3 Tracing catalytic activity
Following the building principles of natural water-oxidation systems, manganese-based nanostructured catalysts have attracted considerable attention.[291, 292, 293] X-ray spectroscopy is sensitive to the oxidation state of metal atoms. In Reference 21, this has been shown in a spectroscopic investigation of the redox evolution of manganese systems relevant for water photo-electro-oxidation. [294, 295] In the experiment, Mn precursors with different oxidation states have been doped into a nafion matrix, where all converted into MnII. These species can be oxidized upon application of a bias and reduced under visible light illumination. To follow this reaction, XAS as well as RIXS spectra have been calculated for the model compounds , , and (cf. Figure 6) [21]. Although the actual experimental systems are much more complex, the local nature of the soft X-ray probe enables comparison with these model complexes. It turned out that for the present systems XAS is much more sensitive to the oxidation state than RIXS, hence only XAS spectra are shown in Figure 6. Comparison between experiment and simulation is hampered by the fact, that the former are done for mixtures of species having different oxidation states. Nevertheless, both spectra not only show strong similarities, but also a distinct dependence on the oxidation state.
The XAS spectra show the typical splitting between the and bands which is about 11 eV, almost independent on the oxidation state. It follows that the SOC constant is about 7.3 eV. This large SOC constant leads to considerable coupling of core excited states of different spin, i.e. if denotes the spin of the electronic ground state, core-excited states are mixtures of , , and spin states. Any interpretation of the XAS spectra has to account for this fact. The decomposition of the spin-orbit coupled wave functions into contributions is given in Figure 6b for the different species. Whereas the valence excited states show little mixing, i.e. they are either of or character, the situation is considerably more complex for core-excited states, with the relative stability of spin-states being strongly affected by the core hole; for a more detailed discussion of the dependence on the actual species, see Reference 21.
The XAS spectra can be assigned as follows (in the following we discuss the band only, the band has a similar assignment, although the spin mixing is even stronger). (i) MnII: Here, the low-energy shoulder (a) is of dominantly character, comprising transitions. The main band (b) is shaped by intense transitions with . The peaks denoted by (c) are due to mostly spin-forbidden transitions of mixed character. (ii) MnIII: The spectrum is dominated by transitions, but also contains shake-up transitions in region (b). Starting from the main peak (b) spin-mixing starts to play a role until the essentially spin-forbidden region (c). (iii) MnIV: the spectrum is similar to MnIII, but also contains notable contribution from transitions in regions (b,d,e). The influence of the PT2 correction to the RASSCF is also illustrated in Figure 6. Apart from an overstabilization of the ground state which needs to be compensated by a constant shift of the spectrum, its shape hardly changes as can be seen in Figure 6 what is an evidence of mostly ionic metal-ligand bond character.
Analysis of inelastic RIXS spectra revealed that for all species most features are due to spin-forbidden transitions, mediated by core-excited states [21] similar to what has been discussed for iron complex in Section 4.1. Note that intricate spin mixing of the core-excited state gives rise to the exciting ultrafast spin-flip dynamics which can be triggered by XFEL and HHG X-ray pulses [200, 212, 213].
4.4 Aggregation and the challenge of multiple metal centers
Straightforward extension of the RASSCF-based approach to the calculation of core-excited states of multi-center metal compounds would require active spaces that go beyond the capabilities of current hard- and software. Even if one would be able to perform such a calculation, the spectra would consist of hundreds of thousands of transitions, which would render any detailed interpretation of such spectra a challenging task. In order to address this problem, we have recently developed an approach which follows the logic of the so-called Frenkel exciton model. [195, 196] The latter is used to describe the coupling of low-lying valence excitations in molecular aggregates and crystals, i.e. in situations where monomers assemble as a consequence of the van der Waals interaction. Since all what is needed is a localized excitation, the Frenkel exciton picture might apply to the present case of core-holes even if multiple metal centers in covalently bound complexes are considered.
The principal idea is to introduce metal center specific excitations which are in turn coupled by the Coulomb interaction. Thus, the Frenkel exciton Hamiltonian reads
[TABLE]
where the states are eigenfunctions of the monomeric Hamiltonian, i.e. , and the Coulomb coupling can be expressed, e.g., in dipole approximation
[TABLE]
where is the transition dipole matrix element and the vector connects monomers and .
The approach is illustrated in Figure 7c for the example of a dimer. The different monomeric state manifolds will be denoted as ground , valence-excited, , and core-excited, , states. Equation (20) contains couplings between all possible transitions in the dimer system. In the following, we will make use of the fact that the actual observables of interest, namely XAS and RIXS spectral amplitudes, are of first and second order with respect to the interaction with the external field. This suggests to employ either a one- or two-particle basis as shown in Figure 7c. In the former, states of the type and () are incorporated, while the latter includes in addition states of type () and is in principle exact for the dimer. Note that this effectively corresponds to a CISD-like treatment of the interaction between monomers in the composite system with X-ray specific preselection of configurations, while monomers are treated within the usual RASSCF procedure. In order to make this approach computationally feasible, a number of approximations can be introduced as detailed in Reference 196. Most important in this respect is the preselection of core-excited states to construct the monomeric basis functions, which is based on an energy window criterion for the excitation light pulse.
In a proof-of-concept study, this approach has been applied to investigate the signatures of aggregation of hemin in solution. [196] Hemin shows a solvent dependent aggregation behavior, staying monomeric in polar solvents like ethanol or DMSO, while it forms dimers in water solution. Experimentally, the effect of aggregation was addressed by means of soft X-ray Fe -edge XAS as well as by NXES and RIXS on the example of DMSO and aqueous solutions. [296, 297] The general shape of the spectra for both cases was quite similar and the pronounced difference in broadenings for RIXS as well as a 1.3 eV energy shift in off-resonant NXES was attributed to aggregation. However, judging the effect of aggregation one should account for the labile equilibrium between different species in solution that also could be a source of spectral changes.
In Figure 7a, XAS as well as selected RIXS spectra are shown for the bare hemin molecule ([Heme B-Cl]0) as well as for the case where the axial Cl- ligand has been exchanged by a water molecule ([Heme B-H2O]+). The XAS spectrum shows a distinct sensitivity of the band to the type of ligand. In the present case, changing the ligand from Cl- to H2O causes a splitting of the peak in two components at 710.0 eV and 710.9 eV. While the main peak is due to and excitations, the splitting results from the energetic lowering of with respect to transitions.
The RIXS spectra of the monomer show a trend similar to the one discussed in Section 4.1. For excitation energies above 720 eV, the core-excited states are mostly of quartet type. Here, the most intense emission goes to valence-excited states with high quartet contributions. The RIXS spectra for both species differ mostly in the inelastic peaks, whereas the elastic peak has a comparable intensity in both spectra.
The comparison of [Heme B-Cl]0 dimer and monomer spectra is shown in Figure 7b. The XAS spectra are essentially identical, i.e. there is almost no effect due to dimerization. This can be attributed to the intensities of the metal and transitions relevant for -edge X-ray spectra that are lower (due to smaller radial overlap and dipole selection rules) than those of the and transitions usually discussed in the case of aggregates of organic dyes. However, distinct fingerprints of dimerization can be seen in RIXS, in both the inelastic and elastic features. Overall, the magnitude of the effect of dimerization is comparable to the differences observed upon ligand exchange in the monomer. Hence, the unequivocal identification of spectral features due to solvent dependent dimerization is not a straightforward task.
5 CONCLUSIONS
There is a plethora of methods which have been developed to calculate core-excited states for prediction of various X-ray spectroscopic signals. For TM compounds beyond the -edge, multi-reference approaches provide the methods of choice as they can deal with the intricate multiplet structure shaped by static and dynamic correlations and substantial SOC. However, limitations are due to the methods’ complexity and associated computational effort. Here, active space approaches and in particular RASSCF are playing a prominent role as they are, in principle, exact and systematically improvable by increasing the size of the active space. In practice, however, methods like RASSCF cannot be considered as a “black box” due to the strong influence of the choice of the AS. Recent progress in the field of Density Matrix Renormalization Group (DMRG) and automated AS selection certainly mitigates this problem. [298] For polymetallic complexes, merging established ideas of electronic structure theory with models developed in other areas such as the Frenkel exciton Hamiltonian could broaden the range of systems for theoretical X-ray spectroscopic studies.
To date, there is a considerable record of theoretical and experimental applications of frequency-domain X-ray techniques to study TM complexes in various environments. With the emerging explicit time-domain techniques, either using XFEL or HHG facilities, nonlinear spectroscopy in the X-ray domain will play a role for unraveling electron dynamics, comparable to that of UV/Vis in case of (non-)Born-Oppenheimer electron-nuclear dynamics. [241]
ACKNOWLEDGEMENTS
We would like to acknowledge the financial support from the Deutsche Forschungsgemeinschaft via grant No. BO 4915/1-1 (S.I.B.) and Ku952/10-1 (O.K.) and the Deanship of Scientific Research (DSR), King Abdulaziz University, Jeddah, (Grant No. D-003-435).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Chergui M. Time-Resolved X-Ray Spectroscopies of Chemical Systems: New Perspectives. Struct Dyn. 2016;3:031001.
- 2[2] Young L, Ueda K, Gühr M, Bucksbaum PH, Simon M, Mukamel S, et al. Roadmap of Ultrafast X-Ray Atomic and Molecular Physics. J Phys B. 2018;51:032003.
- 3[3] Stöhr J. NEXAFS Spectroscopy. Berlin, New York: Springer-Verlag; 1992.
- 4[4] de Groot F, Kotani A. Core Level Spectroscopy of Solids. Boca Raton: CRC Press; 2008.
- 5[5] Milne CJ, Penfold TJ, Chergui M. Recent Experimental and Theoretical Developments in Time-Resolved X-Ray Spectroscopies. Coord Chem Rev. 2014;277-278:44–68.
- 6[6] Chergui M, Collet E. Photoinduced Structural Dynamics of Molecular Systems Mapped by Time-Resolved X-Ray Methods. Chem Rev. 2017;117:11025–11065.
- 7[7] Rehr JJ, Ankudinov AL. Progress in the Theory and Interpretation of XANES. Coord Chem Rev. 2005;249:131–140.
- 8[8] Leng X, Jin F, Wei M, Ma Y. GW Method and Bethe-Salpeter Equation for Calculating Electronic Excitations: GW Method and Bethe-Salpeter Equation. Wiley Interdiscip Rev: Comput Mol Sci. 2016;6:532–550.
