Bayesian Hamiltonian Selection in X-ray Photoelectron Spectroscopy
Yoh-ichi Mototake, Masaichiro Mizumaki, Ichiro Akai, Masato Okada

TL;DR
This paper introduces an automated Bayesian approach for selecting and parameterizing effective Hamiltonians in core-level XPS spectra analysis, improving accuracy and providing uncertainty quantification.
Contribution
The paper presents a novel Bayesian model selection framework using exchange Monte Carlo sampling for automatic Hamiltonian selection and parameter estimation in XPS spectra analysis.
Findings
Successfully applied to Ce and La compounds' 3d XPS spectra
Confirmed the method's selections align with physical knowledge
Enabled uncertainty evaluation of the estimated parameters
Abstract
Core-level X-ray photoelectron spectroscopy (XPS) is a useful measurement technique for investigating the electronic states of a strongly correlated electron system. Usually, to extract physical information of a target object from a core-level XPS spectrum, we need to set an effective Hamiltonian by physical consideration so as to express complicated electron-to-electron interactions in the transition of core-level XPS, and manually tune the physical parameters of the effective Hamiltonian so as to represent the XPS spectrum. Then, we can extract physical information from the tuned parameters. In this paper, we propose an automated method for analyzing core-level XPS spectra based on the Bayesian model selection framework, which selects the effective Hamiltonian and estimates its parameters automatically. The Bayesian model selection, which often has a large computational cost, was…
| Parameter | |||||
|---|---|---|---|---|---|
| La2O3 | 12.5 | 0.57 | 10.5 | 12.7 | 0.5 |
| CeO2 | 1.6 | 0.76 | 10.5 | 12.5 | 0.7 |
| Number of peaks | |||||||
|---|---|---|---|---|---|---|---|
| 12.5 | 2 | -2.8 | 2.72 | 12.48 | 0.3850 | 0.2512 | 0.0005 |
| 10.08 | 2-3 | -4.53 | 1.94 | 8.15 | 0.2824 | 0.3509 | 0.0033 |
| 7.66 | 3 | -6.67 | 1.25 | 4.3746 | 0.2330 | 0.3581 | 0.0456 |
| 5.23 | 3 | -9.03 | 2.74 | -0.92 | 0.2298 | 0.2414 | 0.1654 |
| 2.81 | 3 | 2.91 | -11.5 | -3.93 | 0.2372 | 0.2623 | 0.1372 |
| 1.6 | 3 | 3.29 | -5.22 | -12.71 | 0.1937 | 0.1575 | 0.2854 |
| b | |||||||
|---|---|---|---|---|---|---|---|
| Min | 0.0 | -20.0 | 0.0 | 0.0 | 0.0 | 0.01 | -5.0 |
| Max | 20.0 | 20.0 | 4.0 | 20.0 | 20.0 | 1.0 | 5.0 |
| \ | 0.001 | 0.0028 | 0.0046 | 0.0064 | 0.0082 | 0.01 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| True | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | |
| 12.5 | 12.5 | 14.68 | 13.6534.53 | 18.56 | 18.9794.66 | 8.81 | 10.7730.62 | 16.07 | 18.1677.65 | 18.71 | 17.8977.3 | 15.68 | 16.3258.92 |
| 10.08 | 10.08 | 8.85 | 8.930.28 | 8.72 | 8.418.92 | 7.94 | 7.2127.91 | 4.64 | 6.6533.98 | 5.91 | 9.4224.6 | 2.33 | 6.239.8 |
| 7.66 | 7.66 | 7.62 | 7.640.0 | 7.58 | 7.620.0 | 7.62 | 7.580.01 | 7.65 | 7.590.03 | 7.32 | 7.560.05 | 7.48 | 7.580.07 |
| 5.23 | 5.23 | 5.22 | 5.210.0 | 5.15 | 5.170.0 | 5.1 | 5.140.01 | 5.13 | 5.10.01 | 5.04 | 5.050.02 | 5.02 | 5.020.03 |
| 2.81 | 2.81 | 2.82 | 2.810.0 | 2.8 | 2.810.0 | 2.79 | 2.80.0 | 2.82 | 2.80.0 | 2.75 | 2.810.01 | 2.71 | 2.810.01 |
| 1.6 | 1.6 | 1.6 | 1.60.0 | 1.62 | 1.590.0 | 1.6 | 1.60.0 | 1.56 | 1.60.0 | 1.61 | 1.60.0 | 1.63 | 1.590.0 |
| \ | 0.001 | 0.0028 | 0.0046 | 0.0064 | 0.0082 | 0.01 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| True | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | |
| 12.5 | 12.5 | 14.31 | 5.8253.4 | 18.0 | 5.7855.52 | 9.68 | 5.8152.92 | 15.92 | 5.9356.29 | 17.91 | 5.9254.12 | 15.33 | 5.9951.92 |
| 10.08 | 12.5 | 11.61 | 11.660.14 | 11.53 | 11.325.58 | 10.9 | 10.6817.29 | 8.79 | 9.8523.41 | 9.54 | 9.6625.75 | 7.6 | 9.7325.67 |
| 7.66 | 12.5 | 12.46 | 12.480.0 | 12.43 | 12.460.0 | 12.48 | 12.430.01 | 12.46 | 12.440.02 | 12.19 | 12.40.03 | 12.35 | 12.390.05 |
| 5.23 | 12.5 | 12.48 | 12.490.0 | 12.45 | 12.470.0 | 12.42 | 12.450.0 | 12.42 | 12.430.01 | 12.41 | 12.40.01 | 12.39 | 12.390.01 |
| 2.81 | 12.5 | 12.5 | 12.50.0 | 12.49 | 12.50.0 | 12.47 | 12.490.0 | 12.49 | 12.50.0 | 12.46 | 12.490.01 | 12.51 | 12.480.01 |
| 1.6 | 12.5 | 12.48 | 12.490.0 | 12.51 | 12.480.0 | 12.43 | 12.470.0 | 12.39 | 12.480.01 | 12.37 | 12.430.01 | 12.45 | 12.420.01 |
| \ | 0.001 | 0.0028 | 0.0046 | 0.0064 | 0.0082 | 0.01 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| True | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | |
| 12.5 | 10.5 | 9.75 | 13.4311.0 | 12.41 | 14.9110.46 | 11.83 | 18.6631.3 | 15.57 | 14.5213.12 | 8.93 | 14.7115.13 | 11.71 | 13.3714.95 |
| 10.08 | 10.5 | 11.13 | 11.050.13 | 11.32 | 11.417.87 | 10.68 | 15.178.03 | 12.34 | 17.3811.87 | 11.1 | 18.7320.31 | 13.67 | 18.9322.6 |
| 7.66 | 10.5 | 10.52 | 10.520.0 | 10.6 | 10.540.01 | 10.62 | 10.550.01 | 10.5 | 10.560.03 | 10.64 | 10.580.05 | 10.58 | 10.610.08 |
| 5.23 | 10.5 | 10.51 | 10.520.0 | 10.57 | 10.560.0 | 10.59 | 10.580.01 | 10.56 | 10.620.01 | 10.69 | 10.660.02 | 10.71 | 10.680.03 |
| 2.81 | 10.5 | 10.5 | 10.50.0 | 10.51 | 10.490.01 | 10.51 | 10.530.02 | 10.47 | 10.490.04 | 10.58 | 10.540.06 | 10.83 | 10.510.09 |
| 1.6 | 10.5 | 10.46 | 10.480.0 | 10.47 | 10.460.01 | 10.3 | 10.390.02 | 10.28 | 10.380.03 | 10.09 | 10.390.05 | 10.25 | 10.280.08 |
| \ | 0.001 | 0.0028 | 0.0046 | 0.0064 | 0.0082 | 0.01 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| True | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | |
| 12.5 | 0.76 | 0.77 | 0.750.02 | 0.76 | 0.750.02 | 0.72 | 0.750.02 | 0.75 | 0.750.01 | 0.78 | 0.750.02 | 0.76 | 0.750.02 |
| 10.08 | 0.76 | 0.73 | 0.730.00 | 0.72 | 0.720.01 | 0.71 | 0.80.03 | 0.58 | 0.80.03 | 0.64 | 0.80.03 | 0.42 | 0.80.03 |
| 7.66 | 0.76 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.75 | 0.760.00 |
| 5.23 | 0.76 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.75 | 0.750.00 | 0.75 | 0.750.00 |
| 2.81 | 0.76 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 |
| 1.6 | 0.76 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 | 0.76 | 0.760.00 |
| \ | 0.001 | 0.0028 | 0.0046 | 0.0064 | 0.0082 | 0.01 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| True | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | |
| 12.5 | 0.5 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.51 | 0.50.00 | 0.5 | 0.50.00 | 0.5 | 0.510.00 | 0.5 | 0.510.00 |
| 10.08 | 0.5 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.49 | 0.50.00 | 0.48 | 0.490.00 | 0.49 | 0.490.00 | 0.5 | 0.490.00 |
| 7.66 | 0.5 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.5 | 0.50.00 |
| 5.23 | 0.5 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.51 | 0.50.00 | 0.51 | 0.50.00 | 0.5 | 0.50.00 |
| 2.81 | 0.5 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.52 | 0.510.00 | 0.51 | 0.510.00 | 0.52 | 0.510.00 | 0.5 | 0.520.00 |
| 1.6 | 0.5 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.51 | 0.50.00 | 0.51 | 0.50.00 | 0.5 | 0.50.00 |
| \ | 0.001 | 0.0028 | 0.0046 | 0.0064 | 0.0082 | 0.01 | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| True | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | MAP | MPM | |
| 12.5 | 0.5 | 0.23 | 0.130.21 | 0.06 | 0.340.11 | 0.34 | 0.630.1 | 0.95 | 0.20.18 | 0.15 | 0.810.17 | 0.84 | 0.690.11 |
| 10.08 | 0.5 | 0.3 | 0.320.01 | 0.21 | 0.250.1 | 0.66 | 0.710.12 | 0.05 | 0.930.25 | 0.05 | 0.930.25 | 0.99 | 0.480.08 |
| 7.66 | 0.5 | 0.5 | 0.50.00 | 0.48 | 0.50.00 | 0.49 | 0.490.00 | 0.51 | 0.490.00 | 0.46 | 0.490.00 | 0.47 | 0.490.01 |
| 5.23 | 0.5 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.51 | 0.50.00 | 0.5 | 0.50.00 | 0.5 | 0.510.00 | 0.51 | 0.510.00 |
| 2.81 | 0.5 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.48 | 0.50.00 | 0.5 | 0.50.00 |
| 1.6 | 0.5 | 0.5 | 0.50.00 | 0.5 | 0.50.00 | 0.49 | 0.50.00 | 0.49 | 0.50.00 | 0.5 | 0.490.00 | 0.51 | 0.490.00 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectron and X-Ray Spectroscopy Techniques · Advanced Chemical Physics Studies · Machine Learning in Materials Science
Bayesian Hamiltonian Selection in X-ray Photoelectron Spectroscopy
Yoh-ichi Mototake1
Masaichiro Mizumaki2
Ichiro Akai3,4
Masato Okada1,5
Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8561, Japan1
Japan Synchrotron Radiation Research Institute (JASRI), 1-1-1, Kouto, Sayo-cho, Sayo-gun, Hyogo 679-5198, Japan2
Institute of Pulsed Power Science, Kumamoto University, Kumamoto 860-8555, Japan3
Kyushu Synchrotron Light Research Center, Tosu, Saga 841-0005, Japan4
Research and Services Division of Materials Data and Integrated Systems, National Institute for Materials Science, Sengen, Tsukuba, Ibaraki 305-0047, Japan5
Abstract
Core-level X-ray photoelectron spectroscopy (XPS) is a useful measurement technique for investigating the electronic states of a strongly correlated electron system. Usually, to extract physical information of a target object from a core-level XPS spectrum, we need to set an effective Hamiltonian by physical consideration so as to express complicated electron-to-electron interactions in the transition of core-level XPS, and manually tune the physical parameters of the effective Hamiltonian so as to represent the XPS spectrum. Then, we can extract physical information from the tuned parameters. In this paper, we propose an automated method for analyzing core-level XPS spectra based on the Bayesian model selection framework, which selects the effective Hamiltonian and estimates its parameters automatically. The Bayesian model selection, which often has a large computational cost, was carried out by the exchange Monte Carlo sampling method. By applying our proposed method to the 3 core-level XPS spectra of Ce and La compounds, we confirmed that our proposed method selected an effective Hamiltonian and estimated its parameters appropriately; these results were consistent with conventional knowledge obtained from physical studies. Moreover, using our proposed method, we can also evaluate the uncertainty of its estimation values and clarify why the effective Hamiltonian was selected. Such information is difficult to obtain by the conventional analysis method.
pacs:
Valid PACS appear here
††preprint: APS/123-QED
I Introduction
Core-level X-ray photoelectron spectroscopy (XPS) is a useful means of investigating the electronic state of a strongly correlated electron systemkanamori87 . The XPS spectral distribution is reproduced from its state transition probability. The state transition probability is calculated on the basis of an effective Hamiltonian, which explains complex electron-to-electron interactions in the transition of XPS. The parameters of the effective Hamiltonian are the physical parameters of the measurement target. Therefore, by adjusting the parameters of the effective Hamiltonian so as to reproduce the measured XPS spectrum, the physical parameters of the object material are estimated as the tuned parametersKotani74 ; Kotani85 ; Kotani87 ; Groot08 . Normally, this parameter adjustment is performed manually, and the effective Hamiltonian, which is the premise of the analysis, is set on the basis of the physical consideration of the XPS transition process by researchersKotani74 ; Kotani85 ; Kotani87 ; Groot08 .
There is an information science methodology called spectral deconvolution that regresses a spectrum as a linear sum of unimodal basis functions, such as the Gaussian function. Nagata et al.Nagata12 introduced Bayesian inference to spectral deconvolution by the exchange Monte Carlo method,Hukushima96 which is an efficient sampling method. By Bayesian spectral deconvolution, the quantitative selection of the spectral model, such as the selection of the number of peaks, has been realized. Bayesian spectral deconvolution has been further developed, and the efficient estimation of noise intensitytokuda16 , the analysis of time series spectramurata16 , and a fast Bayesian spectral deconvolution algorithmmototake18 applicable to a high-dimensional spectrum have been realized. Many studies have used Bayesian spectral deconvolution to analyze spectra measured in various scientific fieldskasai16 ; iwamitsu16 ; hong16 . Normally, in spectral deconvolution, the parameters of the unimodal basis, such as the position, intensity, and variance of basis functions, are estimated. Therefore, if the parameters of the physical model do not directly correspond to the parameters of the unimodal basis function, we need an indirect estimation of the parameters. In the analysis of the core-level XPS spectrum, the parameters of the effective Hamiltonian also do not directly correspond to the peak positions or peak intensities. To realize the direct estimation of physical parameters, Bayesian spectral deconvolution methods that build an internal model on the spectrum model have been developedkasai16 ; murata16 . Here, the internal model is defined as a model whose parameters are physical parameters themselves, unlike the parameters of the unimodal basis function such as the mean, intensity, and variance. For instance, Kasai et al. applied the relationship between the protein species and the peak intensity ratio as an internal model, and estimated the protein species directlykasai16 . Also, Murata et al. applied a probability differential equation representing the time evolution of peak positions of the spectrum, and estimated the parameters of latent dynamics directlymurata16 .
In this study, we aim to achieve the automatic selection of an effective Hamiltonian and the estimation of its parameters based on the support of statistical science. By incorporationg the effective Hamiltonian of core-level XPS into the Bayesian spectral deconvolution method as an internal model, we propose a Bayesian spectral deconvolution method that enables the automatic analysis of core-level XPS spectra. Our proposed method is not merely an automated spectral analysis method for estimating the parameters and selecting a model. In Bayesian inference, a physical parameter is treated as a statistical variable. Therefore, the physical parameter is estimated as the probability distribution of the values that the parameter can possibly take, which is called posterior distribution. As a result, we can estimate the parameter and, at the same time, determine its estimation accuracy. This makes it possible to obtain the information necessary for developing measurement plans, such as the number of measurements and the measurement method, to satisfy the required estimation accuracy. In addition, by analyzing the shape of the distribution, we can also discuss the nature of the effective model. For example, we can discuss the uncertainty of the model parameters when fitting the model to the data. This provides information such as the relationship between the expressive ability of the model and the data complexity. It is difficult to obtain this information from simple parameter fitting. Thus, our proposed method can extract useful information for physical discussion from XPS spectra.
In the current methods of analyzing core-level XPS spectra, researchers use effective Hamiltonians such as the cluster modelgroot90 and the impurity Anderson modelgunnarsson83 ; jo88 . The cluster model contains the whole process of core-level XPS spectrum generation, such as the spin-orbit interaction, multiplet effect, and crystal field effect. In the impurity Anderson model, the effect of the conduction band structure is considered on the hybridization interaction between 4 electrons and conduction electrons. In contrast, as the internal model, our proposed method adopts two of the simplest effective Hamiltonians that can represent the XPS spectra of La2O3 and CeO2. The analysis of the core-level XPS spectra started from these effective HamiltoniansKotani74 ; Kotani85 . Since these effective Hamiltonians do not contain many factors, such as the effects of the spin-orbit interaction, multiplet, and crystal field, they cannot be applied to the general core-level XPS spectra of a rare-earth insulator compound. However, the framework of the proposed method does not limit the inner model to these effective Hamiltonians. Even if the effective Hamiltonian is replaced with an arbitrary one in the framework of the proposed method, this simply increases the number of parameters to be estimated and does not change the principle of the framework. This suggests that our proposed method is applicable to general core-level XPS spectra.
Finally, we applied the proposed method to emulated measurement data of La2O3 and CeO2 and their intermediate electron states. As a result, it was confirmed that the two effective Hamiltonians, which are also regarded as effective Hamiltonians in conventional physical studiesKotani74 ; Kotani85 , were selected for the spectrum data of La2O3 and CeO2. Furthermore, from the analysis of the posterior distribution, it was shown that the effective Hamiltonian of CeO2 has a too large degree of freedom to express the spectrum data of La2O3. In addition, we also applied our proposed method to the spectra, which are the intermediate states of a seamless transition from the CeO2 spectrum (three peaks) to the La2O3 spectrum (two peaks). As a result, it was confirmed that the proposed method selects an effective Hamiltonian on the basis of not only the information about the peak number but also other information contained in the effective Hamiltonian. This cannot be realized by existing spectral deconvolution methods applicable to XPS spectra, which select the model on the basis of the peak numberNagata12 ; tokuda16 .
In this way, it was shown that the proposed method can be used to select an effective Hamiltonian that is consistent with physical knowledge. It was also shown that, unlike the simple parameter fitting, the proposed method gives us the knowledge necessary for the discussion of physics.
II Generative Model of Spectrum
In this study, we analyzed spectra that were simulated on the basis of an effective model that is applicable to emulating all types of simplified 4 electron derived 3 core-level XPS spectra of rare-earth insulating compounds. We refer to this effective model as a generative model. The generative model was proposed by Kotani et al.Kotani85 It employs one of the simplest cluster models as the effective Hamiltonian. In the effective Hamiltonian, the effect of the spin-orbit interaction, the multiplet effect, and the crystal field effect are ignored. Moreover, the effect of hybridization between 4 electrons and conduction electrons is simplified because it assumes that there is only one ligand state. Note here that the generative model contains two effective models, which we applied later as the inner model of the spectral deconvolution method. We refer to these effective models as recognition models, which are explained later.
There are three eigenstates of electrons: , and . In the state, there is no electron in the orbital, whereas there is one electron in the state and two in the state. In such a state space of 4 electrons, the effective Hamiltonian of the generative model is given as
[TABLE]
where , , and are the energies of the conducting electron of rare-earth metals (, electrons), the electron, and the core electron, respectively. The index represents the quantum number of the spin and -orbital. , , and are the energies of the hybridization interaction between 4 electrons and conduction electrons, the Coulomb interaction between electrons, and the core-hole Coulomb potential for electrons, respectively. In both the initial and final states, we set the energy level of to 0 as the standard energy level. Then, the number of parameters of the effective Hamiltonian is reduced to five: , and . For later use, we define the parameter set of the effective Hamiltonian as .
The conceptual diagram of the electron state from the initial state to the final state is shown in Fig. 1. The initial state is defined as the state before X-rays are irradiated, and the final state is defined as the state after X-rays are irradiated and a core hole is generated. Furthermore, we define the initial eigenstate of the minimum energy as and the final eigenstates of the three energy levels as (Fig. 1). Using Fermi’s golden rule, we define the transition probability from the initial ground state to the final state as above
[TABLE]
By convoluting with the Lorentz function and by adding the Gaussian noise , we can obtain the intensity distribution of the spectrum as
[TABLE]
where is one-half the width of the Lorentz function. We define the data generated following as “emulated measurement data”. Because the generative model is uniquely defined by the effective Hamiltonian , we label this model as .
Following Groot and KotaniGroot08 , the emulated parameters of the XPS spectrum of La2O3 and CeO2 are set as illustrated in Table 1. The generated data based on Table 1 are shown in Fig. 2. The two peaks in Fig. 2(a) correspond to La2O3 and the three peaks in Fig. 2(b) correspond to CeO2.
In the parameters of the two-peak spectrum, the peak intensity of the highest energy becomes almost zero. This is caused by the decreased interaction between the state and the state, originating from the increased energy of the state. This suggests that is essential for changing the property of the peak number, because controls the energy of the initial and final states. In fact, the most significant change between the parameters of La2O3 and CeO2 is (Table 1). Thus, when we discuss the properties of the La2O3 and CeO2 spectra, we can ignore the difference in parameters between La2O3 and CeO2, excluding .
III Recognition Model of Emulated Spectrum
In this section, we describe the two effective models for recognizing the emulated measurement data generated following the generative model . Hereinafter, we refer to these effective models as the recognition models. In this section, we introduce two recognition models referred to as the two-state Hamiltonian model and the three-state Hamiltonian model.
III.1 Two-state Hamiltonian model
Kotani and ToyozawaKotani74 proposed a two-state Hamiltonian model as an effective model of La2O3 XPS spectra. The two-state Hamiltonian model is defined using an effective Hamiltonian on two eigenstates of electrons and . The effective Hamiltonian is given as
[TABLE]
where, as in the generative model Hamiltonian , , , and are the energies of the conducting electron of rare-earth metals (, electrons), the electron, and the core electron, respectively. The index – represents the quantum number of the spin and -orbital. and are the energies of the hybridization interaction and the core-hole Coulomb potential for the electrons, respectively.
We define the initial eigenstate of the minimum energy as and the final eigenstates of the two energy levels as . In the two-state Hamiltonian model, the initial eigenstate is set to be equal to the electron eigenstate . By using Fermi’s golden rule, convoluting the transition probability with the Lorentz function, and adding the Gaussian noise , we obtained the spectral distribution
[TABLE]
where, as in the generative model , we set the energy level of the initial and final states to 0 as the standard energy level. Thus, the number of parameters of the two-state Hamiltonian model is five: , and . The energy shift parameter is added to the model to compensate for the difference in the standard energy level between the models. The energy shift parameter is also added to the model in the conventional analysis of the XPS spectra. Because the two-state Hamiltonian model is uniquely defined by the effective Hamiltonian , we label this model as .
It was reported that the XPS spectra of La2O3 can be reproduced by the two-state Hamiltonian model Kotani74 . As we mentioned, the emulated measurement data of the La2O3 XPS spectrum is generated on the basis of the generative model , which is more complex than the two-state Hamiltonian model . One purpose of this study is to confirm that this physical knowledge is reproduced by the proposed method.
III.2 Three-state Hamiltonian model
Kotani et al.Kotani85 proposed a three-state Hamiltonian model as an effective model of CeO2 XPS spectra. The effective Hamiltonian of the three-state Hamiltonian model is defined as
[TABLE]
Then, as in the previously described model, the XPS spectral model is derived as
[TABLE]
where is the initial eigenstate of the minimum energy , and as are the final eigenstates of the three energy levels. We set the energy level of the initial and final states to 0 as the standard energy level. Thus, the number of parameters of the three-state Hamiltonian model is eight: , and . The three-state Hamiltonian model and the generative model are similar except for and the degree of freedom of .
As we mentioned earlier, it was reported that the XPS spectra of CeO2 can be reproduced by the three-state Hamiltonian model Kotani85 . Because the three-state Hamiltonian model is uniquely defined by the effective Hamiltonian , we label this model as .
IV Method
IV.1 Bayesian model selection
We evaluate the recognition models and in terms of their capability to represent the spectrum data generated by the generative model . The likelihood of the recognition model for the dataset is defined as
[TABLE]
where is a normalization constant. In this study, we assume that there is no prior knowledge about the likelihood of the model. Thus, we set the prior probability as a uniform distribution; in this study, it is equal to . We also assume that in the dataset is given deterministically, that is, non-probabilistically. Then, the likelihood of the model is transformed as
[TABLE]
where is the parameter set of the recognition model described in Sect. III.
The conditional probability of Eq. (9) is a stochastic generative model of the recognition model . When the additive noise of the XPS spectra is given as an independent and identically distributed Gaussian with average 0 and standard deviation ,
[TABLE]
The probability in Eq. (9) simulates the prior knowledge about the model parameters as a probability distribution. By substituting Eq. (LABEL:noise_model) into Eq. (9), we obtain the following:
[TABLE]
where
[TABLE]
The probability is often referred to as the marginal likelihood and is proportionally related to the likelihood of the recognition model . The negative log-likelihood,
[TABLE]
is often referred to as the Bayesian free energy (). The effective model with the smallest value represents the best model.
IV.2 Exchange Monte Carlo method
To obtain the value of , we need to execute the integration in Eq. (11). However, it is difficult to analytically execute the integration owing to the complicated relationship between and . We overcame this difficulty by numerical integration using the exchange Monte Carlo methodHukushima96 .
Markov chain Monte Carlo (MCMC) methodsmetropolis53 are efficient for sampling from a probability distribution in a high-dimensional space, such as . To apply an MCMC method to execute an integration, we need to transform the integration to a mean value calculation. When applying an MCMC method to the calculation of , we introduce an auxiliary variable and transform Eq. (13) into
[TABLE]
where represents an average and
[TABLE]
When is regarded as energy, this equation suggests that corresponds to the Boltzmann distribution in statistical physics. In the same way, the auxiliary variable corresponds to the inverse temperature in statistical physics. Equation (LABEL:integ_beta) is approximated to a quadrature by parts,
[TABLE]
where is given as a sequence of inverse temperatures obtained by dividing to into pieces, and is obtained by -independent MCMC sampling. However, the MCMC sampling is often trapped at local minima.
The exchange Monte Carlo method (EMC) is an algorithm of an MCMC method used to avoid local trapping at minima. This method simulates multiple samplings from multiple densities with different inverse temperatures . The EMC takes samples from the joint density
[TABLE]
where the probability density is defined in Eq. (15). The EMC algorithm is based on the following updates, in which the joint density is invariant.
- 1
Sampling from each density
Sampling from by a conventional MCMC method, such as
the Metropolis–Hastings algorithmhastings70 .
- 2
Exchange process between two densities corresponding to adjacent inverse temperatures
The exchanges between the configurations and correspond to adjacent inverse temperatures following the probability , where
[TABLE]
Sampling from a distribution with a smaller corresponds to sampling from a distribution with a larger intensity of noise; thus, the distribution tends not to have a local minimum. On the other hand, sampling from a distribution with a larger corresponds to sampling from a distribution with local minima. Hence, sampling from the joint density overcomes the local minimum and enables the fast convergence of sampling.
Using the sampling result of the state, we obtained a posterior density of the parameter [Eq. (15)]. From the posterior density of , we can estimate the model parameters of and the related information, such as estimation accuracy.
V Results
We applied our proposed method to the emulated measurement data and estimated the likelihood of the effective models and . As described in Sect. III, from the physical knowledge, two recognition models and are expected to be selected for the emulated measurement data of La2O3 and CeO2 XPS spectra, respectively. The spectrum of CeO2 has a three-peak structure, and La2O3 has a two-peak structure. If the selection of an effective model is based only on the peak number, the effective model could also be selected indirectly by the existing spectral deconvolution method which has no internal modelNagata12 ; tokuda16 . On the other hand, because our proposed method builds an effective model into spectral deconvolution, various information about peak structure, such as the peak position or the order of peak intensity, suppose to be used to select the effective model. To confirm this, we applied our proposed method not only to spectra of CeO2 and La2O3, but also to spectra that are the intermediate states of a seamless transition from the CeO2 spectrum to the La2O3 spectrum. As we mentioned in Sect. II, is an important parameter for controlling the properties of XPS spectra from La2O3 to CeO2. The generated parameters and of the La2O3 spectrum are also slightly different from the generated parameters of the CeO2 spectrum (Table 1). However, the emulated measurement data of the La2O3 spectrum generated by replacing the and values with the parameter values of the CeO2 spectrum have almost the same peak position and peak intensity as those in the La2O3 spectrum [Figs. 3(a)-(1), 3(a)-(2), and 2(a)]. Therefore, we refer to this parameter-replaced emulated spectrum as the La2O3 spectrum. That is, by shifting from 1.6 to 12.5 and fixing the other parameters as the parameters of CeO2, we can generate spectra that have the intermediate structure of the La2O3 and CeO2 spectra. The increase in the parameter from CeO2 to La2O3 induces the decrease in the transition probability from the initial ground state to the final maximum eigenenergy state . Because the square of the transition probability is the peak intensity, the decrease in the transition probability transforms the three peak CeO2 spectrum to a two-peak spectrum. To be precise, the emulated measurement data of La2O3 has three peaks, but the peak intensity corresponding to the largest eigenvalue is very small. Hence, in this study, we define the peak number of the emulated measurement data as the number of peaks whose intensity is larger than the noise intensity . In this way, we generated emulated measurement data by shifting the parameter and applying our proposed method to it. More specifically, all parameters except are the same, , for all applied emulated measurement data. As mentioned above, the peak number is defined using the noise intensity and the peak intensity. Therefore, we evaluated the effect of not only , but also the noise intensity . Hence, we also generated the emulated measurement data by setting the noise standard deviation to . The peak number, peak position, and peak intensity of the emulated measurement data are described in Table 2 and some examples of emulated measurement data are described in Fig. 3(a). In the emulated measurement data of , if the noise intensity is smaller than 0.0052, the peak number is three, whereas if the noise intensity is larger than 0.0052, the peak number is two (Table 2). In this study, each emulated measurement data consists of samples.
To apply Bayesian estimation, we set the prior density to a uniform distribution in the range described in Table 3. In the execution of EMC sampling, we adopted the Metropolis–Hastings algorithmhastings70 to sample each state of inverse temperature. The states of inverse temperature were determined following the exponential functionNagata08 :
[TABLE]
where and . We abandoned the first 10,000 steps and sampled the next 1,000,000 steps.
On the basis of the obtained Bayesian free energies and , which correspond to the two-state Hamiltonian model and the three-state Hamiltonian model , respectively, the log likelihood of , , was calculated as
[TABLE]
If , then is a more plausible model than . Otherwise, if , then is a more plausible model than . From the phase diagram of (Fig. 4), the three-state Hamiltonian model was selected for the emulated measurement data of , corresponding to the CeO2 spectra, and the two-state Hamiltonian model was selected for the emulated measurement data of , corresponding to the La2O3 spectra. Furthermore, the proposed method selected the three-state Hamiltonian model for all intermediated emulated measurement data from to . It included the emulated measurement data of and , whose peak intensity of the largest energy was smaller than the noise intensity. For further analysis, we evaluated the ratio of the s, , which corresponds to the difference in likelihoods, , in logarithm space. means that is a better model than , and means that is a better model than . The value of tends to gradually increase as the data generated by the parameters and increase [Fig. 4(b)].
To estimate the model parameters and their estimation uncertainty on the basis of the effective model , we evaluated the posterior density of parameters . The posterior density must consist of independent samples. However, the sampling time series of the EMC has a time correlation. Therefore, we extracted samples from sufficiently separated intervals with a correlation coefficient of 0.5 or lower, and calculated the posterior distribution. Furthermore, to visualize a posterior distribution with more than three dimensions, we calculated the following marginal posterior density, which marginalizes the posterior distribution of the parameters except for the parameter of interest :
[TABLE]
Assuming that the sampling data of one parameter , , are extracted from the sampling result of the EMC, the marginalized posterior distribution can be estimated from sampling result by kernel density estimation method using Gaussian kernels. We determined the bandwidth of Gaussian kernels using Scott’s rulescott 15 .
From the evaluation of the marginal posterior density of the emulated measurement data of equal to 7.66 or less, we found a sharp peak in posterior distributions around the true parameter, which we used to generate the emulated measurement data (Figs. 6 – 10 ). From the evaluation of the marginal posterior density with the emulated measurement data of , whose value is the model estimation switching boundary from to , we found that the width of the posterior distribution of , , and increased as the noise intensity (Figs. 6, 6, and 8) increased. On the other hand, the marginal posterior densities of , and for the emulated measurement data of have almost uniform distributions.
In this study, model parameters were estimated from such posterior distributions using the following two methods. The first method was the maximum a posteriori (MAP) method. The MAP method estimates parameters on the basis of the following equation:
[TABLE]
The second method was the maximizer of the posterior marginal (MPM) method. The MPM method estimates parameters on the basis of the following equation:
[TABLE]
where, as with the marginal posterior density, is the index of a certain parameter included in the parameter set . The MPM corresponds to using the maximum marginal posterior density as an estimation value. To evaluate the estimation uncertainty of a parameter, we defined the variation of sampling data from the MPM as
[TABLE]
Here, we focused on the parameters of the effective model , which are , , , , , and (Tables 4, 5, 6, 7, 8, and 9, respectively).
In the emulated measurement data of 10.08, all parameters were estimated correctly by both the MAP and MPM methods. In greater detail, the gaps between the estimated parameters and the true parameters increased as the noise variance increased, and the variation also increased as the noise intensity increased. In the emulated measurement data of , the gaps between the estimated parameters and the true parameters were much larger than the others, except for the emulated measurement data of and and 0.0028. We evaluated the uncertainty of parameter estimation from the variation of the marginal posterior density. As a result, at and , a large transition of the variation of , , and greater than one order was observed (Tables 4, 5, and 6). Also, at , a large was observed regardless of the noise intensity (Tables 4, 5, and 6).
VI Discussion
By applying our proposed method to emulated measurement data, we determined that the two-state Hamiltonian model should be applied to emulated measurement data corresponding to the La2O3 spectra. On the other hand, the three-state Hamiltonian model should be applied to emulated measurement data corresponding to the CeO2 spectra. These results are consistent with those of previous studiesKotani74 ; Kotani85 . For the emulated measurement data of and , our proposed method selected the three-state Hamiltonian model . These spectral distributions, the same as the La2O3 spectra, are two-peak spectra as described in Fig. 3[a-(2)] and Table 2. Here, we consider applying the existing method to the emulated measurement data of . The existing Bayesian spectral deconvolution methods that are applicable to the analysis of the core-level 3 XPS spectrum have no internal modelNagata12 ; tokuda16 . Such existing methods simply select the model whose number of peaks is the same as the appearance of peak numbernumber of peaks appearing in the spectratokuda_Dthesis16 . Therefore, if the existing methods are applied to the emulated measurement data of , a model that has a two peak structure should be selected. Such differences in model selection results between our proposed method and existing Bayesian spectral deconvolution methods will depend on whether the internal model, the effective Hamiltonian, was built in the spectral deconvolution model. This suggests that our proposed method should be applied to the analysis of core-shell XPS spectra when its candidates of the effective Hamiltonian are given.
From the posterior distribution, we can estimate the uncertainties of estimated parameters. Actually, from the analysis of the posterior distribution of the model, it is confirmed that the estimation uncertainty, which corresponds to the variation , is increased as the noise intensity increased. The marginalized posterior distributions, except for and , of and were broad and had no sharp peak structures, whereas the marginalized posterior distributions of and had sharp peak structures. These properties suggest that the estimation uncertainty significantly decreases around and , where the peak number of the spectrum is changed. This suggests that, to obtain high estimation accuracy, the noise intensity must be reduced to less than tokuda_Dthesis16 . Through such an analysis of the posterior distribution, it is possible to make a measurement plan, such as the number of measurements and the measurement method, to satisfy the required estimation accuracytokuda_Dthesis16 . Such an expansion of the variation of the marginalized posterior distribution is presumed to occur via the indefinite estimation parameter as a result of the effective model having an excessive expression power. In particular, the fact that the marginalized posterior density began to spread at and , where the peak number was changed, suggests that the effective model has excessive expression capability for the two-peak spectrum. Information about the estimation accuracy of parameters or the expression capability of the effective model is difficult to obtain by the conventional methods of analysis such as the core-level XPS analysis method using manual tuning and spectral deconvolution using a simple fitting method.
In this study, we adopted the simplified cluster model as the effective model of core-level 3 XPS spectra. This effective model does not take into account the spin-orbit interaction, the multiplet effect, or the crystal field effect. It is generally too simplistic to explain the actual measurement spectra of core-level XPS. However, if the spectrum intensity model is generated by Fermi’s golden rule, our proposed method can easily replace the internal model with another model. For example, except for the difficulty related to increasing the number of parameters, the effective Hamiltonian used in this study can be easily replaced with a model that takes into account the spin-orbit interaction, the multiplet effect, and the crystal field effect. This capability means that we can apply our proposed method to a wider range of actual observed XPS spectra by replacing the effective model with a cluster model that focuses more on interactions or with the impurity Anderson model. In the impurity Anderson model, the effect of the band structure of conducting electrons is concerned with the band structure of conducting electrons. Thus, it is suggested that the framework of the proposed method has wide applicability to actual measurement data of core-level XPS. Also, the analysis of emulated measurement data in the intermediate state can also be realized by the analysis of the actual spectra which have the same kind of spectral structure for each other.
VII Summary
By incorporating the effective Hamiltonian into the stochastic model of spectral deconvolution, we developed a Bayesian spectral deconvolution method for core-level XPS to realize the automatic analysis of core-level XPS spectra. By applying our proposed method to the emulated 3 core-level XPS spectra of La2O3 and CeO2, it was confirmed that our proposed method selects effective Hamiltonians that are consistent with knowledge obtained from the conventional study of physics. We also applied our proposed method to spectra which are the intermediate states of a seamless transition from the CeO2 spectrum (three peaks) to the La2O3 spectrum (two peaks). As a result, it was confirmed that the proposed method selects an effective Hamiltonian on the basis of not only the information about the peak number but also other information contained in the effective Hamiltonian. This cannot be realized by existing Bayesian spectral deconvolution methods applicable to XPS spectra, which select the model on the basis of the peak numberNagata12 ; tokuda16 . Our proposed method also enables the parameter estimation of the effective model using the posterior distribution of its parameter. Using the MAP or MPM methods, we were able to estimate the true parameters of the generative model from the posterior distributions. Furthermore, using the posterior distribution, we were able to evaluate the parameter estimation accuracy or obtain information about the properties of the effective model for the spectrum. This capability of our proposed method can yield information for scientific discussion, e.g., detection limittokuda_Dthesis16 , or the improvement of the effective Hamiltonian using the observed data. In conventional analysis methods, such as those using manual tuning or a simple fitting technique, such information cannot be obtained. It is also suggested through our discussion that the framework of the proposed method has wide applicability to actual measurement data of core-level XPS. We expect that our proposed method will pave the way for the highly quantitative analysis of core-level XPS spectra.
Acknowledgments This work was supported by the Cross-ministerial Strategic Innovation Promotion Program (SIP), “Structural Materials for Innovation” (Funding agency: JST) and JST CREST (JPMJCR1761, JPMJCR1861).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) J. Kanamori and A. Kotani, Proc. 10th Taniguchi Int. Symp., 1987, p. 81.
- 2(2) A. Kotani and Y. Toyozawa, J. Phys. Soc. Jpn. 37 , 912 (1974).
- 3(3) A. Kotani, H. Mizuta, T. Jo, and J. C. Parlebas, Solid State Commun. 53 , 805 (1985).
- 4(4) A. Kotani, M. Okada, T. Jo, A. Bianconi, A. Marcelli, and J. C. Parlebas, J. Phys. Soc. Jpn. 56 , 798 (1987).
- 5(5) F. Groot and A. Kotani, Core Level Spectroscopy of Solids (CRC Press, London, 2008).
- 6(6) K. Nagata, S. Sugita, and M. Okada, Neural Networks 28 , 82 (2012).
- 7(7) K. Hukushima and K. Nemoto, J. Phys. Soc. Jpn. 65 , 1604 (1996).
- 8(8) S. Tokuda, K. Nagata, and M. Okada, J. Phys. Soc. Jpn. 86 , 2 (2016).
