# Bayesian optimization of chemical composition: a comprehensive framework   and its application to $R$Fe$_{12}$-type magnet compounds

**Authors:** Taro Fukazawa, Yosuke Harashima, Zhufeng Hou, and Takashi Miyake

arXiv: 1903.09385 · 2019-06-05

## TL;DR

This paper introduces a machine learning-based framework using Bayesian optimization and first-principles calculations to efficiently optimize the chemical composition of complex magnetic compounds, improving material properties prediction.

## Contribution

It presents a novel integration method for datasets to correct systematic errors and applies Bayesian optimization to design $R$Fe$_{12}$-type magnets, demonstrating enhanced efficiency with proper descriptors.

## Key findings

- Optimization efficiency depends on descriptor choice.
- Variables $eta$, $	ext{γ}$, and electron count are key factors.
- Bayesian optimization outperforms random sampling when descriptors are well-chosen.

## Abstract

We propose a framework for optimization of the chemical composition of multinary compounds with the aid of machine learning. The scheme is based on first-principles calculation using the Korringa-Kohn-Rostoker method and the coherent potential approximation (KKR-CPA). We introduce a method for integrating datasets to reduce systematic errors in a dataset, where the data are corrected using a smaller and more accurate dataset. We apply this method to values of the formation energy calculated by KKR-CPA for nonstoichiometric systems to improve them using a small dataset for stoichiometric systems obtained by the projector-augmented-wave (PAW) method. We apply our framework to optimization of $R$Fe$_{12}$-type magnet compounds (R$_{1-\alpha}$Z$_{\alpha}$)(Fe$_{1-\beta}$Co$_{\beta}$)$_{12-\gamma}$Ti$_{\gamma}$, and benchmark the efficiency in determination of the optimal choice of elements (R and Z) and ratio ($\alpha$, $\beta$ and $\gamma$) with respect to magnetization, Curie temperature and formation energy. We find that the optimization efficiency depends on descriptors significantly. The variable $\beta$, $\gamma$ and the number of electrons from the R and Z elements per cell are important in improving the efficiency. When the descriptor is appropriately chosen, the Bayesian optimization becomes much more efficient than random sampling.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.09385/full.md

## Figures

28 figures with captions in the complete paper: https://tomesphere.com/paper/1903.09385/full.md

## References

70 references — full list in the complete paper: https://tomesphere.com/paper/1903.09385/full.md

---
Source: https://tomesphere.com/paper/1903.09385