Machine learning of kinetic energy densities with target and feature averaging: better results with fewer training data
Sergei Manzhos, Johann L\"uder, Manabu Ihara

TL;DR
This paper demonstrates that averaging density-dependent variables improves machine learning of kinetic energy densities, enabling accurate models with significantly fewer training data for materials like Al, Mg, and Si.
Contribution
The study introduces a novel averaging approach for density variables that reduces data requirements and enhances the accuracy of ML kinetic energy models in OF-DFT.
Findings
Accurate ML models achieved with only 2000 samples per material.
Averaging improves data distribution and model stability.
Achieved ~1% accuracy in energy-volume relation for multiple materials.
Abstract
Machine learning of kinetic energy functionals (KEF), in particular kinetic energy density (KED) functionals, has recently attracted attention as a promising way to construct KEFs for orbital-free density functional theory (OF-DFT). Neural networks (NN) and kernel methods including Gaussian process regression (GPR) have been used to learn Kohn-Sham (KS) KED from density-based descriptors derived from KS DFT calculations. The descriptors are typically expressed as functions of different powers and derivatives of the electron density. This can generate large and extremely unevenly distributed datasets, which complicates effective application of machine learning techniques. Very uneven data distributions require many training data points, can cause overfitting, and ultimately lower the quality of a ML KED model. We show that one can produce more accurate ML models from fewer data by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · X-ray Diffraction in Crystallography · Computational Drug Discovery Methods
