Generalized convolutional many body distribution functional representations
Danish Khan, O. Anatole von Lilienfeld

TL;DR
The paper introduces generalized convolutional many-body distribution functionals (cMBDF), a highly efficient atomic representation that improves accuracy and reduces computational costs in machine learning models for chemical systems, especially in low-data regimes.
Contribution
It presents a novel, compact, and accurate atomic representation framework, cMBDF, that generalizes MBDF and leverages convolutional evaluations for enhanced efficiency and applicability across diverse chemical datasets.
Findings
cMBDF is up to two orders of magnitude more compact than existing representations.
cMBDF achieves higher accuracy in predicting quantum properties.
Model training time is reduced from 23 hours to 8 minutes.
Abstract
Modern machine learning (ML) models of chemical and materials systems with billions of parameters require vast training datasets and considerable computational efforts. Lightweight kernel or decision tree based methods, however, can be rapidly trained, leading to a considerably lower carbon footprint. We introduce generalized convolutional many-body distribution functionals (cMBDF) as highly compute and data efficient atomic representations for accurate kernels that excel in low-data regimes. Generalizing the MBDF framework, cMBDF encodes local chemical environments in a compact fashion using translationally and rotationally invariant functionals of smooth atom centered Gaussian electron density proxy distributions weighted by interaction potentials. The functional values can be efficiently evaluated by expressing them in terms of convolutions which are calculated via fast Fourier…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMorphological variations and asymmetry
