Embedding parameters in ab initio theory to develop approximations based on molecular similarity
Matteus Tanha, Haichen Li, Shiva Kaul, Alexander Cappiello, Geoffrey, J. Gordon, David J. Yaron

TL;DR
This paper introduces a method to embed adjustable parameters into low-cost ab initio models, leveraging molecular similarity to significantly improve their accuracy in predicting electronic properties while reducing computational costs.
Contribution
The authors develop a parametrized low-level ab initio model that uses scaling factors based on molecular features, demonstrating systematic improvements over traditional models.
Findings
Reduced RMS energy prediction errors by over 85% on training data
Achieved over 75% error reduction on test molecules
Model flexibility allows systematic improvement with additional data
Abstract
A means to take advantage of molecular similarity to lower the computational cost of electronic structure theory is explored, in which parameters are embedded into a low-cost, low-level (LL) ab initio model and adjusted to obtain agreement with results from a higher-level (HL) ab initio model. A parametrized LL (pLL) model is created by multiplying selected matrix elements of the Hamiltonian operators by scaling factors that depend on element types. Various schemes for applying the scaling factors are compared, along with the impact of making the scaling factors linear functions of variables related to bond lengths, atomic charges, and bond orders. The models are trained on ethane and ethylene, substituted with -NH2, -OH and -F, and tested on substituted propane, propylene and t-butane. Training and test datasets are created by distorting the molecular geometries and applying uniform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Chemical Physics Studies · Computational Drug Discovery Methods
