Data-driven approach for benchmarking DFTB-approximate excited state methods
Andr\'es I. Bertoni, Cristi\'an G. S\'anchez

TL;DR
This paper introduces a data-driven benchmarking approach for DFTB excited state methods using a large ML dataset, revealing their limitations and chemical dependencies in excitation energy predictions.
Contribution
It presents a chemically-informed benchmarking framework leveraging ML data to evaluate and understand the limitations of DFTB excited state methods.
Findings
Identification of error trends related to chemical identity.
Insights into the limitations of DFTB methods compared to higher-level theories.
Recommendations for improving DFTB excited state predictions.
Abstract
In this work we propose a chemically-informed data-driven approach to benchmark the approximate density-functional tight-binding (DFTB) excited state (ES) methods that are currently available within the DFTB+ suite. By taking advantage of the large volume of low-detail ES-data in the machine learning (ML) dataset, QM8, we were able to extract valuable insights regarding the limitations of the benchmarked methods in terms of the approximations made to the parent formalism, density-functional theory (DFT), while providing recommendations on how to overcome them. For this benchmark, we compared the first singlet-singlet vertical excitation energies () predicted by the DFTB-approximate methods with predictions of less approximate methods from the reference ML-dataset. For the nearly 21,800 organic molecules in the GDB-8 chemical space, we were able to identify clear trends in the …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Chemical Physics Studies · Spectroscopy and Quantum Chemical Studies
