$\nabla^2$DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials
Kuzma Khrabrov, Anton Ber, Artem Tsypin, Konstantin Ushenin, Egor, Rumiantsev, Alexander Telepov, Dmitry Protasov, Ilya Shenbin, Anton Alekseev,, Mikhail Shirokikh, Sergey Nikolenko, Elena Tutubalina, Artur Kadurin

TL;DR
The paper introduces $ abla^2$DFT, a comprehensive quantum chemistry dataset with diverse molecular data and a benchmark for neural network potentials, enhancing drug discovery research and NNP evaluation.
Contribution
It presents a new, extensive dataset and benchmark for neural network potentials, including relaxation trajectories and multiple molecular properties, advancing the development and assessment of NNPs.
Findings
Dataset contains twice as many structures as previous datasets.
Includes relaxation trajectories for drug-like molecules.
Provides a framework and 10 models for training NNPs.
Abstract
Methods of computational quantum chemistry provide accurate approximations of molecular properties crucial for computer-aided drug discovery and other areas of chemical science. However, high computational complexity limits the scalability of their applications. Neural network potentials (NNPs) are a promising alternative to quantum chemistry methods, but they require large and diverse datasets for training. This work presents a new dataset and benchmark called DFT that is based on the nablaDFT. It contains twice as much molecular structures, three times more conformations, new data types and tasks, and state-of-the-art models. The dataset includes energies, forces, 17 molecular properties, Hamiltonian and overlap matrices, and a wavefunction object. All calculations were performed at the DFT level (B97X-D/def2-SVP) for each conformation. Moreover, DFT is the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsComputational Drug Discovery Methods · Various Chemistry Research Topics
