An Open Quantum Chemistry Property Database of 120 Kilo Molecules with 20 Million Conformers
Weiqi Liu, Xi Ai, Zhijian Zhou, Chao Qu, Junyi An, Zhipeng Zhou, Yuan, Cheng, Yinghui Xu, Fenglei Cao, Alan Qi

TL;DR
The paper introduces QO2Mol, a large open-source quantum chemistry database with 120,000 molecules and 20 million conformers, enabling advanced AI research in organic molecular sciences.
Contribution
It provides a comprehensive, high-precision quantum chemistry dataset of diverse organic molecules and conformers, facilitating machine learning and structure-property relationship studies.
Findings
Extensive coverage of molecular structures and elements.
High-precision quantum mechanical properties computed for all conformers.
Open-source database and benchmark codes available for research.
Abstract
Artificial intelligence is revolutionizing computational chemistry, bringing unprecedented innovation and efficiency to the field. To further advance research and expedite progress, we introduce the Quantum Open Organic Molecular (QO2Mol) database -- a large-scale quantum chemistry dataset designed for professional and transformative research in organic molecular sciences under an open-source license. The database comprises 120,000 organic molecules and approximately 20 million conformers, encompassing 10 different elements (C, H, O, N, S, P, F, Cl, Br, I), with heavy atom counts exceeding 40. Utilizing the high-precision B3LYP/def2-SVP quantum mechanical level, each conformation was meticulously computed for quantum mechanical properties, including potential energy and forces. These molecules are derived from fragments of compounds in ChEMBL, ensuring their structural relevance to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHistory and advancements in chemistry
