Data-Driven Parametrization of Molecular Mechanics Force Fields for Expansive Chemical Space Coverage
Tianze Zheng, Ailun Wang, Xu Han, Yu Xia, Xingyuan Xu, Jiawei Zhan, Yu, Liu, Yang Chen, Zhi Wang, Xiaojie Wu, Sheng Gong, Wen Yan

TL;DR
This paper introduces ByteFF, a data-driven, GNN-based force field for molecular dynamics that covers broad chemical space with high accuracy, enhancing computational drug discovery.
Contribution
Developed ByteFF, a novel Amber-compatible force field trained on a large, diverse dataset, enabling accurate predictions across expansive chemical space.
Findings
ByteFF achieves state-of-the-art accuracy on benchmark datasets.
It predicts force field parameters for diverse drug-like molecules.
ByteFF demonstrates high accuracy in geometries, torsions, and energies.
Abstract
A force field is a critical component in molecular dynamics simulations for computational drug discovery. It must achieve high accuracy within the constraints of molecular mechanics' (MM) limited functional forms, which offers high computational efficiency. With the rapid expansion of synthetically accessible chemical space, traditional look-up table approaches face significant challenges. In this study, we address this issue using a modern data-driven approach, developing ByteFF, an Amber-compatible force field for drug-like molecules. To create ByteFF, we generated an expansive and highly diverse molecular dataset at the B3LYP-D3(BJ)/DZVP level of theory. This dataset includes 2.4 million optimized molecular fragment geometries with analytical Hessian matrices, along with 3.2 million torsion profiles. We then trained an edge-augmented, symmetry-preserving molecular graph neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Catalysis and Oxidation Reactions · Advanced Polymer Synthesis and Characterization
MethodsGraph Neural Network
