HORM: A Large Scale Molecular Hessian Database for Optimizing Reactive Machine Learning Interatomic Potentials
Taoyong Cui, Yunhong Han, Haojun Jia, Chenru Duan, Qiyuan Zhao

TL;DR
This paper introduces HORM, the largest Hessian database for reactive systems, and demonstrates how Hessian-informed training significantly improves the accuracy of machine learning interatomic potentials for transition state searches.
Contribution
The work provides the HORM Hessian dataset and a Hessian-informed training strategy, enabling more accurate reactive MLIPs for large-scale reaction modeling.
Findings
Up to 63% reduction in Hessian mean absolute error.
200-fold increase in transition state search efficiency.
HORM addresses key data and methodological gaps in reactive MLIPs.
Abstract
Transition state (TS) characterization is central to computational reaction modeling, yet conventional approaches depend on expensive density functional theory (DFT) calculations, limiting their scalability. Machine learning interatomic potentials (MLIPs) have emerged as a promising approach to accelerate TS searches by approximating quantum-level accuracy at a fraction of the cost. However, most MLIPs are primarily designed for energy and force prediction, thus their capacity to accurately estimate Hessians, which are crucial for TS optimization, remains constrained by limited training data and inadequate learning strategies. This work introduces the Hessian dataset for Optimizing Reactive MLIP (HORM), the largest quantum chemistry Hessian database dedicated to reactive systems, comprising 1.84 million Hessian matrices computed at the B97x/6-31G(d) level of theory. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Chemical Physics Studies · Computational Drug Discovery Methods
