# A Large Scale Molecular Hessian Database for Optimizing Reactive Machine Learning Interatomic Potentials

**Authors:** Taoyong Cui, Yunhong Han, Haojun Jia, Chenru Duan, Qiyuan Zhao

PMC · DOI: 10.1038/s41597-025-06350-5 · Scientific Data · 2025-12-04

## TL;DR

This paper introduces HORM, a large quantum-chemistry Hessian dataset that improves the accuracy and efficiency of machine-learning interatomic potentials for modeling chemical reactions.

## Contribution

The paper presents HORM, the largest Hessian dataset for reactive systems, and a novel training method that leverages second-order information for MLIPs.

## Key findings

- HORM contains 1.84 million Hessian matrices at the ωB97x/6-31G(d) level.
- HORM reduces Hessian mean absolute error by up to 63% and improves TS-search efficiency by up to 200×.
- The proposed method enables scalable and accurate reaction network exploration.

## Abstract

Transition-state (TS) characterization underpins reaction modeling but conventional DFT is costly. Machine-learning interatomic potentials (MLIPs) promise quantum-level accuracy at lower cost, yet, lacking large-scale Hessian data, most are pretrained only on energies and forces, limiting TS optimization. We present HORM, the largest quantum-chemistry Hessian dataset for reactive systems: 1.84 million matrices at the ωB97x/6-31G(d) level. To exploit second-order information efficiently, we propose Hessian-informed training with stochastic row sampling, which controls the computational overhead of incorporating Hessians. Across diverse MLIP architectures and force-learning schemes, HORM yields up to 63% lower Hessian mean absolute error and up to 200× improvement in TS-search efficiency versus counterparts trained without Hessians. HORM thus fills critical data and methodological gaps, enabling more accurate, robust reactive MLIPs and scalable exploration of reaction networks.

## Full-text entities

- **Genes:** MLIP (muscular LMNA interacting protein) [NCBI Gene 90523] {aka C6orf142, CIP, MMCKR}
- **Diseases:** HORM (MESH:D000275), TS (MESH:D008579)
- **Chemicals:** O (MESH:D010100), N (MESH:D009584), C (MESH:D002244), H (MESH:D006859), LMDB (-)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12808639/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12808639/full.md

## References

14 references — full list in the complete paper: https://tomesphere.com/paper/PMC12808639/full.md

---
Source: https://tomesphere.com/paper/PMC12808639