# Automated Discovery of Reactive Events via Hypergraph Mining of Ab Initio Atomistic Simulations

**Authors:** Alexandra Stan-Bernhardt, Paolo Pellizzoni, Karsten Borgwardt, Christian Ochsenfeld

PMC · DOI: 10.1021/acs.jctc.5c01682 · Journal of Chemical Theory and Computation · 2026-02-12

## TL;DR

This paper introduces a new automated method to discover reactive events in chemical simulations using hypergraph mining and statistical analysis.

## Contribution

The novel contribution is an automated workflow for analyzing chemical reaction networks using frequent pattern mining and statistical correlation with environmental conditions.

## Key findings

- Frequent reactive patterns were identified across simulations using directed hypergraph mining.
- Statistically significant correlations between reactive events and environmental conditions were found using Fisher’s exact test.
- Minimum energy paths for key patterns were computed using the molecular double-ended growing string method.

## Abstract

The field of generative chemistry and automated exploration
of
chemical reaction space has gained much interest in recent years as
it provides a feasible alternative to performing resource-intensive
experiments by enabling important computational insights into new
molecular systems. The results are often summarized in reaction networks,
which reveal intricate relations between different key reactive events.
Although various approaches to explore the available chemical space
have been introduced, the information contained in the resulting reaction
networks has not been fully exploited so far. We propose an automated
workflow for the analysis of chemical reaction networks by applying
frequent pattern mining on the corresponding directed hypergraphs
to identify frequently occurring reactive patterns across a set of
simulations. Furthermore, we identify reactive events that are statistically
correlated with given environmental conditions by applying Fisher’s
exact test and controlling the family-wise error rate to ensure high
statistical relevance. Minimum energy paths for frequent and statistically
significant patterns are obtained with the molecular double-ended
growing string method at ωB97X-3c level of theory. We showcase
the pattern-mining-based analysis on the thermally controlled interstellar
synthesis of carbamic acid, where we retrieve results in line with
experimental data and further investigate the role of water as a protic
solvent therein.

## Linked entities

- **Chemicals:** carbamic acid (PubChem CID 277), water (PubChem CID 962)

## Full-text entities

- **Chemicals:** carbamic acid (MESH:C070766), water (MESH:D014867)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12937057/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12937057/full.md

## References

79 references — full list in the complete paper: https://tomesphere.com/paper/PMC12937057/full.md

---
Source: https://tomesphere.com/paper/PMC12937057