Recognizing molecular patterns by machine learning: an agnostic structural definition of the hydrogen bond
Piero Gasparotto, Michele Ceriotti

TL;DR
This paper presents a machine learning-based, unbiased, and adaptive method to identify hydrogen bonds solely from structural data, providing a clear and general definition that can be applied to various molecular systems.
Contribution
It introduces a novel machine learning approach to define hydrogen bonds based on structural information, avoiding arbitrary criteria and enhancing pattern recognition in molecular analysis.
Findings
The method reliably identifies hydrogen bonds in atomistic simulations.
It offers a univocal and unbiased definition of hydrogen bonding.
The approach is adaptable to other structural pattern recognition tasks.
Abstract
The concept of chemical bonding can ultimately be seen as a rationalization of the recurring structural patterns observed in molecules and solids. Chemical intuition is nothing but the ability to recognize and predict such patterns, and how they transform into one another. Here we discuss how to use a computer to identify atomic patterns automatically, so as to provide an algorithmic definition of a bond based solely on structural information. We concentrate in particular on hydrogen bonding -- a central concept to our understanding of the physical chemistry of water, biological systems and many technologically important materials. Since the hydrogen bond is a somewhat fuzzy entity that covers a broad range of energies and distances, many different criteria have been proposed and used over the years, based either on sophisticate electronic structure calculations followed by an energy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
