Accurate and efficient structure elucidation from routine one-dimensional NMR spectra using multitask machine learning
Frank Hu, Michael S. Chen, Grant M. Rotskoff, Matthew W. Kanan, Thomas, E. Markland

TL;DR
This paper presents a multitask machine learning framework combining transformer and CNN architectures to accurately predict molecular structures from 1D NMR spectra, significantly reducing the search space and outperforming traditional methods.
Contribution
The authors develop an end-to-end deep learning model that predicts molecular formulas and connectivity directly from spectra without prior chemical knowledge.
Findings
Predicts exact molecules 69.6% of the time within top 15 guesses.
Reduces the search space by up to 11 orders of magnitude.
Effective on molecules with up to 19 heavy atoms.
Abstract
Rapid determination of molecular structures can greatly accelerate workflows across many chemical disciplines. However, elucidating structure using only one-dimensional (1D) NMR spectra, the most readily accessible data, remains an extremely challenging problem because of the combinatorial explosion of the number of possible molecules as the number of constituent atoms is increased. Here, we introduce a multitask machine learning framework that predicts the molecular structure (formula and connectivity) of an unknown compound solely based on its 1D 1H and/or 13C NMR spectra. First, we show how a transformer architecture can be constructed to efficiently solve the task, traditionally performed by chemists, of assembling large numbers of molecular fragments into molecular structures. Integrating this capability with a convolutional neural network (CNN), we build an end-to-end model for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMolecular spectroscopy and chirality
