ReactXT: Understanding Molecular "Reaction-ship" via Reaction-Contextualized Molecule-Text Pretraining
Zhiyuan Liu, Yaorui Shi, An Zhang, Sihang Li, Enzhi Zhang, Xiang Wang,, Kenji Kawaguchi, Tat-Seng Chua

TL;DR
ReactXT is a novel pretraining approach for reaction-text modeling that leverages reaction and molecule contexts, supported by a new dataset, to enhance chemical synthesis automation and understanding.
Contribution
The paper introduces ReactXT, a new pretraining method with three context-based tasks and a dataset, OpenExp, for improving reaction-text understanding and experimental procedure prediction.
Findings
ReactXT improves experimental procedure prediction accuracy.
ReactXT enhances molecule captioning performance.
ReactXT achieves competitive results in retrosynthesis tasks.
Abstract
Molecule-text modeling, which aims to facilitate molecule-relevant tasks with a textual interface and textual knowledge, is an emerging research direction. Beyond single molecules, studying reaction-text modeling holds promise for helping the synthesis of new materials and drugs. However, previous works mostly neglect reaction-text modeling: they primarily focus on modeling individual molecule-text pairs or learning chemical reactions without texts in context. Additionally, one key task of reaction-text modeling -- experimental procedure prediction -- is less explored due to the absence of an open-source dataset. The task is to predict step-by-step actions of conducting chemical experiments and is crucial to automating chemical synthesis. To resolve the challenges above, we propose a new pretraining method, ReactXT, for reaction-text modeling, and a new dataset, OpenExp, for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · History and advancements in chemistry
MethodsFocus
