Bridging the Gap between Chemical Reaction Pretraining and Conditional Molecule Generation with a Unified Model
Bo Qiang, Yiran Zhou, Yuheng Ding, Ningfeng Liu, Song Song, Liangren, Zhang, Bo Huang, Zhenming Liu

TL;DR
This paper introduces a unified deep-learning framework that combines reaction representation learning and molecule generation, leveraging chemical knowledge and inductive biases to produce high-quality, synthesizable drug-like molecules, advancing reaction-based applications.
Contribution
The paper proposes a novel pretraining framework inspired by organic chemistry mechanisms that unifies reaction understanding and molecule generation tasks, achieving state-of-the-art results.
Findings
Achieves state-of-the-art performance on downstream tasks
Generates high-quality, synthesizable drug-like molecules
Overcomes limitations of template-based molecule generation
Abstract
Chemical reactions are the fundamental building blocks of drug design and organic chemistry research. In recent years, there has been a growing need for a large-scale deep-learning framework that can efficiently capture the basic rules of chemical reactions. In this paper, we have proposed a unified framework that addresses both the reaction representation learning and molecule generation tasks, which allows for a more holistic approach. Inspired by the organic chemistry mechanism, we develop a novel pretraining framework that enables us to incorporate inductive biases into the model. Our framework achieves state-of-the-art results on challenging downstream tasks. By possessing chemical knowledge, our generative framework overcome the limitations of current molecule generation models that rely on a small number of reaction templates. In the extensive experiments, our model generates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Chemical Synthesis and Analysis
