Widely Used and Fast De Novo Drug Design by a Protein Sequence-Based Reinforcement Learning Model
Yaqin Li, Lingli Li, Yongjin Xu, and Yi Yu

TL;DR
This paper introduces a fast, protein sequence-based reinforcement learning model for de novo drug design that bypasses the need for 3D structures, enabling efficient discovery of bioactive compounds.
Contribution
The study presents a novel RL model utilizing only 1D protein sequences for drug design, demonstrating its effectiveness across multiple targets including those without experimental structures.
Findings
Generated compounds showed bioactivity validated by QSAR and docking.
Performance depends on training data source for the binding predictor.
Successfully designed molecules for kinase CDK20 using predicted structures.
Abstract
De novo molecular design has facilitated the exploration of large chemical space to accelerate drug discovery. Structure-based de novo method can overcome the data scarcity of active ligands by incorporating drug-target interaction into deep generative architectures. However, these strategies are bottlenecked by the small fraction of experimentally determined protein or complex structures. In addition, the cost of molecular generation is computationally expensive due to 3D representations of both molecule and protein. Here, we demonstrate a widely used and fast protein sequence-based reinforcement learning (RL) model for drug discovery. In the generative model, one of the reward components, a binding affinity predictor, is based on 1D protein sequence and molecular SMILES. As a proof of concept, the RL model was utilized to design molecules for four targets. The generated compounds…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Transgenic Plants and Applications · Viral Infectious Diseases and Gene Expression in Insects
MethodsAlphaFold
