Tailoring Molecules for Protein Pockets: a Transformer-based Generative Solution for Structured-based Drug Design
Kehan Wu, Yingce Xia, Yang Fan, Pan Deng, Haiguang Liu, Lijun Wu,, Shufang Xie, Tong Wang, Tao Qin, Tie-Yan Liu

TL;DR
This paper introduces TamGent, a Transformer-based generative model that designs novel drug molecules tailored to specific protein pockets, improving binding affinity and drugability over existing methods.
Contribution
The paper presents TamGent, a novel Transformer-based framework for structure-based drug design that directly generates candidate molecules from scratch for specific protein targets.
Findings
TamGent outperforms previous methods in effectiveness and efficiency.
Generated molecules show improved binding affinity and drugability.
Successfully rediscovered known drugs and generated novel candidates for SARS-CoV-2 and KRAS G12C.
Abstract
Structure-based drug design is drawing growing attentions in computer-aided drug discovery. Compared with the virtual screening approach where a pre-defined library of compounds are computationally screened, de novo drug design based on the structure of a target protein can provide novel drug candidates. In this paper, we present a generative solution named TamGent (Target-aware molecule generator with Transformer) that can directly generate candidate drugs from scratch for a given target, overcoming the limits imposed by existing compound libraries. Following the Transformer framework (a state-of-the-art framework in deep learning), we design a variant of Transformer encoder to process 3D geometric information of targets and pre-train the Transformer decoder on 10 million compounds from PubChem for candidate drug generation. Systematical evaluation on candidate compounds generated for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Protein Structure and Dynamics · Machine Learning in Materials Science
MethodsLib · Multi-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Softmax · Absolute Position Encodings · Dropout
