Joint Design of Protein Sequence and Structure based on Motifs
Zhenqiao Song, Yunlong Zhao, Yufei Song, Wenxian Shi, Yang Yang, Lei, Li

TL;DR
GeoPro is a novel method for jointly designing protein sequences and structures, leveraging 3D geometry to produce diverse, functional proteins that outperform existing methods and discover new biologically stable proteins.
Contribution
It introduces GeoPro, a joint design approach using equivariant encoding and geometry-guided decoding for protein backbone and sequence, enabling the discovery of novel functional proteins.
Findings
Outperforms strong baselines on metalloprotein datasets.
Discovers novel stable β-lactamases and myoglobins.
Produced proteins with natural-like folding and active sites.
Abstract
Designing novel proteins with desired functions is crucial in biology and chemistry. However, most existing work focus on protein sequence design, leaving protein sequence and structure co-design underexplored. In this paper, we propose GeoPro, a method to design protein backbone structure and sequence jointly. Our motivation is that protein sequence and its backbone structure constrain each other, and thus joint design of both can not only avoid nonfolding and misfolding but also produce more diverse candidates with desired functions. To this end, GeoPro is powered by an equivariant encoder for three-dimensional (3D) backbone structure and a protein sequence decoder guided by 3D geometry. Experimental results on two biologically significant metalloprotein datasets, including -lactamases and myoglobins, show that our proposed GeoPro outperforms several strong baselines on most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Protein Structure and Dynamics · Genomics and Phylogenetic Studies
MethodsFocus
