Functional Geometry Guided Protein Sequence and Backbone Structure Co-Design
Zhenqiao Song, Yunlong Zhao, Wenxian Shi, Yang Yang, Lei Li

TL;DR
This paper introduces NAEPro, a novel model that jointly designs protein sequences and structures by capturing global and local correlations, leading to highly accurate and functional protein generation.
Contribution
NAEPro is a new model that integrates attention and equivariant layers for effective protein sequence and structure co-design based on functional sites.
Findings
NAEPro achieves the highest amino acid recovery rate.
NAEPro attains the best TM-score and lowest RMSD among baselines.
Generated proteins effectively bind to target metallocofactors.
Abstract
Proteins are macromolecules responsible for essential functions in almost all living organisms. Designing reasonable proteins with desired functions is crucial. A protein's sequence and structure are strongly correlated and they together determine its function. In this paper, we propose NAEPro, a model to jointly design Protein sequence and structure based on automatically detected functional sites. NAEPro is powered by an interleaving network of attention and equivariant layers, which can capture global correlation in a whole sequence and local influence from nearest amino acids in three dimensional (3D) space. Such an architecture facilitates effective yet economic message passing at two levels. We evaluate our model and several strong baselines on two protein datasets, -lactamase and myoglobin. Experimental results show that our model consistently achieves the highest amino…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Protein Structure and Dynamics · RNA and protein synthesis mechanisms
