Designing novel protein structures using sequence generator and AlphaFold2
Xeerak Agha, Nihang Fu, Jianjun Hu

TL;DR
This paper introduces a new protein design pipeline combining a sequence generator and structure prediction, enabling the creation of novel proteins with desired functions and structures more efficiently.
Contribution
The study develops a novel pipeline integrating ProteinSolver and AlphaFold2 for efficient de novo protein design, expanding the diversity of possible protein structures.
Findings
Generated 40 de novo binding sites with high precision
30 of the designed proteins are novel structures
The pipeline reduces exploration space for protein conformations
Abstract
Protein structures and functions are determined by a contiguous arrangement of amino acid sequences. Designing novel protein sequences and structures with desired geometry and functions is a complex task with large state spaces. Here we develop a novel protein design pipeline consisting of two deep learning algorithms, ProteinSolver and AlphaFold2. ProteinSolver is a deep graph neural network that generates amino acid sequences such that the forces between interacting amino acids are favorable and compatible with the fold while AlphaFold2 is a deep learning algorithm that predicts the protein structures from protein sequences. We present forty de novo designed binding sites of the PTP1B and P53 proteins with high precision, out of which thirty proteins are novel. Using ProteinSolver and AlphaFold2 in conjunction, we can trim the exploration of the large protein conformation space, thus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · RNA and protein synthesis mechanisms · Machine Learning in Bioinformatics
