EMOCPD: Efficient Attention-based Models for Computational Protein Design Using Amino Acid Microenvironment
Xiaoqi Ling, Cheng Cai, Demin Kong, Zhisheng Wei, Jing Wu, Lei Wang,, Zhaohong Deng

TL;DR
EMOCPD introduces an attention-based deep learning model that predicts amino acid categories from protein microenvironments, significantly improving accuracy and aiding in the design of more stable, expressive proteins.
Contribution
This paper presents EMOCPD, a novel attention-based neural network that effectively analyzes sparse protein microenvironments for improved computational protein design.
Findings
Achieves over 80% accuracy on training data
Surpasses existing methods by over 10% on test sets
Enhances protein stability and expression in designed mutants
Abstract
Computational protein design (CPD) refers to the use of computational methods to design proteins. Traditional methods relying on energy functions and heuristic algorithms for sequence design are inefficient and do not meet the demands of the big data era in biomolecules, with their accuracy limited by the energy functions and search algorithms. Existing deep learning methods are constrained by the learning capabilities of the networks, failing to extract effective information from sparse protein structures, which limits the accuracy of protein design. To address these shortcomings, we developed an Efficient attention-based Models for Computational Protein Design using amino acid microenvironment (EMOCPD). It aims to predict the category of each amino acid in a protein by analyzing the three-dimensional atomic environment surrounding the amino acids, and optimize the protein based on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsViral Infectious Diseases and Gene Expression in Insects · RNA and protein synthesis mechanisms · Protein Structure and Dynamics
MethodsAttention Is All You Need · Softmax · Sparse Evolutionary Training · Focus · Linear Layer · Multi-Head Attention
