Pro-PRIME: A general Temperature-Guided Language model to engineer enhanced Stability and Activity in Proteins
Fan Jiang, Mingchen Li, Jiajun Dong, Yuanxi Yu, Xinyu Sun, Banghao Wu,, Jin Huang, Liqi Kang, Yufeng Pei, Liang Zhang, Shaojie Wang, Wenxue Xu,, Jingyao Xin, Wanli Ouyang, Guisheng Fan, Lirong Zheng, Yang Tan, Zhiqiang Hu,, Yi Xiong, Yan Feng, Guangyu Yang, Qian Liu, Jie Song

TL;DR
Pro-PRIME is a temperature-aware deep learning model that predicts protein mutants with enhanced stability and activity, outperforming existing models and validated across multiple proteins and properties, advancing protein engineering capabilities.
Contribution
Introduces PRIME, a novel temperature-guided language model capable of predicting improved protein mutants without prior experimental data, demonstrating broad applicability.
Findings
PRIME outperforms current state-of-the-art models on mutagenesis datasets.
Over 30% of AI-recommended mutants show improved properties.
PRIME enables rapid design of multi-site mutants with enhanced stability and activity.
Abstract
Designing protein mutants of both high stability and activity is a critical yet challenging task in protein engineering. Here, we introduce PRIME, a deep learning model, which can suggest protein mutants of improved stability and activity without any prior experimental mutagenesis data of the specified protein. Leveraging temperature-aware language modeling, PRIME demonstrated superior predictive power compared to current state-of-the-art models on the public mutagenesis dataset over 283 protein assays. Furthermore, we validated PRIME's predictions on five proteins, examining the top 30-45 single-site mutations' impact on various protein properties, including thermal stability, antigen-antibody binding affinity, and the ability to polymerize non-natural nucleic acid or resilience to extreme alkaline conditions. Remarkably, over 30% of the AI-recommended mutants exhibited superior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Protein Structure and Dynamics · Machine Learning in Bioinformatics
