Computational Protein Science in the Era of Large Language Models (LLMs)

Wenqi Fan; Yi Zhou; Shijie Wang; Yuyao Yan; Hui Liu; Qian Zhao; Le; Song; and Qing Li

arXiv:2501.10282·cs.CE·January 28, 2025·6 cites

Computational Protein Science in the Era of Large Language Models (LLMs)

Wenqi Fan, Yi Zhou, Shijie Wang, Yuyao Yan, Hui Liu, Qian Zhao, Le, Song, and Qing Li

PDF

Open Access

TL;DR

This paper reviews how large language models have revolutionized computational protein science by enabling better understanding and prediction of protein structures, functions, and design, highlighting recent advances and future prospects.

Contribution

It provides a systematic overview of protein language models, categorizes existing models, and discusses their applications and future directions in the field.

Findings

01

pLMs improve protein structure prediction

02

pLMs enhance protein function annotation

03

pLMs facilitate protein design and drug discovery

Abstract

Considering the significance of proteins, computational protein science has always been a critical scientific field, dedicated to revealing knowledge and developing applications within the protein sequence-structure-function paradigm. In the last few decades, Artificial Intelligence (AI) has made significant impacts in computational protein science, leading to notable successes in specific protein modeling tasks. However, those previous AI models still meet limitations, such as the difficulty in comprehending the semantics of protein sequences, and the inability to generalize across a wide range of protein modeling tasks. Recently, LLMs have emerged as a milestone in AI due to their unprecedented language processing & generalization capability. They can promote comprehensive progress in fields rather than solving individual tasks. As a result, researchers have actively introduced LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenetics, Bioinformatics, and Biomedical Research