Transfer learning on protein language models improves antimicrobial peptide classification
Elias Georgoulis, Michaela Areti Zervou, Yannis Pantazis

TL;DR
This paper shows that using large pre-trained protein models improves the classification of antimicrobial peptides, even with limited labeled data.
Contribution
The study demonstrates that transfer learning with protein language models achieves state-of-the-art AMP classification performance with minimal effort.
Findings
Model scale significantly improves AMP classification performance.
PLM embeddings with shallow classifiers achieve state-of-the-art results.
Fine-tuning PLM parameters further enhances classification accuracy.
Abstract
Antimicrobial peptides (AMPs) are essential components of the innate immune system in humans and other organisms, exhibiting potent activity against a broad spectrum of pathogens. Their potential therapeutic applications, particularly in combating antibiotic resistance, have rendered AMP classification a vital task in computational biology. However, the scarcity of labeled AMP sequences, coupled with the diversity and complexity of AMPs, poses significant challenges for the training of standalone AMP classifiers. Self-supervised learning has emerged as a powerful paradigm in addressing such challenges across various fields, leading to the development of Protein Language Models (PLMs). These models leverage vast amounts of unlabeled protein sequences to learn biologically relevant features, providing transferable protein sequence representations (embeddings), that can be fine-tuned for…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAntimicrobial Peptides and Activities · Machine Learning in Bioinformatics · Biochemical and Structural Characterization
