A Standardized Benchmark for Multilabel Antimicrobial Peptide Classification
Sebastian Ojeda, Rafael Velasquez, Nicol\'as Aparicio, Juanita Puentes, Paula C\'ardenas, Nicol\'as Andrade, Gabriel Gonz\'alez, Sergio Rinc\'on, Carolina Mu\~noz-Camargo, Pablo Arbel\'aez

TL;DR
This paper introduces ESCAPE, a comprehensive benchmark dataset for multilabel antimicrobial peptide classification, and a transformer-based model that achieves state-of-the-art performance, facilitating AI-driven discovery of new antimicrobial agents.
Contribution
The paper presents ESCAPE, a large standardized dataset with functional annotations, and a novel transformer-based model for multilabel peptide activity prediction, advancing computational antimicrobial research.
Findings
ESCAPE dataset includes over 80,000 peptides with multilabel annotations.
The proposed model achieves up to 2.56% improvement in mean Average Precision.
State-of-the-art performance established for multilabel antimicrobial peptide classification.
Abstract
Antimicrobial peptides have emerged as promising molecules to combat antimicrobial resistance. However, fragmented datasets, inconsistent annotations, and the lack of standardized benchmarks hinder computational approaches and slow down the discovery of new candidates. To address these challenges, we present the Expanded Standardized Collection for Antimicrobial Peptide Evaluation (ESCAPE), an experimental framework integrating over 80.000 peptides from 27 validated repositories. Our dataset separates antimicrobial peptides from negative sequences and incorporates their functional annotations into a biologically coherent multilabel hierarchy, capturing activities across antibacterial, antifungal, antiviral, and antiparasitic classes. Building on ESCAPE, we propose a transformer-based model that leverages sequence and structural information to predict multiple functional activities of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAntimicrobial Peptides and Activities · vaccines and immunoinformatics approaches · Machine Learning in Bioinformatics
