Prediction of Hemolysis Tendency of Peptides using a Reliable Evaluation Method
Ali Raza, Hafiz Saud Arshad

TL;DR
This paper introduces a machine learning method for predicting the hemolytic tendency of peptides, utilizing a novel clustering-based train-test split to ensure reliable performance estimates on unseen data, aiding therapeutic peptide screening.
Contribution
The study presents a new clustering-based train-test splitting method and demonstrates its effectiveness in reliably predicting peptide hemolytic tendencies with high accuracy and AUC-ROC.
Findings
Achieved 97.79% accuracy and 0.9986 AUC-ROC on test set using traditional split.
Achieved 97.58% accuracy and 0.997 AUC-ROC with clustering-based split.
Recorded 79.5% accuracy on unseen data distribution.
Abstract
There are numerous peptides discovered through past decades, which exhibit antimicrobial and anti-cancerous tendencies. Due to these reasons, peptides are supposed to be sound therapeutic candidates. Some peptides can pose low metabolic stability, high toxicity and high hemolity of peptides. This highlights the importance for evaluating hemolytic tendencies and toxicity of peptides, before using them for therapeutics. Traditional methods for evaluation of toxicity of peptides can be time-consuming and costly. In this study, we have extracted peptides data (Hemo-DB) from Database of Antimicrobial Activity and Structure of Peptides (DBAASP) based on certain hemolity criteria and we present a machine learning based method for prediction of hemolytic tendencies of peptides (i.e. Hemolytic or Non-Hemolytic). Our model offers significant improvement on hemolity prediction benchmarks. we also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsvaccines and immunoinformatics approaches · Antimicrobial Peptides and Activities · Machine Learning in Bioinformatics
