PlasmoFAB: A Benchmark to Foster Machine Learning for Plasmodium falciparum Protein Antigen Candidate Prediction
Jonas Christian Ditz, Jacqueline Wistuba-Hamprecht, Timo Maier, and Rolf Fendel, Nico Pfeifer, Bernhard Reuter

TL;DR
PlasmoFAB is a curated benchmark dataset designed to train machine learning models for predicting Plasmodium falciparum protein antigen candidates, aiding malaria vaccine development.
Contribution
This work introduces PlasmoFAB, the first high-quality, curated dataset for Plasmodium falciparum antigen prediction, and demonstrates its effectiveness over existing general-purpose prediction tools.
Findings
Models trained on PlasmoFAB outperform general prediction services.
Available prediction services perform inadequately on Plasmodium falciparum antigen prediction.
PlasmoFAB is publicly accessible for further research and development.
Abstract
Motivation: Machine learning methods can be used to support scientific discovery in healthcare-related research fields. However, these methods can only be reliably used if they can be trained on high-quality and curated datasets. Currently, no such dataset for the exploration of Plasmodium falciparum protein antigen candidates exists. The parasite Plasmodium falciparum causes the infectious disease malaria. Thus, identifying potential antigens is of utmost importance for the development of antimalarial drugs and vaccines. Since exploring antigen candidates experimentally is an expensive and time-consuming process, applying machine learning methods to support this process has the potential to accelerate the development of drugs and vaccines, which are needed for fighting and controlling malaria. Results: We developed PlasmoFAB, a curated benchmark that can be used to train machine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · vaccines and immunoinformatics approaches · Computational Drug Discovery Methods
