Determining the best classifier for predicting the value of a boolean field on a blood donor database using genetic algorithms
Ritabrata Maiti

TL;DR
This study compares various machine learning classifiers, including genetic algorithm-optimized pipelines, to predict boolean fields in blood donor data, aiming to identify the most effective approach.
Contribution
It introduces the use of genetic algorithms via TPOT to optimize classifier pipelines for blood donor data prediction tasks.
Findings
Genetic algorithm-optimized pipeline outperforms standard classifiers
Support Vector Machine and Decision Tree show high accuracy
TPOT effectively automates pipeline selection and tuning
Abstract
Motivation: Thanks to digitization, we often have access to large databases, consisting of various fields of information, ranging from numbers to texts and even boolean values. Such databases lend themselves especially well to machine learning, classification and big data analysis tasks. We are able to train classifiers, using already existing data and use them for predicting the values of a certain field, given that we have information regarding the other fields. Most specifically, in this study, we look at the Electronic Health Records (EHRs) that are compiled by hospitals. These EHRs are convenient means of accessing data of individual patients, but there processing as a whole still remains a task. However, EHRs that are composed of coherent, well-tabulated structures lend themselves quite well to the application to machine language, via the usage of classifiers. In this study, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Blood donation and transfusion practices · Data Stream Mining Techniques
