A Novel Machine Learning Classifier Based on Genetic Algorithms and Data Importance Reformatting
A. K. Alkhayyata, N. M. Hewahi

TL;DR
This paper introduces GADIC, a new classification approach combining data importance reformatting and genetic algorithms to enhance the performance of existing machine learning classifiers across diverse datasets.
Contribution
GADIC is a novel method that integrates data reformatting, genetic algorithm tuning, and instance averaging to improve classifier accuracy.
Findings
GADIC significantly improved classifier performance on multiple datasets.
KNN with GADIC showed the highest accuracy gain.
Maximum average improvement was 16.79%.
Abstract
In this paper, a novel classification algorithm that is based on Data Importance (DI) reformatting and Genetic Algorithms (GA) named GADIC is proposed to overcome the issues related to the nature of data which may hinder the performance of the Machine Learning (ML) classifiers. GADIC comprises three phases which are data reformatting phase which depends on DI concept, training phase where GA is applied on the reformatted training dataset, and testing phase where the instances of the reformatted testing dataset are being averaged based on similar instances in the training dataset. GADIC is an approach that utilizes the exiting ML classifiers with involvement of data reformatting, using GA to tune the inputs, and averaging the similar instances to the unknown instance. The averaging of the instances becomes the unknown instance to be classified in the stage of testing. GADIC has been…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Advanced Algorithms and Applications · Advanced Sensor and Control Systems
MethodsGenetic Algorithms · Logistic Regression · Support Vector Machine
