ATwo-Stage Ensemble Feature Selection and Particle Swarm Optimization Approach for Micro-Array Data Classification in Distributed Computing Environments
Aayush Adhikari, Sandesh Bhatta, Harendra S. Jangwan, Amit Mishra, Khair Ul Nisa, Abu Taha Zamani, Aaron Sapkota, Debendra Muduli, Nikhat Parveen

TL;DR
This paper introduces a hybrid ensemble feature selection method combined with Particle Swarm Optimization and a majority voting classifier to improve microarray data classification accuracy in distributed computing environments.
Contribution
It proposes a novel two-stage ensemble feature selection approach with PSO and a majority voting classifier, enhancing microarray classification performance in local and cloud settings.
Findings
Achieved up to 99.58% accuracy on microarray datasets.
Demonstrated improved performance in cloud environments with different vCPU configurations.
Validated effectiveness across multiple datasets and computing environments.
Abstract
High dimensionality in datasets produced by microarray technology presents a challenge for Machine Learning (ML) algorithms, particularly in terms of dimensionality reduction and handling imbalanced sample sizes. To mitigate the explained problems, we have proposedhybrid ensemble feature selection techniques with majority voting classifier for micro array classi f ication. Here we have considered both filter and wrapper-based feature selection techniques including Mutual Information (MI), Chi-Square, Variance Threshold (VT), Least Absolute Shrinkage and Selection Operator (LASSO), Analysis of Variance (ANOVA), and Recursive Feature Elimination (RFE), followed by Particle Swarm Optimization (PSO) for selecting the optimal features. This Artificial Intelligence (AI) approach leverages a Majority Voting Classifier that combines multiple machine learning models, such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
