Parallel bi-objective evolutionary algorithms for scalable feature subset selection via migration strategy under Spark
Yelleti Vivek, Vadlamani Ravi, P. Radha Krishna

TL;DR
This paper introduces a scalable, parallel framework for bi-objective feature subset selection using evolutionary algorithms under Spark, demonstrating improved efficiency and effectiveness on high-dimensional data.
Contribution
It develops a novel MapReduce-based parallel framework for bi-objective EAs with migration strategy, integrating non-dominated sorting and decomposition methods under Spark.
Findings
P-NSGA-II outperforms other algorithms on most datasets.
The proposed framework achieves significant speedup and scalability.
Empirical results show improved AUC and diversity in feature selection.
Abstract
Feature subset selection (FSS) for classification is inherently a bi-objective optimization problem, where the task is to obtain a feature subset which yields the maximum possible area under the receiver operator characteristic curve (AUC) with minimum cardinality of the feature subset. In todays world, a humungous amount of data is generated in all activities of humans. To mine such voluminous data, which is often high-dimensional, there is a need to develop parallel and scalable frameworks. In the first-of-its-kind study, we propose and develop an iterative MapReduce-based framework for bi-objective evolutionary algorithms (EAs) based wrappers under Apache spark with the migration strategy. In order to accomplish this, we parallelized the non-dominated sorting based algorithms namely non dominated sorting algorithm (NSGA-II), and non-dominated sorting particle swarm optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Metaheuristic Optimization Algorithms Research · Face and Expression Recognition
