Hybrid Synthetic Minority Over-sampling Technique (HSMOTE) and Ensemble Deep Dynamic Classifier Model (EDDCM) for big data analytics
Priyadharsini M, Bhawana Tyagi, Naga Priyadarsini R, Mohankumar B

TL;DR
This paper introduces a hybrid framework combining HSMOTE and EDDCM to improve big data classification by addressing class imbalance and high dimensionality.
Contribution
The novel HSMOTE and EDDCM framework integrates meta-heuristic optimization and deep learning for enhanced classification in imbalanced datasets.
Findings
HSMOTE improves minority class representation by interpolating closely located instances.
EDDCM combines DWCNN, DWBi-LSTM, and WAE with dynamic ensemble strategies for reliable predictions.
The framework outperforms conventional models in precision, recall, F-measure, and accuracy on imbalanced datasets.
Abstract
Big Data Classification (BDC) has become increasingly important across domains such as healthcare, e-commerce, and banking. However, challenges such as high dimensionality and class imbalance often degrade the performance of conventional machine learning (ML) models. This study proposes a hybrid framework that integrates meta-heuristic optimization with class imbalance handling to enhance BDC effectiveness. To address the class imbalance problem in both binary and multi-class datasets, a Hybrid Synthetic Minority Over-sampling Technique (HSMOTE) is introduced. HSMOTE generates synthetic minority samples by interpolating between closely located minority instances, improving the representation of rare classes. For robust feature selection, the Optimization Ensemble Feature Selection Model (OEFSM) is developed by combining the outputs of three algorithms: Fuzzy Weight Dragonfly Algorithm…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 10
Figure 11
Figure 12
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Financial Distress and Bankruptcy Prediction · Artificial Intelligence in Healthcare
