TL;DR
ProWRAS is a novel, classifier-independent oversampling method that improves minority class sample generation by integrating multiple schemes, outperforming existing algorithms across various classifiers and datasets.
Contribution
It introduces ProWRAS, a multi-schematic oversampling approach that is independent of classifiers, enhancing performance and reducing benchmarking efforts.
Findings
ProWRAS outperforms five state-of-the-art oversampling algorithms.
ProWRAS achieves statistically significant improvements in F1-score and Kappa-score.
ProWRAS's classifier independence is validated by the I-score measure.
Abstract
Over 85 oversampling algorithms, mostly extensions of the SMOTE algorithm, have been built over the past two decades, to solve the problem of imbalanced datasets. However, it has been evident from previous studies that different oversampling algorithms have different degrees of efficiency with different classifiers. With numerous algorithms available, it is difficult to decide on an oversampling algorithm for a chosen classifier. Here, we overcome this problem with a multi-schematic and classifier-independent oversampling approach: ProWRAS(Proximity Weighted Random Affine Shadowsampling). ProWRAS integrates the Localized Random Affine Shadowsampling (LoRAS)algorithm and the Proximity Weighted Synthetic oversampling (ProWSyn) algorithm. By controlling the variance of the synthetic samples, as well as a proximity-weighted clustering system of the minority classdata, the ProWRAS algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSynthetic Minority Over-sampling Technique.
