Learning Classifiers for Imbalanced and Overlapping Data
Shivaditya Shivganesh, Nitin Narayanan N, Pranav Murali, Ajaykumar M

TL;DR
This paper investigates methods for improving classifiers on imbalanced and overlapping data, introducing a new sparsity-based approach and comparing various resampling techniques on artificial datasets.
Contribution
It presents a novel sparsity method to enhance classifier performance and evaluates multiple resampling strategies on artificially generated imbalanced datasets.
Findings
Sparsity improves classifier accuracy on imbalanced data.
Resampling methods like over-sampling and NCR under-sampling show varying effectiveness.
Artificial datasets help analyze data characteristics affecting classification.
Abstract
This study is about inducing classifiers using data that is imbalanced, with a minority class being under-represented in relation to the majority classes. The first section of this research focuses on the main characteristics of data that generate this problem. Following a study of previous, relevant research, a variety of artificial, imbalanced data sets influenced by important elements were created. These data sets were used to create decision trees and rule-based classifiers. The second section of this research looks into how to improve classifiers by pre-processing data with resampling approaches. The results of the following trials are compared to the performance of distinct pre-processing re-sampling methods: two variants of random over-sampling and focused under-sampling NCR. This paper further optimises class imbalance with a new method called Sparsity. The data is made more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Financial Distress and Bankruptcy Prediction · Electricity Theft Detection Techniques
