Box Drawings for Learning with Imbalanced Data
Siong Thye Goh, Cynthia Rudin

TL;DR
This paper introduces two interpretable machine learning algorithms designed for highly imbalanced classification tasks, utilizing box-based classifiers optimized via mixed integer programming and scalable approximation methods.
Contribution
It presents novel box-based classifiers tailored for imbalanced data, combining interpretability with optimization and scalable approximation techniques.
Findings
Both methods effectively handle imbalanced data with interpretable models.
The mixed integer programming approach balances class accuracies.
The scalable method enables parallelization and focuses on relevant feature regions.
Abstract
The vast majority of real world classification problems are imbalanced, meaning there are far fewer data from the class of interest (the positive class) than from other classes. We propose two machine learning algorithms to handle highly imbalanced classification problems. The classifiers constructed by both methods are created as unions of parallel axis rectangles around the positive examples, and thus have the benefit of being interpretable. The first algorithm uses mixed integer programming to optimize a weighted balance between positive and negative class accuracies. Regularization is introduced to improve generalization performance. The second method uses an approximation in order to assist with scalability. Specifically, it follows a \textit{characterize then discriminate} approach, where the positive class is characterized first by boxes, and then each box boundary becomes a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Machine Learning and Data Classification · Electricity Theft Detection Techniques
