A Skew-Sensitive Evaluation Framework for Imbalanced Data Classification

Min Du; Nesime Tatbul; Brian Rivers; Akhilesh Kumar Gupta; Lucas Hu,; Wei Wang; Ryan Marcus; Shengtian Zhou; Insup Lee; Justin Gottschlich

arXiv:2010.05995·cs.LG·November 20, 2023·5 cites

A Skew-Sensitive Evaluation Framework for Imbalanced Data Classification

Min Du, Nesime Tatbul, Brian Rivers, Akhilesh Kumar Gupta, Lucas Hu,, Wei Wang, Ryan Marcus, Shengtian Zhou, Insup Lee, Justin Gottschlich

PDF

Open Access 1 Repo

TL;DR

This paper introduces a flexible evaluation framework for imbalanced data classification that accounts for class importance and skew, improving classifier assessment and training.

Contribution

It proposes a novel, general-purpose evaluation framework that is sensitive to class importance and skew, enhancing classifier evaluation and training.

Findings

01

Effective in evaluating and ranking classifiers

02

Improves classifier training on imbalanced data

03

Validated on real-world datasets from multiple domains

Abstract

Class distribution skews in imbalanced datasets may lead to models with prediction bias towards majority classes, making fair assessment of classifiers a challenging task. Metrics such as Balanced Accuracy are commonly used to evaluate a classifier's prediction performance under such scenarios. However, these metrics fall short when classes vary in importance. In this paper, we propose a simple and general-purpose evaluation framework for imbalanced data classification that is sensitive to arbitrary skews in class cardinalities and importances. Experiments with several state-of-the-art classifiers tested on real-world datasets from three different domains show the effectiveness of our framework - not only in evaluating and ranking classifiers, but also training them.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

2023-07-03/weighted-balanced-accuracy
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Artificial Intelligence in Healthcare · Electricity Theft Detection Techniques