AutoBalance: Optimized Loss Functions for Imbalanced Data

Mingchen Li; Xuechen Zhang; Christos Thrampoulidis; Jiasi Chen; Samet; Oymak

arXiv:2201.01212·cs.LG·January 5, 2022·27 cites

AutoBalance: Optimized Loss Functions for Imbalanced Data

Mingchen Li, Xuechen Zhang, Christos Thrampoulidis, Jiasi Chen, Samet, Oymak

PDF

Open Access 1 Repo 1 Video

TL;DR

AutoBalance is a bi-level optimization framework that automatically designs loss functions to improve accuracy and fairness in imbalanced and group-sensitive classification tasks, addressing overfitting and fairness concerns in deep learning.

Contribution

It introduces a novel bi-level optimization approach for automatic loss function design tailored for imbalanced and sensitive data scenarios.

Findings

01

AutoBalance outperforms state-of-the-art methods in imbalanced classification.

02

The approach improves fairness and accuracy simultaneously.

03

Theoretical insights support the benefits of the designed loss functions.

Abstract

Imbalanced datasets are commonplace in modern machine learning problems. The presence of under-represented classes or groups with sensitive attributes results in concerns about generalization and fairness. Such concerns are further exacerbated by the fact that large capacity deep nets can perfectly fit the training data and appear to achieve perfect accuracy and fairness during training, but perform poorly during test. To address these challenges, we propose AutoBalance, a bi-level optimization framework that automatically designs a training loss function to optimize a blend of accuracy and fairness-seeking objectives. Specifically, a lower-level problem trains the model weights, and an upper-level problem tunes the loss function by monitoring and optimizing the desired objective over the validation data. Our loss design enables personalized treatment for classes/groups by employing a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ucr-optml/autobalance
pytorchOfficial

Videos

AutoBalance: Optimized Loss Functions for Imbalanced Data· slideslive

Taxonomy

TopicsImbalanced Data Classification Techniques · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)