Improved Balanced Classification with Theoretically Grounded Loss Functions

Corinna Cortes; Mehryar Mohri; Yutao Zhong

arXiv:2512.23947·cs.LG·January 1, 2026

Improved Balanced Classification with Theoretically Grounded Loss Functions

Corinna Cortes, Mehryar Mohri, Yutao Zhong

PDF

Open Access

TL;DR

This paper introduces two theoretically grounded surrogate loss functions, GLA and GCA, for improved multi-class classification under class imbalance, with strong consistency guarantees and empirical performance advantages.

Contribution

It develops and analyzes GLA and GCA loss families, providing the first comprehensive theoretical consistency analysis and demonstrating their empirical effectiveness in imbalanced classification.

Findings

01

GCA losses are more robust with better consistency bounds for imbalanced data.

02

Both GLA and GCA outperform standard class-weighted losses in experiments.

03

GLA performs slightly better overall, GCA excels in highly imbalanced scenarios.

Abstract

The balanced loss is a widely adopted objective for multi-class classification under class imbalance. By assigning equal importance to all classes, regardless of their frequency, it promotes fairness and ensures that minority classes are not overlooked. However, directly minimizing the balanced classification loss is typically intractable, which makes the design of effective surrogate losses a central question. This paper introduces and studies two advanced surrogate loss families: Generalized Logit-Adjusted (GLA) loss functions and Generalized Class-Aware weighted (GCA) losses. GLA losses generalize Logit-Adjusted losses, which shift logits based on class priors, to the broader general cross-entropy loss family. GCA loss functions extend the standard class-weighted losses, which scale losses inversely by class frequency, by incorporating class-dependent confidence margins and extending…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning