Improved Balanced Classification with Theoretically Grounded Loss Functions
Corinna Cortes, Mehryar Mohri, Yutao Zhong

TL;DR
This paper introduces two theoretically grounded surrogate loss functions, GLA and GCA, for improved multi-class classification under class imbalance, with strong consistency guarantees and empirical performance advantages.
Contribution
It develops and analyzes GLA and GCA loss families, providing the first comprehensive theoretical consistency analysis and demonstrating their empirical effectiveness in imbalanced classification.
Findings
GCA losses are more robust with better consistency bounds for imbalanced data.
Both GLA and GCA outperform standard class-weighted losses in experiments.
GLA performs slightly better overall, GCA excels in highly imbalanced scenarios.
Abstract
The balanced loss is a widely adopted objective for multi-class classification under class imbalance. By assigning equal importance to all classes, regardless of their frequency, it promotes fairness and ensures that minority classes are not overlooked. However, directly minimizing the balanced classification loss is typically intractable, which makes the design of effective surrogate losses a central question. This paper introduces and studies two advanced surrogate loss families: Generalized Logit-Adjusted (GLA) loss functions and Generalized Class-Aware weighted (GCA) losses. GLA losses generalize Logit-Adjusted losses, which shift logits based on class priors, to the broader general cross-entropy loss family. GCA loss functions extend the standard class-weighted losses, which scale losses inversely by class frequency, by incorporating class-dependent confidence margins and extending…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
