Minimax Generalized Cross-Entropy

Kartheek Bondugula; Santiago Mazuelas; Aritz P\'erez; Anqi Liu

arXiv:2603.19874·stat.ML·April 29, 2026

Minimax Generalized Cross-Entropy

Kartheek Bondugula, Santiago Mazuelas, Aritz P\'erez, Anqi Liu

PDF

TL;DR

This paper introduces a convex minimax formulation of generalized cross-entropy (MGCE) for supervised classification, improving robustness, convergence speed, and calibration, especially with noisy labels.

Contribution

The paper proposes a novel convex minimax formulation of GCE, enabling efficient optimization and better performance over existing non-convex approaches.

Findings

01

MGCE achieves higher accuracy on benchmark datasets.

02

MGCE converges faster than traditional GCE.

03

MGCE provides better calibration, especially with label noise.

Abstract

Loss functions play a central role in supervised classification. Cross-entropy (CE) is widely used, whereas the mean absolute error (MAE) loss can offer robustness but is difficult to optimize. Interpolating between the CE and MAE losses, generalized cross-entropy (GCE) has recently been introduced to provide a trade-off between optimization difficulty and robustness. Existing formulations of GCE result in a non-convex optimization over classification margins that is prone to underfitting, leading to poor performances with complex datasets. In this paper, we propose a minimax formulation of generalized cross-entropy (MGCE) that results in a convex optimization over classification margins. Moreover, we show that MGCEs can provide an upper bound on the classification error. The proposed bilevel convex optimization can be efficiently implemented using stochastic gradient computed via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.