On the Implicit Geometry of Cross-Entropy Parameterizations for   Label-Imbalanced Data

Tina Behnia; Ganesh Ramachandra Kini; Vala Vakilian; Christos; Thrampoulidis

arXiv:2303.07608·cs.LG·March 15, 2023·1 cites

On the Implicit Geometry of Cross-Entropy Parameterizations for Label-Imbalanced Data

Tina Behnia, Ganesh Ramachandra Kini, Vala Vakilian, Christos, Thrampoulidis

PDF

Open Access 1 Repo

TL;DR

This paper investigates how different cross-entropy parameterizations influence the geometry of classifiers in imbalanced data settings, extending implicit bias theory from linear to non-linear models, and provides formulas for classifier geometry.

Contribution

It characterizes the geometry of classifiers learned by various CE parameterizations in non-linear models and derives formulas linking geometry to class imbalance and hyperparameters.

Findings

01

Logit-adjusted parameterizations can be tuned to learn symmetric geometries regardless of imbalance.

02

Derived formulas relate classifier angles and norms to class imbalance and hyperparameters.

03

Experiments confirm the theoretical predictions on deep networks.

Abstract

Various logit-adjusted parameterizations of the cross-entropy (CE) loss have been proposed as alternatives to weighted CE for training large models on label-imbalanced data far beyond the zero train error regime. The driving force behind those designs has been the theory of implicit bias, which for linear(ized) models, explains why they successfully induce bias on the optimization path towards solutions that favor minorities. Aiming to extend this theory to non-linear models, we investigate the implicit geometry of classifiers and embeddings that are learned by different CE parameterizations. Our main result characterizes the global minimizers of a non-convex cost-sensitive SVM classifier for the unconstrained features model, which serves as an abstraction of deep nets. We derive closed-form formulas for the angles and norms of classifiers and embeddings as a function of the number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

valavakilian/implicit_geometry
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Industrial Vision Systems and Defect Detection · Imbalanced Data Classification Techniques

MethodsSupport Vector Machine