Consistent Polyhedral Surrogates for Top-$k$ Classification and Variants
Jessie Finocchiaro, Rafael Frongillo, Emma Goodwill, Anish Thilagar

TL;DR
This paper introduces new convex polyhedral surrogates for top-$k$ classification, providing the first consistent surrogate and analyzing conditions for consistency, addressing limitations of previous hinge-like methods.
Contribution
It proposes the first consistent convex polyhedral surrogate for top-$k$ classification and analyzes conditions for surrogate consistency using an embedding framework.
Findings
Derived constraints for surrogate consistency under label distributions
Introduced the first consistent polyhedral surrogate for top-$k$
Analyzed the inconsistency of previous hinge-like surrogates
Abstract
Top- classification is a generalization of multiclass classification used widely in information retrieval, image classification, and other extreme classification settings. Several hinge-like (piecewise-linear) surrogates have been proposed for the problem, yet all are either non-convex or inconsistent. For the proposed hinge-like surrogates that are convex (i.e., polyhedral), we apply the recent embedding framework of Finocchiaro et al. (2019; 2022) to determine the prediction problem for which the surrogate is consistent. These problems can all be interpreted as variants of top- classification, which may be better aligned with some applications. We leverage this analysis to derive constraints on the conditional label distributions under which these proposed surrogates become consistent for top-. It has been further suggested that every convex hinge-like surrogate must be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Imbalanced Data Classification Techniques
