Wasserstein Distributionally Robust Multiclass Support Vector Machine
Michael Ibrahim, Heraldo Rozas, and Nagi Gebraeel

TL;DR
This paper introduces a Wasserstein distributionally robust multiclass SVM that effectively handles uncertain and imbalanced data, providing a convex reformulation and demonstrating superior performance over existing models.
Contribution
The paper develops a novel Wasserstein distributionally robust multiclass SVM with a tractable convex reformulation and a kernel extension, addressing label uncertainty and class imbalance.
Findings
Outperforms state-of-the-art OVA models on imbalanced datasets
Provides a convex reformulation of the worst-case risk problem
Demonstrates effectiveness of the kernel version and scalability with a subgradient method
Abstract
We study the problem of multiclass classification for settings where data features and their labels are uncertain. We identify that distributionally robust one-vs-all (OVA) classifiers often struggle in settings with imbalanced data. To address this issue, we use Wasserstein distributionally robust optimization to develop a robust version of the multiclass support vector machine (SVM) characterized by the Crammer-Singer (CS) loss. First, we prove that the CS loss is bounded from above by a Lipschitz continuous function for all and , then we exploit strong duality results to express the dual of the worst-case risk problem, and we show that the worst-case risk minimization problem admits a tractable convex reformulation due to the regularity of the CS loss. Moreover, we develop a kernel version of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Machine Learning and ELM
