Smoothly Giving up: Robustness for Simple Models

Tyler Sypherd; Nathan Stromberg; Richard Nock; Visar Berisha; and; Lalitha Sankar

arXiv:2302.09114·cs.LG·February 21, 2023

Smoothly Giving up: Robustness for Simple Models

Tyler Sypherd, Nathan Stromberg, Richard Nock, Visar Berisha, and, Lalitha Sankar

PDF

Open Access

TL;DR

This paper introduces a margin-based alpha-loss that smoothly transitions between convex and non-convex losses, enabling simple models like logistic regression and boosting to be more robust against label noise, with demonstrated effectiveness on real datasets.

Contribution

It proposes a novel alpha-loss function that adaptively balances convexity and non-convexity to improve robustness of simple models against noisy labels.

Findings

01

Enhanced robustness of logistic regression and boosting to label noise.

02

Effective performance on COVID-19 survey and Long-Servedio datasets.

03

Smooth transition between convex and non-convex losses improves training outcomes.

Abstract

There is a growing need for models that are interpretable and have reduced energy and computational cost (e.g., in health care analytics and federated learning). Examples of algorithms to train such models include logistic regression and boosting. However, one challenge facing these algorithms is that they provably suffer from label noise; this has been attributed to the joint interaction between oft-used convex loss functions and simpler hypothesis classes, resulting in too much emphasis being placed on outliers. In this work, we use the margin-based $α$ -loss, which continuously tunes between canonical convex and quasi-convex losses, to robustly train simple models. We show that the $α$ hyperparameter smoothly introduces non-convexity and offers the benefit of "giving up" on noisy training examples. We also provide results on the Long-Servedio dataset for boosting and a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Statistical Methods and Inference · Domain Adaptation and Few-Shot Learning

MethodsLogistic Regression