Minimizing the Maximal Loss: How and Why?

Shai Shalev-Shwartz; Yonatan Wexler

arXiv:1602.01690·cs.LG·May 24, 2016·41 cites

Minimizing the Maximal Loss: How and Why?

Shai Shalev-Shwartz, Yonatan Wexler

PDF

Open Access

TL;DR

This paper introduces an algorithm that transforms online learning methods to minimize the maximal loss, addressing robustness and generalization issues associated with traditional average loss minimization.

Contribution

It presents a novel algorithm for converting online algorithms into maximal loss minimizers and proposes robust variants to handle outliers.

Findings

01

The algorithm effectively minimizes maximal loss in various settings.

02

Better training accuracy can lead to improved generalization performance.

03

Robust versions handle outliers effectively.

Abstract

A commonly used learning rule is to approximately minimize the \emph{average} loss over the training set. Other learning algorithms, such as AdaBoost and hard-SVM, aim at minimizing the \emph{maximal} loss over the training set. The average loss is more popular, particularly in deep learning, due to three main reasons. First, it can be conveniently minimized using online algorithms, that process few examples at each iteration. Second, it is often argued that there is no sense to minimize the loss on the training set too much, as it will not be reflected in the generalization loss. Last, the maximal loss is not robust to outliers. In this paper we describe and analyze an algorithm that can convert any online algorithm to a minimizer of the maximal loss. We prove that in some situations better accuracy on the training set is crucial to obtain good performance on unseen examples. Last, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Imbalanced Data Classification Techniques