Implicitly Maximizing Margins with the Hinge Loss

Justin Lizama

arXiv:2006.14286·cs.LG·June 26, 2020

Implicitly Maximizing Margins with the Hinge Loss

Justin Lizama

PDF

Open Access

TL;DR

This paper introduces a modified hinge loss for neural networks that guarantees faster convergence to the maximum margin in classification tasks, outperforming traditional exponential loss functions.

Contribution

It extends the hinge loss by assigning gradients at critical points, achieving faster convergence rates for linear classifiers and demonstrating similar benefits in ReLU networks.

Findings

01

Convergence rate to max-margin is (1/t) for the new loss.

02

Empirical results show improved margin convergence in ReLU networks.

03

The method outperforms logistic loss in convergence speed.

Abstract

A new loss function is proposed for neural networks on classification tasks which extends the hinge loss by assigning gradients to its critical points. We will show that for a linear classifier on linearly separable data with fixed step size, the margin of this modified hinge loss converges to the $ℓ_{2}$ max-margin at the rate of $O (1/ t)$ . This rate is fast when compared with the $O (1/ lo g t)$ rate of exponential losses such as the logistic loss. Furthermore, empirical results suggest that this increased convergence speed carries over to ReLU networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · *Communicated@Fast*How Do I Communicate to Expedia?