A New Smooth Approximation to the Zero One Loss with a Probabilistic   Interpretation

Md Kamrul Hasan; Christopher J. Pal

arXiv:1511.05643·cs.CV·November 19, 2015

A New Smooth Approximation to the Zero One Loss with a Probabilistic Interpretation

Md Kamrul Hasan, Christopher J. Pal

PDF

TL;DR

This paper introduces a smooth, probabilistic approximation to the zero-one loss using a generalized Beta-Bernoulli formulation, improving robustness, scalability, and probabilistic outputs in classification tasks.

Contribution

It proposes a novel generalized logistic function based on a Beta-Bernoulli model, enhancing zero-one loss approximation with probabilistic interpretation and broad applicability.

Findings

01

Improved robustness to outliers compared to logistic and hinge losses.

02

Outperforms logistic and max margin models on large benchmark datasets.

03

Yields sparser solutions with Gaussian-Laplacian priors and improves structured prediction probabilities.

Abstract

We examine a new form of smooth approximation to the zero one loss in which learning is performed using a reformulation of the widely used logistic function. Our approach is based on using the posterior mean of a novel generalized Beta-Bernoulli formulation. This leads to a generalized logistic function that approximates the zero one loss, but retains a probabilistic formulation conferring a number of useful properties. The approach is easily generalized to kernel logistic regression and easily integrated into methods for structured prediction. We present experiments in which we learn such models using an optimization method consisting of a combination of gradient descent and coordinate descent using localized grid search so as to escape from local minima. Our experiments indicate that optimization quality is improved when learning meta-parameters are themselves optimized using a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLogistic Regression