Learning Kernel-Based Halfspaces with the Zero-One Loss

Shai Shalev-Shwartz; Ohad Shamir; Karthik Sridharan

arXiv:1005.3681·cs.LG·August 3, 2010·4 cites

Learning Kernel-Based Halfspaces with the Zero-One Loss

Shai Shalev-Shwartz, Ohad Shamir, Karthik Sridharan

PDF

Open Access

TL;DR

This paper introduces a new algorithm for agnostically learning kernel-based halfspaces directly with zero-one loss, providing finite sample guarantees and analyzing its computational complexity.

Contribution

It presents the first finite-time algorithm for learning kernel-based halfspaces with zero-one loss and establishes computational hardness results under cryptographic assumptions.

Findings

01

Algorithm learns in polynomial time for fixed parameters.

02

Guarantees the learned classifier is within epsilon of the optimal.

03

Proves hardness results indicating limits of efficient learning.

Abstract

We describe and analyze a new algorithm for agnostically learning kernel-based halfspaces with respect to the \emph{zero-one} loss function. Unlike most previous formulations which rely on surrogate convex loss functions (e.g. hinge-loss in SVM and log-loss in logistic regression), we provide finite time/sample guarantees with respect to the more natural zero-one loss function. The proposed algorithm can learn kernel-based halfspaces in worst-case time $\poly (exp (L lo g (L / ϵ)))$ , for $any$ distribution, where $L$ is a Lipschitz constant (which can be thought of as the reciprocal of the margin), and the learned classifier is worse than the optimal halfspace by at most $ϵ$ . We also prove a hardness result, showing that under a certain cryptographic assumption, no algorithm can learn kernel-based halfspaces in time polynomial in $L$ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Face and Expression Recognition · Domain Adaptation and Few-Shot Learning