Agnostic Learnability of Halfspaces via Logistic Loss
Ziwei Ji, Kwangjun Ahn, Pranjal Awasthi, Satyen Kale, and Stefani Karp

TL;DR
This paper studies the limits of logistic regression in agnostic learning of halfspaces, establishing tight bounds on its approximation guarantees and proposing a simpler two-step convex optimization method.
Contribution
It constructs a distribution showing the tightness of existing bounds and introduces a new, simpler two-step convex optimization approach to achieve near-optimal risk.
Findings
Constructed a distribution matching the lower bound of $ ilde{ ext{OPT}}$ for logistic regression.
Proposed a two-step convex optimization algorithm that attains $ ilde{O}( ext{OPT})$ misclassification risk.
Demonstrated conditions under which logistic regression reaches near-optimal performance.
Abstract
We investigate approximation guarantees provided by logistic regression for the fundamental problem of agnostic learning of homogeneous halfspaces. Previously, for a certain broad class of "well-behaved" distributions on the examples, Diakonikolas et al. (2020) proved an lower bound, while Frei et al. (2021) proved an upper bound, where denotes the best zero-one/misclassification risk of a homogeneous halfspace. In this paper, we close this gap by constructing a well-behaved distribution such that the global minimizer of the logistic risk over this distribution only achieves misclassification risk, matching the upper bound in (Frei et al., 2021). On the other hand, we also show that if we impose a radial-Lipschitzness condition in addition to well-behaved-ness on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Domain Adaptation and Few-Shot Learning · Non-Destructive Testing Techniques
MethodsLogistic Regression
