Reliably Learning the ReLU in Polynomial Time
Surbhi Goel, Varun Kanade, Adam Klivans, Justin Thaler

TL;DR
This paper introduces the first dimension-efficient polynomial-time algorithms for reliably learning ReLUs in the agnostic setting, enabling advances in neural network training and polynomial approximation.
Contribution
It presents the first efficient algorithms for learning ReLUs in the agnostic model with arbitrary labels, using kernel methods and polynomial approximations.
Findings
Algorithms run in polynomial time in the input dimension
Achieves a PTAS for maximizing ReLUs with error 1/ log n
First efficient algorithms for learning ReLU networks and convex piecewise-linear fitting
Abstract
We give the first dimension-efficient algorithms for learning Rectified Linear Units (ReLUs), which are functions of the form with . Our algorithm works in the challenging Reliable Agnostic learning model of Kalai, Kanade, and Mansour (2009) where the learner is given access to a distribution on labeled examples but the labeling may be arbitrary. We construct a hypothesis that simultaneously minimizes the false-positive rate and the loss on inputs given positive labels by , for any convex, bounded, and Lipschitz loss function. The algorithm runs in polynomial-time (in ) with respect to any distribution on (the unit sphere in dimensions) and for any error parameter (this yields a PTAS for a question raised by F. Bach on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Anomaly Detection Techniques and Applications · Machine Learning and Algorithms
