Hardness of Noise-Free Learning for Two-Hidden-Layer Neural Networks

Sitan Chen; Aravind Gollakota; Adam R. Klivans; Raghu Meka

arXiv:2202.05258·cs.LG·November 15, 2022·6 cites

Hardness of Noise-Free Learning for Two-Hidden-Layer Neural Networks

Sitan Chen, Aravind Gollakota, Adam R. Klivans, Raghu Meka

PDF

Open Access 1 Video

TL;DR

This paper establishes superpolynomial statistical query lower bounds for noise-free learning of two-hidden-layer ReLU neural networks with Gaussian inputs, revealing fundamental computational hardness in this setting.

Contribution

It provides the first general SQ lower bounds for noise-free learning of ReLU networks of any depth, extending previous results and introducing new reduction techniques.

Findings

01

Superpolynomial SQ lower bounds for two-hidden-layer ReLU networks

02

New cryptographic hardness results for PAC learning ReLU networks

03

Lower bounds for learning constant-depth ReLU networks from label queries

Abstract

We give superpolynomial statistical query (SQ) lower bounds for learning two-hidden-layer ReLU networks with respect to Gaussian inputs in the standard (noise-free) model. No general SQ lower bounds were known for learning ReLU networks of any depth in this setting: previous SQ lower bounds held only for adversarial noise models (agnostic learning) or restricted models such as correlational SQ. Prior work hinted at the impossibility of our result: Vempala and Wilmes showed that general SQ lower bounds cannot apply to any real-valued family of functions that satisfies a simple non-degeneracy condition. To circumvent their result, we refine a lifting procedure due to Daniely and Vardi that reduces Boolean PAC learning problems to Gaussian ones. We show how to extend their technique to other learning models and, in many well-studied cases, obtain a more efficient reduction. As such, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Hardness of Noise-Free Learning for Two-Hidden-Layer Neural Networks· slideslive

Taxonomy

TopicsMachine Learning and Algorithms · Adversarial Robustness in Machine Learning · Neural Networks and Applications