Provable limitations of deep learning
Emmanuel Abbe, Colin Sandon

TL;DR
This paper proves that certain deep learning algorithms fail to learn specific functions due to limitations characterized by cross-predictability, highlighting fundamental constraints in training neural networks under certain conditions.
Contribution
It introduces the notion of cross-predictability to characterize when deep learning algorithms fail, providing theoretical bounds and examples such as parity functions and community detection.
Findings
Deep learning algorithms fail on functions with low cross-predictability.
Super-polynomial decay of cross-predictability leads to learning failures.
Failures occur under limited memory or high randomness initialization.
Abstract
As the success of deep learning reaches more grounds, one would like to also envision the potential limits of deep learning. This paper gives a first set of results proving that certain deep learning algorithms fail at learning certain efficiently learnable functions. The results put forward a notion of cross-predictability that characterizes when such failures take place. Parity functions provide an extreme example with a cross-predictability that decays exponentially, while a mere super-polynomial decay of the cross-predictability is shown to be sufficient to obtain failures. Examples in community detection and arithmetic learning are also discussed. Recall that it is known that the class of neural networks (NNs) with polynomial network size can express any function that can be implemented in polynomial time, and that their sample complexity scales polynomially with the network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Bayesian Modeling and Causal Inference
MethodsStochastic Gradient Descent
