Theoretical Issues in Deep Networks: Approximation, Optimization and Generalization
Tomaso Poggio, Andrzej Banburski, Qianli Liao

TL;DR
This paper reviews recent theoretical advances in deep learning, focusing on representation power, optimization landscapes, and generalization, especially in overparametrized networks, providing insights into their surprising effectiveness.
Contribution
It synthesizes recent theoretical results across approximation, optimization, and generalization, offering a unified understanding of deep networks' success.
Findings
Deep networks can approximate any continuous function with exponential parameters.
Deep convolutional networks can efficiently approximate compositional functions.
SGD is likely to find global minima in exponential loss landscapes.
Abstract
While deep learning is successful in a number of applications, it is not yet well understood theoretically. A satisfactory theoretical characterization of deep learning however, is beginning to emerge. It covers the following questions: 1) representation power of deep networks 2) optimization of the empirical risk 3) generalization properties of gradient descent techniques --- why the expected error does not suffer, despite the absence of explicit regularization, when the networks are overparametrized? In this review we discuss recent advances in the three areas. In approximation theory both shallow and deep networks have been shown to approximate any continuous functions on a bounded domain at the expense of an exponential number of parameters (exponential in the dimensionality of the function). However, for a subset of compositional functions, deep networks of the convolutional type…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Neural Networks and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia?
