Mathematics of Deep Learning

Rene Vidal; Joan Bruna; Raja Giryes; Stefano Soatto

arXiv:1712.04741·cs.LG·December 14, 2017·79 cites

Mathematics of Deep Learning

Rene Vidal, Joan Bruna, Raja Giryes, Stefano Soatto

PDF

Open Access

TL;DR

This paper reviews recent mathematical work explaining why deep learning architectures perform well, focusing on properties like optimality, stability, and invariance of learned representations.

Contribution

It provides a comprehensive overview of mathematical justifications for key properties of deep networks, enhancing understanding of their success.

Findings

01

Deep networks can achieve global optimality under certain conditions

02

Mathematical frameworks explain geometric stability of deep representations

03

Invariance properties of learned features are supported by recent theoretical work

Abstract

Recently there has been a dramatic increase in the performance of recognition systems due to the introduction of deep architectures for representation learning and classification. However, the mathematical reasons for this success remain elusive. This tutorial will review recent work that aims to provide a mathematical justification for several properties of deep networks, such as global optimality, geometric stability, and invariance of the learned representations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Neural Networks and Applications · Stochastic Gradient Optimization Techniques