Notes on Deep Learning Theory

Eugene A. Golikov

arXiv:2012.05760·cs.LG·December 11, 2020

Notes on Deep Learning Theory

Eugene A. Golikov

PDF

Open Access

TL;DR

This paper provides lecture notes on deep learning theory, covering topics like initialization, loss landscape, generalization, and neural tangent kernels, offering insights into the theoretical foundations of neural networks.

Contribution

It compiles and presents key theoretical aspects of deep learning, including neural tangent kernel theory, in a comprehensive lecture note format.

Findings

01

Insights into neural tangent kernel behavior

02

Analysis of loss landscape properties

03

Discussion on generalization in deep networks

Abstract

These are the notes for the lectures that I was giving during Fall 2020 at the Moscow Institute of Physics and Technology (MIPT) and at the Yandex School of Data Analysis (YSDA). The notes cover some aspects of initialization, loss landscape, generalization, and a neural tangent kernel theory. While many other topics (e.g. expressivity, a mean-field theory, a double descent phenomenon) are missing in the current version, we plan to add them in future revisions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis