Recent Advances in Non-convex Smoothness Conditions and Applicability to Deep Linear Neural Networks
Vivak Patel, Christian Varner

TL;DR
This paper reviews recent smoothness conditions for non-convex optimization in deep learning, categorizes them, and assesses their relevance to training deep linear neural networks for binary classification.
Contribution
It systematically orders and evaluates various non-convex smoothness conditions and their applicability to deep linear neural network training.
Findings
Different smoothness conditions are applicable to deep linear networks.
The paper provides criteria to determine the validity of these conditions.
Applicability varies depending on the specific smoothness assumption.
Abstract
The presence of non-convexity in smooth optimization problems arising from deep learning have sparked new smoothness conditions in the literature and corresponding convergence analyses. We discuss these smoothness conditions, order them, provide conditions for determining whether they hold, and evaluate their applicability to training a deep linear neural network for binary classification.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Face and Expression Recognition · Neural Networks and Applications
