Recent Advances in Non-convex Smoothness Conditions and Applicability to   Deep Linear Neural Networks

Vivak Patel; Christian Varner

arXiv:2409.13672·cs.LG·September 23, 2024

Recent Advances in Non-convex Smoothness Conditions and Applicability to Deep Linear Neural Networks

Vivak Patel, Christian Varner

PDF

Open Access

TL;DR

This paper reviews recent smoothness conditions for non-convex optimization in deep learning, categorizes them, and assesses their relevance to training deep linear neural networks for binary classification.

Contribution

It systematically orders and evaluates various non-convex smoothness conditions and their applicability to deep linear neural network training.

Findings

01

Different smoothness conditions are applicable to deep linear networks.

02

The paper provides criteria to determine the validity of these conditions.

03

Applicability varies depending on the specific smoothness assumption.

Abstract

The presence of non-convexity in smooth optimization problems arising from deep learning have sparked new smoothness conditions in the literature and corresponding convergence analyses. We discuss these smoothness conditions, order them, provide conditions for determining whether they hold, and evaluate their applicability to training a deep linear neural network for binary classification.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Face and Expression Recognition · Neural Networks and Applications