High-Dimensional Non-Convex Landscapes and Gradient Descent Dynamics

Tony Bonnaire; Davide Ghio; Kamesh Krishnamurthy; Francesca Mignacco,; Atsushi Yamamura; Giulio Biroli

arXiv:2308.03754·cond-mat.dis-nn·November 13, 2023

High-Dimensional Non-Convex Landscapes and Gradient Descent Dynamics

Tony Bonnaire, Davide Ghio, Kamesh Krishnamurthy, Francesca Mignacco,, Atsushi Yamamura, Giulio Biroli

PDF

Open Access

TL;DR

This paper explores how statistical physics methods can be applied to understand gradient descent behavior in complex, high-dimensional non-convex landscapes common in machine learning, providing new insights into optimization dynamics.

Contribution

It introduces physics-inspired approaches to analyze high-dimensional non-convex optimization landscapes in machine learning, bridging physics and AI research.

Findings

01

Application of statistical physics methods to gradient descent analysis

02

Insights into the dynamics of optimization in high-dimensional spaces

03

Framework for studying non-convex landscapes in machine learning

Abstract

In these lecture notes we present different methods and concepts developed in statistical physics to analyze gradient descent dynamics in high-dimensional non-convex landscapes. Our aim is to show how approaches developed in physics, mainly statistical physics of disordered systems, can be used to tackle open questions on high-dimensional dynamics in Machine Learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Statistical Mechanics and Entropy · Markov Chains and Monte Carlo Methods