Curvature in the Looking-Glass: Optimal Methods to Exploit Curvature of Expectation in the Loss Landscape
Jed A. Duersch, Tommie A. Catanach, Alexander Safonov, and Jeremy, Wendt

TL;DR
This paper introduces a novel framework for understanding and exploiting the complex curvature of the loss landscape in deep learning, especially near ReLU-induced discontinuities, leading to improved optimization methods.
Contribution
It presents a new conceptual model of loss curvature near ReLU boundaries, estimates gradient variation density, and develops the Alice algorithm for enhanced optimization.
Findings
Hessian approximations are unreliable near gradient discontinuities.
The glass-like structure of the loss landscape influences gradient variations.
The Alice algorithm improves training stability and efficiency.
Abstract
Harnessing the local topography of the loss landscape is a central challenge in advanced optimization tasks. By accounting for the effect of potential parameter changes, we can alter the model more efficiently. Contrary to standard assumptions, we find that the Hessian does not always approximate loss curvature well, particularly near gradient discontinuities, which commonly arise in deep learning architectures. We present a new conceptual framework to understand how curvature of expected changes in loss emerges in architectures with many rectified linear units. Each ReLU creates a parameter boundary that, when crossed, induces a pseudorandom gradient perturbation. Our derivations show how these discontinuities combine to form a glass-like structure, similar to amorphous solids that contain microscopic domains of strong, but random, atomic alignment. By estimating the density of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbability and Risk Models
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Adam · Pruning
