GradVis: Visualization and Second Order Analysis of Optimization   Surfaces during the Training of Deep Neural Networks

Avraam Chatzimichailidis; Franz-Josef Pfreundt; Nicolas R. Gauger,; Janis Keuper

arXiv:1909.12108·cs.LG·September 30, 2019

GradVis: Visualization and Second Order Analysis of Optimization Surfaces during the Training of Deep Neural Networks

Avraam Chatzimichailidis, Franz-Josef Pfreundt, Nicolas R. Gauger,, Janis Keuper

PDF

1 Repo

TL;DR

GradVis is an open source toolbox that enables scalable visualization and second order analysis of deep neural network loss landscapes, helping researchers understand optimization surfaces more efficiently.

Contribution

The paper introduces GradVis, a novel, efficient, and scalable library for visualizing and analyzing deep neural network loss landscapes, including second order information.

Findings

01

Enables high-resolution visualization of large networks' loss surfaces

02

Provides efficient computation of second order gradient information

03

Facilitates better understanding of optimization dynamics

Abstract

Current training methods for deep neural networks boil down to very high dimensional and non-convex optimization problems which are usually solved by a wide range of stochastic gradient descent methods. While these approaches tend to work in practice, there are still many gaps in the theoretical understanding of key aspects like convergence and generalization guarantees, which are induced by the properties of the optimization surface (loss landscape). In order to gain deeper insights, a number of recent publications proposed methods to visualize and analyze the optimization surfaces. However, the computational cost of these methods are very high, making it hardly possible to use them on larger networks. In this paper, we present the GradVis Toolbox, an open source library for efficient and scalable visualization and analysis of deep neural network loss landscapes in Tensorflow and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cc-hpc-itwm/GradVis
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.