Local Saddle Point Optimization: A Curvature Exploitation Approach

Leonard Adolphs; Hadi Daneshmand; Aurelien Lucchi; Thomas; Hofmann

arXiv:1805.05751·cs.LG·February 15, 2019·26 cites

Local Saddle Point Optimization: A Curvature Exploitation Approach

Leonard Adolphs, Hadi Daneshmand, Aurelien Lucchi, Thomas, Hofmann

PDF

Open Access 1 Repo

TL;DR

This paper introduces a curvature-based optimization method that effectively escapes undesired stationary points in saddle point problems, improving convergence to true local optima in gradient-based methods.

Contribution

The paper presents a novel curvature exploitation approach that enhances existing gradient methods to avoid non-optimal stationary points in saddle problems.

Findings

01

Curvature exploitation enables escape from undesired stationary points.

02

Gradient methods with curvature information outperform standard methods.

03

Empirical results confirm improved convergence in saddle point problems.

Abstract

Gradient-based optimization methods are the most popular choice for finding local optima for classical minimization and saddle point problems. Here, we highlight a systemic issue of gradient dynamics that arise for saddle point problems, namely the presence of undesired stable stationary points that are no local optima. We propose a novel optimization approach that exploits curvature information in order to escape from these undesired stationary points. We prove that different optimization methods, including gradient method and Adagrad, equipped with curvature exploitation can escape non-optimal stationary points. We also provide empirical results on common saddle point problems which confirm the advantage of using curvature exploitation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

limcherhang/finalreport
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research