Structured second-order methods via natural gradient descent

Wu Lin; Frank Nielsen; Mohammad Emtiyaz Khan; Mark Schmidt

arXiv:2107.10884·stat.ML·February 22, 2022·1 cites

Structured second-order methods via natural gradient descent

Wu Lin, Frank Nielsen, Mohammad Emtiyaz Khan, Mark Schmidt

PDF

Open Access

TL;DR

This paper introduces structured second-order and adaptive-gradient methods based on natural-gradient descent, offering invariant properties and simplicity, with demonstrated efficiency on non-convex and deep learning tasks.

Contribution

It develops new structured second-order and adaptive-gradient algorithms using natural-gradient descent with invariant properties and simple expressions.

Findings

01

Effective on deterministic non-convex problems

02

Demonstrated efficiency in deep learning tasks

03

Structured methods maintain invariance and simplicity

Abstract

In this paper, we propose new structured second-order methods and structured adaptive-gradient methods obtained by performing natural-gradient descent on structured parameter spaces. Natural-gradient descent is an attractive approach to design new algorithms in many settings such as gradient-free, adaptive-gradient, and second-order methods. Our structured methods not only enjoy a structural invariance but also admit a simple expression. Finally, we test the efficiency of our proposed methods on both deterministic non-convex problems and deep learning problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Optimization Algorithms Research · Advanced Numerical Analysis Techniques