Optimizing Neural Networks via Koopman Operator Theory

Akshunna S. Dogra; William T Redman

arXiv:2006.02361·cs.NE·October 8, 2021·6 cites

Optimizing Neural Networks via Koopman Operator Theory

Akshunna S. Dogra, William T Redman

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explores using Koopman operator theory to predict and accelerate neural network training, demonstrating over 10x speedups in certain scenarios by leveraging linear dynamics insights.

Contribution

It introduces a novel application of Koopman operator theory to neural network training, enabling accurate weight predictions and significant speedups over traditional gradient methods.

Findings

01

Koopman methods predict network weights over training intervals.

02

Achieved >10x faster training compared to gradient descent.

03

Potential for extending to broader network classes.

Abstract

Koopman operator theory, a powerful framework for discovering the underlying dynamics of nonlinear dynamical systems, was recently shown to be intimately connected with neural network training. In this work, we take the first steps in making use of this connection. As Koopman operator theory is a linear theory, a successful implementation of it in evolving network weights and biases offers the promise of accelerated training, especially in the context of deep networks, where optimization is inherently a non-convex problem. We show that Koopman operator theoretic methods allow for accurate predictions of weights and biases of feedforward, fully connected deep networks over a non-trivial range of training time. During this window, we find that our approach is >10x faster than various gradient descent based methods (e.g. Adam, Adadelta, Adagrad), in line with our complexity analysis. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

william-redman/Koopman-Neural-Network-Training
noneOfficial

Videos

Optimizing Neural Networks via Koopman Operator Theory· slideslive

Taxonomy

TopicsModel Reduction and Neural Networks · Probabilistic and Robust Engineering Design · Lattice Boltzmann Simulation Studies

MethodsAdam