Lipschitz Recurrent Neural Networks

N.Benjamin Erichson; Omri Azencot; Alejandro Queiruga; Liam; Hodgkinson; and Michael W. Mahoney

arXiv:2006.12070·cs.LG·April 27, 2021·32 cites

Lipschitz Recurrent Neural Networks

N.Benjamin Erichson, Omri Azencot, Alejandro Queiruga, Liam, Hodgkinson, and Michael W. Mahoney

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Lipschitz RNNs, a new recurrent unit with stability guarantees and improved robustness, outperforming existing units across various benchmark tasks in vision, language, and speech domains.

Contribution

It proposes a Lipschitz continuous recurrent unit with stability analysis, a novel hidden-to-hidden matrix construction, and demonstrates superior performance and robustness.

Findings

01

Outperforms existing RNNs on benchmark tasks

02

Provides stability conditions for recurrent units

03

Shows increased robustness to perturbations

Abstract

Viewing recurrent neural networks (RNNs) as continuous-time dynamical systems, we propose a recurrent unit that describes the hidden state's evolution with two parts: a well-understood linear component plus a Lipschitz nonlinearity. This particular functional form facilitates stability analysis of the long-term behavior of the recurrent unit using tools from nonlinear systems theory. In turn, this enables architectural design decisions before experimentation. Sufficient conditions for global stability of the recurrent unit are obtained, motivating a novel scheme for constructing hidden-to-hidden matrices. Our experiments demonstrate that the Lipschitz RNN can outperform existing recurrent units on a range of benchmark tasks, including computer vision, language modeling and speech prediction tasks. Finally, through Hessian-based analysis we demonstrate that our Lipschitz recurrent unit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

erichson/LipschitzRNN
pytorchOfficial

Videos

Lipschitz Recurrent Neural Networks· slideslive

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Neural Networks and Applications · Machine Learning in Healthcare