Initializing LSTM internal states via manifold learning

Felix P. Kemeth; Tom Bertalan; Nikolaos Evangelou; Tianqi Cui; Saurabh; Malani; Ioannis G. Kevrekidis

arXiv:2104.13101·stat.ML·September 29, 2021

Initializing LSTM internal states via manifold learning

Felix P. Kemeth, Tom Bertalan, Nikolaos Evangelou, Tianqi Cui, Saurabh, Malani, Ioannis G. Kevrekidis

PDF

TL;DR

This paper introduces a manifold learning-based method for initializing LSTM internal states, improving performance and enabling better modeling of partially observed dynamical systems.

Contribution

It proposes a novel approach to initialize LSTM states using data manifold learning, ensuring consistency with observed data and enhancing system identification.

Findings

01

Improved LSTM performance on a chemical model system

02

Learned manifold enables transformation of partially observed to fully observed dynamics

03

Method facilitates alternative nonlinear system identification paths

Abstract

We present an approach, based on learning an intrinsic data manifold, for the initialization of the internal state values of LSTM recurrent neural networks, ensuring consistency with the initial observed input data. Exploiting the generalized synchronization concept, we argue that the converged, "mature" internal states constitute a function on this learned manifold. The dimension of this manifold then dictates the length of observed input time series data required for consistent initialization. We illustrate our approach through a partially observed chemical model system, where initializing the internal LSTM states in this fashion yields visibly improved performance. Finally, we show that learning this data manifold enables the transformation of partially observed dynamics into fully observed ones, facilitating alternative identification paths for nonlinear dynamical systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory