Learning deep dynamical models from image pixels

Niklas Wahlstr\"om; Thomas B. Sch\"on; Marc Peter Deisenroth

arXiv:1410.7550·stat.ML·October 29, 2014

Learning deep dynamical models from image pixels

Niklas Wahlstr\"om, Thomas B. Sch\"on, Marc Peter Deisenroth

PDF

TL;DR

This paper introduces a method combining deep auto-encoders and predictive models to learn dynamical systems directly from high-dimensional pixel observations, enabling effective system identification in complex, non-linear scenarios.

Contribution

The paper presents a novel approach that jointly learns low-dimensional embeddings and transition models from pixel data, addressing non-linear system identification challenges.

Findings

01

Successfully models dynamical systems from raw pixel data

02

Outperforms traditional linear system identification methods

03

Enables predictive control directly from images

Abstract

Modeling dynamical systems is important in many disciplines, e.g., control, robotics, or neurotechnology. Commonly the state of these systems is not directly observed, but only available through noisy and potentially high-dimensional observations. In these cases, system identification, i.e., finding the measurement mapping and the transition mapping (system dynamics) in latent space can be challenging. For linear system dynamics and measurement mappings efficient solutions for system identification are available. However, in practical applications, the linearity assumptions does not hold, requiring non-linear system identification techniques. If additionally the observations are high-dimensional (e.g., images), non-linear system identification is inherently hard. To address the problem of non-linear system identification from high-dimensional observations, we combine recent advances in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.