Robust Locally-Linear Controllable Embedding

Ershad Banijamali; Rui Shu; Mohammad Ghavamzadeh; Hung Bui; Ali Ghodsi

arXiv:1710.05373·cs.LG·February 23, 2018·25 cites

Robust Locally-Linear Controllable Embedding

Ershad Banijamali, Rui Shu, Mohammad Ghavamzadeh, Hung Bui, Ali Ghodsi

PDF

Open Access

TL;DR

This paper introduces a robust locally-linear controllable embedding (RCE) model that improves upon previous methods by directly modeling predictive densities and incorporating structures for noise robustness, enhancing control in noisy environments.

Contribution

The paper proposes a new RCE model with a structured generative process and a variational approximation that accounts for future observations, improving robustness over existing methods.

Findings

01

RCE outperforms E2C in noisy dynamics scenarios.

02

The model provides more accurate embeddings under noise.

03

Experimental results demonstrate significant improvements.

Abstract

Embed-to-control (E2C) is a model for solving high-dimensional optimal control problems by combining variational auto-encoders with locally-optimal controllers. However, the E2C model suffers from two major drawbacks: 1) its objective function does not correspond to the likelihood of the data sequence and 2) the variational encoder used for embedding typically has large variational approximation error, especially when there is noise in the system dynamics. In this paper, we present a new model for learning robust locally-linear controllable embedding (RCE). Our model directly estimates the predictive conditional density of the future observation given the current one, while introducing the bottleneck between the current and future observations. Although the bottleneck provides a natural embedding candidate for control, our RCE model introduces additional specific structures in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Control Systems and Identification · Reinforcement Learning in Robotics