Dream and Search to Control: Latent Space Planning for Continuous   Control

Anurag Koul; Varun V. Kumar; Alan Fern; Somdeb Majumdar

arXiv:2010.09832·cs.LG·October 21, 2020·1 cites

Dream and Search to Control: Latent Space Planning for Continuous Control

Anurag Koul, Varun V. Kumar, Alan Fern, Somdeb Majumdar

PDF

Open Access 1 Repo

TL;DR

This paper introduces a latent space planning method using tree search for continuous control in model-based reinforcement learning, demonstrating improved sample efficiency and performance on benchmarks.

Contribution

It extends latent-space tree search techniques from discrete to continuous actions, showing practical benefits in continuous control environments.

Findings

01

Achieves better sample efficiency than state-of-the-art methods.

02

Demonstrates successful bootstrapping benefits in continuous action spaces.

03

Improves performance on challenging continuous-control benchmarks.

Abstract

Learning and planning with latent space dynamics has been shown to be useful for sample efficiency in model-based reinforcement learning (MBRL) for discrete and continuous control tasks. In particular, recent work, for discrete action spaces, demonstrated the effectiveness of latent-space planning via Monte-Carlo Tree Search (MCTS) for bootstrapping MBRL during learning and at test time. However, the potential gains from latent-space tree search have not yet been demonstrated for environments with continuous action spaces. In this work, we propose and explore an MBRL approach for continuous action spaces based on tree-based planning over learned latent dynamics. We show that it is possible to demonstrate the types of bootstrapping benefits as previously shown for discrete spaces. In particular, the approach achieves improved sample efficiency and performance on a majority of challenging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

koulanurag/dream-and-search
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Model Reduction and Neural Networks

MethodsMonte-Carlo Tree Search