Guiding the search in continuous state-action spaces by learning an   action sampling distribution from off-target samples

Beomjoon Kim; Leslie Pack Kaelbling; Tomas Lozano-Perez

arXiv:1711.01391·cs.AI·November 8, 2017·2 cites

Guiding the search in continuous state-action spaces by learning an action sampling distribution from off-target samples

Beomjoon Kim, Leslie Pack Kaelbling, Tomas Lozano-Perez

PDF

Open Access

TL;DR

This paper introduces a method that learns an action sampling distribution using GANs to improve planning efficiency in high-dimensional continuous spaces, with robust generalization across problem instances.

Contribution

It proposes a novel approach that guides state-space search by learning a generalizable action sampler, addressing data efficiency with importance-ratio estimation, and providing theoretical and empirical validation.

Findings

01

Improved planning efficiency in high-dimensional spaces.

02

Effective generalization across different planning problems.

03

Theoretical guarantees for the proposed method.

Abstract

In robotics, it is essential to be able to plan efficiently in high-dimensional continuous state-action spaces for long horizons. For such complex planning problems, unguided uniform sampling of actions until a path to a goal is found is hopelessly inefficient, and gradient-based approaches often fall short when the optimization manifold of a given problem is not smooth. In this paper we present an approach that guides the search of a state-space planner, such as A*, by learning an action-sampling distribution that can generalize across different instances of a planning problem. The motivation is that, unlike typical learning approaches for planning for continuous action space that estimate a policy, an estimated action sampler is more robust to error since it has a planner to fall back on. We use a Generative Adversarial Network (GAN), and address an important issue: search experience…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Adversarial Robustness in Machine Learning

MethodsConvolution · Dogecoin Customer Service Number +1-833-534-1729