Motion Planning as Online Learning: A Multi-Armed Bandit Approach to   Kinodynamic Sampling-Based Planning

Marco Faroni; Dmitry Berenson

arXiv:2308.13949·cs.RO·August 29, 2023·1 cites

Motion Planning as Online Learning: A Multi-Armed Bandit Approach to Kinodynamic Sampling-Based Planning

Marco Faroni, Dmitry Berenson

PDF

Open Access

TL;DR

This paper introduces a novel kinodynamic motion planning method that uses a multi-armed bandit framework to adaptively bias sampling, resulting in faster, higher-quality solutions for complex robotic tasks with dynamics constraints.

Contribution

It formulates the sampling bias problem as a non-stationary multi-armed bandit, enabling adaptive and efficient exploration in kinodynamic planning.

Findings

01

Faster convergence to high-quality solutions

02

Higher success rate in complex manipulation tasks

03

Effective in scenarios with dynamics uncertainty

Abstract

Kinodynamic motion planners allow robots to perform complex manipulation tasks under dynamics constraints or with black-box models. However, they struggle to find high-quality solutions, especially when a steering function is unavailable. This paper presents a novel approach that adaptively biases the sampling distribution to improve the planner's performance. The key contribution is to formulate the sampling bias problem as a non-stationary multi-armed bandit problem, where the arms of the bandit correspond to sets of possible transitions. High-reward regions are identified by clustering transitions from sequential runs of kinodynamic RRT and a bandit algorithm decides what region to sample at each timestep. The paper demonstrates the approach on several simulated examples as well as a 7-degree-of-freedom manipulation task with dynamics uncertainty, suggesting that the approach finds…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Reinforcement Learning in Robotics · Artificial Intelligence in Games