Task-Directed Exploration in Continuous POMDPs for Robotic Manipulation   of Articulated Objects

Aidan Curtis; Leslie Kaelbling; Siddarth Jain

arXiv:2212.04554·cs.RO·December 12, 2022

Task-Directed Exploration in Continuous POMDPs for Robotic Manipulation of Articulated Objects

Aidan Curtis, Leslie Kaelbling, Siddarth Jain

PDF

Open Access

TL;DR

This paper introduces STRUG, an online POMDP solver designed for long-horizon planning in uncertain, partially observable environments, demonstrated on robotic manipulation tasks with improved performance over existing methods.

Contribution

The paper presents STRUG, a novel online POMDP solver that effectively manages task-relevant and irrelevant uncertainties in long-horizon planning for robotic manipulation.

Findings

01

STRUG outperforms existing POMDP solvers on several tasks.

02

Demonstrated effectiveness on robotic manipulation of articulated objects.

03

Utilizes neural perception frontend for model distribution construction.

Abstract

Representing and reasoning about uncertainty is crucial for autonomous agents acting in partially observable environments with noisy sensors. Partially observable Markov decision processes (POMDPs) serve as a general framework for representing problems in which uncertainty is an important factor. Online sample-based POMDP methods have emerged as efficient approaches to solving large POMDPs and have been shown to extend to continuous domains. However, these solutions struggle to find long-horizon plans in problems with significant uncertainty. Exploration heuristics can help guide planning, but many real-world settings contain significant task-irrelevant uncertainty that might distract from the task objective. In this paper, we propose STRUG, an online POMDP solver capable of handling domains that require long-horizon planning with significant task-relevant and task-irrelevant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Data Classification · Robot Manipulation and Learning