Robotic manipulation of multiple objects as a POMDP
Joni Pajarinen, Ville Kyrki

TL;DR
This paper models multi-object robotic manipulation under uncertainty as a POMDP, introducing an online particle filtering approach for adaptive planning in cluttered environments, validated through simulations and physical experiments.
Contribution
It presents a novel online POMDP method with particle filtering for multi-object manipulation, automatically adapting success probabilities and handling occlusions in real-time.
Findings
Multi-step POMDP planning outperforms greedy heuristics.
Online planning adapts system dynamics based on real experience.
Approach validated in simulation and physical experiments.
Abstract
This paper investigates manipulation of multiple unknown objects in a crowded environment. Because of incomplete knowledge due to unknown objects and occlusions in visual observations, object observations are imperfect and action success is uncertain, making planning challenging. We model the problem as a partially observable Markov decision process (POMDP), which allows a general reward based optimization objective and takes uncertainty in temporal evolution and partial observations into account. In addition to occlusion dependent observation and action success probabilities, our POMDP model also automatically adapts object specific action success probabilities. To cope with the changing system dynamics and performance constraints, we present a new online POMDP method based on particle filtering that produces compact policies. The approach is validated both in simulation and in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robotic Path Planning Algorithms · Robot Manipulation and Learning
