PointPatchRL -- Masked Reconstruction Improves Reinforcement Learning on Point Clouds
Bal\'azs Gyenes, Nikolai Franke, Philipp Becker, Gerhard Neumann

TL;DR
PointPatchRL introduces a transformer-based approach with masked reconstruction for reinforcement learning on point clouds, significantly enhancing performance in complex manipulation tasks involving deformable objects and geometric variations.
Contribution
The paper presents PointPatchRL, a novel transformer-based method with masked reconstruction for RL on point clouds, addressing previous limitations of simple encoder architectures.
Findings
Outperforms previous point-cloud RL architectures.
Achieves better results on complex manipulation tasks.
Effective on deformable objects and geometric variations.
Abstract
Perceiving the environment via cameras is crucial for Reinforcement Learning (RL) in robotics. While images are a convenient form of representation, they often complicate extracting important geometric details, especially with varying geometries or deformable objects. In contrast, point clouds naturally represent this geometry and easily integrate color and positional data from multiple camera views. However, while deep learning on point clouds has seen many recent successes, RL on point clouds is under-researched, with only the simplest encoder architecture considered in the literature. We introduce PointPatchRL (PPRL), a method for RL on point clouds that builds on the common paradigm of dividing point clouds into overlapping patches, tokenizing them, and processing the tokens with transformers. PPRL provides significant improvements compared with other point-cloud processing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Remote Sensing and LiDAR Applications · Image Processing and 3D Reconstruction
