Hierarchical Policies for Cluttered-Scene Grasping with Latent Plans
Lirui Wang, Xiangyun Meng, Yu Xiang, Dieter Fox

TL;DR
This paper introduces a hierarchical, end-to-end learning framework for 6D robotic grasping in cluttered scenes, leveraging latent plan representations and reinforcement learning to improve collision avoidance and generalization.
Contribution
It presents a novel hierarchical approach combining latent plan encoding, variational autoencoders, and reinforcement learning for robust cluttered-scene grasping.
Findings
Outperforms baselines in simulation
Generalizes to real-world cluttered scenes
Effective in complex, obstacle-rich environments
Abstract
6D grasping in cluttered scenes is a longstanding problem in robotic manipulation. Open-loop manipulation pipelines may fail due to inaccurate state estimation, while most end-to-end grasping methods have not yet scaled to complex scenes with obstacles. In this work, we propose a new method for end-to-end learning of 6D grasping in cluttered scenes. Our hierarchical framework learns collision-free target-driven grasping based on partial point cloud observations. We learn an embedding space to encode expert grasping plans during training and a variational autoencoder to sample diverse grasping trajectories at test time. Furthermore, we train a critic network for plan selection and an option classifier for switching to an instance grasping policy through hierarchical reinforcement learning. We evaluate our method and compare against several baselines in simulation, as well as demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Soft Robotics and Applications
