TL;DR
This paper presents a novel object-centric visual affordance approach integrated with deep reinforcement learning to enable dexterous robotic grasping, improving generalization, efficiency, and robustness over traditional methods.
Contribution
It introduces an image-based, object-centric prior within reinforcement learning for dexterous grasping, enabling better generalization and faster training on unseen objects.
Findings
Policies are 3x faster to train than baselines.
Affordance-guided policies outperform traditional methods.
System generalizes well to novel objects.
Abstract
Dexterous robotic hands are appealing for their agility and human-like morphology, yet their high degree of freedom makes learning to manipulate challenging. We introduce an approach for learning dexterous grasping. Our key idea is to embed an object-centric visual affordance model within a deep reinforcement learning loop to learn grasping policies that favor the same object regions favored by people. Unlike traditional approaches that learn from human demonstration trajectories (e.g., hand joint sequences captured with a glove), the proposed prior is object-centric and image-based, allowing the agent to anticipate useful affordance regions for objects unseen during policy learning. We demonstrate our idea with a 30-DoF five-fingered robotic hand simulator on 40 objects from two datasets, where it successfully and efficiently learns policies for stable functional grasps. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
