TL;DR
This paper introduces TOSC, a novel approach for task-oriented shape completion that improves dexterous grasping in open-world scenarios with partial data by focusing on contact regions and leveraging foundation models.
Contribution
It proposes a new task-oriented shape completion method guided by manipulation tasks, utilizing foundation models, a discriminative autoencoder, and a flow-matching grasp generator.
Findings
Achieves state-of-the-art in task-oriented grasping and shape completion.
Improves Grasp Displacement by 16.17% and Chamfer Distance by 55.26%.
Excels in grasping objects with severe missing data and generalizes across categories.
Abstract
Task-oriented dexterous grasping remains challenging in robotic manipulations of open-world objects under severe partial observation, where significant missing data invalidates generic shape completion. In this paper, to overcome this limitation, we study Task-Oriented Shape Completion, a new task that focuses on completing the potential contact regions rather than the entire shape. We argue that shape completion for grasping should be explicitly guided by the downstream manipulation task. To achieve this, we first generate multiple task-oriented shape completion candidates by leveraging the zero-shot capabilities of object functional understanding from several pre-trained foundation models. A 3D discriminative autoencoder is then proposed to evaluate the plausibility of each generated candidate and optimize the most plausible one from a global perspective. A conditional flow-matching…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
