GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation
Wenbo Cui, Chengyang Zhao, Songlin Wei, Jiazhao Zhang, Haoran Geng,, Yaran Chen, Haoran Li, He Wang

TL;DR
This paper introduces a large-scale, part-centric dataset for articulated object manipulation that enhances depth perception and interaction pose prediction, enabling more robust and generalizable manipulation in household scenarios.
Contribution
The paper presents a novel dataset with detailed part annotations and a modular framework that improves manipulation performance over existing methods.
Findings
Dataset significantly improves depth estimation accuracy.
Enhanced interaction pose prediction in simulation and real-world.
Framework achieves superior robustness and generalization.
Abstract
Effectively manipulating articulated objects in household scenarios is a crucial step toward achieving general embodied artificial intelligence. Mainstream research in 3D vision has primarily focused on manipulation through depth perception and pose detection. However, in real-world environments, these methods often face challenges due to imperfect depth perception, such as with transparent lids and reflective handles. Moreover, they generally lack the diversity in part-based interactions required for flexible and adaptable manipulation. To address these challenges, we introduced a large-scale part-centric dataset for articulated object manipulation that features both photo-realistic material randomization and detailed annotations of part-oriented, scene-level actionable interaction poses. We evaluated the effectiveness of our dataset by integrating it with several state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Industrial Vision Systems and Defect Detection · 3D Surveying and Cultural Heritage
