GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic   Articulated Object Manipulation

Wenbo Cui; Chengyang Zhao; Songlin Wei; Jiazhao Zhang; Haoran Geng,; Yaran Chen; Haoran Li; He Wang

arXiv:2411.18276·cs.RO·March 24, 2025

GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object Manipulation

Wenbo Cui, Chengyang Zhao, Songlin Wei, Jiazhao Zhang, Haoran Geng,, Yaran Chen, Haoran Li, He Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a large-scale, part-centric dataset for articulated object manipulation that enhances depth perception and interaction pose prediction, enabling more robust and generalizable manipulation in household scenarios.

Contribution

The paper presents a novel dataset with detailed part annotations and a modular framework that improves manipulation performance over existing methods.

Findings

01

Dataset significantly improves depth estimation accuracy.

02

Enhanced interaction pose prediction in simulation and real-world.

03

Framework achieves superior robustness and generalization.

Abstract

Effectively manipulating articulated objects in household scenarios is a crucial step toward achieving general embodied artificial intelligence. Mainstream research in 3D vision has primarily focused on manipulation through depth perception and pose detection. However, in real-world environments, these methods often face challenges due to imperfect depth perception, such as with transparent lids and reflective handles. Moreover, they generally lack the diversity in part-based interactions required for flexible and adaptable manipulation. To address these challenges, we introduced a large-scale part-centric dataset for articulated object manipulation that features both photo-realistic material randomization and detailed annotations of part-oriented, scene-level actionable interaction poses. We evaluated the effectiveness of our dataset by integrating it with several state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

roboverseorg/roboverse
jax

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing and 3D Reconstruction · Industrial Vision Systems and Defect Detection · 3D Surveying and Cultural Heritage