Towards Real-World Category-level Articulation Pose Estimation
Liu Liu, Han Xue, Wenqiang Xu, Haoyuan Fu, Cewu Lu

TL;DR
This paper introduces a new real-world setting for category-level articulation pose estimation, along with datasets and a framework that handle multiple instances and varied kinematic structures in complex environments.
Contribution
It proposes the CAPER task setting, creates a comprehensive dataset and model repository, and develops ReArtNOCS for efficient multi-instance pose estimation in real-world scenarios.
Findings
ReArtNOCS achieves good performance on CAPER and CAPE datasets.
The datasets support varied kinematic structures and multiple instances.
The framework enables efficient part-level pose estimation in complex scenes.
Abstract
Human life is populated with articulated objects. Current Category-level Articulation Pose Estimation (CAPE) methods are studied under the single-instance setting with a fixed kinematic structure for each category. Considering these limitations, we reform this problem setting for real-world environments and suggest a CAPE-Real (CAPER) task setting. This setting allows varied kinematic structures within a semantic category, and multiple instances to co-exist in an observation of real world. To support this task, we build an articulated model repository ReArt-48 and present an efficient dataset generation pipeline, which contains Fast Articulated Object Modeling (FAOM) and Semi-Authentic MixEd Reality Technique (SAMERT). Accompanying the pipeline, we build a large-scale mixed reality dataset ReArtMix and a real world dataset ReArtVal. We also propose an effective framework ReArtNOCS that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning · Hand Gesture Recognition Systems
