You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping
Jingshun Huang, Haitao Lin, Tianyu Wang, Yanwei Fu, Yu-Gang Jiang, Xiangyang Xue

TL;DR
This paper introduces YOEO, a real-time, single-stage method for category-level articulated object 6D pose estimation, enabling robotic grasping with high efficiency and accuracy in real-world scenarios.
Contribution
The paper presents YOEO, a novel end-to-end network that estimates object pose and segmentation simultaneously, reducing computational costs and improving real-time performance.
Findings
Achieves real-time pose estimation at 200Hz.
Demonstrates effective pose estimation on the GAPart dataset.
Enables a robot to interact with unseen objects in real-world settings.
Abstract
This paper addresses the problem of category-level pose estimation for articulated objects in robotic manipulation tasks. Recent works have shown promising results in estimating part pose and size at the category level. However, these approaches primarily follow a complex multi-stage pipeline that first segments part instances in the point cloud and then estimates the Normalized Part Coordinate Space (NPCS) representation for 6D poses. These approaches suffer from high computational costs and low performance in real-time robotic tasks. To address these limitations, we propose YOEO, a single-stage method that simultaneously outputs instance segmentation and NPCS representations in an end-to-end manner. We use a unified network to generate point-wise semantic labels and centroid offsets, allowing points from the same part instance to vote for the same centroid. We further utilize a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Robotic Path Planning Algorithms
MethodsALIGN
