OnePose: One-Shot Object Pose Estimation without CAD Models

Jiaming Sun; Zihao Wang; Siyu Zhang; Xingyi He; Hongcheng Zhao,; Guofeng Zhang; Xiaowei Zhou

arXiv:2205.12257·cs.CV·May 25, 2022

OnePose: One-Shot Object Pose Estimation without CAD Models

Jiaming Sun, Zihao Wang, Siyu Zhang, Xingyi He, Hongcheng Zhao,, Guofeng Zhang, Xiaowei Zhou

PDF

Open Access 1 Repo

TL;DR

OnePose introduces a novel object pose estimation method that does not require CAD models, using a visual localization-inspired approach with a graph attention network for real-time, category-agnostic 6D pose tracking from RGB videos.

Contribution

It presents a CAD-model-free, category-agnostic pose estimation framework utilizing a graph attention network for efficient 6D pose tracking from RGB videos.

Findings

01

Achieves real-time 6D pose detection and tracking.

02

Handles arbitrary object categories without CAD models.

03

Introduces a large-scale dataset with 150 objects and 450 sequences.

Abstract

We propose a new method named OnePose for object pose estimation. Unlike existing instance-level or category-level methods, OnePose does not rely on CAD models and can handle objects in arbitrary categories without instance- or category-specific network training. OnePose draws the idea from visual localization and only requires a simple RGB video scan of the object to build a sparse SfM model of the object. Then, this model is registered to new query images with a generic feature matching network. To mitigate the slow runtime of existing visual localization methods, we propose a new graph attention network that directly matches 2D interest points in the query image with the 3D points in the SfM model, resulting in efficient and robust pose estimation. Combined with a feature-based pose tracker, OnePose is able to stably detect and track 6D poses of everyday household objects in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zju3dv/OnePose
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Multimodal Machine Learning Applications