Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object   Pose Estimation

Mengchen Zhang; Tong Wu; Tai Wang; Tengfei Wang; Ziwei Liu; Dahua Lin

arXiv:2409.18261·cs.CV·March 24, 2025

Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation

Mengchen Zhang, Tong Wu, Tai Wang, Tengfei Wang, Ziwei Liu, Dahua Lin

PDF

Open Access 1 Repo

TL;DR

Omni6D is a large, diverse RGBD dataset with 166 categories and over 0.8 million captures, designed to advance category-level 6D object pose estimation in realistic scenarios.

Contribution

The paper introduces Omni6D, a comprehensive dataset with extensive categories and benchmarks, and proposes a symmetry-aware metric and fine-tuning method for improved pose estimation.

Findings

01

Benchmarking of existing algorithms on Omni6D

02

Introduction of a symmetry-aware evaluation metric

03

Effective fine-tuning approach for large vocabulary adaptation

Abstract

6D object pose estimation aims at determining an object's translation, rotation, and scale, typically from a single RGBD image. Recent advancements have expanded this estimation from instance-level to category-level, allowing models to generalize across unseen instances within the same category. However, this generalization is limited by the narrow range of categories covered by existing datasets, such as NOCS, which also tend to overlook common real-world challenges like occlusion. To tackle these challenges, we introduce Omni6D, a comprehensive RGBD dataset featuring a wide range of categories and varied backgrounds, elevating the task to a more realistic context. 1) The dataset comprises an extensive spectrum of 166 categories, 4688 instances adjusted to the canonical pose, and over 0.8 million captures, significantly broadening the scope for evaluation. 2) We introduce a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

3dtopia/omni6d
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning · Hand Gesture Recognition Systems