Object Gaussian for Monocular 6D Pose Estimation from Sparse Views

Luqing Luo; Shichu Sun; Jiangang Yang; Linfang Zheng; Jinwei Du; Jian; Liu

arXiv:2409.02581·cs.CV·September 5, 2024

Object Gaussian for Monocular 6D Pose Estimation from Sparse Views

Luqing Luo, Shichu Sun, Jiangang Yang, Linfang Zheng, Jinwei Du, Jian, Liu

PDF

Open Access

TL;DR

SGPose is a novel framework that estimates 6D object poses from as few as ten sparse views without relying on CAD models, using Gaussian-based representations and geometric supervision.

Contribution

It introduces a CAD-model-free, Gaussian-based approach for sparse view 6D pose estimation that outperforms existing methods on benchmark datasets.

Findings

01

Outperforms existing methods on Occlusion LM-O dataset

02

Effective with as few as ten input views

03

Does not rely on CAD models or SfM pipeline geometry

Abstract

Monocular object pose estimation, as a pivotal task in computer vision and robotics, heavily depends on accurate 2D-3D correspondences, which often demand costly CAD models that may not be readily available. Object 3D reconstruction methods offer an alternative, among which recent advancements in 3D Gaussian Splatting (3DGS) afford a compelling potential. Yet its performance still suffers and tends to overfit with fewer input views. Embracing this challenge, we introduce SGPose, a novel framework for sparse view object pose estimation using Gaussian-based methods. Given as few as ten views, SGPose generates a geometric-aware representation by starting with a random cuboid initialization, eschewing reliance on Structure-from-Motion (SfM) pipeline-derived geometry as required by traditional 3DGS methods. SGPose removes the dependence on CAD models by regressing dense 2D-3D correspondences…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Object Detection Techniques · Advanced Vision and Imaging · Robotics and Sensor-Based Localization