LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level   Object Pose Estimation

Ruida Zhang; Ziqin Huang; Gu Wang; Chenyangguang Zhang; Yan Di,; Xingxing Zuo; Jiwen Tang; and Xiangyang Ji

arXiv:2409.15727·cs.CV·September 25, 2024

LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation

Ruida Zhang, Ziqin Huang, Gu Wang, Chenyangguang Zhang, Yan Di,, Xingxing Zuo, Jiwen Tang, and Xiangyang Ji

PDF

Open Access 1 Repo

TL;DR

LaPose introduces a probabilistic shape modeling framework using Laplacian mixtures to improve RGB-based category-level object pose estimation, effectively handling shape uncertainty and scale ambiguity, and achieving state-of-the-art results.

Contribution

The paper proposes LaPose, a novel Laplacian mixture model for shape representation that enhances RGB-only pose estimation by explicitly modeling shape uncertainty and scale ambiguity.

Findings

01

Achieves state-of-the-art performance on NOCS datasets.

02

Effectively models shape uncertainty with Laplacian mixtures.

03

Robustly estimates pose without depth data.

Abstract

While RGBD-based methods for category-level object pose estimation hold promise, their reliance on depth data limits their applicability in diverse scenarios. In response, recent efforts have turned to RGB-based methods; however, they face significant challenges stemming from the absence of depth information. On one hand, the lack of depth exacerbates the difficulty in handling intra-class shape variation, resulting in increased uncertainty in shape predictions. On the other hand, RGB-only inputs introduce inherent scale ambiguity, rendering the estimation of object size and translation an ill-posed problem. To tackle these challenges, we propose LaPose, a novel framework that models the object shape as the Laplacian mixture model for Pose estimation. By representing each point as a probabilistic distribution, we explicitly quantify the shape uncertainty. LaPose leverages both a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lolrudy/lapose
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Robot Manipulation and Learning

MethodsPnP