LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation
Ruida Zhang, Ziqin Huang, Gu Wang, Chenyangguang Zhang, Yan Di,, Xingxing Zuo, Jiwen Tang, and Xiangyang Ji

TL;DR
LaPose introduces a probabilistic shape modeling framework using Laplacian mixtures to improve RGB-based category-level object pose estimation, effectively handling shape uncertainty and scale ambiguity, and achieving state-of-the-art results.
Contribution
The paper proposes LaPose, a novel Laplacian mixture model for shape representation that enhances RGB-only pose estimation by explicitly modeling shape uncertainty and scale ambiguity.
Findings
Achieves state-of-the-art performance on NOCS datasets.
Effectively models shape uncertainty with Laplacian mixtures.
Robustly estimates pose without depth data.
Abstract
While RGBD-based methods for category-level object pose estimation hold promise, their reliance on depth data limits their applicability in diverse scenarios. In response, recent efforts have turned to RGB-based methods; however, they face significant challenges stemming from the absence of depth information. On one hand, the lack of depth exacerbates the difficulty in handling intra-class shape variation, resulting in increased uncertainty in shape predictions. On the other hand, RGB-only inputs introduce inherent scale ambiguity, rendering the estimation of object size and translation an ill-posed problem. To tackle these challenges, we propose LaPose, a novel framework that models the object shape as the Laplacian mixture model for Pose estimation. By representing each point as a probabilistic distribution, we explicitly quantify the shape uncertainty. LaPose leverages both a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Robot Manipulation and Learning
MethodsPnP
