TL;DR
This paper introduces SARR, a novel rotation representation tailored for symmetric objects in pose estimation, enabling standard CNNs to produce symmetry-aware 3D orientations without requiring 3D models.
Contribution
The authors propose SARR, a continuous and unique rotation representation for symmetric objects, improving pose estimation accuracy and symmetry handling in deep learning models.
Findings
SARR outperforms state-of-the-art methods on T-LESS and ITODD datasets.
Networks trained with SARR achieve higher symmetry-sensitive accuracy.
The method works with depth and texture-less RGB images without 3D models.
Abstract
Symmetric objects are common in daily life and industry, yet their inherent orientation ambiguities that impede the training of deep learning networks for pose estimation are rarely discussed in the literature. To cope with these ambiguities, existing solutions typically require the design of specific loss functions and network architectures or resort to symmetry-invariant evaluation metrics. In contrast, we focus on the numeric representation of the rotation itself, modifying trigonometric identities with the degrees of symmetry derived from the objects' shapes. We use our representation, SARR, to obtain canonic (symmetry-resolved) poses for the symmetric objects in two popular 6D pose estimation datasets, T-LESS and ITODD, where SARR is unique and continuous w.r.t. the visual appearance. This allows us to use a standard CNN for 3D orientation estimation whose performance is evaluated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
