On Representation of 3D Rotation in the Context of Deep Learning

Vikt\'oria Pravdov\'a; Luk\'a\v{s} Gajdo\v{s}ech; Hassan Ali; Viktor; Kocur

arXiv:2410.10350·cs.CV·October 16, 2024

On Representation of 3D Rotation in the Context of Deep Learning

Vikt\'oria Pravdov\'a, Luk\'a\v{s} Gajdo\v{s}ech, Hassan Ali, Viktor, Kocur

PDF

TL;DR

This paper compares different 3D rotation representations in deep learning, showing that continuous 5D and 6D representations outperform others in rotation estimation tasks across synthetic and real datasets.

Contribution

It provides a comprehensive evaluation of rotation representations and their effects on neural network performance in 3D rotation estimation.

Findings

01

5D and 6D representations outperform discontinuous ones

02

Continuous representations improve learning stability

03

Texture and data distribution influence estimation accuracy

Abstract

This paper investigates various methods of representing 3D rotations and their impact on the learning process of deep neural networks. We evaluated the performance of ResNet18 networks for 3D rotation estimation using several rotation representations and loss functions on both synthetic and real data. The real datasets contained 3D scans of industrial bins, while the synthetic datasets included views of a simple asymmetric object rendered under different rotations. On synthetic data, we also assessed the effects of different rotation distributions within the training and test sets, as well as the impact of the object's texture. In line with previous research, we found that networks using the continuous 5D and 6D representations performed better than the discontinuous ones.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.