3D Pose Regression using Convolutional Neural Networks

Siddharth Mahendran; Haider Ali; Rene Vidal

arXiv:1708.05628·cs.CV·August 21, 2017

3D Pose Regression using Convolutional Neural Networks

Siddharth Mahendran, Haider Ali, Rene Vidal

PDF

TL;DR

This paper introduces a CNN-based regression method for 3D pose estimation that models the continuous pose space, demonstrating competitive results on PASCAL3D+ by leveraging geometry-aware representations and data augmentation.

Contribution

It proposes a novel CNN regression framework for 3D pose estimation that better captures the continuous nature of pose space, improving upon discretization-based methods.

Findings

01

Achieves competitive accuracy on PASCAL3D+ dataset.

02

Outperforms pose classification approaches in certain metrics.

03

Utilizes geometry-aware loss functions and data augmentation.

Abstract

3D pose estimation is a key component of many important computer vision tasks such as autonomous navigation and 3D scene understanding. Most state-of-the-art approaches to 3D pose estimation solve this problem as a pose-classification problem in which the pose space is discretized into bins and a CNN classifier is used to predict a pose bin. We argue that the 3D pose space is continuous and propose to solve the pose estimation problem in a CNN regression framework with a suitable representation, data augmentation and loss function that captures the geometry of the pose space. Experiments on PASCAL3D+ show that the proposed 3D pose regression approach achieves competitive performance compared to the state-of-the-art.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.