A Versatile Scene Model with Differentiable Visibility Applied to   Generative Pose Estimation

Helge Rhodin; Nadia Robertini; Christian Richardt; Hans-Peter Seidel,; Christian Theobalt

arXiv:1602.03725·cs.CV·February 12, 2016·1 cites

A Versatile Scene Model with Differentiable Visibility Applied to Generative Pose Estimation

Helge Rhodin, Nadia Robertini, Christian Richardt, Hans-Peter Seidel,, Christian Theobalt

PDF

Open Access

TL;DR

This paper introduces a novel scene representation with a differentiable visibility model that improves the accuracy and efficiency of generative pose estimation by better handling occlusions through a smooth, closed-form formulation.

Contribution

The authors propose a new scene model that provides an analytically differentiable visibility function, enabling more effective optimization in generative pose estimation tasks.

Findings

01

Enhanced convergence in pose optimization tasks

02

Improved occlusion handling in 3D reconstruction

03

Versatile application across multiple pose estimation problems

Abstract

Generative reconstruction methods compute the 3D configuration (such as pose and/or geometry) of a shape by optimizing the overlap of the projected 3D shape model with images. Proper handling of occlusions is a big challenge, since the visibility function that indicates if a surface point is seen from a camera can often not be formulated in closed form, and is in general discrete and non-differentiable at occlusion boundaries. We present a new scene representation that enables an analytically differentiable closed-form formulation of surface visibility. In contrast to previous methods, this yields smooth, analytically differentiable, and efficient to optimize pose similarity energies with rigorous occlusion handling, fewer local minima, and experimentally verified improved convergence of numerical optimization. The underlying idea is a new image formation model that represents opaque…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Human Pose and Action Recognition