RePOSE: Fast 6D Object Pose Refinement via Deep Texture Rendering
Shun Iwase, Xingyu Liu, Rawal Khirodkar, Rio Yokota, Kris M. Kitani

TL;DR
RePOSE introduces a rapid 6D object pose refinement method using deep texture rendering and differentiable optimization, achieving high accuracy and real-time performance for multiple objects.
Contribution
It proposes a novel deep texture rendering approach combined with differentiable Levenberg-Marquardt optimization for fast, accurate pose refinement.
Findings
Runs at 92 FPS, enabling real-time applications.
Achieves 51.6% accuracy on Occlusion LineMOD, surpassing previous methods.
Maintains high accuracy on YCB-Video dataset with faster runtime.
Abstract
We present RePOSE, a fast iterative refinement method for 6D object pose estimation. Prior methods perform refinement by feeding zoomed-in input and rendered RGB images into a CNN and directly regressing an update of a refined pose. Their runtime is slow due to the computational cost of CNN, which is especially prominent in multiple-object pose refinement. To overcome this problem, RePOSE leverages image rendering for fast feature extraction using a 3D model with a learnable texture. We call this deep texture rendering, which uses a shallow multi-layer perceptron to directly regress a view-invariant image representation of an object. Furthermore, we utilize differentiable Levenberg-Marquardt (LM) optimization to refine a pose fast and accurately by minimizing the feature-metric error between the input and rendered image representations without the need of zooming in. These image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · 3D Shape Modeling and Analysis
