# Instance- and Category-level 6D Object Pose Estimation

**Authors:** Caner Sahin, Guillermo Garcia-Hernando, Juil Sock, Tae-Kyun Kim

arXiv: 1903.04229 · 2019-03-20

## TL;DR

This paper reviews advances in 6D object pose estimation, focusing on instance- and category-level methods, highlighting progress with depth sensors and neural networks, and discussing challenges like occlusion and shape variation.

## Contribution

It provides a comprehensive overview of recent methods for 6D pose estimation at both instance and category levels, emphasizing new neural network approaches and ongoing challenges.

## Key findings

- Depth sensors have significantly improved pose estimation accuracy.
- Neural network-based methods excel in instance-level pose recovery.
- Category-level pose estimation faces challenges like shape variation and domain shift.

## Abstract

6D object pose estimation is an important task that determines the 3D position and 3D rotation of an object in camera-centred coordinates. By utilizing such a task, one can propose promising solutions for various problems related to scene understanding, augmented reality, control and navigation of robotics. Recent developments on visual depth sensors and low-cost availability of depth data significantly facilitate object pose estimation. Using depth information from RGB-D sensors, substantial progress has been made in the last decade by the methods addressing the challenges such as viewpoint variability, occlusion and clutter, and similar looking distractors. Particularly, with the recent advent of convolutional neural networks, RGB-only based solutions have been presented. However, improved results have only been reported for recovering the pose of known instances, i.e., for the instance-level object pose estimation tasks. More recently, state-of-the-art approaches target to solve object pose estimation problem at the level of categories, recovering the 6D pose of unknown instances. To this end, they address the challenges of the category-level tasks such as distribution shift among source and target domains, high intra-class variations, and shape discrepancies between objects.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.04229/full.md

## Figures

14 figures with captions in the complete paper: https://tomesphere.com/paper/1903.04229/full.md

## References

61 references — full list in the complete paper: https://tomesphere.com/paper/1903.04229/full.md

---
Source: https://tomesphere.com/paper/1903.04229