# Normalized Object Coordinate Space for Category-Level 6D Object Pose and   Size Estimation

**Authors:** He Wang, Srinath Sridhar, Jingwei Huang, Julien Valentin, Shuran Song,, Leonidas J. Guibas

arXiv: 1901.02970 · 2019-06-25

## TL;DR

This paper introduces a category-level 6D object pose and size estimation method using a shared normalized coordinate space, enabling pose estimation of unseen objects without requiring exact CAD models, and demonstrates state-of-the-art results.

## Contribution

The paper proposes a novel Normalized Object Coordinate Space (NOCS) for category-level pose estimation and a training technique with synthetic and real data, advancing beyond instance-specific methods.

## Key findings

- Robust estimation of pose and size for unseen objects in cluttered scenes.
- State-of-the-art performance on standard benchmarks.
- Effective training with synthetic and real annotated data.

## Abstract

The goal of this paper is to estimate the 6D pose and dimensions of unseen object instances in an RGB-D image. Contrary to "instance-level" 6D pose estimation tasks, our problem assumes that no exact object CAD models are available during either training or testing time. To handle different and unseen object instances in a given category, we introduce a Normalized Object Coordinate Space (NOCS)---a shared canonical representation for all possible object instances within a category. Our region-based neural network is then trained to directly infer the correspondence from observed pixels to this shared object representation (NOCS) along with other object information such as class label and instance mask. These predictions can be combined with the depth map to jointly estimate the metric 6D pose and dimensions of multiple objects in a cluttered scene. To train our network, we present a new context-aware technique to generate large amounts of fully annotated mixed reality data. To further improve our model and evaluate its performance on real data, we also provide a fully annotated real-world dataset with large environment and instance variation. Extensive experiments demonstrate that the proposed method is able to robustly estimate the pose and size of unseen object instances in real environments while also achieving state-of-the-art performance on standard 6D pose estimation benchmarks.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1901.02970/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/1901.02970/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/1901.02970/full.md

---
Source: https://tomesphere.com/paper/1901.02970