# Transferable Semi-supervised 3D Object Detection from RGB-D Data

**Authors:** Yew Siang Tang, Gim Hee Lee

arXiv: 1904.10300 · 2019-04-24

## TL;DR

This paper introduces a semi-supervised 3D object detection method that transfers knowledge from classes with 3D labels to those with only 2D labels, improving detection accuracy on RGB-D datasets.

## Contribution

The authors propose a novel transferable semi-supervised model with specialized loss functions and a Box-to-Point Cloud Fit network for effective knowledge transfer.

## Key findings

- Outperforms baseline methods on SUN-RGBD and KITTI datasets.
- Enables 3D detection for classes with only 2D annotations.
- Enhances fully-supervised methods with the Box-to-Point Cloud Fit network.

## Abstract

We investigate the direction of training a 3D object detector for new object classes from only 2D bounding box labels of these new classes, while simultaneously transferring information from 3D bounding box labels of the existing classes. To this end, we propose a transferable semi-supervised 3D object detection model that learns a 3D object detector network from training data with two disjoint sets of object classes - a set of strong classes with both 2D and 3D box labels, and another set of weak classes with only 2D box labels. In particular, we suggest a relaxed reprojection loss, box prior loss and a Box-to-Point Cloud Fit network that allow us to effectively transfer useful 3D information from the strong classes to the weak classes during training, and consequently, enable the network to detect 3D objects in the weak classes during inference. Experimental results show that our proposed algorithm outperforms baseline approaches and achieves promising results compared to fully-supervised approaches on the SUN-RGBD and KITTI datasets. Furthermore, we show that our Box-to-Point Cloud Fit network improves performances of the fully-supervised approaches on both datasets.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.10300/full.md

## Figures

24 figures with captions in the complete paper: https://tomesphere.com/paper/1904.10300/full.md

## References

42 references — full list in the complete paper: https://tomesphere.com/paper/1904.10300/full.md

---
Source: https://tomesphere.com/paper/1904.10300