CubifAE-3D: Monocular Camera Space Cubification for Auto-Encoder based   3D Object Detection

Shubham Shrivastava; Punarjay Chakravarty

arXiv:2006.04080·cs.CV·January 27, 2021·1 cites

CubifAE-3D: Monocular Camera Space Cubification for Auto-Encoder based 3D Object Detection

Shubham Shrivastava, Punarjay Chakravarty

PDF

Open Access

TL;DR

CubifAE-3D introduces a novel monocular 3D object detection method that leverages a pre-trained auto-encoder and a cubification of 3D space, enabling effective detection with minimal real data.

Contribution

The paper presents a new approach combining auto-encoder pre-training with space cubification for monocular 3D detection, reducing reliance on dense depth data.

Findings

01

Effective 3D detection using only RGB images and sparse labels

02

Pre-training on synthetic data improves real-world performance

03

Method achieves competitive results on KITTI datasets

Abstract

We introduce a method for 3D object detection using a single monocular image. Starting from a synthetic dataset, we pre-train an RGB-to-Depth Auto-Encoder (AE). The embedding learnt from this AE is then used to train a 3D Object Detector (3DOD) CNN which is used to regress the parameters of 3D object poses after the encoder from the AE generates a latent embedding from the RGB image. We show that we can pre-train the AE using paired RGB and depth images from simulation data once and subsequently only train the 3DOD network using real data, comprising of RGB images and 3D object pose labels (without the requirement of dense depth). Our 3DOD network utilizes a particular `cubification' of 3D space around the camera, where each cuboid is tasked with predicting N object poses, along with their class and confidence values. The AE pre-training and this method of dividing the 3D space around…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Autonomous Vehicle Technology and Safety

MethodsAutoencoders