Probabilistic Vehicle Reconstruction Using a Multi-Task CNN

Max Coenen; Franz Rottensteiner

arXiv:2102.10681·cs.CV·February 23, 2021

Probabilistic Vehicle Reconstruction Using a Multi-Task CNN

Max Coenen, Franz Rottensteiner

PDF

TL;DR

This paper introduces a probabilistic 3D vehicle reconstruction method from stereo images that uses a multi-task CNN to predict distributions for vehicle pose and shape, improving robustness and accuracy.

Contribution

The paper presents a novel multi-task CNN that outputs probability distributions for vehicle keypoints, edges, and orientation, integrated into a probabilistic framework for improved 3D reconstruction.

Findings

01

Achieves state-of-the-art results on KITTI benchmark

02

Reduces sensitivity to illumination and keypoint localization errors

03

Demonstrates effectiveness on new Stereo-Vehicle dataset

Abstract

The retrieval of the 3D pose and shape of objects from images is an ill-posed problem. A common way to object reconstruction is to match entities such as keypoints, edges, or contours of a deformable 3D model, used as shape prior, to their corresponding entities inferred from the image. However, such approaches are highly sensitive to model initialisation, imprecise keypoint localisations and/or illumination conditions. In this paper, we present a probabilistic approach for shape-aware 3D vehicle reconstruction from stereo images that leverages the outputs of a novel multi-task CNN. Specifically, we train a CNN that outputs probability distributions for the vehicle's orientation and for both, vehicle keypoints and wireframe edges. Together with 3D stereo information we integrate the predicted distributions into a common probabilistic framework. We believe that the CNN-based detection of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.