Probabilistic Vehicle Reconstruction Using a Multi-Task CNN
Max Coenen, Franz Rottensteiner

TL;DR
This paper introduces a probabilistic 3D vehicle reconstruction method from stereo images that uses a multi-task CNN to predict distributions for vehicle pose and shape, improving robustness and accuracy.
Contribution
The paper presents a novel multi-task CNN that outputs probability distributions for vehicle keypoints, edges, and orientation, integrated into a probabilistic framework for improved 3D reconstruction.
Findings
Achieves state-of-the-art results on KITTI benchmark
Reduces sensitivity to illumination and keypoint localization errors
Demonstrates effectiveness on new Stereo-Vehicle dataset
Abstract
The retrieval of the 3D pose and shape of objects from images is an ill-posed problem. A common way to object reconstruction is to match entities such as keypoints, edges, or contours of a deformable 3D model, used as shape prior, to their corresponding entities inferred from the image. However, such approaches are highly sensitive to model initialisation, imprecise keypoint localisations and/or illumination conditions. In this paper, we present a probabilistic approach for shape-aware 3D vehicle reconstruction from stereo images that leverages the outputs of a novel multi-task CNN. Specifically, we train a CNN that outputs probability distributions for the vehicle's orientation and for both, vehicle keypoints and wireframe edges. Together with 3D stereo information we integrate the predicted distributions into a common probabilistic framework. We believe that the CNN-based detection of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
