Depth- and Semantics-aware Multi-modal Domain Translation: Generating 3D   Panoramic Color Images from LiDAR Point Clouds

Tiago Cortinhal; Eren Erdal Aksoy

arXiv:2302.07661·cs.CV·March 19, 2024

Depth- and Semantics-aware Multi-modal Domain Translation: Generating 3D Panoramic Color Images from LiDAR Point Clouds

Tiago Cortinhal, Eren Erdal Aksoy

PDF

Open Access 1 Repo

TL;DR

This paper introduces TITAN-Next, a novel depth- and semantics-aware generative model that translates LiDAR point clouds into RGB-D images, improving cross-modal translation for autonomous vehicle applications.

Contribution

It presents the first framework leveraging scene semantics for LiDAR-to-camera image translation, enhancing accuracy and practical utility in autonomous driving.

Findings

01

Outperforms previous models by 23.7% IoU on Semantic-KITTI dataset.

02

Leverages scene semantics as a mid-level representation.

03

Demonstrates practical applications in autonomous vehicle safety and data augmentation.

Abstract

This work presents a new depth- and semantics-aware conditional generative model, named TITAN-Next, for cross-domain image-to-image translation in a multi-modal setup between LiDAR and camera sensors. The proposed model leverages scene semantics as a mid-level representation and is able to translate raw LiDAR point clouds to RGB-D camera images by solely relying on semantic scene segments. We claim that this is the first framework of its kind and it has practical applications in autonomous vehicles such as providing a fail-safe mechanism and augmenting available data in the target image domain. The proposed model is evaluated on the large-scale and challenging Semantic-KITTI dataset, and experimental findings show that it considerably outperforms the original TITAN-Net and other strong baselines by 23.7 $%$ margin in terms of IoU.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tiagocortinhal/titan-next
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging