XD-MAP: Cross-Modal Domain Adaptation via Semantic Parametric Maps for Scalable Training Data Generation

Frank Bieder; Hendrik K\"onigshof; Haohao Hu; Fabian Immel; Yinzhe Shen; Jan-Hendrik Pauls; Christoph Stiller

arXiv:2601.14477·cs.CV·April 14, 2026

XD-MAP: Cross-Modal Domain Adaptation via Semantic Parametric Maps for Scalable Training Data Generation

Frank Bieder, Hendrik K\"onigshof, Haohao Hu, Fabian Immel, Yinzhe Shen, Jan-Hendrik Pauls, Christoph Stiller

PDF

TL;DR

XD-MAP introduces a novel cross-modal domain adaptation method that transfers semantic knowledge from images to LiDAR, significantly improving segmentation performance without manual labeling.

Contribution

The paper presents XD-MAP, a new approach that enables sensor-agnostic semantic transfer, extending perception capabilities from camera images to LiDAR without requiring sensor overlap.

Findings

01

Outperforms baseline by +19.5 mIoU in 2D semantic segmentation

02

Achieves +19.5 PQth in 2D panoptic segmentation

03

Improves 3D semantic segmentation by +32.3 mIoU

Abstract

Until open-world foundation models match the performance of specialized approaches, deep learning systems remain dependent on task- and sensor-specific data availability. To bridge the gap between available datasets and deployment domains, domain adaptation strategies are widely used. In this work, we propose XD-MAP, a novel approach to transfer sensor-specific knowledge from an image dataset to LiDAR, an entirely different sensing domain. Our method leverages detections on camera images to create a semantic parametric map. The map elements are modeled to produce pseudo labels in the target domain without any manual annotation effort. Unlike previous domain transfer approaches, our method does not require direct overlap between sensors and enables extending the angular perception range from a front-view camera to a full 360{\deg} view. On our large-scale road feature dataset, XD-MAP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.