How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation
Yining Pan, Qiongjie Cui, Xulei Yang, Na Zhao

TL;DR
This paper introduces IAL, a multi-modal 3D panoptic segmentation framework that effectively combines LiDAR and image data through synchronized augmentation, advanced fusion, and prior-based query generation, achieving state-of-the-art results.
Contribution
The paper proposes a novel multi-modal framework with synchronized data augmentation, a transformer-based decoder, and innovative fusion and query modules for improved 3D segmentation.
Findings
Achieves state-of-the-art performance on benchmark datasets.
Effectively aligns LiDAR and image data during augmentation.
Enhances segmentation accuracy with novel fusion and query strategies.
Abstract
LiDAR-based 3D panoptic segmentation often struggles with the inherent sparsity of data from LiDAR sensors, which makes it challenging to accurately recognize distant or small objects. Recently, a few studies have sought to overcome this challenge by integrating LiDAR inputs with camera images, leveraging the rich and dense texture information provided by the latter. While these approaches have shown promising results, they still face challenges, such as misalignment during data augmentation and the reliance on post-processing steps. To address these issues, we propose Image-Assists-LiDAR (IAL), a novel multi-modal 3D panoptic segmentation framework. In IAL, we first introduce a modality-synchronized data augmentation strategy, PieAug, to ensure alignment between LiDAR and image inputs from the start. Next, we adopt a transformer decoder to directly predict panoptic segmentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing and LiDAR Applications · 3D Surveying and Cultural Heritage
MethodsADaptive gradient method with the OPTimal convergence rate
