LiDAR-Camera Fusion for Video Panoptic Segmentation without Video   Training

Fardin Ayar; Ehsan Javanmardi; Manabu Tsukada; Mahdi Javanmardi,; Mohammad Rahmati

arXiv:2412.20881·cs.CV·December 31, 2024

LiDAR-Camera Fusion for Video Panoptic Segmentation without Video Training

Fardin Ayar, Ehsan Javanmardi, Manabu Tsukada, Mahdi Javanmardi,, Mohammad Rahmati

PDF

Open Access

TL;DR

This paper introduces a LiDAR-camera fusion method that improves panoptic segmentation in autonomous vehicles, achieving higher accuracy without requiring video training data.

Contribution

It proposes a novel feature fusion module and two simple modifications that enhance video panoptic segmentation without additional video training.

Findings

01

Up to 5-point improvement in segmentation metrics

02

Effective fusion of LiDAR and image data for scene understanding

03

High-quality VPS achieved without video training

Abstract

Panoptic segmentation, which combines instance and semantic segmentation, has gained a lot of attention in autonomous vehicles, due to its comprehensive representation of the scene. This task can be applied for cameras and LiDAR sensors, but there has been a limited focus on combining both sensors to enhance image panoptic segmentation (PS). Although previous research has acknowledged the benefit of 3D data on camera-based scene perception, no specific study has explored the influence of 3D data on image and video panoptic segmentation (VPS).This work seeks to introduce a feature fusion module that enhances PS and VPS by fusing LiDAR and image data for autonomous vehicles. We also illustrate that, in addition to this fusion, our proposed model, which utilizes two simple modifications, can further deliver even more high-quality VPS without being trained on video data. The results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Fusion Techniques · Remote Sensing and LiDAR Applications · Advanced Optical Sensing Technologies

MethodsSoftmax · Attention Is All You Need · Focus