PanDepth: Joint Panoptic Segmentation and Depth Completion

Juan Lagos; Esa Rahtu

arXiv:2212.14180·cs.CV·August 21, 2024

PanDepth: Joint Panoptic Segmentation and Depth Completion

Juan Lagos, Esa Rahtu

PDF

Open Access 1 Repo

TL;DR

PanDepth is a multi-task model that jointly performs panoptic segmentation and depth completion from RGB images and sparse depth data, achieving dense depth maps and segmentation with high accuracy and low computational cost.

Contribution

It introduces a novel multi-task framework that combines panoptic segmentation and depth completion, efficiently handling multiple scene understanding tasks simultaneously.

Findings

01

Successfully predicts dense depth maps from sparse inputs.

02

Performs accurate panoptic segmentation on each frame.

03

Operates with minimal increase in computational cost.

Abstract

Understanding 3D environments semantically is pivotal in autonomous driving applications where multiple computer vision tasks are involved. Multi-task models provide different types of outputs for a given scene, yielding a more holistic representation while keeping the computational cost low. We propose a multi-task model for panoptic segmentation and depth completion using RGB images and sparse depth maps. Our model successfully predicts fully dense depth maps and performs semantic segmentation, instance segmentation, and panoptic segmentation for every input frame. Extensive experiments were done on the Virtual KITTI 2 dataset and we demonstrate that our model solves multiple tasks, without a significant increase in computational cost, while keeping high accuracy performance. Code is available at https://github.com/juanb09111/PanDepth.git

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

juanb09111/pandepth
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Advanced Neural Network Applications