Towards Deeply Unified Depth-aware Panoptic Segmentation with   Bi-directional Guidance Learning

Junwen He; Yifan Wang; Lijun Wang; Huchuan Lu; Jun-Yan He; Jin-Peng; Lan; Bin Luo; Yifeng Geng; Xuansong Xie

arXiv:2307.14786·cs.CV·August 15, 2023·1 cites

Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning

Junwen He, Yifan Wang, Lijun Wang, Huchuan Lu, Jun-Yan He, Jin-Peng, Lan, Bin Luo, Yifeng Geng, Xuansong Xie

PDF

Open Access 1 Repo

TL;DR

This paper introduces a deeply unified framework for depth-aware panoptic segmentation that jointly performs segmentation and depth estimation, leveraging bi-directional guidance learning and geometric query enhancement to improve scene understanding.

Contribution

The paper presents a novel unified approach that integrates segmentation and depth estimation with geometric query enhancement and bi-directional guidance learning, advancing the state of the art.

Findings

01

Achieves new state-of-the-art results on Cityscapes-DVPS and SemKITTI-DVPS datasets.

02

Bi-directional guidance learning improves performance even with incomplete labels.

03

Integrates scene geometry into object queries for better cross-task feature learning.

Abstract

Depth-aware panoptic segmentation is an emerging topic in computer vision which combines semantic and geometric understanding for more robust scene interpretation. Recent works pursue unified frameworks to tackle this challenge but mostly still treat it as two individual learning tasks, which limits their potential for exploring cross-domain information. We propose a deeply unified framework for depth-aware panoptic segmentation, which performs joint segmentation and depth estimation both in a per-segment manner with identical object queries. To narrow the gap between the two tasks, we further design a geometric query enhancement method, which is able to integrate scene geometry into object queries using latent representations. In addition, we propose a bi-directional guidance learning approach to facilitate cross-task feature learning by taking advantage of their mutual relations. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jwh97nn/DeepDPS
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging