Panoramic Out-of-Distribution Segmentation
Mengfei Duan, Yuheng Zhang, Yihong Cao, Fei Teng, Kai Luo, Jiaming Zhang, Kailun Yang, Zhiyong Li

TL;DR
This paper introduces PanOoS, a novel task for panoramic out-of-distribution segmentation, and proposes POS, a text-guided, prompt-based method that significantly improves outlier detection in panoramic images for safer scene understanding.
Contribution
The paper presents the first solution for Panoramic OoD segmentation, leveraging CLIP-based disentanglement and prompt learning to adapt to panoramic image distortions and clutter.
Findings
POS outperforms state-of-the-art pinhole-OoS methods in benchmarks.
Significant improvements in AuPRC and FPR95 metrics.
Establishment of two new panoramic OoD datasets for evaluation.
Abstract
Panoramic imaging enables capturing 360{\deg} images with an ultra-wide Field-of-View (FoV) for dense omnidirectional perception, which is critical to applications, such as autonomous driving and augmented reality, etc. However, current panoramic semantic segmentation methods fail to identify outliers, and pinhole Out-of-distribution Segmentation (OoS) models perform unsatisfactorily in the panoramic domain due to pixel distortions and background clutter. To address these issues, we introduce a new task, Panoramic Out-of-distribution Segmentation (PanOoS), with the aim of achieving comprehensive and safe scene understanding. Furthermore, we propose the first solution, POS, which adapts to the characteristics of panoramic images through text-guided prompt distribution learning. Specifically, POS integrates a disentanglement strategy designed to materialize the cross-domain generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing and 3D Reconstruction · Image and Object Detection Techniques · Generative Adversarial Networks and Image Synthesis
MethodsSoftmax · Attention Is All You Need · Contrastive Language-Image Pre-training
